WO2021057141A1 - Pipelined instruction reading method and apparatus based on fpga - Google Patents

Pipelined instruction reading method and apparatus based on fpga Download PDF

Info

Publication number
WO2021057141A1
WO2021057141A1 PCT/CN2020/098522 CN2020098522W WO2021057141A1 WO 2021057141 A1 WO2021057141 A1 WO 2021057141A1 CN 2020098522 W CN2020098522 W CN 2020098522W WO 2021057141 A1 WO2021057141 A1 WO 2021057141A1
Authority
WO
WIPO (PCT)
Prior art keywords
fpga
chip
length
operation instruction
code
Prior art date
Application number
PCT/CN2020/098522
Other languages
French (fr)
Chinese (zh)
Inventor
潘国振
魏长征
闫莺
Original Assignee
支付宝(杭州)信息技术有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 支付宝(杭州)信息技术有限公司 filed Critical 支付宝(杭州)信息技术有限公司
Publication of WO2021057141A1 publication Critical patent/WO2021057141A1/en

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F9/00Arrangements for program control, e.g. control units
    • G06F9/06Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
    • G06F9/30Arrangements for executing machine instructions, e.g. instruction decode
    • G06F9/38Concurrent instruction execution, e.g. pipeline, look ahead
    • G06F9/3867Concurrent instruction execution, e.g. pipeline, look ahead using instruction pipelines

Definitions

  • One or more embodiments of this specification relate to the field of blockchain technology, and in particular to an FPGA-based pipelined instruction reading method and device.
  • Blockchain technology is built on a transmission network (such as a peer-to-peer network).
  • the network nodes in the transmission network use chained data structures to verify and store data, and use distributed node consensus algorithms to generate and update data.
  • TEE Trusted Execution Environment
  • TEE can play the role of a black box in the hardware. Neither the code executed in the TEE nor the data operating system layer can be peeped, and only the pre-defined interface in the code can operate on it.
  • plaintext data is calculated in TEE instead of complex cryptographic operations in homomorphic encryption. There is no loss of efficiency in the calculation process. Therefore, the combination with TEE can achieve less performance loss. Under the premise, the security and privacy of the blockchain are greatly improved. At present, the industry is very concerned about the TEE solution.
  • TEE solutions including TPM (Trusted Platform Module) in software and Intel SGX (Software Guard Extensions) in hardware. , Software Protection Extension), ARM Trustzone (trust zone) and AMD PSP (Platform Security Processor, platform security processor).
  • one or more embodiments of this specification provide a pipelined instruction reading method and device based on FPGA.
  • an FPGA-based pipelined instruction reading method including: FPGA chip
  • the on-chip processor determines the code program to be executed.
  • the on-chip processor is formed by the FPGA chip loading the deployed circuit logic configuration file on the FPGA structure to which it belongs, and the code program corresponds to the area to which the FPGA structure belongs.
  • the smart contract called by the transaction received by the blockchain node; the on-chip processor reads the data contained in the code program according to the preset length and parses out the non-deterministic data contained in the data segment read each time.
  • the end bit of the operation instruction is long, so that the data segment read next time is adjacent to the end bit.
  • an FPGA-based pipelined instruction reading device which includes: a determining unit that enables an on-chip processor on the FPGA chip to determine the code program to be executed, and The on-chip processor is formed by the FPGA chip loading the deployed circuit logic configuration file on the FPGA structure to which the FPGA structure belongs, and the code program corresponds to the smart contract of the transaction call received by the blockchain node to which the FPGA structure belongs; parsing unit , So that the on-chip processor in the process of sequentially reading the data contained in the code program according to the preset length, parses out the end bit of the non-fixed-length operation instruction contained in the data segment read each time, so that the next The data segment read this time is adjacent to the end bit.
  • an electronic device including: a processor; a memory for storing executable instructions of the processor; wherein the processor runs the executable instructions In order to realize the method as described in the first aspect.
  • a computer-readable storage medium on which computer instructions are stored, and when the instructions are executed by a processor, the steps of the method described in the first aspect are implemented.
  • Fig. 1 is a flowchart of an FPGA-based pipelined instruction reading method provided by an exemplary embodiment.
  • Fig. 2 is a schematic structural diagram of a blockchain node provided by an exemplary embodiment.
  • Fig. 3 is a schematic diagram of forming a functional module on an FPGA chip provided by an exemplary embodiment.
  • Fig. 4 is a schematic diagram of reading operation instructions in a pipeline manner according to an exemplary embodiment.
  • Fig. 5 is a block diagram of an FPGA-based pipelined instruction reading device provided by an exemplary embodiment.
  • the steps of the corresponding method are not necessarily executed in the order shown and described in this specification.
  • the method may include more or fewer steps than described in this specification.
  • a single step described in this specification may be decomposed into multiple steps for description in other embodiments; and multiple steps described in this specification may also be combined into a single step in other embodiments. description.
  • Block chains are generally divided into three types: Public Blockchain, Private Blockchain and Consortium Blockchain.
  • the public chain is represented by Bitcoin and Ethereum. Participants who join the public chain can read the data records on the chain, participate in transactions, and compete for the accounting rights of new blocks. Moreover, each participant (ie, node) can freely join and exit the network, and perform related operations.
  • the private chain is the opposite.
  • the write permission of the network is controlled by an organization or institution, and the data read permission is regulated by the organization.
  • the private chain can be a weakly centralized system with strict restrictions and few participating nodes.
  • This type of blockchain is more suitable for internal use by specific institutions.
  • Consortium chain is a block chain between public chain and private chain, which can realize "partial decentralization".
  • Each node in the alliance chain usually has a corresponding entity or organization; participants are authorized to join the network and form a stakeholder alliance to jointly maintain the operation of the blockchain.
  • the nodes in the blockchain network may use a solution that combines the blockchain and the TEE (Trusted Execution Environment).
  • TEE Trusted Execution Environment
  • TEE is a secure extension based on CPU hardware and a trusted execution environment that is completely isolated from the outside.
  • TEE was first proposed by Global Platform to solve the security isolation of resources on mobile devices, and parallel to the operating system to provide a trusted and secure execution environment for applications.
  • ARM's Trust Zone technology is the first to realize the real commercial TEE technology. With the rapid development of the Internet, security requirements are getting higher and higher. Not only mobile devices, cloud devices, and data centers have put forward more demands on TEE.
  • TEE has also been rapidly developed and expanded. Compared with the originally proposed concept, the TEE referred to now is a more generalized TEE.
  • server chip manufacturers Intel and AMD have successively introduced hardware-assisted TEE and enriched the concepts and features of TEE, which has been widely recognized in the industry.
  • the TEE mentioned now usually refers more to this kind of hardware-assisted TEE technology.
  • SGX provides an enclave (also called an enclave), which is an encrypted trusted execution area in the memory, and the CPU protects data from being stolen.
  • enclave also called an enclave
  • the CPU protects data from being stolen.
  • a part of the area EPC Enclave Page Cache, enclave page cache or enclave page cache
  • the encryption engine MEE Memory Encryption Engine
  • the first step in using TEE is to confirm the authenticity of TEE.
  • the related technology provides a remote certification mechanism for the above-mentioned SGX technology to prove that the SGX platform on the target device and the challenger have deployed the same configuration file.
  • the TEE technology in the related technology is implemented by software or a combination of software and hardware, even if the remote attestation method can indicate to a certain extent that the configuration file deployed in the TEE has not been tampered with, the TEE itself depends on the operation The environment cannot be verified.
  • a virtual machine for executing smart contracts needs to be configured in the TEE.
  • the instructions executed by the virtual machine are not directly executed, but actually executed corresponding X86 instructions (Assuming that the target device adopts the X86 architecture), which poses a certain degree of security risk.
  • this specification proposes a hardware TEE technology based on FPGA implementation.
  • FPGA implements hardware TEE by loading circuit logic configuration files. Because the content of the circuit logic configuration file can be checked and verified in advance, and the FPGA is configured and operated completely based on the logic recorded in the circuit logic configuration file, it can be ensured that the hardware TEE implemented by the FPGA has relatively higher security. At the same time, by improving the instruction reading method of the code program in this specification, the execution efficiency of the code program can be improved.
  • Fig. 1 is a flowchart of an FPGA-based pipelined instruction reading method provided by an exemplary embodiment. As shown in FIG. 1, the method is applied to the FPGA structure and may include the following steps 102 to 104.
  • Step 102 The on-chip processor on the FPGA chip determines the code program to be executed.
  • the on-chip processor is formed by the FPGA chip loading the circuit logic configuration file deployed on the FPGA structure to which it belongs, and the code program corresponds to the The smart contract called by the transaction received by the blockchain node to which the FPGA structure belongs.
  • the FPGA chip contains a number of editable hardware logic units. After these hardware logic units are configured via a circuit logic configuration file, they can be implemented as corresponding functional modules to implement corresponding logic functions. Specifically, the circuit logic configuration file can be burned to the FPGA structure based on the form of a bit stream.
  • the above-mentioned on-chip processor is formed by the deployed circuit logic configuration file, and by further deploying other related functional modules, the FPGA structure can be configured as a hardware TEE on the blockchain node. Since these functional modules are completely configured by the circuit logic configuration file, it is possible to determine the logic and other aspects of the information realized by the functional module configured by checking the circuit logic configuration file to ensure that the functional module can be configured according to the complete user’s requirements. Needs to be formed and run.
  • the above-mentioned on-chip processor is used to implement virtual machine logic.
  • the virtual machine logic may include the execution logic of the Ethereum virtual machine or the execution logic of the WASM virtual machine, etc. This specification does not limit this.
  • the circuit logic configuration file can be deployed locally to the FPGA structure.
  • the deployment operation can be implemented in an offline environment to ensure safety.
  • the user can remotely deploy the circuit logic configuration file to the FPGA structure.
  • the FPGA structure can obtain the contract address of the smart contract called by the exchange by parsing the to field of the transaction, and obtain the code program of the corresponding smart contract based on the contract address. If the transaction is encrypted and submitted to the blockchain by the transaction initiator, the FPGA structure needs to decrypt the transaction to read the information in the to field. Wherein, by loading the above-mentioned deployed circuit logic configuration file, a decryption module can be formed on the FPGA chip, so that the transaction can be decrypted by the decryption module.
  • the FPGA structure can maintain a node private key, and the node public key corresponding to the node private key is disclosed. Then, on the one hand, the transaction initiator can obtain the above-mentioned node public key, on the other hand, it can generate a symmetric key by itself, and implement a digital envelope encryption operation on the plaintext transaction content based on the node’s public key and symmetric key: The key encrypts the plaintext transaction content to obtain the ciphertext transaction content, encrypts the symmetric key with the node public key to obtain the ciphertext symmetric key, and the above transaction includes the ciphertext transaction content and the ciphertext symmetric key.
  • the aforementioned decryption module can decrypt the ciphertext symmetric key contained in the exchange based on the node private key to obtain the symmetric key, and then the decryption module can decrypt the ciphertext transaction content based on the symmetric key to obtain the plaintext transaction content , So as to read the information in the to field in the plaintext transaction content, and determine the contract address of the smart contract called by the exchange.
  • the FPGA structure needs to interact with the blockchain node, such as sending the contract address to the blockchain node and receiving the code program returned by the blockchain node. If the code program corresponding to the contract address is deployed in the local space of the FPGA structure, compared to interacting with the blockchain node, the FPGA structure can obtain the code program from the local space to save resource efficiency and shorten the waiting time.
  • the local space may include on-chip storage space formed on the FPGA chip, or external storage space outside the FPGA chip.
  • the external storage space may include an external DDR plugged into the FPGA structure.
  • Step 104 In the process of sequentially reading the data contained in the code program according to the preset length, the on-chip processor parses out the end bit of the non-fixed-length operation instruction contained in the data segment read each time, so that The data segment read next time is adjacent to the end bit.
  • the on-chip processor performs a reading operation on the code program, it always reads data segments of the same length (that is, the aforementioned preset length), so that the on-chip processor can improve the efficiency of data reading.
  • the on-chip processor can read a data segment in each clock cycle, and implement pipelined processing operations for the operation instructions contained in the read data segment, so as to execute a data segment in each clock cycle as much as possible Contains an operation instruction.
  • the on-chip processor cannot read the data segment based on the above-mentioned preset length alone, otherwise the operation instructions may be truncated.
  • the length of the first operation instruction is 3B
  • the length of the second operation instruction is B
  • the length of the third operation instruction is 2B. If the above-mentioned preset length is 2B, the first operation instruction cannot be completely intercepted.
  • the above-mentioned preset length should not be less than the maximum length of a single operation instruction in the code program to ensure that the data segment read each time must contain an operation instruction completely; and, assuming that the maximum length of a single operation instruction is 5B, follow
  • the preset length is 5B to read the data segment
  • the data segment read for the first time contains not only the first operation instruction, but also the second operation instruction and part of the third operation instruction. If the second data segment read operation is performed after the data segment read the second time, the second operation instruction cannot be read, and the third operation instruction cannot be read completely, so that the code program cannot be executed correctly.
  • the on-chip processor analyzes the data segment read each time to determine the end bit of the contained operation instruction, so that the data segment read next time is adjacent to the end bit instead of the last read.
  • the data segments are adjacent. For example, in the above example, although the data segment with a length of 5B is read for the first time, it can be analyzed to determine that the length of the first operation instruction is 3B. Then, when the data segment is read for the second time, it can be ensured from the first operation instruction. Read the data segment with a length of 5B at the beginning and back to ensure that the read data segment contains the second operation instruction; similarly, the second operation instruction is determined by analyzing the data segment read the second time The length of is B.
  • Each operation instruction contained in the code program contains an operation code, which indicates the type of operation to be performed. Further, some operation instructions may include operands, that is, these operation instructions include associated operation codes and operands, and the operands are used as parameters when the corresponding operation codes are executed; among them, the operands contained in the operation instructions can be one or more One, usually one or two. It can be seen that because the operation instructions may or may not include operands, and the number of operands contained is not fixed, the length of different operation instructions is not fixed, thus forming the above-mentioned non-fixed-length operation instructions. In addition, there are other factors that may cause the length of the operation instruction to be non-fixed.
  • each operand is usually fixed, for example, it can be 4B or 8B based on different numeric types.
  • the operand is an encoded operand, such as LEB (Little-Endian Base) encoding is usually used in the wasm bytecode program, the length of the operand after encoding will change, usually 2B or 4B. The maximum possible is 5B.
  • the length of the operation code contained in each operation instruction is fixed, for example, the length of each operation code in the byte code is 1B.
  • the on-chip processor reads the data segment, the first data segment must start from the start address of the first operation instruction, and because the length of the opcode is fixed, the on-chip processor can parse out the non-indication contained in the data segment read for the first time.
  • the operation code of the fixed-length operation instruction and determine whether the operation code has a corresponding operand and the number of corresponding operands according to the analysis result; among them, the on-chip processor determines that the non-fixed-length operation instruction contains an operand based on the operation code.
  • the last bit of the last operand is determined as the end bit of the non-fixed-length operation instruction; and the on-chip processor determines the non-fixed bit based on the opcode.
  • the long operation instruction does not include an operand
  • the last bit of the opcode is used as the end bit of the non-fixed length operation instruction.
  • the on-chip processor can determine the end bit based on the above method to ensure that the data segment read next time must start from the start address of the second operation instruction to ensure that the on-chip processor can successfully parse the second operation instruction.
  • Operation code and determine whether the operation code has a corresponding operand and the number of corresponding operands according to the analysis result, and then determine the end bit of the second operation instruction; and so on.
  • Fig. 2 is a schematic structural diagram of a blockchain node provided by an exemplary embodiment.
  • an FPGA structure can be added to the blockchain node to implement hardware TEE.
  • the FPGA structure can be an FPGA board as shown in FIG. 2.
  • the FPGA board can be connected to the blockchain node through the PCIE interface to realize the data interaction between the FPGA board and the blockchain node.
  • FPGA boards can include FPGA chips, Flash (flash memory) chips, and dense tube chips; of course, in addition to FPGA chips in some embodiments, they may only include parts of the remaining Flash chips and dense tube chips. , Or may contain more structures, here are just examples.
  • no user-defined logic is programmed on the FPGA chip, which is equivalent to the FPGA chip in a blank state.
  • Users can burn circuit logic configuration files on the FPGA chip to form corresponding functions or logic on the FPGA chip.
  • the FPGA board does not have the capability of security protection, so it usually needs to provide an external security environment.
  • users can implement the programming of the circuit logic configuration file in an offline environment to achieve physical security isolation. Instead of implementing remote programming online.
  • the corresponding logic code can be formed through FPGA hardware language, and then the logic code can be mirrored to obtain the above-mentioned circuit logic configuration file.
  • the user can check the above-mentioned logic code. Especially, when multiple users are involved at the same time, multiple users can check the above logic code separately to ensure that the FPGA board can finally meet the needs of all users and prevent security risks, logic errors, fraud and other abnormalities. problem.
  • the user can burn the circuit logic configuration file to the FPGA board in the above-mentioned offline environment.
  • the circuit logic configuration file is transferred from the blockchain node to the FPGA board, and then deployed to the Flash chip as shown in Figure 2, so that even if the FPGA board is powered off, the Flash chip can still save the above-mentioned circuit logic. Configuration file.
  • Fig. 3 is a schematic diagram of forming a functional module on an FPGA chip provided by an exemplary embodiment.
  • the hardware logic unit contained in the FPGA chip can be configured to form corresponding functional modules on the FPGA chip.
  • the formed functional modules can include such Figure 3 shows the on-chip cache module, plaintext calculation module, key agreement module, decryption verification module, encryption and decryption module, etc.
  • the circuit logic configuration file can also be used to transmit the information that needs to be stored to the FPGA board.
  • the preset certificate can be stored on the FPGA chip, and the authentication root key can be stored in the secret tube chip (the authentication root key can also be Stored on the FPGA chip) and so on.
  • the FPGA board can realize remote key agreement with the user.
  • the key agreement process can use related technologies. Any algorithm or standard can be implemented, and this specification does not limit it.
  • the key agreement process can include: the user can generate a key Ka-1 at the local client, the key agreement module can generate a key Kb-1 locally, and the client can generate a key Kb-1 based on the key Ka- 1 Calculate the key agreement information Ka-2, the key agreement module can calculate the key agreement information Kb-2 based on the key Kb-1, and then the client sends the key agreement information Ka-2 to the key agreement module, The key agreement module sends the key agreement information Kb-2 to the client, so that the client can generate a secret value based on the key Ka-1 and the key agreement information Kb-2, and the key agreement module can be based on the key Kb -1 generates the same secret value as the key agreement information Ka-2, and finally the client and the key agreement module respectively derive the same
  • the key agreement information Ka-2 and key agreement information Kb-2 are transmitted between the client and the key agreement module via the blockchain node
  • the key Ka-1 is controlled by the client
  • the key Kb-1 is controlled by the key agreement module, so it can ensure that the blockchain node cannot know the final secret value and the configuration file deployment key, so as to avoid possible security risks.
  • the secret value is also used to derive the business secret deployment key; for example, the secret value can be derived as a 32-bit value, the first 16 bits can be used as the configuration file deployment key, and the last 16 bits can be used as the business secret deployment Key.
  • the user can deploy the service key to the FPGA board through the service secret deployment key.
  • the service key may include the node private key and the service root key.
  • the user can use the business secret deployment key on the client to sign, encrypt the node private key or the business root key, and send it to the FPGA board, so that after the FPGA board is decrypted and verified through the decryption verification module, Deploy the obtained node private key or service root key.
  • the FPGA board can be implemented as a TEE on the blockchain node to meet privacy requirements. For example, when a blockchain node receives a transaction, if the transaction is a plaintext transaction, the blockchain node can directly process the plaintext transaction, if the transaction is a private transaction, the blockchain node transmits the private transaction to the FPGA The board is processed.
  • the transaction content of a plaintext transaction is in plaintext form, and the contract status generated after the transaction is executed is also stored in plaintext form.
  • the transaction content of a private transaction is in the form of cipher text, which is obtained by encrypting the content of the transaction in plain text by the transaction initiator, and the contract state generated after the transaction is executed needs to be stored in the form of cipher text to ensure the protection of transaction privacy.
  • the transaction initiator can generate a symmetric key randomly or based on other methods.
  • the business public key corresponding to the above-mentioned business private key is disclosed, then the transaction initiator can perform transaction content in plaintext based on the symmetric key and the business public key.
  • the transaction initiator encrypts the plaintext transaction content with a symmetric key, and encrypts the symmetric key with the business public key.
  • the two parts obtained are included in the above-mentioned private transaction; in other words, the private transaction includes Two parts of content: the content of the transaction in plaintext encrypted with a symmetric key, and the symmetric key encrypted with the business public key.
  • the encryption and decryption module can use the business private key to decrypt the symmetric key encrypted with the business public key to obtain the symmetric key, and then the encryption and decryption module
  • the symmetric key is used to decrypt the plaintext transaction content encrypted with the symmetric key to obtain the plaintext transaction content.
  • Private transactions can be used to deploy smart contracts, then the data field of the plaintext transaction content can contain the contract code of the smart contract to be deployed; or, the privacy transaction can be used to call the smart contract, then the to field of the plaintext transaction content can contain the called The contract address of the smart contract, and the FPGA board can retrieve the corresponding contract code based on the contract address.
  • the on-chip cache module can be used to cache the contract code and/or the contract state involved in the contract code.
  • the FPGA board may have an external DDR, and the contract code and/or contract state involved in the contract code can be stored in the external DDR.
  • the contract code and/or the contract state involved in the contract code can also be stored at the blockchain node.
  • the storage space of the external DDR is often larger or even much larger than the storage space of the on-chip cache module, so the external DDR can help to achieve more data cache.
  • the FPGA board can contain both on-chip cache module and external DDR. For example, the relatively more popular contract code can be cached in the on-chip cache module, and the relatively less popular contract code can be maintained in the external DDR.
  • the on-chip cache module and external DDR can be considered as the local space of the FPGA board, and the amount of resources and time consumed for data interaction with the local space is much less than that between the blockchain nodes
  • the data interaction process helps to improve the execution efficiency of smart contracts.
  • the plaintext calculation module formed on the FPGA chip is used to implement virtual machine logic in related technologies, that is, the plaintext calculation module is equivalent to the "hardware virtual machine" on the FPGA board. Therefore, after the contract code is determined based on the foregoing plaintext transaction content, the contract code can be passed into the plaintext calculation module, so that the plaintext calculation module executes the contract code.
  • the plaintext calculation module is equivalent to the on-chip processor formed on the FPGA chip in this specification.
  • FIG. 4 is a schematic diagram of reading operation instructions in a pipeline manner according to an exemplary embodiment.
  • the code program of the contract code includes several operation instructions
  • the first operation instruction includes operation code P1 and operand Q1
  • the second operation instruction includes operation code P2 and operands Q21, Q22
  • the third operation instruction includes operation code P2 and operands Q21 and Q22.
  • One operation instruction includes operation code P3 and operands Q31, Q32
  • the fourth operation instruction includes operation code P4
  • the fifth operation instruction includes operation code P5 and so on.
  • the plaintext calculation module can be set to perform a reading operation in each clock cycle, and each time a data segment with a length of 88b is read. For example, as shown in Figure 4, the plaintext calculation module reads a data segment with a length of 88b in the first clock cycle C1. Since this data segment is the first data segment, the data segment must start with an opcode, making the plaintext calculation module You can directly read the data of the first Byte of the data segment, that is, the operation code P1, and analyze and determine the operand corresponding to the operation code P1.
  • the plaintext calculation module can determine based on the analysis result: there is one operand in the opcode P1, that is, the above-mentioned Q1, and the length of the operand is 2B; and the plaintext calculation module can determine the location of the data segment read by the clock cycle C1 based on this In the second clock cycle C2, the plaintext calculation module can read a data segment with a length of 88b starting from the next bit of the end bit.
  • the plaintext calculation module can analyze the data segment read by clock cycle C2 in a similar manner to the above, and determine that the opcode P2 contained in the data segment has two operands with a length of 2B, that is, the aforementioned operand Q21. , Q22, which determines the end bit of the operation instruction contained in the data segment read in clock cycle C2, and then in the third clock cycle C3, the plaintext calculation module can read the length from the next bit of the end bit The data segment of 88b.
  • the plaintext calculation module can accurately analyze the end bit of the operation instruction contained in the read data segment, the data segment in the code program can be sequentially intercepted according to a fixed length in each clock cycle, which is helpful to improve the code
  • the read efficiency of the program speeds up the execution of the smart contract.
  • the user may want to update the version of the circuit logic configuration file deployed on the FPGA board.
  • the authentication root key contained in the circuit logic configuration file may be known by risky users, or the user wants to update the version on the FPGA board.
  • the deployed functional modules are upgraded, etc. This manual does not limit this.
  • the circuit logic configuration file that has been deployed in the above process can be referred to as the old version of the circuit logic configuration file, and the circuit logic configuration file that needs to be deployed is referred to as the new version of the circuit logic configuration file.
  • the user can generate a new version of the circuit logic configuration file through the process of writing code and mirroring. Further, the user can sign the new version of the circuit logic configuration file with his own private key, and then encrypt the signed new version of the circuit logic configuration file with the configuration file deployment key negotiated above to obtain the encrypted new version of the circuit Logical configuration file. In some cases, there may be multiple users at the same time, so the old version of the circuit logic configuration file needs to deploy the preset certificates corresponding to these users to the FPGA board, and these users need to use their own private keys to pair the new version of the circuit. Sign the logical configuration file.
  • the user can remotely send the encrypted new version of the circuit logic configuration file to the blockchain node through the client, and the blockchain node will further transfer it to the FPGA board.
  • the decryption verification module formed on the FPGA chip in the foregoing process is located on the transmission path between the PCIE interface and the Flash chip, so that the encrypted new version of the circuit logic configuration file must first be successfully processed by the decryption verification module before it can be
  • the Flash chip is passed in to achieve a credible update, and the Flash chip cannot be updated directly without bypassing the process of decryption and verification.
  • the decryption verification module After the decryption verification module receives the encrypted new version of the circuit logic configuration file, it first decrypts it with the configuration file deployment key deployed on the FPGA board. If the decryption is successful, the decryption verification module is further based on the preset certificate deployed on the FPGA chip , To perform signature verification on the decrypted new version of the circuit logic configuration file.
  • the decryption and signature verification module will trigger the termination of the update operation; and if the decryption is successful and the signature verification is passed, you can It is determined that the obtained new version of the circuit logic configuration file is from the aforementioned user and has not been tampered with during the transmission process.
  • the new version of the circuit logic configuration file can be further transmitted to the Flash chip to update and deploy the old version of the circuit logic configuration file in the Flash chip.
  • the above-mentioned plaintext calculation module, on-chip cache module, key agreement module, encryption and decryption module, decryption verification module, and storage in the FPGA chip can also be formed on the FPGA chip. Enter the preset certificate, and store the authentication root key to the secret management chip and other information.
  • the formed plaintext calculation module, on-chip cache module, key agreement module, encryption/decryption module, decryption and signature verification module, etc., the implemented functional logic can be changed and upgraded, and stored in the deployed preset certificate, authentication root Information such as keys may also be different from the information before the update.
  • the FPGA board can remotely negotiate with the user to obtain a new configuration file deployment key based on the updated key agreement module, authentication root key, etc., and the configuration file deployment key can be used for the next renewal Update process. Similarly, a reliable update operation for FPGA boards can be continuously implemented accordingly.
  • the FPGA board can generate certification results for the new version of the circuit logic configuration file.
  • the above-mentioned key agreement module can calculate the hash value of the new version of the circuit logic configuration file and the hash value of the configuration file deployment key negotiated based on the new version of the circuit logic configuration file through an algorithm such as sm3 or other algorithms.
  • the calculation result can be used as the above-mentioned authentication result, and the key agreement module sends the authentication result to the user.
  • the user can verify the authentication result on the client based on the maintained new version of the circuit logic configuration file and the configuration file deployment key negotiated accordingly. If the verification is successful, it indicates that the new version of the circuit logic configuration file is successful on the FPGA board. Deployed, and the user and the FPGA board successfully negotiated accordingly to obtain a consistent configuration file deployment key, thereby confirming the successful completion of the circuit logic configuration file update deployment.
  • Fig. 5 is a schematic structural diagram of an FPGA-based pipelined instruction reading device provided by an exemplary embodiment.
  • the terminal interaction device may include: a determining unit 501, which enables the on-chip processor on the FPGA chip to determine the code program to be executed, and the on-chip processor is loaded by the FPGA chip to the FPGA.
  • the circuit logic configuration file that has been deployed structurally is formed, and the code program corresponds to the smart contract called by the transaction received by the blockchain node to which the FPGA structure belongs; the parsing unit 502 enables the on-chip processor to execute In the process of reading the data contained in the code program by length, the end bit of the non-fixed-length operation instruction contained in the data segment read each time is parsed, so that the data segment read next time is adjacent to the end Bit.
  • the preset length is not less than the maximum length of a single operation instruction in the code program.
  • the parsing unit 502 is specifically configured to: make the on-chip processor parse out the operation code of the non-fixed-length operation instruction contained in the data segment read each time; make the on-chip processor be based on the When the operation code determines that the non-fixed-length operation instruction contains operands, the last bit of the last operand is determined according to the number of operands contained and the length of each operand as the non-fixed-length operation instruction The end bit of the operation code; if the on-chip processor determines that the non-fixed-length operation instruction does not contain an operand based on the operation code, use the last bit of the operation code as the end of the non-fixed-length operation instruction Bit.
  • the operand is a post-encoded operand after LEB encoding.
  • the reading of the data contained in the code program by the on-chip processor includes: the on-chip processor sequentially reads the data contained in the code program at a frequency of reading once per clock cycle.
  • the code program includes a bytecode program.
  • the bytecode program includes a wasm bytecode program.
  • a typical implementation device is a computer.
  • the specific form of the computer can be a personal computer, a laptop computer, a cellular phone, a camera phone, a smart phone, a personal digital assistant, a media player, a navigation device, an email receiving and sending device, and a game control A console, a tablet computer, a wearable device, or a combination of any of these devices.
  • the computer includes one or more processors (CPU), input/output interfaces, network interfaces, and memory.
  • processors CPU
  • input/output interfaces network interfaces
  • memory volatile and non-volatile memory
  • the memory may include non-permanent memory in a computer readable medium, random access memory (RAM) and/or non-volatile memory, such as read-only memory (ROM) or flash memory (flash RAM). Memory is an example of computer readable media.
  • RAM random access memory
  • ROM read-only memory
  • flash RAM flash memory
  • Computer-readable media include permanent and non-permanent, removable and non-removable media, and information storage can be realized by any method or technology.
  • the information can be computer-readable instructions, data structures, program modules, or other data.
  • Examples of computer storage media include, but are not limited to, phase change memory (PRAM), static random access memory (SRAM), dynamic random access memory (DRAM), other types of random access memory (RAM), read-only memory (ROM), electrically erasable programmable read-only memory (EEPROM), flash memory or other memory technology, CD-ROM, digital versatile disc (DVD) or other optical storage, Magnetic cassettes, disk storage, quantum memory, graphene-based storage media or other magnetic storage devices or any other non-transmission media can be used to store information that can be accessed by computing devices. According to the definition in this article, computer-readable media does not include transitory media, such as modulated data signals and carrier waves.
  • first, second, third, etc. may be used to describe various information in one or more embodiments of this specification, the information should not be limited to these terms. These terms are only used to distinguish the same type of information from each other.
  • first information may also be referred to as second information, and similarly, the second information may also be referred to as first information.
  • word “if” as used herein can be interpreted as "when” or “when” or "in response to determination”.

Landscapes

  • Engineering & Computer Science (AREA)
  • Software Systems (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Storage Device Security (AREA)

Abstract

A pipelined instruction reading method and apparatus based on an FPGA. The method may comprise: an on-chip processor on an FPGA chip determining a code program to be executed, wherein the on-chip processor is formed by the FPGA chip loading a circuit logic configuration file that has been deployed on an FPGA structure to which the FPGA chip belongs, and the code program corresponds to a smart contract called by a transaction and received by a blockchain node to which the FPGA structure belongs (102); and during a process in which the on-chip processor sequentially reads, according to a preset length, data contained in the code program, parsing the end bit of a non-fixed-length operation instruction contained in a data segment read each time, so that a data segment read next time is adjacent to the end bit (104).

Description

基于FPGA的流水线式指令读取方法及装置FPGA-based pipelined instruction reading method and device 技术领域Technical field
本说明书一个或多个实施例涉及区块链技术领域,尤其涉及一种基于FPGA的流水线式指令读取方法及装置。One or more embodiments of this specification relate to the field of blockchain technology, and in particular to an FPGA-based pipelined instruction reading method and device.
背景技术Background technique
区块链技术构建在传输网络(例如点对点网络)之上。传输网络中的网络节点利用链式数据结构来验证与存储数据,并采用分布式节点共识算法来生成和更新数据。Blockchain technology is built on a transmission network (such as a peer-to-peer network). The network nodes in the transmission network use chained data structures to verify and store data, and use distributed node consensus algorithms to generate and update data.
目前企业级的区块链平台技术上最大的两个挑战就是隐私和性能,往往这两个挑战很难同时解决。大多解决方案都是通过损失性能换取隐私,或者不大考虑隐私去追求性能。常见的解决隐私问题的加密技术,如同态加密(Homomorphic encryption)和零知识证明(Zero-knowledge proof)等复杂度高,通用性差,而且还可能带来严重的性能损失。At present, the two biggest challenges in enterprise-level blockchain platform technology are privacy and performance. It is often difficult to solve these two challenges at the same time. Most of the solutions are to lose performance in exchange for privacy, or do not consider privacy to pursue performance. Common encryption technologies that solve privacy problems, such as Homomorphic encryption and Zero-knowledge proof, are highly complex, have poor versatility, and may also cause serious performance losses.
可信执行环境(Trusted Execution Environment,TEE)是另一种解决隐私问题的方式。TEE可以起到硬件中的黑箱作用,在TEE中执行的代码和数据操作系统层都无法偷窥,只有代码中预先定义的接口才能对其进行操作。在效率方面,由于TEE的黑箱性质,在TEE中进行运算的是明文数据,而不是同态加密中的复杂密码学运算,计算过程效率没有损失,因此与TEE相结合可以在性能损失较小的前提下很大程度上提升区块链的安全性和隐私性。目前工业界十分关注TEE的方案,几乎所有主流的芯片和软件联盟都有自己的TEE解决方案,包括软件方面的TPM(Trusted Platform Module,可信赖平台模块)以及硬件方面的Intel SGX(Software Guard Extensions,软件保护扩展)、ARM Trustzone(信任区)和AMD PSP(Platform Security Processor,平台安全处理器)。Trusted Execution Environment (TEE) is another way to solve privacy issues. TEE can play the role of a black box in the hardware. Neither the code executed in the TEE nor the data operating system layer can be peeped, and only the pre-defined interface in the code can operate on it. In terms of efficiency, due to the black box nature of TEE, plaintext data is calculated in TEE instead of complex cryptographic operations in homomorphic encryption. There is no loss of efficiency in the calculation process. Therefore, the combination with TEE can achieve less performance loss. Under the premise, the security and privacy of the blockchain are greatly improved. At present, the industry is very concerned about the TEE solution. Almost all mainstream chip and software alliances have their own TEE solutions, including TPM (Trusted Platform Module) in software and Intel SGX (Software Guard Extensions) in hardware. , Software Protection Extension), ARM Trustzone (trust zone) and AMD PSP (Platform Security Processor, platform security processor).
发明内容Summary of the invention
有鉴于此,本说明书一个或多个实施例提供一种基于FPGA的流水线式指令读取方法及装置。In view of this, one or more embodiments of this specification provide a pipelined instruction reading method and device based on FPGA.
为实现上述目的,本说明书一个或多个实施例提供技术方案如下:根据本说明书一个或多个实施例的第一方面,提出了一种基于FPGA的流水线式指令读取方法,包括: FPGA芯片上的片上处理器确定待执行的代码程序,所述片上处理器由所述FPGA芯片加载所属FPGA结构上已部署的电路逻辑配置文件而形成,所述代码程序对应于所述FPGA结构所属的区块链节点收到的交易调用的智能合约;所述片上处理器在按照预设长度依次读取所述代码程序所含数据的过程中,解析出每次读取的数据段中所含非定长操作指令的结束位,以使下次读取的数据段相邻于所述结束位。In order to achieve the foregoing objectives, one or more embodiments of this specification provide technical solutions as follows: According to the first aspect of one or more embodiments of this specification, an FPGA-based pipelined instruction reading method is proposed, including: FPGA chip The on-chip processor determines the code program to be executed. The on-chip processor is formed by the FPGA chip loading the deployed circuit logic configuration file on the FPGA structure to which it belongs, and the code program corresponds to the area to which the FPGA structure belongs. The smart contract called by the transaction received by the blockchain node; the on-chip processor reads the data contained in the code program according to the preset length and parses out the non-deterministic data contained in the data segment read each time. The end bit of the operation instruction is long, so that the data segment read next time is adjacent to the end bit.
根据本说明书一个或多个实施例的第二方面,提出了一种基于FPGA的流水线式指令读取装置,包括:确定单元,使FPGA芯片上的片上处理器确定待执行的代码程序,所述片上处理器由所述FPGA芯片加载所属FPGA结构上已部署的电路逻辑配置文件而形成,所述代码程序对应于所述FPGA结构所属的区块链节点收到的交易调用的智能合约;解析单元,使所述片上处理器在按照预设长度依次读取所述代码程序所含数据的过程中,解析出每次读取的数据段中所含非定长操作指令的结束位,以使下次读取的数据段相邻于所述结束位。According to the second aspect of one or more embodiments of this specification, an FPGA-based pipelined instruction reading device is proposed, which includes: a determining unit that enables an on-chip processor on the FPGA chip to determine the code program to be executed, and The on-chip processor is formed by the FPGA chip loading the deployed circuit logic configuration file on the FPGA structure to which the FPGA structure belongs, and the code program corresponds to the smart contract of the transaction call received by the blockchain node to which the FPGA structure belongs; parsing unit , So that the on-chip processor in the process of sequentially reading the data contained in the code program according to the preset length, parses out the end bit of the non-fixed-length operation instruction contained in the data segment read each time, so that the next The data segment read this time is adjacent to the end bit.
根据本说明书一个或多个实施例的第三方面,提出了一种电子设备,包括:处理器;用于存储处理器可执行指令的存储器;其中,所述处理器通过运行所述可执行指令以实现如第一方面所述的方法。According to a third aspect of one or more embodiments of this specification, an electronic device is proposed, including: a processor; a memory for storing executable instructions of the processor; wherein the processor runs the executable instructions In order to realize the method as described in the first aspect.
根据本说明书一个或多个实施例的第四方面,提出了一种计算机可读存储介质,其上存储有计算机指令,该指令被处理器执行时实现如第一方面所述方法的步骤。According to the fourth aspect of one or more embodiments of the present specification, a computer-readable storage medium is provided, on which computer instructions are stored, and when the instructions are executed by a processor, the steps of the method described in the first aspect are implemented.
附图说明Description of the drawings
图1是一示例性实施例提供的一种基于FPGA的流水线式指令读取方法的流程图。Fig. 1 is a flowchart of an FPGA-based pipelined instruction reading method provided by an exemplary embodiment.
图2是一示例性实施例提供的一种区块链节点的结构示意图。Fig. 2 is a schematic structural diagram of a blockchain node provided by an exemplary embodiment.
图3是一示例性实施例提供的一种在FPGA芯片上形成功能模块的示意图。Fig. 3 is a schematic diagram of forming a functional module on an FPGA chip provided by an exemplary embodiment.
图4是一示例性实施例提供的一种采用流水线的方式读取操作指令的示意图。Fig. 4 is a schematic diagram of reading operation instructions in a pipeline manner according to an exemplary embodiment.
图5是一示例性实施例提供的一种基于FPGA的流水线式指令读取装置的框图。Fig. 5 is a block diagram of an FPGA-based pipelined instruction reading device provided by an exemplary embodiment.
具体实施方式detailed description
这里将详细地对示例性实施例进行说明,其示例表示在附图中。下面的描述涉及附图时,除非另有表示,不同附图中的相同数字表示相同或相似的要素。以下示例性实施例中所描述的实施方式并不代表与本说明书一个或多个实施例相一致的所有实施方 式。相反,它们仅是与如所附权利要求书中所详述的、本说明书一个或多个实施例的一些方面相一致的装置和方法的例子。The exemplary embodiments will be described in detail here, and examples thereof are shown in the accompanying drawings. When the following description refers to the drawings, unless otherwise indicated, the same numbers in different drawings indicate the same or similar elements. The implementation manners described in the following exemplary embodiments do not represent all implementation manners consistent with one or more embodiments of this specification. Rather, they are merely examples of devices and methods consistent with some aspects of one or more embodiments of this specification as detailed in the appended claims.
需要说明的是:在其他实施例中并不一定按照本说明书示出和描述的顺序来执行相应方法的步骤。在一些其他实施例中,其方法所包括的步骤可以比本说明书所描述的更多或更少。此外,本说明书中所描述的单个步骤,在其他实施例中可能被分解为多个步骤进行描述;而本说明书中所描述的多个步骤,在其他实施例中也可能被合并为单个步骤进行描述。It should be noted that in other embodiments, the steps of the corresponding method are not necessarily executed in the order shown and described in this specification. In some other embodiments, the method may include more or fewer steps than described in this specification. In addition, a single step described in this specification may be decomposed into multiple steps for description in other embodiments; and multiple steps described in this specification may also be combined into a single step in other embodiments. description.
区块链一般被划分为三种类型:公有链(Public Blockchain),私有链(Private Blockchain)和联盟链(Consortium Blockchain)。此外,还有多种类型的结合,比如私有链+联盟链、联盟链+公有链等不同组合形式。其中去中心化程度最高的是公有链。公有链以比特币、以太坊为代表,加入公有链的参与者可以读取链上的数据记录、参与交易以及竞争新区块的记账权等。而且,各参与者(即节点)可自由加入以及退出网络,并进行相关操作。私有链则相反,该网络的写入权限由某个组织或者机构控制,数据读取权限受组织规定。简单来说,私有链可以为一个弱中心化系统,参与节点具有严格限制且少。这种类型的区块链更适合于特定机构内部使用。联盟链则是介于公有链以及私有链之间的区块链,可实现“部分去中心化”。联盟链中各个节点通常有与之相对应的实体机构或者组织;参与者通过授权加入网络并组成利益相关联盟,共同维护区块链运行。Block chains are generally divided into three types: Public Blockchain, Private Blockchain and Consortium Blockchain. In addition, there are many types of combinations, such as private chain + alliance chain, alliance chain + public chain and other different combinations. Among them, the most decentralized one is the public chain. The public chain is represented by Bitcoin and Ethereum. Participants who join the public chain can read the data records on the chain, participate in transactions, and compete for the accounting rights of new blocks. Moreover, each participant (ie, node) can freely join and exit the network, and perform related operations. The private chain is the opposite. The write permission of the network is controlled by an organization or institution, and the data read permission is regulated by the organization. In simple terms, the private chain can be a weakly centralized system with strict restrictions and few participating nodes. This type of blockchain is more suitable for internal use by specific institutions. Consortium chain is a block chain between public chain and private chain, which can realize "partial decentralization". Each node in the alliance chain usually has a corresponding entity or organization; participants are authorized to join the network and form a stakeholder alliance to jointly maintain the operation of the blockchain.
不论是公有链、私有链还是联盟链,区块链网络中的节点出于隐私保护的目的,均可能通过区块链与TEE(Trusted Execution Environment,可信执行环境)相结合的解决方案,在TEE内执行收到的交易。TEE是基于CPU硬件的安全扩展,且与外部完全隔离的可信执行环境。TEE最早是由Global Platform提出的概念,用于解决移动设备上资源的安全隔离,平行于操作系统为应用程序提供可信安全的执行环境。ARM的Trust Zone技术最早实现了真正商用的TEE技术。伴随着互联网的高速发展,安全的需求越来越高,不仅限于移动设备,云端设备,数据中心都对TEE提出了更多的需求。TEE的概念也得到了高速的发展和扩充。现在所说的TEE相比与最初提出的概念已经是更加广义的TEE。例如,服务器芯片厂商Intel,AMD等都先后推出了硬件辅助的TEE并丰富了TEE的概念和特性,在工业界得到了广泛的认可。现在提起的TEE通常更多指这类硬件辅助的TEE技术。Regardless of whether it is a public chain, a private chain or a consortium chain, for the purpose of privacy protection, the nodes in the blockchain network may use a solution that combines the blockchain and the TEE (Trusted Execution Environment). Execute received transactions within TEE. TEE is a secure extension based on CPU hardware and a trusted execution environment that is completely isolated from the outside. TEE was first proposed by Global Platform to solve the security isolation of resources on mobile devices, and parallel to the operating system to provide a trusted and secure execution environment for applications. ARM's Trust Zone technology is the first to realize the real commercial TEE technology. With the rapid development of the Internet, security requirements are getting higher and higher. Not only mobile devices, cloud devices, and data centers have put forward more demands on TEE. The concept of TEE has also been rapidly developed and expanded. Compared with the originally proposed concept, the TEE referred to now is a more generalized TEE. For example, server chip manufacturers Intel and AMD have successively introduced hardware-assisted TEE and enriched the concepts and features of TEE, which has been widely recognized in the industry. The TEE mentioned now usually refers more to this kind of hardware-assisted TEE technology.
以Intel SGX技术为例,SGX提供了围圈(enclave,也称为飞地),即内存中一个 加密的可信执行区域,由CPU保护数据不被窃取。以第一区块链节点采用支持SGX的CPU为例,利用新增的处理器指令,在内存中可以分配一部分区域EPC(Enclave Page Cache,围圈页面缓存或飞地页面缓存),通过CPU内的加密引擎MEE(Memory Encryption Engine)对其中的数据进行加密。EPC中加密的内容只有进入CPU后才会被解密成明文。因此,在SGX中,用户可以不信任操作系统、VMM(Virtual Machine Monitor,虚拟机监控器)、甚至BIOS(Basic Input Output System,基本输入输出系统),只需要信任CPU便能确保隐私数据不会泄漏。因此,围圈就相当于SGX技术下产生的TEE。Taking Intel SGX technology as an example, SGX provides an enclave (also called an enclave), which is an encrypted trusted execution area in the memory, and the CPU protects data from being stolen. Taking the first blockchain node using a CPU that supports SGX as an example, using the newly added processor instructions, a part of the area EPC (Enclave Page Cache, enclave page cache or enclave page cache) can be allocated in the memory, and through the CPU The encryption engine MEE (Memory Encryption Engine) encrypts the data in it. The encrypted content in EPC will be decrypted into plain text only after entering the CPU. Therefore, in SGX, users can distrust the operating system, VMM (Virtual Machine Monitor), and even BIOS (Basic Input Output System). They only need to trust the CPU to ensure that private data will not leakage. Therefore, the enclosure is equivalent to the TEE produced under SGX technology.
不同于移动端,云端访问需要远程访问,终端用户对硬件平台不可见,因此使用TEE的第一步就是要确认TEE的真实可信。例如,相关技术中提供了针对上述SGX技术的远程证明机制,以用于证明目标设备上的SGX平台与挑战方部署了相同的配置文件。但是,由于相关技术中的TEE技术是以软件或软硬件结合的方式实现,使得即便通过远程证明方式可以在一定程度上表明TEE内所部署的配置文件未经篡改,但是TEE本身所依托的运行环境却无法被验证。例如,在需要实现隐私功能的区块链节点上,TEE内需要配置用于执行智能合约的虚拟机,该虚拟机所执行的指令并非直接执行,而是实际上执行了对应的若干条X86指令(假定目标设备采用X86架构),从而造成了一定程度上的安全性风险。Different from the mobile terminal, cloud access requires remote access, and the end user is invisible to the hardware platform. Therefore, the first step in using TEE is to confirm the authenticity of TEE. For example, the related technology provides a remote certification mechanism for the above-mentioned SGX technology to prove that the SGX platform on the target device and the challenger have deployed the same configuration file. However, because the TEE technology in the related technology is implemented by software or a combination of software and hardware, even if the remote attestation method can indicate to a certain extent that the configuration file deployed in the TEE has not been tampered with, the TEE itself depends on the operation The environment cannot be verified. For example, on a blockchain node that needs to implement privacy functions, a virtual machine for executing smart contracts needs to be configured in the TEE. The instructions executed by the virtual machine are not directly executed, but actually executed corresponding X86 instructions (Assuming that the target device adopts the X86 architecture), which poses a certain degree of security risk.
为此,本说明书提出了一种基于FPGA实现的硬件TEE技术,FPGA通过加载电路逻辑配置文件而实现硬件TEE。由于电路逻辑配置文件的内容可以被预先查看与检验,并且FPGA完全基于电路逻辑配置文件中记载的逻辑而配置运行,因而可以确保FPGA所实现的硬件TEE具有相对更高的安全性。同时,本说明书中通过改进对代码程序的指令读取方式,可以提升针对代码程序的执行效率。To this end, this specification proposes a hardware TEE technology based on FPGA implementation. FPGA implements hardware TEE by loading circuit logic configuration files. Because the content of the circuit logic configuration file can be checked and verified in advance, and the FPGA is configured and operated completely based on the logic recorded in the circuit logic configuration file, it can be ensured that the hardware TEE implemented by the FPGA has relatively higher security. At the same time, by improving the instruction reading method of the code program in this specification, the execution efficiency of the code program can be improved.
以下结合实施例说明本说明书提供的一种基于FPGA的流水线式指令读取方法,以提升代码程序的执行效率。The following describes an FPGA-based pipelined instruction reading method provided in this specification with reference to embodiments, so as to improve the execution efficiency of the code program.
图1是一示例性实施例提供的一种基于FPGA的流水线式指令读取方法的流程图。如图1所示,该方法应用于FPGA结构,可以包括以下步骤102至104。Fig. 1 is a flowchart of an FPGA-based pipelined instruction reading method provided by an exemplary embodiment. As shown in FIG. 1, the method is applied to the FPGA structure and may include the following steps 102 to 104.
步骤102,FPGA芯片上的片上处理器确定待执行的代码程序,所述片上处理器由所述FPGA芯片加载所属FPGA结构上已部署的电路逻辑配置文件而形成,所述代码程序对应于所述FPGA结构所属的区块链节点收到的交易调用的智能合约。Step 102: The on-chip processor on the FPGA chip determines the code program to be executed. The on-chip processor is formed by the FPGA chip loading the circuit logic configuration file deployed on the FPGA structure to which it belongs, and the code program corresponds to the The smart contract called by the transaction received by the blockchain node to which the FPGA structure belongs.
FPGA芯片上包含若干可编辑的硬件逻辑单元,这些硬件逻辑单元经由电路逻辑配置文件进行配置后,可以实现为相应的功能模块,以用于实现相应的逻辑功能。具体的,该电路逻辑配置文件可以基于比特流的形式被烧录至FPGA结构。例如,上述的片上处理器等即为通过已部署的电路逻辑配置文件而形成,而通过进一步部署相关的其他功能模块,可以将FPGA结构配置为区块链节点上的硬件TEE。由于这些功能模块完全由电路逻辑配置文件进行配置而形成,因而通过检查电路逻辑配置文件即可确定由此配置得到的功能模块所实现的逻辑等各方面的信息,确保功能模块能够按照完全用户的需求而形成和运行。其中,上述的片上处理器用于实现虚拟机逻辑,譬如该虚拟机逻辑可以包括以太坊虚拟机的执行逻辑或者WASM虚拟机的执行逻辑等,本说明书并不对此进行限制。The FPGA chip contains a number of editable hardware logic units. After these hardware logic units are configured via a circuit logic configuration file, they can be implemented as corresponding functional modules to implement corresponding logic functions. Specifically, the circuit logic configuration file can be burned to the FPGA structure based on the form of a bit stream. For example, the above-mentioned on-chip processor is formed by the deployed circuit logic configuration file, and by further deploying other related functional modules, the FPGA structure can be configured as a hardware TEE on the blockchain node. Since these functional modules are completely configured by the circuit logic configuration file, it is possible to determine the logic and other aspects of the information realized by the functional module configured by checking the circuit logic configuration file to ensure that the functional module can be configured according to the complete user’s requirements. Needs to be formed and run. The above-mentioned on-chip processor is used to implement virtual machine logic. For example, the virtual machine logic may include the execution logic of the Ethereum virtual machine or the execution logic of the WASM virtual machine, etc. This specification does not limit this.
用户生成电路逻辑配置文件后,如果位于FPGA结构所在地点处,则可以在本地将该电路逻辑配置文件部署至FPGA结构,譬如可以在离线环境下实施部署操作,以确保安全性。或者,在FPGA结构处于线上环境的情况下,用户可以将电路逻辑配置文件远程部署至FPGA结构。After the user generates the circuit logic configuration file, if it is located at the location of the FPGA structure, the circuit logic configuration file can be deployed locally to the FPGA structure. For example, the deployment operation can be implemented in an offline environment to ensure safety. Or, when the FPGA structure is in an online environment, the user can remotely deploy the circuit logic configuration file to the FPGA structure.
FPGA结构通过解析交易的to字段,可以获得该交易所调用的智能合约的合约地址,并基于该合约地址获取相应智能合约的代码程序。如果该交易由交易发起方经过加密后提交至区块链,那么FPGA结构需要对该交易进行解密,以读取to字段的信息。其中,通过加载上述已部署的电路逻辑配置文件,可以在FPGA芯片上形成解密模块,从而通过该解密模块对交易进行解密。The FPGA structure can obtain the contract address of the smart contract called by the exchange by parsing the to field of the transaction, and obtain the code program of the corresponding smart contract based on the contract address. If the transaction is encrypted and submitted to the blockchain by the transaction initiator, the FPGA structure needs to decrypt the transaction to read the information in the to field. Wherein, by loading the above-mentioned deployed circuit logic configuration file, a decryption module can be formed on the FPGA chip, so that the transaction can be decrypted by the decryption module.
例如,FPGA结构可以维护有节点私钥,且该节点私钥对应的节点公钥被公开。那么,交易发起方一方面可以获得上述的节点公钥,另一方面可以自行生成一对称密钥,并基于该节点公钥和对称密钥对明文交易内容实施数字信封形式的加密操作:通过对称密钥对明文交易内容进行加密、得到密文交易内容,通过节点公钥对该对称密钥进行加密、得到密文对称密钥,而上述的交易包括密文交易内容和密文对称密钥。相应地,上述的解密模块可以基于节点私钥对交易所含的密文对称密钥进行解密、得到对称密钥,然后解密模块可以基于对称密钥对密文交易内容进行解密、得到明文交易内容,从而在明文交易内容中读取to字段的信息,确定交易所调用的智能合约的合约地址。For example, the FPGA structure can maintain a node private key, and the node public key corresponding to the node private key is disclosed. Then, on the one hand, the transaction initiator can obtain the above-mentioned node public key, on the other hand, it can generate a symmetric key by itself, and implement a digital envelope encryption operation on the plaintext transaction content based on the node’s public key and symmetric key: The key encrypts the plaintext transaction content to obtain the ciphertext transaction content, encrypts the symmetric key with the node public key to obtain the ciphertext symmetric key, and the above transaction includes the ciphertext transaction content and the ciphertext symmetric key. Correspondingly, the aforementioned decryption module can decrypt the ciphertext symmetric key contained in the exchange based on the node private key to obtain the symmetric key, and then the decryption module can decrypt the ciphertext transaction content based on the symmetric key to obtain the plaintext transaction content , So as to read the information in the to field in the plaintext transaction content, and determine the contract address of the smart contract called by the exchange.
如果合约地址对应的代码程序可以部署在区块链节点处,则FPGA结构需要与区块链节点进行数据交互,比如向区块链节点发送合约地址,并接收区块链节点返回的代码程序。如果合约地址对应的代码程序部署在FPGA结构的本地空间,则相比于与区块 链节点进行交互,FPGA结构可以从本地空间获取代码程序,以节省资源效率、缩短等待时长。其中,本地空间可以包括FPGA芯片上形成的片上存储空间,或者FPGA芯片之外的外部存储空间,比如该外部存储空间可以包括插接在FPGA结构上的外接DDR等。If the code program corresponding to the contract address can be deployed at the blockchain node, the FPGA structure needs to interact with the blockchain node, such as sending the contract address to the blockchain node and receiving the code program returned by the blockchain node. If the code program corresponding to the contract address is deployed in the local space of the FPGA structure, compared to interacting with the blockchain node, the FPGA structure can obtain the code program from the local space to save resource efficiency and shorten the waiting time. The local space may include on-chip storage space formed on the FPGA chip, or external storage space outside the FPGA chip. For example, the external storage space may include an external DDR plugged into the FPGA structure.
步骤104,所述片上处理器在按照预设长度依次读取所述代码程序所含数据的过程中,解析出每次读取的数据段中所含非定长操作指令的结束位,以使下次读取的数据段相邻于所述结束位。Step 104: In the process of sequentially reading the data contained in the code program according to the preset length, the on-chip processor parses out the end bit of the non-fixed-length operation instruction contained in the data segment read each time, so that The data segment read next time is adjacent to the end bit.
片上处理器每次对代码程序实施读取操作时,始终读取相同长度(即上述的预设长度)的数据段,使得片上处理器可以提升数据读取效率。例如,片上处理器可以在每个时钟周期读取一个数据段,并针对所读取的数据段中包含的操作指令实现流水线式的处理操作,从而尽可能地在每一时钟周期执行一个数据段所含的一条操作指令。Each time the on-chip processor performs a reading operation on the code program, it always reads data segments of the same length (that is, the aforementioned preset length), so that the on-chip processor can improve the efficiency of data reading. For example, the on-chip processor can read a data segment in each clock cycle, and implement pipelined processing operations for the operation instructions contained in the read data segment, so as to execute a data segment in each clock cycle as much as possible Contains an operation instruction.
由于代码程序所含的操作指令并非固定长度(即非定长),使得片上处理器无法仅基于上述的预设长度来读取数据段,否则可能导致操作指令被截断。例如,首条操作指令的长度为3B、第二条操作指令的长度为B、第三条操作指令的长度为2B,如果上述的预设长度为2B,则无法完整地截取首条操作指令,因而上述的预设长度应当不小于代码程序中单条操作指令的最大长度,以确保每次读取的数据段必然可以完整地包含一条操作指令;以及,假定单条操作指令的最大长度为5B,按照预设长度为5B读取数据段时,第一次读取的数据段不仅包含了首条操作指令,还包含了第二条操作指令和第三条操作指令的一部分,如果后续直接在第一次读取的数据段之后实施第二次数据段读取操作,则无法读取到第二条操作指令、无法完整地读取到第三条操作指令,从而无法正确的执行代码程序。Since the operation instructions contained in the code program are not of fixed length (ie, non-fixed length), the on-chip processor cannot read the data segment based on the above-mentioned preset length alone, otherwise the operation instructions may be truncated. For example, the length of the first operation instruction is 3B, the length of the second operation instruction is B, and the length of the third operation instruction is 2B. If the above-mentioned preset length is 2B, the first operation instruction cannot be completely intercepted. Therefore, the above-mentioned preset length should not be less than the maximum length of a single operation instruction in the code program to ensure that the data segment read each time must contain an operation instruction completely; and, assuming that the maximum length of a single operation instruction is 5B, follow When the preset length is 5B to read the data segment, the data segment read for the first time contains not only the first operation instruction, but also the second operation instruction and part of the third operation instruction. If the second data segment read operation is performed after the data segment read the second time, the second operation instruction cannot be read, and the third operation instruction cannot be read completely, so that the code program cannot be executed correctly.
因此,片上处理器通过对每次读取的数据段进行解析,确定出所含操作指令的结束位,可使下次读取的数据段与该结束位相邻、而非与上次读取的数据段相邻。例如在上述的示例中,虽然第一次读取了5B长度的数据段,但是可以解析确定首条操作指令的长度为3B,那么第二次读取数据段时可以确保从首条操作指令之后开始、向后读取长度为5B的数据段,以确保所读取的数据段包含第二条操作指令;类似地,通过对第二次读取的数据段进行解析确定出第二条操作指令的长度为B,第三次读取数据段时可以确保从第二条操作指令之后开始、向后读取长度为5B的数据段,以确保所读取的数据段包含第三条操作指令,以此类推。可见,基于上述方案可以在操作指令具有非固定长度的情况下,确保片上处理器每次都可以按照固定的预设长度正确地读取数据段,以 提升对操作指令的读取效率、加快对代码程序的执行效率。Therefore, the on-chip processor analyzes the data segment read each time to determine the end bit of the contained operation instruction, so that the data segment read next time is adjacent to the end bit instead of the last read. The data segments are adjacent. For example, in the above example, although the data segment with a length of 5B is read for the first time, it can be analyzed to determine that the length of the first operation instruction is 3B. Then, when the data segment is read for the second time, it can be ensured from the first operation instruction. Read the data segment with a length of 5B at the beginning and back to ensure that the read data segment contains the second operation instruction; similarly, the second operation instruction is determined by analyzing the data segment read the second time The length of is B. When reading the data segment for the third time, you can ensure that the data segment of length 5B is read from the second operation instruction and backward to ensure that the read data segment contains the third operation instruction. And so on. It can be seen that, based on the above solution, it is possible to ensure that the on-chip processor can correctly read the data segment according to the fixed preset length every time when the operation instruction has a non-fixed length, so as to improve the efficiency of reading the operation instruction and speed up the operation. The execution efficiency of the code program.
代码程序所含的每条操作指令均包含操作码,该操作码指示了所需执行的操作类型。进一步的,部分操作指令可以包含操作数,即这些操作指令包含相关联的操作码和操作数,操作数作为相应操作码执行时的参数;其中,操作指令所含的操作数可以为一个或多个,通常为1个或2个。可见,由于操作指令可能包含或不包含操作数、所含操作数的数量不固定,导致不同操作指令的长度不固定,从而形成了上述的非定长操作指令。此外,还存在其他因素可能导致操作指令的长度非固定。例如,在智能合约所采用的字节码中,每个操作数的长度通常是固定的,譬如基于不同数值类型可以为4B或8B。但是,如果操作数为经过编码的编码后操作数,譬如wasm字节码程序中通常采用LEB(Little-Endian Base)编码等,则编码后操作数的长度会发生变化,通常为2B或4B,最大可能为5B。Each operation instruction contained in the code program contains an operation code, which indicates the type of operation to be performed. Further, some operation instructions may include operands, that is, these operation instructions include associated operation codes and operands, and the operands are used as parameters when the corresponding operation codes are executed; among them, the operands contained in the operation instructions can be one or more One, usually one or two. It can be seen that because the operation instructions may or may not include operands, and the number of operands contained is not fixed, the length of different operation instructions is not fixed, thus forming the above-mentioned non-fixed-length operation instructions. In addition, there are other factors that may cause the length of the operation instruction to be non-fixed. For example, in the bytecode used by smart contracts, the length of each operand is usually fixed, for example, it can be 4B or 8B based on different numeric types. However, if the operand is an encoded operand, such as LEB (Little-Endian Base) encoding is usually used in the wasm bytecode program, the length of the operand after encoding will change, usually 2B or 4B. The maximum possible is 5B.
每条操作指令所含操作码的长度固定,譬如字节码中每个操作码的长度为1B。片上处理器读取数据段时,首个数据段必然从首条操作指令的起始地址开始,并且由于操作码的长度固定,使得片上处理器可以解析出首次读取的数据段中所含非定长操作指令的操作码,并根据解析结果确定该操作码是否存在对应的操作数、所对应的操作数的数量;其中,片上处理器在基于操作码确定非定长操作指令包含操作数的情况下,根据所含操作数的数量和每一操作数的长度,确定最后一个操作数的末位,以作为非定长操作指令的结束位;以及,片上处理器在基于操作码确定非定长操作指令不包含操作数的情况下,将该操作码的末位作为非定长操作指令的结束位。然后,片上处理器可以基于上述方式确定出的结束位,确保下次读取的数据段必然从第二条操作指令的起始地址开始,以确保片上处理器可以成功解析第二条操作指令的操作码,并根据解析结果确定该操作码是否存在对应的操作数、所对应的操作数的数量,进而确定出第二条操作指令的结束位;以此类推。The length of the operation code contained in each operation instruction is fixed, for example, the length of each operation code in the byte code is 1B. When the on-chip processor reads the data segment, the first data segment must start from the start address of the first operation instruction, and because the length of the opcode is fixed, the on-chip processor can parse out the non-indication contained in the data segment read for the first time. The operation code of the fixed-length operation instruction, and determine whether the operation code has a corresponding operand and the number of corresponding operands according to the analysis result; among them, the on-chip processor determines that the non-fixed-length operation instruction contains an operand based on the operation code. In this case, according to the number of operands contained and the length of each operand, the last bit of the last operand is determined as the end bit of the non-fixed-length operation instruction; and the on-chip processor determines the non-fixed bit based on the opcode. When the long operation instruction does not include an operand, the last bit of the opcode is used as the end bit of the non-fixed length operation instruction. Then, the on-chip processor can determine the end bit based on the above method to ensure that the data segment read next time must start from the start address of the second operation instruction to ensure that the on-chip processor can successfully parse the second operation instruction. Operation code, and determine whether the operation code has a corresponding operand and the number of corresponding operands according to the analysis result, and then determine the end bit of the second operation instruction; and so on.
图2是一示例性实施例提供的一种区块链节点的结构示意图。基于本说明书的技术方案,可以在区块链节点上添加FPGA结构以实现硬件TEE,譬如该FPGA结构可以为如图2所示的FPGA板卡。FPGA板卡可以通过PCIE接口连接至区块链节点上,以实现FPGA板卡与区块链节点之间的数据交互。FPGA板卡可以包括FPGA芯片、Flash(闪存)芯片和密管芯片等结构;当然,在一些实施例中除了包含FPGA芯片之外,可能仅包含剩余的Flash芯片和密管芯片等中的部分结构,或者可能包含更多结构,此处仅用于举例。Fig. 2 is a schematic structural diagram of a blockchain node provided by an exemplary embodiment. Based on the technical solution in this specification, an FPGA structure can be added to the blockchain node to implement hardware TEE. For example, the FPGA structure can be an FPGA board as shown in FIG. 2. The FPGA board can be connected to the blockchain node through the PCIE interface to realize the data interaction between the FPGA board and the blockchain node. FPGA boards can include FPGA chips, Flash (flash memory) chips, and dense tube chips; of course, in addition to FPGA chips in some embodiments, they may only include parts of the remaining Flash chips and dense tube chips. , Or may contain more structures, here are just examples.
在初始阶段,FPGA芯片上并未烧录用户定义的任何逻辑,相当于FPGA芯片处于空白状态。用户可以通过向FPGA芯片上烧录电路逻辑配置文件,以在FPGA芯片上形成相应的功能或逻辑。在首次烧录电路逻辑配置文件时,FPGA板卡不具有安全防护的能力,因而通常需要外部提供安全环境,比如用户可以在离线环境下实施对电路逻辑配置文件的烧录以实现物理安全隔离,而非在线上实施远程烧录。In the initial stage, no user-defined logic is programmed on the FPGA chip, which is equivalent to the FPGA chip in a blank state. Users can burn circuit logic configuration files on the FPGA chip to form corresponding functions or logic on the FPGA chip. When programming the circuit logic configuration file for the first time, the FPGA board does not have the capability of security protection, so it usually needs to provide an external security environment. For example, users can implement the programming of the circuit logic configuration file in an offline environment to achieve physical security isolation. Instead of implementing remote programming online.
针对用户所需实现的功能或逻辑,可以通过FPGA硬件语言形成相应的逻辑代码,并进而对该逻辑代码进行镜像化处理,即可得到上述的电路逻辑配置文件。在烧录至FPGA板卡之前,用户可以针对上述的逻辑代码进行检查。尤其是,当同时涉及到多个用户时,多个用户可以分别对上述的逻辑代码进行检查,以确保FPGA板卡最终能够满足所有用户的需求,防止出现安全性风险、逻辑错误、欺诈等异常问题。For the function or logic that the user needs to implement, the corresponding logic code can be formed through FPGA hardware language, and then the logic code can be mirrored to obtain the above-mentioned circuit logic configuration file. Before programming to the FPGA board, the user can check the above-mentioned logic code. Especially, when multiple users are involved at the same time, multiple users can check the above logic code separately to ensure that the FPGA board can finally meet the needs of all users and prevent security risks, logic errors, fraud and other abnormalities. problem.
在确定代码无误后,用户可以在上述的离线环境下,将电路逻辑配置文件烧录至FPGA板卡上。具体的,电路逻辑配置文件被从区块链节点传入FPGA板卡,进而部署至如图2所示的Flash芯片中,使得即便FPGA板卡发生掉电,Flash芯片仍然能够保存上述的电路逻辑配置文件。After confirming that the code is correct, the user can burn the circuit logic configuration file to the FPGA board in the above-mentioned offline environment. Specifically, the circuit logic configuration file is transferred from the blockchain node to the FPGA board, and then deployed to the Flash chip as shown in Figure 2, so that even if the FPGA board is powered off, the Flash chip can still save the above-mentioned circuit logic. Configuration file.
图3是一示例性实施例提供的一种在FPGA芯片上形成功能模块的示意图。通过将Flash芯片中所部署的电路逻辑配置文件加载至FPGA芯片,可以对FPGA芯片所含的硬件逻辑单元进行配置,从而在FPGA芯片上形成相应的功能模块,譬如所形成的功能模块可以包括如图3所示的片上缓存模块、明文计算模块、密钥协商模块、解密验签模块、加解密模块等。同时,电路逻辑配置文件还可以用于向FPGA板卡传输需要存储的信息,比如可以将预置证书存储于FPGA芯片上、将认证根密钥存储于密管芯片中(认证根密钥也可以存储于FPGA芯片上)等。Fig. 3 is a schematic diagram of forming a functional module on an FPGA chip provided by an exemplary embodiment. By loading the circuit logic configuration file deployed in the Flash chip to the FPGA chip, the hardware logic unit contained in the FPGA chip can be configured to form corresponding functional modules on the FPGA chip. For example, the formed functional modules can include such Figure 3 shows the on-chip cache module, plaintext calculation module, key agreement module, decryption verification module, encryption and decryption module, etc. At the same time, the circuit logic configuration file can also be used to transmit the information that needs to be stored to the FPGA board. For example, the preset certificate can be stored on the FPGA chip, and the authentication root key can be stored in the secret tube chip (the authentication root key can also be Stored on the FPGA chip) and so on.
基于FPGA芯片上所形成的密钥协商模块,以及部署于FPGA板卡上的认证根密钥,使得FPGA板卡可以与用户实现远程的密钥协商,该密钥协商过程可以采用相关技术中的任意算法或标准来实现,本说明书并不对此进行限制。举例而言,密钥协商过程可以包括:用户可以在本地的客户端生成一密钥Ka-1、密钥协商模块可以在本地生成一密钥Kb-1,且客户端可以基于密钥Ka-1计算得到密钥协商信息Ka-2、密钥协商模块可以基于密钥Kb-1计算得到密钥协商信息Kb-2,然后客户端将密钥协商信息Ka-2发送至密钥协商模块、密钥协商模块将密钥协商信息Kb-2发送至客户端,使得客户端可以基于密钥Ka-1与密钥协商信息Kb-2生成一秘密值,而密钥协商模块可以基于密钥Kb-1与密钥协商信息Ka-2生成相同的秘密值,最后由客户端、密钥协商模块分别基于密钥 导出函数从该相同的秘密值导出相同的配置文件部署密钥,该配置文件部署密钥可以存在FPGA芯片或密管芯片。在上述过程中,虽然密钥协商信息Ka-2、密钥协商信息Kb-2是经由区块链节点在客户端与密钥协商模块之间传输,但是由于密钥Ka-1由客户端掌握、密钥Kb-1由密钥协商模块掌握,因而可以确保区块链节点无法获知最终得到的秘密值和配置文件部署密钥,避免可能造成的安全性风险。Based on the key agreement module formed on the FPGA chip and the authentication root key deployed on the FPGA board, the FPGA board can realize remote key agreement with the user. The key agreement process can use related technologies. Any algorithm or standard can be implemented, and this specification does not limit it. For example, the key agreement process can include: the user can generate a key Ka-1 at the local client, the key agreement module can generate a key Kb-1 locally, and the client can generate a key Kb-1 based on the key Ka- 1 Calculate the key agreement information Ka-2, the key agreement module can calculate the key agreement information Kb-2 based on the key Kb-1, and then the client sends the key agreement information Ka-2 to the key agreement module, The key agreement module sends the key agreement information Kb-2 to the client, so that the client can generate a secret value based on the key Ka-1 and the key agreement information Kb-2, and the key agreement module can be based on the key Kb -1 generates the same secret value as the key agreement information Ka-2, and finally the client and the key agreement module respectively derive the same configuration file deployment key from the same secret value based on the key derivation function, and the configuration file deployment The key can be stored in the FPGA chip or the secret management chip. In the above process, although the key agreement information Ka-2 and key agreement information Kb-2 are transmitted between the client and the key agreement module via the blockchain node, the key Ka-1 is controlled by the client , The key Kb-1 is controlled by the key agreement module, so it can ensure that the blockchain node cannot know the final secret value and the configuration file deployment key, so as to avoid possible security risks.
除了配置文件部署密钥之外,秘密值还用于导出业务秘密部署密钥;例如,秘密值可以导出32位数值,可以将前16位作为配置文件部署密钥、后16位作为业务秘密部署密钥。用户可以通过业务秘密部署密钥向FPGA板卡部署业务密钥,譬如该业务密钥可以包括节点私钥和业务根密钥。例如,用户可以在客户端上采用业务秘密部署密钥对节点私钥或业务根密钥进行签名、加密并发送至FPGA板卡,使得FPGA板卡通过解密验签模块进行解密、验签后,对得到的节点私钥或业务根密钥进行部署。In addition to the configuration file deployment key, the secret value is also used to derive the business secret deployment key; for example, the secret value can be derived as a 32-bit value, the first 16 bits can be used as the configuration file deployment key, and the last 16 bits can be used as the business secret deployment Key. The user can deploy the service key to the FPGA board through the service secret deployment key. For example, the service key may include the node private key and the service root key. For example, the user can use the business secret deployment key on the client to sign, encrypt the node private key or the business root key, and send it to the FPGA board, so that after the FPGA board is decrypted and verified through the decryption verification module, Deploy the obtained node private key or service root key.
基于部署的节点密钥、业务根密钥和FPGA芯片上的加解密模块、明文计算模块,使得FPGA板卡可以实现为区块链节点上的TEE,以满足隐私需求。例如,当区块链节点收到一笔交易时,如果该交易为明文交易,区块链节点可以直接处理该明文交易,如果该交易为隐私交易,区块链节点将该隐私交易传入FPGA板卡进行处理。Based on the deployed node key, service root key, encryption and decryption module and plaintext calculation module on the FPGA chip, the FPGA board can be implemented as a TEE on the blockchain node to meet privacy requirements. For example, when a blockchain node receives a transaction, if the transaction is a plaintext transaction, the blockchain node can directly process the plaintext transaction, if the transaction is a private transaction, the blockchain node transmits the private transaction to the FPGA The board is processed.
明文交易的交易内容为明文形式,并且交易执行后所产生的合约状态等同样采用明文形式进行存储。隐私交易的交易内容为密文形式,由交易发起方对明文交易内容进行加密而得到,且交易执行后产生的合约状态等需要采用密文形式进行存储,从而确保交易隐私保护。例如,交易发起方可以随机或基于其他方式生成一对称密钥,同样上述的业务私钥对应的业务公钥被公开,那么交易发起方可以基于该对称密钥和业务公钥对明文交易内容进行数字信封加密:交易发起方通过对称密钥加密明文交易内容,并通过业务公钥对该对称密钥进行加密,得到的两部分内容均被包含于上述的隐私交易中;换言之,隐私交易中包含两部分内容:采用对称密钥加密的明文交易内容、采用业务公钥加密的对称密钥。The transaction content of a plaintext transaction is in plaintext form, and the contract status generated after the transaction is executed is also stored in plaintext form. The transaction content of a private transaction is in the form of cipher text, which is obtained by encrypting the content of the transaction in plain text by the transaction initiator, and the contract state generated after the transaction is executed needs to be stored in the form of cipher text to ensure the protection of transaction privacy. For example, the transaction initiator can generate a symmetric key randomly or based on other methods. Similarly, the business public key corresponding to the above-mentioned business private key is disclosed, then the transaction initiator can perform transaction content in plaintext based on the symmetric key and the business public key. Digital Envelope Encryption: The transaction initiator encrypts the plaintext transaction content with a symmetric key, and encrypts the symmetric key with the business public key. The two parts obtained are included in the above-mentioned private transaction; in other words, the private transaction includes Two parts of content: the content of the transaction in plaintext encrypted with a symmetric key, and the symmetric key encrypted with the business public key.
因此,FPGA板卡在收到区块链节点传入的隐私交易后,可由加解密模块通过业务私钥对采用业务公钥加密的对称密钥进行解密、得到对称密钥,然后由加解密模块通过对称密钥对采用对称密钥加密的明文交易内容进行解密、得到明文交易内容。隐私交易可以用于部署智能合约,那么明文交易内容的data字段可以包含待部署的智能合约的合约代码;或者,隐私交易可以用于调用智能合约,那么明文交易内容的to字段可以包含被调用的智能合约的合约地址,而FPGA板卡可以基于该合约地址调取相应的合约代码。Therefore, after the FPGA board receives the private transaction from the blockchain node, the encryption and decryption module can use the business private key to decrypt the symmetric key encrypted with the business public key to obtain the symmetric key, and then the encryption and decryption module The symmetric key is used to decrypt the plaintext transaction content encrypted with the symmetric key to obtain the plaintext transaction content. Private transactions can be used to deploy smart contracts, then the data field of the plaintext transaction content can contain the contract code of the smart contract to be deployed; or, the privacy transaction can be used to call the smart contract, then the to field of the plaintext transaction content can contain the called The contract address of the smart contract, and the FPGA board can retrieve the corresponding contract code based on the contract address.
片上缓存模块可以用于对合约代码和/或合约代码所涉及的合约状态进行缓存。在一些情况中,FPGA板卡可以存在外接DDR,可以将合约代码和/或合约代码所涉及的合约状态存储至外接DDR。当然,也可以将合约代码和/或合约代码所涉及的合约状态存储至区块链节点处。相对而言,外接DDR的存储空间往往大于甚至远大于片上缓存模块的存储空间,因而外接DDR可以有助于实现更多数据的缓存。当然,FPGA板卡上可以同时包含片上缓存模块和外接DDR,例如可以将热度相对更高的合约代码缓存于片上缓存模块,而将热度相对更低的合约代码维护于外接DDR中。以及,相比于区块链节点而言,片上缓存模块和外接DDR可以认为是FPGA板卡的本地空间,与本地空间进行数据交互所消耗的资源量和时长远小于与区块链节点之间的数据交互过程,有助于提升智能合约的执行效率。The on-chip cache module can be used to cache the contract code and/or the contract state involved in the contract code. In some cases, the FPGA board may have an external DDR, and the contract code and/or contract state involved in the contract code can be stored in the external DDR. Of course, the contract code and/or the contract state involved in the contract code can also be stored at the blockchain node. In contrast, the storage space of the external DDR is often larger or even much larger than the storage space of the on-chip cache module, so the external DDR can help to achieve more data cache. Of course, the FPGA board can contain both on-chip cache module and external DDR. For example, the relatively more popular contract code can be cached in the on-chip cache module, and the relatively less popular contract code can be maintained in the external DDR. And, compared to blockchain nodes, the on-chip cache module and external DDR can be considered as the local space of the FPGA board, and the amount of resources and time consumed for data interaction with the local space is much less than that between the blockchain nodes The data interaction process helps to improve the execution efficiency of smart contracts.
FPGA芯片上形成的明文计算模块用于实现相关技术中的虚拟机逻辑,即明文计算模块相当于FPGA板卡上的“硬件虚拟机”。因此,基于上述明文交易内容确定出合约代码后,可以将该合约代码传入明文计算模块中,以由该明文计算模块执行该合约代码。该明文计算模块相当于本说明书中在FPGA芯片上形成的片上处理器。The plaintext calculation module formed on the FPGA chip is used to implement virtual machine logic in related technologies, that is, the plaintext calculation module is equivalent to the "hardware virtual machine" on the FPGA board. Therefore, after the contract code is determined based on the foregoing plaintext transaction content, the contract code can be passed into the plaintext calculation module, so that the plaintext calculation module executes the contract code. The plaintext calculation module is equivalent to the on-chip processor formed on the FPGA chip in this specification.
明文计算模块执行合约代码的过程,可以分解为读取并执行合约代码所含的各条操作指令的过程。例如,图4是一示例性实施例提供的一种采用流水线的方式读取操作指令的示意图。如图4所示,假定合约代码的代码程序包括若干条操作指令,第一条操作指令包括操作码P1和操作数Q1,第二条操作指令包括操作码P2和操作数Q21、Q22,第三条操作指令包括操作码P3和操作数Q31、Q32,第四条操作指令包括操作码P4,第五条操作指令包括操作码P5等。假定代码程序为wasm字节码程序,那么每一操作码的长度为1B,每个操作数均采用LEB编码、长度通常为2B或4B且最大不超过5B,同时每条操作指令最多包含2个操作数。据此,可以确定本实施例中的每条操作指令最多包含1个操作码和2个操作数,且每条操作指令的长度最大为1+2×5=11B,即88b。The process of executing the contract code by the plaintext calculation module can be broken down into the process of reading and executing each operation instruction contained in the contract code. For example, FIG. 4 is a schematic diagram of reading operation instructions in a pipeline manner according to an exemplary embodiment. As shown in Figure 4, assuming that the code program of the contract code includes several operation instructions, the first operation instruction includes operation code P1 and operand Q1, the second operation instruction includes operation code P2 and operands Q21, Q22, and the third operation instruction includes operation code P2 and operands Q21 and Q22. One operation instruction includes operation code P3 and operands Q31, Q32, the fourth operation instruction includes operation code P4, and the fifth operation instruction includes operation code P5 and so on. Assuming that the code program is a wasm bytecode program, the length of each opcode is 1B, and each operand is coded in LEB. The length is usually 2B or 4B and the maximum is not more than 5B. At the same time, each operation instruction contains at most 2 Operand. Based on this, it can be determined that each operation instruction in this embodiment includes at most one operation code and two operands, and the maximum length of each operation instruction is 1+2×5=11B, that is, 88b.
因此,明文计算模块在读取操作指令的过程中,可以设定在每个时钟周期实施一次读取操作,并且每次读取长度为88b的数据段。例如图4所示,明文计算模块在第一个时钟周期C1读取长度为88b的数据段,该数据段由于是首个数据段,因而该数据段必然以一个操作码开始,使得明文计算模块可以直接读取该数据段的第一个Byte的数据即操作码P1,并解析确定出该操作码P1对应的操作数。明文计算模块基于解析结果可以确定:操作码P1存在1个操作数即上述的Q1,且该操作数的长度为2B;以及,明文计算模块可以据此确定出时钟周期C1读取的数据段所含的操作指令的结束位,进 而在第二个时钟周期C2中,明文计算模块可以从该结束位的下一位开始读取长度为88b的数据段。Therefore, in the process of reading the operation instruction, the plaintext calculation module can be set to perform a reading operation in each clock cycle, and each time a data segment with a length of 88b is read. For example, as shown in Figure 4, the plaintext calculation module reads a data segment with a length of 88b in the first clock cycle C1. Since this data segment is the first data segment, the data segment must start with an opcode, making the plaintext calculation module You can directly read the data of the first Byte of the data segment, that is, the operation code P1, and analyze and determine the operand corresponding to the operation code P1. The plaintext calculation module can determine based on the analysis result: there is one operand in the opcode P1, that is, the above-mentioned Q1, and the length of the operand is 2B; and the plaintext calculation module can determine the location of the data segment read by the clock cycle C1 based on this In the second clock cycle C2, the plaintext calculation module can read a data segment with a length of 88b starting from the next bit of the end bit.
由于准确地分析出了时钟周期C1内读取的数据段中的结束位,使得明文计算模块在时钟周期C2读取的数据段必然以第二条操作指令的操作码开始。因此,明文计算模块可以通过类似上述方式,对时钟周期C2读取的数据段进行分析,确定出该数据段所含的操作码P2存在两个长度为2B的操作数,即上述的操作数Q21、Q22,由此确定出时钟周期C2读取的数据段所含的操作指令的结束位,进而在第三个时钟周期C3中,明文计算模块可以从该结束位的下一位开始读取长度为88b的数据段。Since the end bit in the data segment read in the clock cycle C1 is accurately analyzed, the data segment read by the plaintext calculation module in the clock cycle C2 must start with the opcode of the second operation instruction. Therefore, the plaintext calculation module can analyze the data segment read by clock cycle C2 in a similar manner to the above, and determine that the opcode P2 contained in the data segment has two operands with a length of 2B, that is, the aforementioned operand Q21. , Q22, which determines the end bit of the operation instruction contained in the data segment read in clock cycle C2, and then in the third clock cycle C3, the plaintext calculation module can read the length from the next bit of the end bit The data segment of 88b.
可见,由于明文计算模块可以准确分析出所读取的数据段中包含的操作指令的结束位,使得每个时钟周期内均可以按照固定长度依次截取代码程序中的数据段,有助于提升对代码程序的读取效率,加快对智能合约的执行速度。It can be seen that because the plaintext calculation module can accurately analyze the end bit of the operation instruction contained in the read data segment, the data segment in the code program can be sequentially intercepted according to a fixed length in each clock cycle, which is helpful to improve the code The read efficiency of the program speeds up the execution of the smart contract.
基于一些原因,用户可能希望对FPGA板卡上部署的电路逻辑配置文件进行版本更新,比如该电路逻辑配置文件所含的认证根密钥可能被风险用户获知、再比如用户希望对FPGA板卡上部署的功能模块进行升级等,本说明书并不对此进行限制。为了便于区分,可以将上述过程中已部署的电路逻辑配置文件称之为旧版电路逻辑配置文件,而将需要部署的电路逻辑配置文件称之为新版电路逻辑配置文件。For some reasons, the user may want to update the version of the circuit logic configuration file deployed on the FPGA board. For example, the authentication root key contained in the circuit logic configuration file may be known by risky users, or the user wants to update the version on the FPGA board. The deployed functional modules are upgraded, etc. This manual does not limit this. In order to facilitate the distinction, the circuit logic configuration file that has been deployed in the above process can be referred to as the old version of the circuit logic configuration file, and the circuit logic configuration file that needs to be deployed is referred to as the new version of the circuit logic configuration file.
与旧版电路逻辑配置文件相类似的,用户可以通过编写代码、镜像化等过程生成新版电路逻辑配置文件。进一步的,用户可以通过自身持有的私钥对新版电路逻辑配置文件进行签名,然后通过上文协商出的配置文件部署密钥对签名后的新版电路逻辑配置文件进行加密,得到加密后新版电路逻辑配置文件。在一些情况下,可能同时存在多名用户,那么旧版电路逻辑配置文件需要将这些用户对应的预置证书均部署至FPGA板卡中,且这些用户需要分别采用自身持有的私钥对新版电路逻辑配置文件进行签名。Similar to the old version of the circuit logic configuration file, the user can generate a new version of the circuit logic configuration file through the process of writing code and mirroring. Further, the user can sign the new version of the circuit logic configuration file with his own private key, and then encrypt the signed new version of the circuit logic configuration file with the configuration file deployment key negotiated above to obtain the encrypted new version of the circuit Logical configuration file. In some cases, there may be multiple users at the same time, so the old version of the circuit logic configuration file needs to deploy the preset certificates corresponding to these users to the FPGA board, and these users need to use their own private keys to pair the new version of the circuit. Sign the logical configuration file.
用户可以通过客户端远程将加密后新版电路逻辑配置文件发送至区块链节点,并由区块链节点进一步将其传入FPGA板卡。前述过程中在FPGA芯片上形成的解密验签模块位于PCIE接口与Flash芯片之间的传输通路上,使得加密后新版电路逻辑配置文件必然需要优先经过解密验签模块的成功处理后,才能够被传入Flash芯片以实现可信更新,无法绕过解密验签的过程而直接对Flash芯片进行更新。The user can remotely send the encrypted new version of the circuit logic configuration file to the blockchain node through the client, and the blockchain node will further transfer it to the FPGA board. The decryption verification module formed on the FPGA chip in the foregoing process is located on the transmission path between the PCIE interface and the Flash chip, so that the encrypted new version of the circuit logic configuration file must first be successfully processed by the decryption verification module before it can be The Flash chip is passed in to achieve a credible update, and the Flash chip cannot be updated directly without bypassing the process of decryption and verification.
解密验签模块在收到加密后新版电路逻辑配置文件后,首先通过FPGA板卡上部署的配置文件部署密钥进行解密,如果解密成功则解密验签模块进一步基于FPGA芯片 上部署的预置证书,对解密后的新版电路逻辑配置文件进行签名验证。如果解密失败或者签名验证未通过,则说明收到的文件并非来自上述用户或者遭到篡改,解密验签模块将触发终止本次的更新操作;而在解密成功且验签通过的情况下,可以确定得到的新版电路逻辑配置文件来自上述用户且传输过程中未遭到篡改,可以将该新版电路逻辑配置文件进一步传输至Flash芯片,以针对Flash芯片中的旧版电路逻辑配置文件进行更新部署。After the decryption verification module receives the encrypted new version of the circuit logic configuration file, it first decrypts it with the configuration file deployment key deployed on the FPGA board. If the decryption is successful, the decryption verification module is further based on the preset certificate deployed on the FPGA chip , To perform signature verification on the decrypted new version of the circuit logic configuration file. If the decryption fails or the signature verification fails, it means that the received file is not from the above-mentioned user or has been tampered with, and the decryption and signature verification module will trigger the termination of the update operation; and if the decryption is successful and the signature verification is passed, you can It is determined that the obtained new version of the circuit logic configuration file is from the aforementioned user and has not been tampered with during the transmission process. The new version of the circuit logic configuration file can be further transmitted to the Flash chip to update and deploy the old version of the circuit logic configuration file in the Flash chip.
新版电路逻辑配置文件被加载至FPGA芯片后,同样可以在该FPGA芯片上形成诸如上述的明文计算模块、片上缓存模块、密钥协商模块、加解密模块、解密验签模块,以及向FPGA芯片存入预置证书、向密管芯片存入认证根密钥等信息。其中,所形成的明文计算模块、片上缓存模块、密钥协商模块、加解密模块、解密验签模块等,所实现的功能逻辑可以发生变化和升级,所存入部署的预置证书、认证根密钥等信息也可能区别于更新前的信息。那么,FPGA板卡可以基于更新后的密钥协商模块、认证根密钥等,与用户进行远程协商得到新的配置文件部署密钥,该配置文件部署密钥可以被用于下一次的可新更新过程。类似地,可以据此不断实现针对FPGA板卡的可信更新操作。After the new version of the circuit logic configuration file is loaded into the FPGA chip, the above-mentioned plaintext calculation module, on-chip cache module, key agreement module, encryption and decryption module, decryption verification module, and storage in the FPGA chip can also be formed on the FPGA chip. Enter the preset certificate, and store the authentication root key to the secret management chip and other information. Among them, the formed plaintext calculation module, on-chip cache module, key agreement module, encryption/decryption module, decryption and signature verification module, etc., the implemented functional logic can be changed and upgraded, and stored in the deployed preset certificate, authentication root Information such as keys may also be different from the information before the update. Then, the FPGA board can remotely negotiate with the user to obtain a new configuration file deployment key based on the updated key agreement module, authentication root key, etc., and the configuration file deployment key can be used for the next renewal Update process. Similarly, a reliable update operation for FPGA boards can be continuously implemented accordingly.
在完成更新部署后,FPGA板卡可以针对新版电路逻辑配置文件生成认证结果。例如,上述的密钥协商模块可以通过诸如sm3算法或其他算法对新版电路逻辑配置文件的哈希值、基于新版电路逻辑配置文件协商得到的配置文件部署密钥的哈希值进行计算,得到的计算结果可以被作为上述的认证结果,并由密钥协商模块将该认证结果发送至用户。相应地,用户可以在客户端上基于所维护的新版电路逻辑配置文件和据此协商的配置文件部署密钥对认证结果进行验证,如果验证成功则表明新版电路逻辑配置文件在FPGA板卡上成功部署,且用户与FPGA板卡之间据此成功协商得到了一致的配置文件部署密钥,从而确认成功完成了针对电路逻辑配置文件的更新部署。After completing the update deployment, the FPGA board can generate certification results for the new version of the circuit logic configuration file. For example, the above-mentioned key agreement module can calculate the hash value of the new version of the circuit logic configuration file and the hash value of the configuration file deployment key negotiated based on the new version of the circuit logic configuration file through an algorithm such as sm3 or other algorithms. The calculation result can be used as the above-mentioned authentication result, and the key agreement module sends the authentication result to the user. Correspondingly, the user can verify the authentication result on the client based on the maintained new version of the circuit logic configuration file and the configuration file deployment key negotiated accordingly. If the verification is successful, it indicates that the new version of the circuit logic configuration file is successful on the FPGA board. Deployed, and the user and the FPGA board successfully negotiated accordingly to obtain a consistent configuration file deployment key, thereby confirming the successful completion of the circuit logic configuration file update deployment.
图5是一示例性实施例提供的一种基于FPGA的流水线式指令读取装置的示意结构图。请参考图5,在软件实施方式中,该终端交互装置可以包括:确定单元501,使FPGA芯片上的片上处理器确定待执行的代码程序,所述片上处理器由所述FPGA芯片加载所属FPGA结构上已部署的电路逻辑配置文件而形成,所述代码程序对应于所述FPGA结构所属的区块链节点收到的交易调用的智能合约;解析单元502,使所述片上处理器在按照预设长度依次读取所述代码程序所含数据的过程中,解析出每次读取的数据段中所含非定长操作指令的结束位,以使下次读取的数据段相邻于所述结束位。Fig. 5 is a schematic structural diagram of an FPGA-based pipelined instruction reading device provided by an exemplary embodiment. Referring to FIG. 5, in a software implementation, the terminal interaction device may include: a determining unit 501, which enables the on-chip processor on the FPGA chip to determine the code program to be executed, and the on-chip processor is loaded by the FPGA chip to the FPGA. The circuit logic configuration file that has been deployed structurally is formed, and the code program corresponds to the smart contract called by the transaction received by the blockchain node to which the FPGA structure belongs; the parsing unit 502 enables the on-chip processor to execute In the process of reading the data contained in the code program by length, the end bit of the non-fixed-length operation instruction contained in the data segment read each time is parsed, so that the data segment read next time is adjacent to the end Bit.
可选的,所述预设长度不小于所述代码程序中单条操作指令的最大长度。Optionally, the preset length is not less than the maximum length of a single operation instruction in the code program.
可选的,所述解析单元502具体用于:使所述片上处理器解析出每次读取的数据段中所含非定长操作指令的操作码;使所述片上处理器在基于所述操作码确定所述非定长操作指令包含操作数的情况下,根据所含操作数的数量和每一操作数的长度,确定最后一个操作数的末位,以作为所述非定长操作指令的结束位;使所述片上处理器在基于所述操作码确定所述非定长操作指令不包含操作数的情况下,将所述操作码的末位作为所述非定长操作指令的结束位。Optionally, the parsing unit 502 is specifically configured to: make the on-chip processor parse out the operation code of the non-fixed-length operation instruction contained in the data segment read each time; make the on-chip processor be based on the When the operation code determines that the non-fixed-length operation instruction contains operands, the last bit of the last operand is determined according to the number of operands contained and the length of each operand as the non-fixed-length operation instruction The end bit of the operation code; if the on-chip processor determines that the non-fixed-length operation instruction does not contain an operand based on the operation code, use the last bit of the operation code as the end of the non-fixed-length operation instruction Bit.
可选的,所述操作数为经过LEB编码的编码后操作数。Optionally, the operand is a post-encoded operand after LEB encoding.
可选的,所述片上处理器读取所述代码程序所含数据,包括:所述片上处理器按照每个时钟周期读取一次的频率,依次读取所述代码程序所含数据。Optionally, the reading of the data contained in the code program by the on-chip processor includes: the on-chip processor sequentially reads the data contained in the code program at a frequency of reading once per clock cycle.
可选的,所述代码程序包括字节码程序。Optionally, the code program includes a bytecode program.
可选的,所述字节码程序包括wasm字节码程序。Optionally, the bytecode program includes a wasm bytecode program.
上述实施例阐明的系统、装置、模块或单元,具体可以由计算机芯片或实体实现,或者由具有某种功能的产品来实现。一种典型的实现设备为计算机,计算机的具体形式可以是个人计算机、膝上型计算机、蜂窝电话、相机电话、智能电话、个人数字助理、媒体播放器、导航设备、电子邮件收发设备、游戏控制台、平板计算机、可穿戴设备或者这些设备中的任意几种设备的组合。The systems, devices, modules, or units explained in the above embodiments may be implemented by computer chips or entities, or implemented by products with certain functions. A typical implementation device is a computer. The specific form of the computer can be a personal computer, a laptop computer, a cellular phone, a camera phone, a smart phone, a personal digital assistant, a media player, a navigation device, an email receiving and sending device, and a game control A console, a tablet computer, a wearable device, or a combination of any of these devices.
在一个典型的配置中,计算机包括一个或多个处理器(CPU)、输入/输出接口、网络接口和内存。In a typical configuration, the computer includes one or more processors (CPU), input/output interfaces, network interfaces, and memory.
内存可能包括计算机可读介质中的非永久性存储器,随机存取存储器(RAM)和/或非易失性内存等形式,如只读存储器(ROM)或闪存(flash RAM)。内存是计算机可读介质的示例。The memory may include non-permanent memory in a computer readable medium, random access memory (RAM) and/or non-volatile memory, such as read-only memory (ROM) or flash memory (flash RAM). Memory is an example of computer readable media.
计算机可读介质包括永久性和非永久性、可移动和非可移动媒体可以由任何方法或技术来实现信息存储。信息可以是计算机可读指令、数据结构、程序的模块或其他数据。计算机的存储介质的例子包括,但不限于相变内存(PRAM)、静态随机存取存储器(SRAM)、动态随机存取存储器(DRAM)、其他类型的随机存取存储器(RAM)、只读存储器(ROM)、电可擦除可编程只读存储器(EEPROM)、快闪记忆体或其他内存技术、只读光盘只读存储器(CD-ROM)、数字多功能光盘(DVD)或其他光学存储、磁盒式磁带、磁盘存储、量子存储器、基于石墨烯的存储介质或其他磁性存储设备或任何其他非传输介质,可用于存储可以被计算设备访问的信息。按照本文中的界定, 计算机可读介质不包括暂存电脑可读媒体(transitory media),如调制的数据信号和载波。Computer-readable media include permanent and non-permanent, removable and non-removable media, and information storage can be realized by any method or technology. The information can be computer-readable instructions, data structures, program modules, or other data. Examples of computer storage media include, but are not limited to, phase change memory (PRAM), static random access memory (SRAM), dynamic random access memory (DRAM), other types of random access memory (RAM), read-only memory (ROM), electrically erasable programmable read-only memory (EEPROM), flash memory or other memory technology, CD-ROM, digital versatile disc (DVD) or other optical storage, Magnetic cassettes, disk storage, quantum memory, graphene-based storage media or other magnetic storage devices or any other non-transmission media can be used to store information that can be accessed by computing devices. According to the definition in this article, computer-readable media does not include transitory media, such as modulated data signals and carrier waves.
还需要说明的是,术语“包括”、“包含”或者其任何其他变体意在涵盖非排他性的包含,从而使得包括一系列要素的过程、方法、商品或者设备不仅包括那些要素,而且还包括没有明确列出的其他要素,或者是还包括为这种过程、方法、商品或者设备所固有的要素。在没有更多限制的情况下,由语句“包括一个……”限定的要素,并不排除在包括所述要素的过程、方法、商品或者设备中还存在另外的相同要素。It should also be noted that the terms "include", "include" or any other variants thereof are intended to cover non-exclusive inclusion, so that a process, method, commodity or equipment including a series of elements not only includes those elements, but also includes Other elements that are not explicitly listed, or they also include elements inherent to such processes, methods, commodities, or equipment. If there are no more restrictions, the element defined by the sentence "including a..." does not exclude the existence of other identical elements in the process, method, commodity, or equipment that includes the element.
上述对本说明书特定实施例进行了描述。其它实施例在所附权利要求书的范围内。在一些情况下,在权利要求书中记载的动作或步骤可以按照不同于实施例中的顺序来执行并且仍然可以实现期望的结果。另外,在附图中描绘的过程不一定要求示出的特定顺序或者连续顺序才能实现期望的结果。在某些实施方式中,多任务处理和并行处理也是可以的或者可能是有利的。The foregoing describes specific embodiments of this specification. Other embodiments are within the scope of the appended claims. In some cases, the actions or steps described in the claims may be performed in a different order than in the embodiments and still achieve desired results. In addition, the processes depicted in the drawings do not necessarily require the specific order or sequential order shown in order to achieve the desired results. In some embodiments, multitasking and parallel processing are also possible or may be advantageous.
在本说明书一个或多个实施例使用的术语是仅仅出于描述特定实施例的目的,而非旨在限制本说明书一个或多个实施例。在本说明书一个或多个实施例和所附权利要求书中所使用的单数形式的“一种”、“所述”和“该”也旨在包括多数形式,除非上下文清楚地表示其他含义。还应当理解,本文中使用的术语“和/或”是指并包含一个或多个相关联的列出项目的任何或所有可能组合。The terms used in one or more embodiments of this specification are only for the purpose of describing specific embodiments, and are not intended to limit one or more embodiments of this specification. The singular forms "a", "said" and "the" used in one or more embodiments of this specification and the appended claims are also intended to include plural forms, unless the context clearly indicates other meanings. It should also be understood that the term "and/or" as used herein refers to and includes any or all possible combinations of one or more associated listed items.
应当理解,尽管在本说明书一个或多个实施例可能采用术语第一、第二、第三等来描述各种信息,但这些信息不应限于这些术语。这些术语仅用来将同一类型的信息彼此区分开。例如,在不脱离本说明书一个或多个实施例范围的情况下,第一信息也可以被称为第二信息,类似地,第二信息也可以被称为第一信息。取决于语境,如在此所使用的词语“如果”可以被解释成为“在……时”或“当……时”或“响应于确定”。It should be understood that, although the terms first, second, third, etc. may be used to describe various information in one or more embodiments of this specification, the information should not be limited to these terms. These terms are only used to distinguish the same type of information from each other. For example, without departing from the scope of one or more embodiments of this specification, the first information may also be referred to as second information, and similarly, the second information may also be referred to as first information. Depending on the context, the word "if" as used herein can be interpreted as "when" or "when" or "in response to determination".
以上所述仅为本说明书一个或多个实施例的较佳实施例而已,并不用以限制本说明书一个或多个实施例,凡在本说明书一个或多个实施例的精神和原则之内,所做的任何修改、等同替换、改进等,均应包含在本说明书一个或多个实施例保护的范围之内。The foregoing descriptions are only preferred embodiments of one or more embodiments of this specification, and are not intended to limit one or more embodiments of this specification. All within the spirit and principle of one or more embodiments of this specification, Any modification, equivalent replacement, improvement, etc. made should be included in the protection scope of one or more embodiments of this specification.

Claims (10)

  1. 一种基于FPGA的流水线式指令读取方法,包括:A pipelined instruction reading method based on FPGA, including:
    FPGA芯片上的片上处理器确定待执行的代码程序,所述片上处理器由所述FPGA芯片加载所属FPGA结构上已部署的电路逻辑配置文件而形成,所述代码程序对应于所述FPGA结构所属的区块链节点收到的交易调用的智能合约;The on-chip processor on the FPGA chip determines the code program to be executed. The on-chip processor is formed by the FPGA chip loading the deployed circuit logic configuration file on the FPGA structure to which it belongs, and the code program corresponds to the FPGA structure to which the FPGA structure belongs. The smart contract called by the transaction received by the blockchain node;
    所述片上处理器在按照预设长度依次读取所述代码程序所含数据的过程中,解析出每次读取的数据段中所含非定长操作指令的结束位,以使下次读取的数据段相邻于所述结束位。In the process of sequentially reading the data contained in the code program according to the preset length, the on-chip processor parses out the end bit of the non-fixed-length operation instruction contained in the data segment read each time, so that the next read The fetched data segment is adjacent to the end bit.
  2. 根据权利要求1所述的方法,所述预设长度不小于所述代码程序中单条操作指令的最大长度。The method according to claim 1, wherein the preset length is not less than the maximum length of a single operation instruction in the code program.
  3. 根据权利要求1所述的方法,所述片上处理器解析出每次读取的数据段中所含非定长操作指令的结束位,包括:The method according to claim 1, wherein the on-chip processor parses out the end bit of the non-fixed-length operation instruction contained in the data segment read each time, comprising:
    所述片上处理器解析出每次读取的数据段中所含非定长操作指令的操作码;The on-chip processor parses out the operation code of the non-fixed-length operation instruction contained in the data segment read each time;
    所述片上处理器在基于所述操作码确定所述非定长操作指令包含操作数的情况下,根据所含操作数的数量和每一操作数的长度,确定最后一个操作数的末位,以作为所述非定长操作指令的结束位;When the on-chip processor determines that the non-fixed-length operation instruction includes an operand based on the opcode, it determines the last bit of the last operand according to the number of operands contained and the length of each operand, As the end bit of the non-fixed length operation instruction;
    所述片上处理器在基于所述操作码确定所述非定长操作指令不包含操作数的情况下,将所述操作码的末位作为所述非定长操作指令的结束位。In the case that the on-chip processor determines based on the operation code that the non-fixed-length operation instruction does not include an operand, the last bit of the operation code is used as the end bit of the non-fixed-length operation instruction.
  4. 根据权利要求3所述的方法,所述操作数为经过LEB编码的编码后操作数。The method according to claim 3, wherein the operand is an encoded operand after LEB encoding.
  5. 根据权利要求1所述的方法,所述片上处理器读取所述代码程序所含数据,包括:The method according to claim 1, wherein the on-chip processor reading the data contained in the code program comprises:
    所述片上处理器按照每个时钟周期读取一次的频率,依次读取所述代码程序所含数据。The on-chip processor sequentially reads the data contained in the code program according to the frequency of reading once per clock cycle.
  6. 根据权利要求1所述的方法,所述代码程序包括字节码程序。The method according to claim 1, wherein the code program includes a bytecode program.
  7. 根据权利要求6所述的方法,所述字节码程序包括wasm字节码程序。The method according to claim 6, wherein the bytecode program comprises a wasm bytecode program.
  8. 一种基于FPGA的流水线式指令读取装置,包括:A pipelined instruction reading device based on FPGA, including:
    确定单元,使FPGA芯片上的片上处理器确定待执行的代码程序,所述片上处理器由所述FPGA芯片加载所属FPGA结构上已部署的电路逻辑配置文件而形成,所述代码程序对应于所述FPGA结构所属的区块链节点收到的交易调用的智能合约;The determining unit enables the on-chip processor on the FPGA chip to determine the code program to be executed. The on-chip processor is formed by the FPGA chip loading the circuit logic configuration file deployed on the FPGA structure to which the code program corresponds The smart contract called by the transaction received by the blockchain node to which the FPGA structure belongs;
    解析单元,使所述片上处理器在按照预设长度依次读取所述代码程序所含数据的过程中,解析出每次读取的数据段中所含非定长操作指令的结束位,以使下次读取的数据 段相邻于所述结束位。The parsing unit enables the on-chip processor to parse out the end bit of the non-fixed-length operation instruction contained in the data segment read each time during the process of sequentially reading the data contained in the code program according to the preset length, and Make the data segment read next time adjacent to the end bit.
  9. 一种电子设备,包括:An electronic device including:
    处理器;processor;
    用于存储处理器可执行指令的存储器;A memory for storing processor executable instructions;
    其中,所述处理器通过运行所述可执行指令以实现如权利要求1-7中任一项所述的方法。Wherein, the processor implements the method according to any one of claims 1-7 by running the executable instruction.
  10. 一种计算机可读存储介质,其上存储有计算机指令,该指令被处理器执行时实现如权利要求1-7中任一项所述的方法的步骤。A computer-readable storage medium having computer instructions stored thereon, which, when executed by a processor, implement the steps of the method according to any one of claims 1-7.
PCT/CN2020/098522 2019-09-25 2020-06-28 Pipelined instruction reading method and apparatus based on fpga WO2021057141A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201910913484.1 2019-09-25
CN201910913484.1A CN110750303B (en) 2019-09-25 2019-09-25 Pipelined instruction reading method and device based on FPGA

Publications (1)

Publication Number Publication Date
WO2021057141A1 true WO2021057141A1 (en) 2021-04-01

Family

ID=69277143

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2020/098522 WO2021057141A1 (en) 2019-09-25 2020-06-28 Pipelined instruction reading method and apparatus based on fpga

Country Status (2)

Country Link
CN (1) CN110750303B (en)
WO (1) WO2021057141A1 (en)

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110750303B (en) * 2019-09-25 2020-10-20 支付宝(杭州)信息技术有限公司 Pipelined instruction reading method and device based on FPGA

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102866982A (en) * 2012-09-14 2013-01-09 复旦大学 Eight-bit complex instruction set central processor based on FPGA (Field Programmable Gata Array)
US20190034917A1 (en) * 2017-12-29 2019-01-31 Intel Corporation Tracking an Electronic Wallet Using Radio Frequency Identification (RFID)
CN109886682A (en) * 2019-01-31 2019-06-14 阿里巴巴集团控股有限公司 The method and node, storage medium that contract calls are realized in block chain
CN109949156A (en) * 2019-02-28 2019-06-28 矩阵元技术(深圳)有限公司 A kind of data processing method and server of block chain contract
CN110008735A (en) * 2019-01-31 2019-07-12 阿里巴巴集团控股有限公司 The method and node, storage medium that contract calls are realized in block chain
CN110750303A (en) * 2019-09-25 2020-02-04 支付宝(杭州)信息技术有限公司 Pipelined instruction reading method and device based on FPGA

Family Cites Families (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
TW357318B (en) * 1997-03-18 1999-05-01 Ind Tech Res Inst Branching forecast and reading device for unspecified command length extra-purity pipeline processor
GB2376098B (en) * 2001-05-31 2004-11-24 Advanced Risc Mach Ltd Unhandled operation handling in multiple instruction set systems
US7568070B2 (en) * 2005-07-29 2009-07-28 Qualcomm Incorporated Instruction cache having fixed number of variable length instructions
US7676659B2 (en) * 2007-04-04 2010-03-09 Qualcomm Incorporated System, method and software to preload instructions from a variable-length instruction set with proper pre-decoding
CN101609392B (en) * 2008-06-16 2012-07-04 群联电子股份有限公司 Method and system for transmitting and allocating data string and controller thereof
CN102567528B (en) * 2011-12-29 2014-01-29 东软集团股份有限公司 Method and device for reading mass data
CN108513669A (en) * 2017-12-29 2018-09-07 深圳前海达闼云端智能科技有限公司 Crowd funding information processing method and device based on block chain, storage medium and electronic equipment
CN109559227A (en) * 2018-11-29 2019-04-02 咪咕文化科技有限公司 A kind of method of commerce, device and the storage medium of transregional piece of chain network

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102866982A (en) * 2012-09-14 2013-01-09 复旦大学 Eight-bit complex instruction set central processor based on FPGA (Field Programmable Gata Array)
US20190034917A1 (en) * 2017-12-29 2019-01-31 Intel Corporation Tracking an Electronic Wallet Using Radio Frequency Identification (RFID)
CN109886682A (en) * 2019-01-31 2019-06-14 阿里巴巴集团控股有限公司 The method and node, storage medium that contract calls are realized in block chain
CN110008735A (en) * 2019-01-31 2019-07-12 阿里巴巴集团控股有限公司 The method and node, storage medium that contract calls are realized in block chain
CN109949156A (en) * 2019-02-28 2019-06-28 矩阵元技术(深圳)有限公司 A kind of data processing method and server of block chain contract
CN110750303A (en) * 2019-09-25 2020-02-04 支付宝(杭州)信息技术有限公司 Pipelined instruction reading method and device based on FPGA

Also Published As

Publication number Publication date
CN110750303A (en) 2020-02-04
CN110750303B (en) 2020-10-20

Similar Documents

Publication Publication Date Title
US20210064784A1 (en) Managing a smart contract on a blockchain
WO2021103794A1 (en) Method for realizing highly efficient privacy-preserving transaction in blockchain, and device
TW202113645A (en) Managing a smart contract on a blockchain
WO2021057181A1 (en) Fpga-based key negotiation method and device
CN110245947B (en) Receipt storage method and node combining conditional restrictions of transaction and user types
WO2020233631A1 (en) Transaction type-based receipt storage method and node
WO2020233635A1 (en) Receipt storage method combining conditional restrictions of multiple types of dimensions and node
WO2021057168A1 (en) Method and apparatus for realizing virtual machine operation on the basis of fpga
WO2021057182A1 (en) Trusted update method and apparatus for fpga logic
WO2020233624A1 (en) Receipt storage method and node employing transaction type in combination with event function type
WO2021057166A1 (en) Method and apparatus for implementing external call in fpga
WO2021057180A1 (en) Fpga-based privacy blockchain implementation method, and device
WO2020233619A1 (en) Receipt storage method and node in combination with user type and transaction type
WO2020233627A1 (en) Receipt storage method and node based on multiple types of dimensions
WO2020233632A1 (en) Receipt storage method and node based on event function type
WO2021057184A1 (en) Efficient operation method and apparatus for security intelligent contract processor based on fpga
WO2020233634A1 (en) Method and node for receipt storage combining transaction and event type condition restrictions
WO2021057167A1 (en) Method and device for transaction processing for fpga-based secure smart contract processor
WO2021057124A1 (en) Fpga-based privacy block chain implementing method and device
WO2021057272A1 (en) Method and apparatus for implementing contract invoking based on fpga
WO2021057273A1 (en) Method and apparatus for realizing efficient contract calling on fpga
WO2021057221A1 (en) Method and apparatus for realizing state update based on fpga
WO2021057141A1 (en) Pipelined instruction reading method and apparatus based on fpga
WO2020233423A1 (en) Receipt storage method and node based on transaction type
WO2020233422A1 (en) Receipt storage method and node based on user type

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 20867322

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 20867322

Country of ref document: EP

Kind code of ref document: A1