CN113806342A - System for extracting data at bottom of ether house block chain - Google Patents

System for extracting data at bottom of ether house block chain Download PDF

Info

Publication number
CN113806342A
CN113806342A CN202110824446.6A CN202110824446A CN113806342A CN 113806342 A CN113806342 A CN 113806342A CN 202110824446 A CN202110824446 A CN 202110824446A CN 113806342 A CN113806342 A CN 113806342A
Authority
CN
China
Prior art keywords
data
module
formatting
analyzing
secondly
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202110824446.6A
Other languages
Chinese (zh)
Inventor
郑程
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Xiamen Lianyin Technology Co ltd
Original Assignee
Xiamen Lianyin Technology Co ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Xiamen Lianyin Technology Co ltd filed Critical Xiamen Lianyin Technology Co ltd
Priority to CN202110824446.6A priority Critical patent/CN113806342A/en
Publication of CN113806342A publication Critical patent/CN113806342A/en
Pending legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/21Design, administration or maintenance of databases
    • G06F16/215Improving data quality; Data cleansing, e.g. de-duplication, removing invalid entries or correcting typographical errors
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/25Integrating or interfacing systems involving database management systems
    • G06F16/252Integrating or interfacing systems involving database management systems between a Database Management System and a front-end application
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/27Replication, distribution or synchronisation of data between databases or within a distributed database system; Distributed database system architectures therefor

Landscapes

  • Engineering & Computer Science (AREA)
  • Databases & Information Systems (AREA)
  • Theoretical Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Computing Systems (AREA)
  • Quality & Reliability (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

The invention discloses a system for extracting data at the bottom of an Ether house block chain, belonging to the technical field of Ether house data extraction; the system comprises a levelDB bottom layer connecting module, a data structure analyzing module and a data formatting module, wherein the levelDB bottom layer module has the following main functions: firstly, directly accessing a level DB database of an Etherhouse; secondly, controlling a database lock during multi-thread analysis; the data analysis module has the main functions as follows: firstly, analyzing block header data; secondly, analyzing transaction data in the block; thirdly, calling Log data in the analysis block; the data formatting module has the main functions as follows: a decimal language of the first hexadecimal trading value; secondly, formatting a transaction address; thirdly, formatting a transaction calling function; the invention leads out the Ether house block chain data by directly accessing the Ether house node bottom level DB and formatting binary data. The invention can greatly shorten the time for exporting the data of the Ethernet house block chain and provide an efficient and reliable data source for the data analysis on the Ethernet house chain.

Description

System for extracting data at bottom of ether house block chain
Technical Field
The invention provides a system for extracting data at the bottom of an Ether house block chain, and belongs to the technical field of Ether house data extraction.
Background
Ethereum (ethernet in english) is an open-source public blockchain platform with intelligent contract functions, and provides an ethernet Virtual Machine (ethernet Virtual Machine) with decentralized center through its dedicated ethernet currency (ETH, abbreviated as "ETH") to process point-to-point contracts. Billions of levels of historical transaction data and contract calling data are stored on an Ethernet main chain, compressed binary block data needs to be analyzed into a data input high-performance database with clear fields, strong indexing and strong analyzability to efficiently and comprehensively analyze the data and records on a block chain, and accordingly, the capacity of performing big data mining analysis and artificial intelligent clustering on the data is accelerated.
There are two distinct pieces of data in the ether house; permanent data and temporary data. An example of persistent data is a transaction, which will be recorded in a transaction trie after it is fully validated, and will never change. An example of temporary data is a balance on a particular etherhouse account address. The balance of the account address is stored in a state trie and is changed whenever the transaction for that particular account changes. In an etherhouse, where persistent data and temporary data (such as account balances) are stored separately, the etherhouse uses a trie data structure to manage the data.
EtherFang uses level DB as its underlying data storage database, level DB is an open source key-value pair database, besides database main functions, also provides forward and backward iterations of data, ordered mapping from key to value, custom comparison function and automatic compression function. Data may be automatically compressed using the open source Google compression/decompression library "Snappy". Snappy aims at faster compression speeds than for maximum compression. The LevelDB provides an important storage and retrieval mechanism for managing the state of the ethernet network. Because of the complexity of the Etherhouse data structure, the existing Etherhouse ETL tools in the market all cross the level DB and directly derive the historical data by requesting the RPC interface carried by the Etherhouse node program.
The existing ether mill data export technology is to export historical data by requesting an RPC interface carried by an ether mill node program, so that the efficiency of an extraction mode is low, a server is greatly loaded, the export period of the full data is long (about one month long), and the requirement of quickly analyzing the ether mill data cannot be met.
Disclosure of Invention
The system for extracting the bottom data of the Ethernet workshop block chain can solve the problem of low extraction efficiency of the existing Ethernet workshop data export technology.
In order to solve the above problems, the present invention provides the following technical solutions: the system comprises a levelDB bottom layer connecting module, a data structure analyzing module and a data formatting module, wherein the levelDB bottom layer module has the following main functions: firstly, directly accessing a level DB database of an Etherhouse; secondly, controlling a database lock during multi-thread analysis; the data analysis module has the main functions as follows: firstly, analyzing block header data; secondly, analyzing transaction data in the block; thirdly, calling Log data in the analysis block; the data formatting module has the main functions as follows: a decimal language of the first hexadecimal trading value; secondly, formatting a transaction address; thirdly, formatting a transaction calling function;
the operation mode is as follows:
a: data extraction, namely directly analyzing the Ethenhouse LevelDB and formatting binary data to obtain the bottom layer data of the Ethenhouse, wherein different extraction methods need to be selected in the extraction process, so that the operation efficiency is improved as much as possible;
b: cleaning and converting data;
c: loading data;
the invention has the beneficial effects that:
the invention leads out the Ethernet workshop block chain data by directly accessing the Ethernet workshop node bottom level DB and formatting binary data. The method can greatly shorten the time for exporting the data of the Ethernet workshop block chain (the general one-month period is shortened to 2 days), and provides an efficient and reliable data source for data analysis on the Ethernet workshop block chain. The invention can improve the extraction efficiency of the data at the bottom layer of the Ethernet block chain and restore the original format of the data at the bottom layer of the Ethernet block chain to the maximum extent.
Drawings
Fig. 1 is a flow chart of ethernet bay data export of a system for extracting data at the bottom of an ethernet bay block chain according to the present invention.
Detailed Description
The invention is further described below with reference to the accompanying drawings.
According to the illustration of FIG. 1: the invention provides a system for extracting data at the bottom of an Ether house block chain, which comprises the following steps: the system comprises a levelDB bottom layer connecting module, a data structure analyzing module and a data formatting module, wherein the levelDB bottom layer module has the main functions as follows: firstly, directly accessing a level DB database of an Etherhouse; secondly, controlling a database lock during multi-thread analysis; the data analysis module has the main functions as follows: firstly, analyzing block header data; secondly, analyzing transaction data in the block; thirdly, calling Log data in the analysis block; the data formatting module has the main functions as follows: a decimal language of the first hexadecimal trading value; secondly, formatting a transaction address; thirdly, formatting a transaction calling function; the operation mode is as follows: a: data extraction, namely directly analyzing the Ethenhouse LevelDB and formatting binary data to obtain the bottom layer data of the Ethenhouse, wherein different extraction methods need to be selected in the extraction process, so that the operation efficiency is improved as much as possible; b: cleaning and converting data; c: loading data;
and deriving the Ethernet block chain data by directly accessing the underlying level DB of the Ethernet node and formatting binary data. The method can greatly shorten the time for exporting the data of the Ethernet workshop blockchain (the general one-month period is shortened to 2 days), and provides an efficient and reliable data source for data analysis on the Ethernet workshop blockchain. The specific mode is that firstly, data extraction is carried out, the Ethenhouse LevelDB is directly analyzed and binary data is formatted to obtain the bottom layer data of the Ethenhouse, different extraction methods need to be selected in the extraction process, therefore, the operation efficiency is improved as much as possible, then the data is cleaned and converted, and finally the data is loaded.
The present invention and its embodiments have been described above, and the description is not intended to be limiting, and the drawings are only one embodiment of the present invention, and the actual structure is not limited thereto. In summary, those skilled in the art should appreciate that they can readily use the disclosed conception and specific embodiments as a basis for designing or modifying other structures for carrying out the same purposes of the present invention without departing from the spirit and scope of the invention as defined by the appended claims.

Claims (2)

1. A system for extracting data at the bottom of an Ethernet house block chain is characterized in that: the system comprises a levelDB bottom layer connecting module, a data structure analyzing module and a data formatting module, wherein the levelDB bottom layer module has the following main functions: firstly, directly accessing a level DB database of an Etherhouse; secondly, controlling a database lock during multi-thread analysis; the data analysis module has the main functions as follows: firstly, analyzing block header data; secondly, analyzing transaction data in the block; thirdly, calling Log data in the analysis block; the data formatting module has the main functions as follows: a decimal language of the first hexadecimal trading value; secondly, formatting a transaction address; and thirdly, formatting a transaction calling function.
2. The system for extracting data at the bottom of an etherhouse blockchain according to claim 1, wherein: the operation mode is as follows:
a: data extraction, namely directly analyzing the Ethenhouse LevelDB and formatting binary data to obtain the bottom layer data of the Ethenhouse, wherein different extraction methods need to be selected in the extraction process, so that the operation efficiency is improved as much as possible;
b: cleaning and converting data;
c: and (4) loading data.
CN202110824446.6A 2021-07-21 2021-07-21 System for extracting data at bottom of ether house block chain Pending CN113806342A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202110824446.6A CN113806342A (en) 2021-07-21 2021-07-21 System for extracting data at bottom of ether house block chain

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202110824446.6A CN113806342A (en) 2021-07-21 2021-07-21 System for extracting data at bottom of ether house block chain

Publications (1)

Publication Number Publication Date
CN113806342A true CN113806342A (en) 2021-12-17

Family

ID=78893120

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202110824446.6A Pending CN113806342A (en) 2021-07-21 2021-07-21 System for extracting data at bottom of ether house block chain

Country Status (1)

Country Link
CN (1) CN113806342A (en)

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20030084057A1 (en) * 2001-11-01 2003-05-01 Verisign, Inc. High speed non-concurrency controlled database
US20180019867A1 (en) * 2016-07-15 2018-01-18 Mastercard International Incorporated Method and system for partitioned blockchains and enhanced privacy for permissioned blockchains
WO2018144302A1 (en) * 2017-01-31 2018-08-09 Rush Thomas Jay Blockchain data-processing engine
CN109325855A (en) * 2018-08-16 2019-02-12 北京京东尚科信息技术有限公司 Block chain network, dispositions method and storage medium
CN109389500A (en) * 2018-09-29 2019-02-26 重庆邮电大学 Data trade platform based on ether mill
CN110609869A (en) * 2019-09-10 2019-12-24 连连银通电子支付有限公司 Block chain-based data storage method, related equipment and storage medium

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20030084057A1 (en) * 2001-11-01 2003-05-01 Verisign, Inc. High speed non-concurrency controlled database
US20180019867A1 (en) * 2016-07-15 2018-01-18 Mastercard International Incorporated Method and system for partitioned blockchains and enhanced privacy for permissioned blockchains
WO2018144302A1 (en) * 2017-01-31 2018-08-09 Rush Thomas Jay Blockchain data-processing engine
CN109325855A (en) * 2018-08-16 2019-02-12 北京京东尚科信息技术有限公司 Block chain network, dispositions method and storage medium
CN109389500A (en) * 2018-09-29 2019-02-26 重庆邮电大学 Data trade platform based on ether mill
CN110609869A (en) * 2019-09-10 2019-12-24 连连银通电子支付有限公司 Block chain-based data storage method, related equipment and storage medium

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
王泓机等: "针对区块链应用的查询优化模型", 《计算机工程与应用》, pages 34 - 39 *
郑泽洲: "以太坊的存储层技术分析之三:以太坊区块读写LevelDB相关代码分析", pages 1 - 5, Retrieved from the Internet <URL:https://blog.csdn.net/wxid2798226/article/details/83689615> *

Similar Documents

Publication Publication Date Title
CN110019218B (en) Data storage and query method and equipment
CN110489445B (en) Rapid mass data query method based on polymorphic composition
CN106844507B (en) A kind of method and apparatus of data batch processing
CN103593422B (en) Virtual access management method of heterogeneous database
CN107247811B (en) SQL statement performance optimization method and device based on Oracle database
CN112347071B (en) Power distribution network cloud platform data fusion method and power distribution network cloud platform
CN109508355A (en) A kind of data pick-up method, system and terminal device
CN102591864B (en) Data updating method and device in comparison system
KR20190063835A (en) System for processing real-time data modification of in-memory database
CN104317957A (en) Open platform and system for processing reports and report processing method
US8600990B2 (en) Interacting methods of data extraction
CN112817958A (en) Electric power planning data acquisition method and device and intelligent terminal
US20090248691A1 (en) Interacting methods of data summarization
CN113961546B (en) Real-time query library design method supporting online analysis and statistics
CN105447069A (en) BW platform based account checking platform data synchronization method and system
CN113806342A (en) System for extracting data at bottom of ether house block chain
CN115098486A (en) Real-time data acquisition method based on customs service big data
CN114722045A (en) Time series data storage method and device
CN109800271A (en) A kind of information collecting method based on big data
CN112507213B (en) Method for recommending optimized system scheme based on behavior big data analysis
KR101024494B1 (en) Extraction method of modified data using meta data
CN114519071A (en) Generation method, matching method, system, device and medium of rule matching model
CN117349327A (en) Memory data acquisition optimization method and system
CN111352933B (en) Index system is swiftly established to big data database in high in clouds
He et al. Research on the SCADA/EMS system data warehouse technology

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination