CN113220716B - Device and method for improving real-time database data processing performance - Google Patents

Device and method for improving real-time database data processing performance Download PDF

Info

Publication number
CN113220716B
CN113220716B CN202110498192.3A CN202110498192A CN113220716B CN 113220716 B CN113220716 B CN 113220716B CN 202110498192 A CN202110498192 A CN 202110498192A CN 113220716 B CN113220716 B CN 113220716B
Authority
CN
China
Prior art keywords
data
real
database
data reading
time
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN202110498192.3A
Other languages
Chinese (zh)
Other versions
CN113220716A (en
Inventor
王智微
丁辉
刘鹏飞
王毅
张宪岭
颜刚
时标
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Xian Thermal Power Research Institute Co Ltd
Huaneng Group Technology Innovation Center Co Ltd
Huaneng Nanjing Jinling Power Generation Co Ltd
Original Assignee
Xian Thermal Power Research Institute Co Ltd
Huaneng Group Technology Innovation Center Co Ltd
Huaneng Nanjing Jinling Power Generation Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Xian Thermal Power Research Institute Co Ltd, Huaneng Group Technology Innovation Center Co Ltd, Huaneng Nanjing Jinling Power Generation Co Ltd filed Critical Xian Thermal Power Research Institute Co Ltd
Priority to CN202110498192.3A priority Critical patent/CN113220716B/en
Publication of CN113220716A publication Critical patent/CN113220716A/en
Application granted granted Critical
Publication of CN113220716B publication Critical patent/CN113220716B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/24Querying
    • G06F16/245Query processing
    • G06F16/2453Query optimisation
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/21Design, administration or maintenance of databases
    • G06F16/215Improving data quality; Data cleansing, e.g. de-duplication, removing invalid entries or correcting typographical errors
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/24Querying
    • G06F16/245Query processing
    • G06F16/2455Query execution

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Databases & Information Systems (AREA)
  • Data Mining & Analysis (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Computational Linguistics (AREA)
  • Quality & Reliability (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
  • Debugging And Monitoring (AREA)

Abstract

A device and a processing method for improving the data processing performance of a real-time database comprise the following steps: the real-time database is used for storing production real-time data collected from each power plant; the data reading node module is internally provided with a data reading node program, is arranged in the computing server, is provided with a plurality of data reading nodes, reads the production real-time data in the real-time database in parallel and stores the read data into the shared memory database; and the data reading node monitoring module is internally provided with a data reading node monitoring program and is used for starting the data reading node program and distributing the measuring point table of the real-time database to each data reading node module. The invention writes the mass data read from the real-time database into the shared memory by simultaneously operating a plurality of data reading node programs, the data processing program takes out and processes the data from the shared memory, and then writes the processed result into the real-time database.

Description

Device and method for improving real-time database data processing performance
Technical Field
The invention relates to the technical field of development of computer server sides, in particular to a device and a processing method for improving the data processing performance of a real-time database.
Background
In developing a business system based on real-time data, a real-time database is required as a storage medium for the data. The common databases on the market are PI, rython, etc., and all these databases provide SDKs with data access to the service system. Given the access pressure on databases, none of these SDKs generally support multi-threaded parallel access. Therefore, in some cases where the calculation needs milliseconds and the amount of data is large, the performance of real-time data access cannot meet the services in the conventional manner.
Disclosure of Invention
In order to solve the problem of low performance of real-time data access in a conventional manner, the invention aims to provide a device and a processing method for improving the data processing performance of a real-time database.
In order to achieve the purpose, the technical scheme adopted by the invention and the beneficial effects of the invention are as follows:
an apparatus for improving real-time database data processing performance, comprising:
a real-time database 1 for storing production real-time data collected from each power plant;
the data reading node module 2 is internally provided with a data reading node program, is arranged in the computing server, is provided with a plurality of data reading nodes, reads the production real-time data in the real-time database 1 in parallel, and stores the read data into the shared memory database 4;
and the data reading node monitoring module 3 is internally provided with a data reading node monitoring program and is used for starting the data reading node program and distributing the measuring point table of the real-time database 1 to each data reading node module 2.
The data reading node module 2 consists of four parts of functions, including measuring point table and configuration acquisition, real-time data reading, cache queue and memory database writing;
the measuring point table and the configuration acquire the measuring point table of the real-time database 1 required to be read from the node monitoring module 3 through the web service, and the configuration of the reading interval and the number of the write-in threads of the memory database;
the real-time data reading is configured by the obtained measuring point table and the reading interval, and the data in the real-time database 1 is periodically read by the measuring point name;
the buffer queue passes the test roll name; a value; writing the read data into a cache queue in a timestamp mode;
and the memory database writing creates a plurality of threads according to the obtained number of the memory database writing threads, and each thread takes out data from the cache queue and writes the data into the memory database 4.
The shared memory database 4 is used for storing the real-time data to be processed read by the data reading node module 2 and providing a high-concurrency and high-performance data reading interface for the data processing module 5;
the data processing module 5 is provided with a data processing program inside, reads data from the shared memory database 4 for processing, and writes the processing result into the real-time database 1 again.
And the data processing module 5 reads data from the shared memory database 4 for data cleaning or calculation.
A processing method for improving the data processing performance of a real-time database comprises the following steps;
step 1:
deploying all programs except the real-time database 1 on one server;
step 2:
starting a shared memory database 4;
and step 3:
configuring the number of nodes needing to be started in a data reading node monitoring module 3, and configuring a measuring point table of a real-time database 1 needing to be read by each node;
and 4, step 4:
the data reading node monitoring program module 3 starts all the data reading node program modules 2 and distributes the measuring point table of the real-time database 1 to different nodes;
and 5:
the data reading node program module 2 writes the read data into the shared memory database 4;
and 6:
and the data processing program module 5, wherein the data processing program 5 writes the data processing result into the real-time database 1.
The invention has the beneficial effects that:
the invention reads the data in the real-time database 1 in parallel by deploying a plurality of data reading node program modules 2. The read data is then stored in the shared memory database 4. The data processing program 5 reads data from the in-memory database and performs data cleaning or calculation. In the data reading process, parallel access is simulated in a multi-node mode, and the data reading performance is improved. In the data processing process, the overhead caused by network access and file access is reduced by using the shared memory database 4. And finally, the calculation function of millisecond level required by the business system is supported.
Drawings
Fig. 1 is a schematic deployment view of the present apparatus.
FIG. 2 is a schematic diagram of a data processing flow according to the present invention.
FIG. 3 is a data reading node module program architecture diagram.
Fig. 4 is a schematic diagram of a conventional data processing flow.
Detailed Description
The present invention will be described in further detail with reference to the accompanying drawings.
The specific process of a device and a processing method for improving the data processing performance of a real-time database is introduced as follows:
the SDK for data access of the real-time database has the characteristics that only one access instance can exist in each data reading node program, and even if the multithreading technology is used, the access instances are finally restrained in a form of locks and are guaranteed to be used by threads in a queuing mode in a concurrent state. However, if a plurality of data reading node programs are started simultaneously, each data reading node program can have its own access instance, so that a plurality of access instances can read data in parallel, and the data reading efficiency is improved.
The data of each data reading node program is isolated by using a process. For security reasons, the data processing program cannot directly read the data of the clients. The data of these clients needs to be written into a common database. Here to improve read write performance; network and file access expenses are reduced, and the memory database is selected to be used for storing the data.
The data processing program reads data from the shared memory database 4 in a multithreading mode, and performs processing such as cleaning and calculation on the data. The results are then written to the real-time database 1.
As shown in fig. 3: the data reading node module 2 consists of four parts of functions, including measuring point table and configuration acquisition, real-time data reading, cache queue and memory database writing;
the measuring point table and the configuration obtain the measuring point table of the real-time database 1 required to be read from the node monitoring module 3 through the web service, and the configuration of the reading interval and the number of the writing threads of the memory database;
the real-time data reading is configured by the obtained measuring point table and the reading interval, and the data in the real-time database 1 is periodically read by the measuring point name;
the buffer queue passes the test roll name; a value; writing the read data into a cache queue in a timestamp mode;
and the memory database writing creates a plurality of threads according to the obtained number of the memory database writing threads, and each thread takes out data from the cache queue and writes the data into the memory database 4.
In summary, the specific implementation steps of the present invention are as follows:
step 1:
according to fig. 1, all programs except the real-time database 1 are deployed on one server;
step 2:
starting a shared memory database 4;
and step 3:
configuring the number of nodes needing to be started in a data reading node monitoring program 3, and configuring a measuring point table of a real-time database 1 needing to be read by each node;
and 4, step 4:
the data reading node monitoring program module 3 starts all the data reading node program modules 2 and distributes the measuring point tables of the database 1 to different nodes;
and 5:
the data reading node program module 2 writes the read data into the shared memory database 4;
and 5:
and the data processing program module 5 is used for writing the data processing result into the real-time database 1 by the data processing program 5.
Compared with the conventional data processing flow (figure 4), the invention (figure 2) improves the processing performance of the real-time database data. Support is provided for the millisecond-level computing function required by the business system.

Claims (5)

1. An apparatus for improving real-time database data processing performance, comprising:
a real-time database (1) for storing production real-time data collected from each power plant;
the data reading node module (2) is internally provided with a data reading node program, is arranged in the computing server, is provided with a plurality of data reading nodes, reads the production real-time data in the real-time database (1) in parallel, and stores the read data into the shared memory database (4);
the data reading node monitoring module (3) is internally provided with a data reading node monitoring program and is used for starting the data reading node program and distributing the measuring point table of the real-time database (1) to each data reading node module (2);
the data reading node module (2) consists of four parts of functions, including measuring point table and configuration acquisition, real-time data reading, cache queue and memory database writing;
the measuring point table and the configuration acquire the measuring point table of the real-time database (1) required to be read from the node monitoring module (3) through the web service, and the configuration of the number of the read intervals and the write threads of the memory database;
the real-time data reading is configured by the acquired measuring point table and the reading interval, and the data in the real-time database (1) is periodically read by the name of the measuring point;
the buffer queue passes the test roll name; a value; writing the read data into a cache queue in a timestamp mode;
and the memory database writing creates a plurality of threads according to the obtained number of the memory database writing threads, and each thread takes out data from the cache queue and writes the data into the memory database (4).
2. An apparatus for improving data processing performance of a real-time database according to claim 1, wherein the shared memory database (4) is configured to store the real-time data to be processed read by the data reading node module (2), and provide a high-concurrency, high-performance data reading interface to the data processing module (5).
3. The device for improving the data processing performance of the real-time database according to claim 2, wherein the data processing module (5) is internally provided with a data processing program, reads data from the shared memory database (4) for processing, and rewrites the processing result into the real-time database (1).
4. An arrangement for improving the performance of real-time database data processing according to claim 2, characterized in that the data processing module (5) reads data from the shared memory database (4) for data cleansing or calculation.
5. The processing method of the device for improving the data processing performance of the real-time database is characterized by comprising the following steps;
step 1:
deploying all programs except the real-time database (1) on one server;
step 2:
starting a shared memory database (4);
and step 3:
configuring the number of nodes needing to be started in a data reading node monitoring module (3), and configuring a measuring point table of a real-time database (1) needing to be read by each node;
and 4, step 4:
the data reading node monitoring program module (3) starts all the data reading node program modules (2) and distributes the measuring point tables of the real-time database (1) to different nodes;
and 5:
the data reading node program module (2) writes the read data into the shared memory database (4);
step 6:
and the data processing program module (5) is used for writing the data processing result into the real-time database (1) by the data processing program (5).
CN202110498192.3A 2021-05-08 2021-05-08 Device and method for improving real-time database data processing performance Active CN113220716B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202110498192.3A CN113220716B (en) 2021-05-08 2021-05-08 Device and method for improving real-time database data processing performance

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202110498192.3A CN113220716B (en) 2021-05-08 2021-05-08 Device and method for improving real-time database data processing performance

Publications (2)

Publication Number Publication Date
CN113220716A CN113220716A (en) 2021-08-06
CN113220716B true CN113220716B (en) 2023-03-03

Family

ID=77091751

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202110498192.3A Active CN113220716B (en) 2021-05-08 2021-05-08 Device and method for improving real-time database data processing performance

Country Status (1)

Country Link
CN (1) CN113220716B (en)

Families Citing this family (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN114547018B (en) * 2022-04-24 2022-08-16 西安热工研究院有限公司 Method and system for automatically cleaning waste points of SIS real-time database
CN114579408A (en) * 2022-05-05 2022-06-03 西安热工研究院有限公司 System and method for analyzing real-time equation of real-time database
CN115794900A (en) * 2022-11-10 2023-03-14 南京捷崎信息科技有限公司 Data processing method and system

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106599036A (en) * 2016-11-07 2017-04-26 济中节能技术(苏州)有限公司 Server cluster-based parallel real-time database
CN106897458A (en) * 2017-03-10 2017-06-27 广州白云电器设备股份有限公司 A kind of storage and search method towards electromechanical equipment data
CN109669929A (en) * 2018-12-14 2019-04-23 江苏瑞中数据股份有限公司 Method for storing real-time data and system based on distributed parallel database
CN109947810A (en) * 2017-11-16 2019-06-28 上海明匠智能系统有限公司 Industrial equipment data real-time collecting system
CN112269835A (en) * 2020-11-10 2021-01-26 浪潮云信息技术股份公司 Method for asynchronously reading and processing batch data by distributed database

Family Cites Families (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8229945B2 (en) * 2008-03-20 2012-07-24 Schooner Information Technology, Inc. Scalable database management software on a cluster of nodes using a shared-distributed flash memory
US8788465B2 (en) * 2010-12-01 2014-07-22 International Business Machines Corporation Notification of configuration updates in a cluster system
CN103514261B (en) * 2013-08-13 2017-03-15 北京华电天益信息科技有限公司 A kind of asynchronous storage of data for being applied to industrial control system and access method
US11074265B2 (en) * 2018-04-02 2021-07-27 International Business Machines Corporation Expression data structure dataset processing

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106599036A (en) * 2016-11-07 2017-04-26 济中节能技术(苏州)有限公司 Server cluster-based parallel real-time database
CN106897458A (en) * 2017-03-10 2017-06-27 广州白云电器设备股份有限公司 A kind of storage and search method towards electromechanical equipment data
CN109947810A (en) * 2017-11-16 2019-06-28 上海明匠智能系统有限公司 Industrial equipment data real-time collecting system
CN109669929A (en) * 2018-12-14 2019-04-23 江苏瑞中数据股份有限公司 Method for storing real-time data and system based on distributed parallel database
CN112269835A (en) * 2020-11-10 2021-01-26 浪潮云信息技术股份公司 Method for asynchronously reading and processing batch data by distributed database

Also Published As

Publication number Publication date
CN113220716A (en) 2021-08-06

Similar Documents

Publication Publication Date Title
CN113220716B (en) Device and method for improving real-time database data processing performance
Zhang et al. GraphP: Reducing communication for PIM-based graph processing with efficient data partition
US9619430B2 (en) Active non-volatile memory post-processing
CN111077870A (en) Intelligent OPC data real-time acquisition and monitoring system and method based on stream calculation
Qian et al. Timestream: Reliable stream computation in the cloud
Rabl et al. Solving big data challenges for enterprise application performance management
CN104391903A (en) Distributed storage and parallel calculation-based power grid data quality detection method
KR20100070968A (en) Cluster data management system and method for data recovery using parallel processing in cluster data management system
CN110032558B (en) Data synchronization method, device and system and storage medium
CN108595605A (en) A kind of construction method of car networking platform database
CN103793442A (en) Spatial data processing method and system
CN112800058A (en) Method for realizing HBase secondary index
Lu et al. TridentKV: A read-Optimized LSM-tree based KV store via adaptive indexing and space-efficient partitioning
Andreolini et al. A software architecture for the analysis of large sets of data streams in cloud infrastructures
US10558391B2 (en) Data processing system and data processing method
CN112328569A (en) Construction method based on Flume distributed data collection architecture
CN108628897A (en) Operation management method based on fast data and big data Technical Architecture
CN103236938A (en) Method and system for user action collection based on cache memory and asynchronous processing technology
Ding et al. Commapreduce: An improvement of mapreduce with lightweight communication mechanisms
CN111666144A (en) Batch processing task execution method and system and machine room deployment system
Sang et al. A fast implementation of parallel discrete-event simulation on GPGPU
CN116185298A (en) Method for distributed storage of logs
Song Performance and energy optimization on TeraSort algorithm by task self-resizing
CN113641654A (en) Marketing handling rule engine method based on real-time event
CN112783620A (en) Optimization method for timing task scheduling, computer equipment and storage medium

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant