CN105809577B - Power plant informatization data classification processing method based on rules and components - Google Patents

Power plant informatization data classification processing method based on rules and components Download PDF

Info

Publication number
CN105809577B
CN105809577B CN201610261317.XA CN201610261317A CN105809577B CN 105809577 B CN105809577 B CN 105809577B CN 201610261317 A CN201610261317 A CN 201610261317A CN 105809577 B CN105809577 B CN 105809577B
Authority
CN
China
Prior art keywords
data
power plant
processing
task
rules
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Fee Related
Application number
CN201610261317.XA
Other languages
Chinese (zh)
Other versions
CN105809577A (en
Inventor
包铁
韩璐
刘淑芬
姚志林
张欣佳
吴姚睿
彭君
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Jilin University
Original Assignee
Jilin University
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Jilin University filed Critical Jilin University
Priority to CN201610261317.XA priority Critical patent/CN105809577B/en
Publication of CN105809577A publication Critical patent/CN105809577A/en
Application granted granted Critical
Publication of CN105809577B publication Critical patent/CN105809577B/en
Expired - Fee Related legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q50/00Systems or methods specially adapted for specific business sectors, e.g. utilities or tourism
    • G06Q50/06Electricity, gas or water supply
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y04INFORMATION OR COMMUNICATION TECHNOLOGIES HAVING AN IMPACT ON OTHER TECHNOLOGY AREAS
    • Y04SSYSTEMS INTEGRATING TECHNOLOGIES RELATED TO POWER NETWORK OPERATION, COMMUNICATION OR INFORMATION TECHNOLOGIES FOR IMPROVING THE ELECTRICAL POWER GENERATION, TRANSMISSION, DISTRIBUTION, MANAGEMENT OR USAGE, i.e. SMART GRIDS
    • Y04S10/00Systems supporting electrical power generation, transmission or distribution
    • Y04S10/50Systems or methods supporting the power network operation or management, involving a certain degree of interaction with the load-side end user applications

Abstract

The invention discloses a classification processing method of power plant informatization data based on rules and components, which comprises the following steps: step (1), establishing a classification model according to the characteristics of power plant informatization data; step (2) establishing or selecting a proper component from a component library based on the classification model and the application requirement; step (3) establishing or selecting a proper rule from a rule base based on the classification model and the application requirement; and (4) aiming at the data needing to be processed, establishing or selecting a proper task from a task library based on corresponding components and rules, providing specific parameters to start the task, and processing the informatization data of the power plant. The method fully considers the diversity and complexity of the power plant information data, collects, analyzes, processes, sends and stores the data according to application requirements based on components and rules, and can provide support for other applications based on the power information data.

Description

Power plant informatization data classification processing method based on rules and components
The technical field is as follows:
the invention relates to a method for processing data in the field of computers, in particular to a method for analyzing and processing informatization data of a power plant through rules and components.
Background art:
with the continuous deepening of the informatization of the power industry and the continuous development of the computer technology, the businesses and the production of a plurality of units and departments in the power industry are more and more dependent on computers and software, so that the working efficiency can be improved, and higher reliability and flexibility can be provided for the production operation and the business management of enterprises. However, due to different industry informatization degrees and complexity of working environments of the power industry, the problems that various types of data and various software systems coexist and the like exist in the power industry at present, analysis and processing of the informatization data become key problems in an industry informatization process, and a method and a framework with good operability and universality are provided for data processing of the power industry, and become important points of attention of industry experts and scholars.
The power plant is a basic supporting node of the power industry, the diversity and the complexity of the power plant information data are very typical, and the analysis and the processing of a plurality of power industry data are based on the power plant information data, so the power plant information data are used as research objects in the invention. Due to differences in power generation modes and informatization degrees, types and characteristics of data to be processed in each power plant are also various, for example, a thermal power plant, a hydraulic power plant, a wind power plant, a nuclear power plant, a biomass power plant and the like need to process different service data, and many power plants simultaneously operate an equipment operation control system, an information management system, a truck scale/rail scale metering system, a fuel test analysis system and the like, so that the source and storage formats of the data are also complicated. At present, a unified processing method and a framework aiming at the informationized data are lacked in a power plant, and various data are respectively processed by adopting various different tools, so that the invention provides a classification processing method of the informationized data of the power plant based on rules and components, and the informationized data of various complex and diverse power plants can be uniformly processed based on application requirements.
The invention content is as follows:
the invention mainly aims to provide a classification processing method of power plant informatization data based on rules and components, which fully considers the characteristics of diversity and complexity of power plant informatization data, establishes a classification model containing data characteristics, determines components and rules supporting three main stages of data processing, collection, analysis processing and sending and storage based on the classification model and application requirements, describes a typical processing method of each type of data through tasks, and maintains the components, the rules and the tasks in a resource library so as to facilitate the reuse of the resources. The invention provides a method with better flexibility and universality for the processing of the power plant information data and also provides support for other applications based on the power plant information data.
The invention aims to be realized by the following technical scheme:
a classification processing method of power plant information data based on rules and components comprises the following steps:
step (1), establishing a classification model according to the characteristics of power plant informatization data;
step (2) establishing or selecting a proper component from a component library based on the classification model and the application requirement;
step (3) establishing or selecting a proper rule from a rule base based on the classification model and the application requirement;
and (4) aiming at the data needing to be processed, establishing or selecting a proper task from a task library based on corresponding components and rules, providing specific parameters to start the task, and processing the informatization data of the power plant.
According to the above features, the classification model has a hierarchical structure, and can be established according to application requirements and data characteristics, each leaf node in the model corresponds to a type of data, and a non-leaf node describes a certain characteristic of the type of data.
According to the above features, the method for creating or selecting a suitable component in a component library comprises two main stages, a first stage of retrieving each node of the classification model and determining relevant characteristics of the data to be processed, and a second stage of creating or selecting a suitable component in the component library based on the retrieved data characteristics.
According to the above features, the method of creating or selecting suitable rules in the rule base comprises two main stages, a first stage of retrieving each node of the classification model and determining relevant characteristics of the data to be processed, and a second stage of creating or selecting suitable rules in the rule base based on the retrieved data characteristics and application requirements.
According to the above feature, the step (4) specifically includes the steps of:
step (4.1) based on the determined task and application requirements, providing specific parameters used in each stage of data acquisition, analysis processing, sending and storage;
and (4.2) creating a task execution space, creating each task instance by a task execution engine, completing the work of required component instantiation, rule loading and the like, and starting an actual data processing process.
Description of the drawings:
FIG. 1 is a general schematic diagram of a classification processing method for plant informatization data according to the present invention;
FIG. 2 is a diagram of the relationship between classification models, resource pools, and data processing tasks in the present invention.
The specific implementation mode is as follows:
the present invention will be described in further detail with reference to the accompanying drawings.
As shown in fig. 1, a general schematic diagram of the classification processing method for the plant informatization data provided by the invention is described, wherein solid arrows indicate that the informatization data sequentially passes through a collector, an analysis processor and a transmitter, dashed arrows indicate that a task execution engine schedules and controls rules of the collector, the analysis processor and the transmitter, rectangular arrows indicate that appropriate components, rules and tasks are selected and output from a component library, a rule library and a task library, and newly established or modified on the original basis are also stored in a resource library. The resource library is used for storing related resources of data processing and supporting reuse of components, rules and tasks. The classification processing method provided by the invention comprises the following specific implementation steps:
(1) establishing a classification model
By analyzing the characteristics of the plant information data and establishing a classification model with a hierarchical structure based on an application environment, as shown in fig. 2, each specific type of data corresponds to one leaf node in the model, and a non-leaf node represents a certain characteristic of the data, so that subsequent work can be carried out based on the type and the characteristic of the data. The power plant informatization data is various and complex, such as: each type of data has different sources, storage formats, contents, destinations and the like, the classification model is established according to the characteristics and application requirements of the data, and the non-leaf nodes encapsulate the characteristics required to be used in the data processing process. The characteristics of the plant information data are more, characteristics packaged by non-leaf nodes in a classification model need to be selected according to requirements, and the main characteristics relate to: the data can be original production and service data of the power plant or monitoring data of an information system of the power plant; the storage format of the data, namely the data can be stored in common files such as excel, word, txt and the like, and can also be stored in files which can be operated only by a specific tool; content of data — data may be text, numbers, or special models; data destination-similar to the source of the data, the processed data may need to be stored in a database, a file, etc., or may need to be sent out via a network according to a certain protocol format.
(2) Determining suitable components
Based on the characteristics of each type of data in the classification model, appropriate components need to be determined for the entire data processing process. Of course, if there is no suitable component in the component library, the original component can be modified according to the requirement, or some new components can be customized, and these components will also be saved in the component library. As shown in fig. 1 and 2, the overall data processing process is divided into three major phases, each of which may require corresponding component support. The first stage is data acquisition, and suitable components can be selected or created according to relevant characteristics such as the source of data in the classification model, for example: if the data comes from a table of a certain database, a corresponding database driving component is needed to support the acquisition of the data; the network snooping protocol component is required to support the collection of data if the data originates from the network interface snooping. The second phase is data analysis processing, if only simple statistics, association, calculation and other operations are performed on the data, the support of components may not be needed, but if more complex statistics analysis, data mining or some special processing is involved, the support of related components is also needed, such as: data is text content, and its tendency and sensitive words need to be analyzed, then a corresponding text analysis component is needed to support the analysis processing of the data. The third phase is data transmission and storage, which is similar to the first phase and also needs to select or create a suitable component according to the destination and other relevant characteristics of the data, such as: after data processing, the data needs to be stored in a table of a certain database, and a driving component of the corresponding database is needed to support data transmission.
(3) Determining appropriate rules
Depending on the application requirements, it may also be necessary to determine appropriate rules for the entire data processing process based on the classification model. The rules are encapsulated with specific business logic, can be selected from a rule base, can be created or modified on the original basis, and new rules are stored in the rule base. As shown in fig. 1 and 2, the three main stages of the data processing process all require appropriate rules for control. The first stage is data acquisition, and the created or selected acquisition rule packages the specific operation method and flow for data acquisition by using the components, and the acquisition rule and the corresponding components form an acquisition device to be responsible for data acquisition. The second stage is data processing analysis, the created and selected processing rules encapsulate the logic and specific operation method for data processing by using components, and the processing rules and corresponding components form an analysis processor for data analysis and processing. The third stage is data transmission storage, the created or selected transmission rule encapsulates the specific operation method and data format for data transmission storage by using the components, and the transmission rule and the corresponding components form a transmitter to be responsible for the data transmission storage work.
(4) Establishing data processing tasks
The power plant information data are various, and the collection, processing, sending and storage processes of each kind of data need to be clear, so that the correctness of the data processing work can be ensured. Here, a corresponding data processing task is established for each type of data, a suitable task can be selected from the task library as well, a new task can be created according to requirements or modified on the original basis, and the new task can be saved in the task library as well. The task is a complete processing method for packaging a type of data based on a classification model, namely components and rules determined in three main stages of data processing are explicitly recorded in the task, usually, one task describes a typical processing process of a type of power plant informatization data, and if the same or similar type of data needs to be processed subsequently, the corresponding task can be directly reused or modified on the basis of the original task.
(5) Initiating data processing tasks
Based on the tasks that have been determined, we can start a specific data acquisition process, i.e. start a data processing task. As shown in fig. 1 and 2, initiating a data processing task comprises two phases, the first phase being the provision of specific parameters, i.e. the provision of specific data processing parameters based on the task that has been determined. Such as in the data acquisition or transmission storage phase: the database driving component is adopted, and generally an IP address of a database operation server, a database monitoring port number, a database service name, a database login user name/password, a table name for storing data and the like are provided; by adopting the file component, a file path, a file name, a file extension name and the like are generally required to be provided; with network listen or transmit components, it is generally necessary to provide a listen protocol name, a listen port, a data sampling interval, etc. And in the data analysis processing stage: with some component of a data mining or data analysis algorithm, it is necessary to provide some condition parameters for the algorithm to run and to perform initial parameters. And in the second stage, based on specific parameters and corresponding tasks, a task execution space is created, each task instance is created by a task execution engine (including a rule engine), required components are instantiated, corresponding rules are loaded to control program execution, and the whole processes of data acquisition, analysis processing and transmission storage are completed.
An example of a thermal power plant is given below, and a brief description will be given of an embodiment of the present invention in conjunction with information data to be processed by the example. The method provided by the invention is adopted to process data according to the implementation steps given above:
(1) establishing a classification model
Table 1 example data classification model
Figure BDA0000972967320000061
Analyzing the characteristics of informationized data required to be processed by the thermal power plant, establishing a classification model by combining with actual application requirements, and selecting four data for explaining in order to simplify an example, as shown in table 1, because a root node has no practical significance, all levels of nodes below the root node and meanings thereof are listed, a first level node encapsulates the source characteristics of the data, a second level node encapsulates the destination characteristics of the data, a third level node encapsulates the processing characteristics of the data, leaf nodes correspond to four different types of data respectively, and each type of data has an internal unique identifier, thereby ensuring the accuracy in model retrieval.
(2) Determining suitable components
The components that need to be used throughout the data process are determined based on the classification model and the application requirements. In the data acquisition stage: the data source characteristics of the primary nodes of the retrieval and analysis model are mainly retrieved, and the four types of data sources are different, so that the data source characteristics are supported by an Oracle database driving component, an equipment control system interface component (software interface component), an Excel file component and a Word file component respectively. In the data analysis processing stage: the data processing characteristics of three-level nodes of the retrieval analysis model are mainly retrieved, the first three types of data only comprise simple data association and extraction operations and do not need component support, and the last type of data needs complex text information analysis and therefore needs the support of a text analysis component. In the data transmission and storage stage: the data destination characteristic of the secondary node of the analysis model is mainly retrieved, and all four kinds of information need to be stored in the MySQL database, so that the support of a MySQL database driving component is needed.
(3) Determining appropriate rules
The rules that need to be used throughout the data process are determined based on the classification model and the application requirements. In the data acquisition stage: the four types of data are acquired by using corresponding components, and by analyzing application requirements, specific data contents of how to use the components for data acquisition and acquisition are explicitly written in the rules. In the data analysis processing stage: by analyzing application requirements, the first two data require rules to control their association, here using the device ID as the association identifier, the third data require rules to control the extracted key information, and the fourth data require rules to control the operating mode of the component. In the data transmission and storage stage: all four types of data are stored in the database, and the database driving component needs to be controlled to write the data into the corresponding table in the rule.
(4) Establishing data processing tasks
Three data processing tasks are established here based on the components and rules determined for the four types of data in the previous steps. The first task contains the first two kinds of data, and the task records the components and rules determined in the three main stages of data acquisition, analysis processing and transmission storage. Similarly, the second and third tasks record data processing information of the third and fourth data, respectively.
(5) Initiating data processing tasks
Specific parameter information is provided for the data to be collected. In the data acquisition stage: the first data needs to provide parameters such as an Oracle database address, a service name, a port, a user name/password, a table name and the like, the second data needs to provide parameters such as an IP address, a user name/password, authority and the like according to specific requirements of a software interface, the third data is similar to the fourth data, and information such as an address, a path, a file name, a file extension name and the like of a host where a file is located needs to be provided. In the data analysis processing stage: the fourth data needs to provide parameters such as a data dictionary for text analysis, emotion analysis keywords and the like. In the data transmission and storage stage: all four kinds of data need to provide parameters such as MySQL data address, service name, port, table space name, table name, field name and the like. Based on these already provided parameter information, a specific data processing procedure can be started by starting a number of instantiated tasks.

Claims (2)

1. A classification processing method of power plant information data based on rules and components comprises the following steps:
the method comprises the following steps that (1) according to different characteristics of different power plant information data, a classification model which has a hierarchical structure and comprises leaf nodes and non-leaf nodes is established for the different power plant information data, so that the different power plant information data are uniformly organized in the classification model; the classification model is established according to application requirements and data characteristics, and the classification model comprises different types of power plant informatization data and characteristics corresponding to the different types of power plant informatization data; one leaf node in the classification model corresponds to one type of power plant information data, a non-leaf node describes a certain characteristic of the corresponding type of power plant information data, and the characteristic of the power plant information data comprises any characteristic of a source characteristic, a destination characteristic and a processing characteristic of the data;
searching model nodes of the classification model based on application requirements, determining relevant characteristics of data to be processed, and establishing or selecting proper different components respectively used for acquiring, processing, sending and storing the data from a component library based on the searched data characteristics;
step (3) retrieving model nodes of the classification model based on application requirements, determining relevant characteristics of data to be processed, and establishing or selecting proper different rules for respectively acquiring, processing, sending and storing the data from a rule base based on the retrieved data characteristics; the acquisition rule and the corresponding acquisition component form an acquisition device for being responsible for the acquisition work of the data, the processing rule and the corresponding processing component form an analysis processor for being responsible for the analysis processing work of the data, and the sending rule and the corresponding sending component form a sender for being responsible for the sending and storing work of the data;
and (4) establishing or selecting a proper task from a task library based on a corresponding collector, an analysis processor and a transmitter which are respectively composed of different components and rules for collecting, processing, transmitting and storing the data according to the data to be processed, providing a specific parameter starting task, and processing the power plant information data, wherein the task in the task library can be reused.
2. The classification processing method for the plant informatization data according to claim 1, characterized in that the step (4) specifically comprises the following steps:
step (4.1) based on the determined task and application requirements, providing specific parameters used in each stage of data acquisition, analysis processing, sending and storage;
and (4.2) creating a task execution space, creating each task instance by a task execution engine, completing the needed component instantiation and rule loading work, and starting an actual data processing process.
CN201610261317.XA 2016-04-25 2016-04-25 Power plant informatization data classification processing method based on rules and components Expired - Fee Related CN105809577B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201610261317.XA CN105809577B (en) 2016-04-25 2016-04-25 Power plant informatization data classification processing method based on rules and components

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201610261317.XA CN105809577B (en) 2016-04-25 2016-04-25 Power plant informatization data classification processing method based on rules and components

Publications (2)

Publication Number Publication Date
CN105809577A CN105809577A (en) 2016-07-27
CN105809577B true CN105809577B (en) 2020-04-07

Family

ID=56457572

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201610261317.XA Expired - Fee Related CN105809577B (en) 2016-04-25 2016-04-25 Power plant informatization data classification processing method based on rules and components

Country Status (1)

Country Link
CN (1) CN105809577B (en)

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110377753B (en) * 2019-07-01 2022-10-21 吉林大学 Relation extraction method and device based on relation trigger word and GRU model

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103902671A (en) * 2014-03-19 2014-07-02 北京科技大学 Dynamic integration method and system of multi-source heterogeneous data
CN104573205A (en) * 2014-12-26 2015-04-29 成都飞机设计研究所 Test data processing method based on component technology

Family Cites Families (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105260835A (en) * 2015-10-13 2016-01-20 北京凯行同创科技有限公司 Method for modeling, analyzing, and self-optimizing multi-source business big data

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103902671A (en) * 2014-03-19 2014-07-02 北京科技大学 Dynamic integration method and system of multi-source heterogeneous data
CN104573205A (en) * 2014-12-26 2015-04-29 成都飞机设计研究所 Test data processing method based on component technology

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
基于业务规则的电信网络数据处理框架;包铁等;《计算机应用研究》;20061201(第12期);第287-288,332页 *

Also Published As

Publication number Publication date
CN105809577A (en) 2016-07-27

Similar Documents

Publication Publication Date Title
CN111159191B (en) Data processing method, device and interface
US8219518B2 (en) Method and apparatus for modelling data exchange in a data flow of an extract, transform, and load (ETL) process
Zhu et al. A framework-based approach to utility big data analytics
EP2779044A1 (en) System and method to provide management of test data at various lifecycle stages
CN115129736B (en) Rule event dynamic loading and updating method based on rule engine and related equipment
CN112347071B (en) Power distribution network cloud platform data fusion method and power distribution network cloud platform
EP2110781A1 (en) Method and system for automatic tracing of a computerized process using a relationship model
US9098497B1 (en) Methods and systems for building a search service application
US20230021259A1 (en) Methods and systems for building search service application
CN103744647A (en) Java workflow development system and method based on workflow GPD
CN111400288A (en) Data quality inspection method and system
CN111190814A (en) Software test case generation method and device, storage medium and terminal
CN111104441A (en) Data acquisition method and system
CN105809577B (en) Power plant informatization data classification processing method based on rules and components
CN107423035B (en) Product data management system in software development process
CN109951376B (en) Instant messaging software information acquisition method, device, system and storage medium
CN104462461A (en) Form null value check processing method and device
CN107357919A (en) User behaviors log inquiry system and method
CN115439015B (en) Local area power grid data management method, device and equipment based on data middleboxes
CN116150236A (en) Data synchronization method and device, electronic equipment and computer readable storage medium
US8856152B2 (en) Apparatus and method for visualizing data
CN115114297A (en) Data lightweight storage and search method and device, electronic equipment and storage medium
CN114547206A (en) Data synchronization method and data synchronization system
CN111104558A (en) Distributed multi-source data processing method and system
Novak et al. Proposed architecture for ETL workflow generator

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant
CF01 Termination of patent right due to non-payment of annual fee

Granted publication date: 20200407

Termination date: 20210425

CF01 Termination of patent right due to non-payment of annual fee