CN105488632A - Method and device for analyzing electric data based on dimensional model - Google Patents
Method and device for analyzing electric data based on dimensional model Download PDFInfo
- Publication number
- CN105488632A CN105488632A CN201510923014.5A CN201510923014A CN105488632A CN 105488632 A CN105488632 A CN 105488632A CN 201510923014 A CN201510923014 A CN 201510923014A CN 105488632 A CN105488632 A CN 105488632A
- Authority
- CN
- China
- Prior art keywords
- data
- dimension table
- overall target
- attribute
- fact
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06Q—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
- G06Q10/00—Administration; Management
- G06Q10/06—Resources, workflows, human or project management; Enterprise or organisation planning; Enterprise or organisation modelling
- G06Q10/063—Operations research, analysis or management
- G06Q10/0637—Strategic management or analysis, e.g. setting a goal or target of an organisation; Planning actions based on goals; Analysis or evaluation of effectiveness of goals
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06Q—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
- G06Q50/00—Information and communication technology [ICT] specially adapted for implementation of business processes of specific business sectors, e.g. utilities or tourism
- G06Q50/06—Energy or water supply
-
- Y—GENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y04—INFORMATION OR COMMUNICATION TECHNOLOGIES HAVING AN IMPACT ON OTHER TECHNOLOGY AREAS
- Y04S—SYSTEMS INTEGRATING TECHNOLOGIES RELATED TO POWER NETWORK OPERATION, COMMUNICATION OR INFORMATION TECHNOLOGIES FOR IMPROVING THE ELECTRICAL POWER GENERATION, TRANSMISSION, DISTRIBUTION, MANAGEMENT OR USAGE, i.e. SMART GRIDS
- Y04S10/00—Systems supporting electrical power generation, transmission or distribution
- Y04S10/50—Systems or methods supporting the power network operation or management, involving a certain degree of interaction with the load-side end user applications
Landscapes
- Business, Economics & Management (AREA)
- Human Resources & Organizations (AREA)
- Engineering & Computer Science (AREA)
- Economics (AREA)
- Strategic Management (AREA)
- General Physics & Mathematics (AREA)
- General Business, Economics & Management (AREA)
- Health & Medical Sciences (AREA)
- Theoretical Computer Science (AREA)
- Marketing (AREA)
- Entrepreneurship & Innovation (AREA)
- Educational Administration (AREA)
- Tourism & Hospitality (AREA)
- Physics & Mathematics (AREA)
- Operations Research (AREA)
- Quality & Reliability (AREA)
- Development Economics (AREA)
- Game Theory and Decision Science (AREA)
- Public Health (AREA)
- Water Supply & Treatment (AREA)
- General Health & Medical Sciences (AREA)
- Primary Health Care (AREA)
- Management, Administration, Business Operations System, And Electronic Commerce (AREA)
Abstract
本发明涉及一种基于维度模型的电力数据分析方法及装置,其中,方法包括:根据融合之后的电力数据建立事实表;事实表包括综合指标以及综合指标的事实数据;综合指标包括变电站、产权属于供电公司的线路、产权属于用户的线路产权属于供电公司的变压器、产权属于用户的变压器和计量箱;利用事实表建立综合指标对应的渐变维度表和快变维度表;渐变维度表包括稳定属性数据和渐变属性数据;快变维度表包括综合指标的身份标识号和快变属性数据;快变属性数据为综合指标的身份标识号对应的随时间变化的事实数据,渐变属性数据为综合指标的身份标识号对应的随时间变化的电压等级数据和运行状态数据;基于渐变维度表和快变维度表对电力数据进行分析。
The present invention relates to a power data analysis method and device based on a dimensional model, wherein the method includes: establishing a fact table based on fused power data; the fact table includes comprehensive indicators and fact data of the comprehensive indicators; the comprehensive indicators include substations, property rights belonging to Lines of the power supply company, property rights of lines belonging to the user belong to the transformer of the power supply company, transformers and metering boxes that belong to the user; use the fact table to establish a gradual change dimension table and a fast change dimension table corresponding to the comprehensive index; the gradual change dimension table includes stable attribute data and gradually changing attribute data; the fast-changing dimension table includes the identity number of the comprehensive indicator and the fast-changing attribute data; the fast-changing attribute data is the fact data that changes over time corresponding to the identity number of the comprehensive indicator, and the gradually changing attribute data is the identity of the comprehensive indicator The time-varying voltage level data and operating status data corresponding to the identification number; the power data is analyzed based on the gradually changing dimension table and the rapidly changing dimension table.
Description
技术领域technical field
本发明涉及数据处理技术领域,特别涉及一种基于维度模型的电力数据分析方法及装置。The invention relates to the technical field of data processing, in particular to a method and device for analyzing power data based on a dimensional model.
背景技术Background technique
目前,国家电网公司正在开展营配调贯通工程建设工作,实现以营配数据共享支撑故障定位、停电范围定位、实时线损统计、业扩报装等业务,以营配信息集成推进营配业务融合,建立面向客户的跨部门、跨专业的营配协同作业流程和服务机制,全面支撑95598全业务上收,提升供电服务品质。At present, the State Grid Corporation of China is carrying out the construction of the battalion, distribution, adjustment and penetration project, realizing the business of supporting fault location, power outage range location, real-time line loss statistics, business expansion report and installation with the sharing of business and distribution data, and promoting the business and distribution business with the integration of business and distribution information Integrate, establish a customer-oriented cross-departmental and cross-professional operation process and service mechanism, fully support 95598 full-service collection, and improve the quality of power supply services.
随着营配调贯通项目建设的深入开展,当前营销业务应用系统、设备(资产)运维精益管理系统PMS2.0的建设已初具规模,营销侧与生产侧设备的坐标、属性采集已初步完成,针对现有站-线-变-户的空间数据、属性数据及逻辑关系的数据核查治理和及时性维护还存在较大需求,目前电力企业急需一套规范、准确、安全、高效的数据质量核查工具,保证营配调贯通数据的整体性、完整性、准确性和及时性,推进各业务系统的实用化,支撑公司营配调一体化建设与智能电网的信息化建设。With the in-depth development of the construction of the marketing business application system and the equipment (asset) operation and maintenance lean management system PMS2.0, the construction of the current marketing business application system and the equipment (asset) operation and maintenance lean management system PMS2.0 has begun to take shape, and the coordinates and attributes of the marketing side and production side equipment have been initially collected Completed, there is still a large demand for data verification, management and timely maintenance of spatial data, attribute data and logical relationships of existing stations-lines-transformers-households. At present, electric power companies urgently need a set of standardized, accurate, safe and efficient data The quality verification tool ensures the integrity, completeness, accuracy and timeliness of the operation data, promotes the practical application of various business systems, and supports the company's integration of operation, allocation and deployment and the informatization construction of smart grids.
维度是营销及配电系统数据融合中数据组织的重要方式。目前,在营销及配电系统数据融合中,在维度设计方面,维度一般与时间无关,这样会影响到业务人员对历史数据的查询和分析。在营销及配电系统数据融合过程中,需要掌握数据维度的变化情况,并据此进行分析处理。Dimension is an important way of data organization in data fusion of marketing and power distribution system. At present, in the data fusion of marketing and power distribution systems, in terms of dimension design, the dimension is generally not related to time, which will affect the query and analysis of historical data by business personnel. In the data fusion process of marketing and power distribution systems, it is necessary to grasp the changes in data dimensions and analyze and process accordingly.
在营销及配电系统数据库中,事实表是系统间数据融合指标的基本表,用于存放营销及配电业务指标事实数据,维度表是事实表的入口,通过维度表可对事实表中的事实数据进行切割分析。在现有的数据库中,维度被处理成与时间无关的属性,可以被用来对数据进行分类。In the marketing and power distribution system database, the fact table is the basic table of inter-system data fusion indicators, which is used to store the fact data of marketing and power distribution business indicators. The dimension table is the entry of the fact table. Through the dimension table, the facts in the fact table The factual data is cut and analyzed. In existing databases, dimensions are treated as time-independent attributes that can be used to classify data.
在营销及配电系统数据融合中,部分维度会随时间改变,若维度表与时间无关,只是在维度表中保留其当前属性值,这会直接影响到对事实表中初始数据和历史变化数据的访问,该技术的缺点如下:In the data fusion of marketing and power distribution systems, some dimensions will change over time. If the dimension table has nothing to do with time, only its current attribute value is retained in the dimension table, which will directly affect the initial data and historical change data in the fact table. The disadvantages of this technique are as follows:
(1)评估角度相对单调,全面性不足。(1) The evaluation angle is relatively monotonous and not comprehensive enough.
(2)评价方法受主观意识影响较大。(2) The evaluation method is greatly influenced by subjective consciousness.
(3)只对当前数据质量状况进行评价,而无历史数据情况的时间性比较,无法根据数据维度属性的变动分析营销及配电系统数据治理情况。(3) Only the current data quality status is evaluated, without temporal comparison of historical data status, and it is impossible to analyze the marketing and power distribution system data governance status according to the changes of data dimension attributes.
发明内容Contents of the invention
为解决现有技术的问题,本发明提出一种基于维度模型的电力数据分析方法及装置。In order to solve the problems in the prior art, the present invention proposes a method and device for analyzing power data based on a dimensional model.
为实现上述目的,本发明提供了一种基于维度模型的电力数据分析方法,包括:To achieve the above object, the present invention provides a method for analyzing power data based on a dimensional model, including:
根据电力数据指标维度属性,利用融合之后的电力数据建立事实表;其中,所述事实表包括综合指标以及综合指标的事实数据;所述综合指标包括变电站、产权属于供电公司的线路、产权属于用户的线路产权属于供电公司的变压器、产权属于用户的变压器和计量箱;According to the power data index dimension attributes, use the fused power data to establish a fact table; wherein, the fact table includes comprehensive indicators and the fact data of the comprehensive indicators; the comprehensive indicators include substations, lines whose property rights belong to the power supply company, and property rights that belong to users The property right of the line belongs to the power supply company's transformer, and the property right belongs to the user's transformer and metering box;
利用所述事实表建立所述综合指标对应的渐变维度表和快变维度表;其中,所述渐变维度表包括稳定属性数据和渐变属性数据;所述快变维度表包括综合指标的身份标识号和快变属性数据;所述快变属性数据为综合指标的身份标识号对应的随时间变化的事实数据,所述渐变属性数据为综合指标的身份标识号对应的随时间变化的电压等级数据和运行状态数据;Utilize the fact table to establish a gradually changing dimension table and a fast changing dimension table corresponding to the comprehensive index; wherein, the gradually changing dimension table includes stable attribute data and gradually changing attribute data; the fast changing dimension table includes the identification number of the comprehensive index and fast-changing attribute data; the fast-changing attribute data is the time-varying fact data corresponding to the identity identification number of the comprehensive index, and the gradual change attribute data is the voltage level data and the time-varying voltage level data corresponding to the identity identification number of the comprehensive index operating status data;
基于所述渐变维度表和快变维度表对电力数据进行分析。The power data is analyzed based on the gradually changing dimension table and the rapidly changing dimension table.
优选地,所述综合指标的事实数据包括核对进度指标数据、数据完整性指标数据、数据一致性指标数据、数据重复性指标数据和数据不规则性指标数据。Preferably, the fact data of the comprehensive index includes verification progress index data, data integrity index data, data consistency index data, data repeatability index data and data irregularity index data.
优选地,所述稳定属性数据包括综合指标标识和综合指标名称。Preferably, the stable attribute data includes a comprehensive index identifier and a comprehensive index name.
优选地,所述渐变属性数据通过属性列或元组进行记录,结合所述稳定属性数据,构成渐变维度表。Preferably, the gradually changing attribute data is recorded through attribute columns or tuples, and combined with the stable attribute data, a gradually changing dimension table is formed.
优选地,所述快变属性数据以预设波段的方式进行转换,将操作型数据环境中的值域进行分区,每一区值域对应的快变属性数据存放在一起,构成快变维度表。Preferably, the fast-changing attribute data is converted in the form of preset bands, and the value ranges in the operational data environment are partitioned, and the fast-changing attribute data corresponding to each zone value range is stored together to form a fast-changing dimension table .
优选地,所述电力数据包括营销数据和配电数据。Preferably, the power data includes marketing data and power distribution data.
对应地,为实现上述目的,本发明还提供了一种基于维度模型的电力数据分析装置,包括:Correspondingly, in order to achieve the above purpose, the present invention also provides a power data analysis device based on a dimensional model, including:
数据质量事实表建立单元,用于根据融合之后的电力数据建立数据质量事实表;其中,所述事实表包括综合指标以及综合指标的事实数据;所述综合指标包括变电站、产权属于供电公司的线路、产权属于用户的线路产权属于供电公司的变压器、产权属于用户的变压器和计量箱;The data quality fact table establishment unit is used to establish a data quality fact table according to the fused power data; wherein, the fact table includes comprehensive indicators and fact data of the comprehensive indicators; the comprehensive indicators include substations, lines whose property rights belong to the power supply company , The property right of the line belonging to the user belongs to the transformer of the power supply company, and the property right of the transformer and metering box belongs to the user;
维度表建立单元,用于利用所述事实表建立所述综合指标对应的渐变维度表和快变维度表;其中,所述渐变维度表包括稳定属性数据和渐变属性数据;所述快变维度表包括综合指标的身份标识号和快变属性数据;所述快变属性数据为综合指标的身份标识号对应的随时间变化的事实数据,所述渐变属性数据为综合指标的身份标识号对应的随时间变化的电压等级数据和运行状态数据;A dimension table establishment unit, configured to use the fact table to establish a gradually changing dimension table and a rapidly changing dimension table corresponding to the comprehensive index; wherein, the gradually changing dimension table includes stable attribute data and gradually changing attribute data; the rapidly changing dimension table Including the identification number of the comprehensive indicator and fast-changing attribute data; the fast-changing attribute data is the fact data corresponding to the identification number of the comprehensive indicator that changes over time, and the gradually changing attribute data is the time-varying fact data corresponding to the identification number of the comprehensive indicator. Time-varying voltage level data and operating status data;
分析单元,用于基于所述渐变维度表和快变维度表对电力数据进行分析。An analysis unit, configured to analyze the power data based on the gradually changing dimension table and the rapidly changing dimension table.
上述技术方案具有如下有益效果:The above technical scheme has the following beneficial effects:
本技术方案可以有效降低维度的更新操作对整个数据库的影响。在变化的数据维度表中保留了属性值的变化历史,可以从变电站、产权属于供电公司的线路、产权属于用户的线路产权属于供电公司的变压器、产权属于用户的变压器和计量箱等维度对统计数据进行全过程跟踪,便于对指标数据进行深层次挖掘和分析,提高了数据融合核查系统历史数据的可溯性和利用率。同时,通过该维度模型设计技术,可以快速发现不同地市数据核查的短板,便于业务人员找到核查工作的着力点,针对数据质量的薄弱环节及时调整阶段性数据核查方案,从而提高数据核查和治理的效率。The technical solution can effectively reduce the impact of dimension update operations on the entire database. The change history of attribute values is kept in the changed data dimension table, and statistics can be collected from substations, lines whose property rights belong to power supply companies, lines whose property rights belong to users, transformers whose property rights belong to power supply companies, transformers whose property rights belong to users, and metering boxes The data is tracked in the whole process, which is convenient for in-depth mining and analysis of the index data, and improves the traceability and utilization rate of the historical data of the data fusion verification system. At the same time, through the dimensional model design technology, the shortcomings of data verification in different cities can be quickly discovered, which is convenient for business personnel to find the focus of the verification work, and timely adjust the phased data verification plan for weak links in data quality, thereby improving data verification and Governance efficiency.
附图说明Description of drawings
为了更清楚地说明本发明实施例或现有技术中的技术方案,下面将对实施例或现有技术描述中所需要使用的附图作简单地介绍,显而易见地,下面描述中的附图仅仅是本发明的一些实施例,对于本领域普通技术人员来讲,在不付出创造性劳动的前提下,还可以根据这些附图获得其他的附图。In order to more clearly illustrate the technical solutions in the embodiments of the present invention or the prior art, the following will briefly introduce the drawings that need to be used in the description of the embodiments or the prior art. Obviously, the accompanying drawings in the following description are only These are some embodiments of the present invention. Those skilled in the art can also obtain other drawings based on these drawings without creative work.
图1为本发明提出的一种基于维度模型的电力数据分析方法流程图;Fig. 1 is a kind of flow chart of the electric power data analysis method based on dimensional model that the present invention proposes;
图2为本发明提出的一种基于维度模型的电力数据分析装置框图;Fig. 2 is a block diagram of a power data analysis device based on a dimensional model proposed by the present invention;
图3为本实施例的技术方案的体系结构图;Fig. 3 is the architecture diagram of the technical solution of the present embodiment;
图4为本实施例数据库维度模型星型结构示意图;FIG. 4 is a schematic diagram of a star structure of a database dimension model in this embodiment;
图5为本实施例变电站渐变维度表示意图。FIG. 5 is a schematic diagram of a gradually changing dimension table of a substation in this embodiment.
具体实施方式detailed description
下面将结合本发明实施例中的附图,对本发明实施例中的技术方案进行清楚、完整地描述,显然,所描述的实施例仅仅是本发明一部分实施例,而不是全部的实施例。基于本发明中的实施例,本领域普通技术人员在没有做出创造性劳动前提下所获得的所有其他实施例,都属于本发明保护的范围。The following will clearly and completely describe the technical solutions in the embodiments of the present invention with reference to the accompanying drawings in the embodiments of the present invention. Obviously, the described embodiments are only some, not all, embodiments of the present invention. Based on the embodiments of the present invention, all other embodiments obtained by persons of ordinary skill in the art without making creative efforts belong to the protection scope of the present invention.
本技术方案的工作原理:本技术方案将电力数据指标维度属性按照随时间变化的规律将电力数据分为3类。第1类,不随时间变化,称为稳定维度;第2类,随时间缓慢变化,称为渐变维度;第3类,随时间变化较快,称为快变维度。这3类维度属性需要采用不同的维度模型设计技术,在数据库的维度模型设计过程中,一方面要充分利用大部分与时间无关的维度属性建立稳定维度表,同时也要考虑某些随时间变化的维度属性,采用渐变维度和快变维度来记录其维度属性随时间变化的历史,提供基于这两种可变维度的营销及配电数据融合综合分析平台。The working principle of this technical solution: this technical solution divides the power data into three categories according to the time-varying law of the dimension attribute of the power data index. The first category, which does not change with time, is called the stable dimension; the second category, which changes slowly with time, is called the gradually changing dimension; the third category, which changes quickly with time, is called the rapidly changing dimension. These three types of dimension attributes require different dimension model design techniques. In the process of designing the dimension model of the database, on the one hand, it is necessary to make full use of most time-independent dimension attributes to establish stable dimension tables, and at the same time, some time-varying Dimensional attributes, using gradually changing dimensions and rapidly changing dimensions to record the history of its dimensional attributes changing over time, and providing a comprehensive analysis platform for marketing and power distribution data fusion based on these two variable dimensions.
该项目的技术目标和技术特点是:The technical objectives and technical characteristics of the project are:
(1)对于渐变维度数据来说,当维度表中的数据发生变化时,为了记录其变化历史,需要在维度表中加入新的属性列或元组来记录变化后的元组数据,这样可以根据维度属性的变化历史来进行数据分析和治理。(1) For gradually changing dimension data, when the data in the dimension table changes, in order to record its change history, it is necessary to add a new attribute column or tuple to the dimension table to record the changed tuple data, which can Perform data analysis and governance based on the change history of dimension attributes.
(2)对于快变维度数据来说,可以采用“预设波段”的方法对电力数据进行转换,即将该属性在操作型数据环境中的值域映射为一组数目较少的离散值。采用“预设波段”的方法可以有效降低维度属性的变化频率,同时也有利于执行相关的分析操作。(2) For fast-changing dimensional data, the "preset band" method can be used to convert the power data, that is, the value range of the attribute in the operational data environment is mapped to a small set of discrete values. Using the "preset band" method can effectively reduce the change frequency of dimension attributes, and is also conducive to the execution of related analysis operations.
基于上述工作原理,本发明提出一种基于维度模型的电力数据分析方法,如图1所示。包括:Based on the above working principle, the present invention proposes a power data analysis method based on a dimensional model, as shown in FIG. 1 . include:
步骤S101):根据电力数据指标维度属性,利用融合之后的电力数据建立事实表;其中,所述事实表包括综合指标以及综合指标的事实数据;所述综合指标包括变电站、产权属于供电公司的线路、产权属于用户的线路产权属于供电公司的变压器、产权属于用户的变压器和计量箱;所述综合指标的事实数据包括核对进度指标数据、数据完整性指标数据、数据一致性指标数据、数据重复性指标数据和数据不规则性指标数据。Step S101): According to the dimension attribute of the power data index, use the fused power data to establish a fact table; wherein, the fact table includes the comprehensive index and the fact data of the comprehensive index; the comprehensive index includes substations, lines whose property rights belong to the power supply company The property rights of the line belonging to the user belong to the transformer of the power supply company, and the transformer and metering box of the user belong to the property right; the factual data of the comprehensive index includes the verification progress index data, data integrity index data, data consistency index data, and data repeatability Metric Data and Data Irregularity Metric Data.
步骤S102):利用所述事实表建立所述综合指标对应的渐变维度表和快变维度表;其中,所述渐变维度表包括稳定属性数据和渐变属性数据;所述快变维度表包括综合指标的身份标识号和快变属性数据;所述快变属性数据为综合指标的身份标识号对应的随时间变化的事实数据,所述渐变属性数据为综合指标的身份标识号对应的随时间变化的电压等级数据和运行状态数据;Step S102): Using the fact table to establish a gradually changing dimension table and a rapidly changing dimension table corresponding to the comprehensive index; wherein, the gradually changing dimension table includes stable attribute data and gradually changing attribute data; the rapidly changing dimension table includes comprehensive indicators The identity identification number and fast-changing attribute data; the fast-changing attribute data is the time-varying fact data corresponding to the identity identification number of the comprehensive index, and the gradual change attribute data is the time-varying fact data corresponding to the identity identification number of the comprehensive index Voltage level data and operating status data;
步骤S103):基于所述渐变维度表和快变维度表对电力数据进行分析。Step S103): Analyze the power data based on the gradually changing dimension table and the rapidly changing dimension table.
对应地,本发明还提出一种基于维度模型的电力数据分析装置,如图2所示。包括:Correspondingly, the present invention also proposes a power data analysis device based on a dimensional model, as shown in FIG. 2 . include:
数据质量事实表建立单元201,用于根据电力数据指标维度属性,利用融合之后的电力数据建立数据质量事实表;其中,所述事实表包括综合指标以及综合指标的事实数据;所述综合指标包括变电站、产权属于供电公司的线路、产权属于用户的线路产权属于供电公司的变压器、产权属于用户的变压器和计量箱;The data quality fact table establishment unit 201 is used to establish a data quality fact table using the fused power data according to the dimension attribute of the power data index; wherein, the fact table includes comprehensive indicators and fact data of the comprehensive indicators; the comprehensive indicators include Substations, lines whose property rights belong to the power supply company, lines whose property rights belong to users, transformers whose property rights belong to the power supply company, transformers and metering boxes whose property rights belong to users;
维度表建立单元202,用于利用所述事实表建立所述综合指标对应的渐变维度表和快变维度表;其中,所述渐变维度表包括稳定属性数据和渐变属性数据;所述快变维度表包括综合指标的身份标识号和快变属性数据;所述快变属性数据为综合指标的身份标识号对应的随时间变化的事实数据,所述渐变属性数据为综合指标的身份标识号对应的随时间变化的电压等级数据和运行状态数据;Dimension table establishment unit 202, used to use the fact table to establish a gradually changing dimension table and a rapidly changing dimension table corresponding to the comprehensive index; wherein, the gradually changing dimension table includes stable attribute data and gradually changing attribute data; the rapidly changing dimension The table includes the identification number of the comprehensive indicator and the fast-changing attribute data; the fast-changing attribute data is the fact data corresponding to the identification number of the comprehensive indicator that changes with time, and the gradually changing attribute data is the corresponding to the identification number of the comprehensive indicator. Voltage level data and operating status data changing over time;
分析单元203,用于基于所述渐变维度表和快变维度表对电力数据进行分析。The analysis unit 203 is configured to analyze the power data based on the gradually changing dimension table and the rapidly changing dimension table.
下面结合实施例对本技术方案进一步详细描述。The technical solution will be further described in detail below in conjunction with the embodiments.
本技术方案采用数据库+联机分析处理系统(OLAP),数据库实现营销及配电系统主题数据的集成、存储和管理,OLAP实现对主题数据的多维度分析。数据库建立在原有的营销业务应用系统数据库、设备(资产)运维精益管理系统数据库两个数据源基础上,如图3所示。This technical solution adopts database + online analysis and processing system (OLAP). The database realizes the integration, storage and management of the subject data of the marketing and power distribution system, and the OLAP realizes the multi-dimensional analysis of the subject data. The database is based on two data sources, the original marketing business application system database and the equipment (asset) operation and maintenance lean management system database, as shown in Figure 3.
数据库维度模型采用的是星型结构,如图4所示,该模型包含一张事实表和六个维度表。事实表用于表述关于数据质量这一主题,其粒度为每一地市考虑变电站、产权属于供电公司的线路、产权属于用户的线路产权属于供电公司的变压器、产权属于用户的变压器和计量箱在内的综合指标。由于各类设备的数据质量均会影响地市的综合指标,所以可从变电站、产权属于供电公司的线路、产权属于用户的线路产权属于供电公司的变压器、产权属于用户的变压器和计量箱六个维度来分析综合指标。反映综合指标的事实数据有:核对进度指标数据、数据完整性指标数据、数据一致性指标数据、数据重复性指标数据、数据不规则性指标数据。在数据库的维度模型中,事实表与维度表之间是通过关键字建立联系的。维度表中的关键字采用关键字,如用户编号、设备编码、变电站标识等。新元组与旧元组具有相同的关键字,在渐变维度表中,关键字是由系统定义并赋值的维度值,用于标识维度表中的元组,在事实表中使用相应的关键字建立事实表与渐变维度表之间的联系。The database dimension model adopts a star structure, as shown in Figure 4, the model includes a fact table and six dimension tables. The fact table is used to express the topic of data quality, and its granularity is that substations, lines whose property rights belong to power supply companies, lines whose property rights belong to users, transformers whose property rights belong to power supply companies, transformers whose property rights belong to users, and metering boxes are considered in each city. Composite indicators within. Since the data quality of various equipment will affect the comprehensive indicators of the prefecture and city, there are six substations, lines whose property rights belong to the power supply company, lines whose property rights belong to users, transformers whose property rights belong to the power supply company, transformers whose property rights belong to users, and metering boxes Dimensions to analyze comprehensive indicators. The factual data reflecting comprehensive indicators include: checking progress indicator data, data integrity indicator data, data consistency indicator data, data repeatability indicator data, and data irregularity indicator data. In the dimensional model of the database, the relationship between the fact table and the dimension table is established through keywords. The keywords in the dimension table adopt keywords, such as user number, equipment code, substation identification and so on. The new tuple has the same keyword as the old tuple. In the gradually changing dimension table, the keyword is a dimension value defined and assigned by the system to identify the tuple in the dimension table. Use the corresponding keyword in the fact table Create a link between the fact table and the slowly changing dimension table.
1、渐变维度方案1. Gradual dimension scheme
在渐变维度表中,当一个元组的属性发生变化时,可在渐变维度表中增加带有相同关键字以及新属性值的元组,并保留维度表中旧数据的更新历史。事实表中的新元组将使用的关键字与历史数据仍然使用的关键字相同,因此不需要修改事实表。在渐变维度表中,每个关键字代表在特定时间跨度内建立的唯一属性概况,渐变维度表完整地记录了维度属性的变化历史。以变电站维度表为例,其中包含了3类属性:稳定维度属性,如变电站标识、变电站名称;渐变维度属性,如电压等级、运行状态;快变维度属性,如核对进度、数据完整性、数据一致性、数据重复性、数据不规则性。去掉快变属性,将其它属性组合成一个渐变维度表——变电站渐变维度表,如图5所示。当该维度表中的渐变属性发生变化时,如电压等级由原来的35kV转为110kV,则在该维度表中增加一个新维度元组来反映新的等压等级属性值。其中的变压器标识不变,代表的是同一变电站,但不同的变电站ID则表示在不同时期该变电站维度属性的概况。在根据变电站渐变维度表中的属性所进行的数据质量查询分析中,若在电压等级属性上给出约束条件,那么我们就可以准确地对不同电压等级进行区分,比较出数据质量与电压等级的关系;若在变电站标识属性上给出约束条件,则会获取同一变电站的所有数据用于分析,这样可以得到同一变电站数据质量的历史变化情况。同样,可以分析出运行状态与数据质量的关系。In a gradually changing dimension table, when an attribute of a tuple changes, a tuple with the same key and a new attribute value can be added to the gradually changing dimension table, and the update history of the old data in the dimension table can be kept. New tuples in the fact table will use the same keys that the historical data still uses, so the fact table does not need to be modified. In a gradually changing dimension table, each keyword represents a unique attribute profile established within a specific time span, and the gradually changing dimension table completely records the change history of dimension attributes. Taking the substation dimension table as an example, it contains three types of attributes: stable dimension attributes, such as substation identification, substation name; gradually changing dimension attributes, such as voltage level, operating status; fast-changing dimension attributes, such as verification progress, data integrity, data Consistency, data repeatability, data irregularity. Remove the fast-changing attribute, and combine other attributes into a gradually changing dimension table—substation gradually changing dimension table, as shown in Figure 5. When the gradient attribute in the dimension table changes, for example, the voltage level changes from 35kV to 110kV, a new dimension tuple is added to the dimension table to reflect the new equal-voltage level attribute value. The transformer ID remains the same, representing the same substation, but different substation IDs represent the overview of the dimension attributes of the substation in different periods. In the data quality query analysis based on the attributes in the substation gradually changing dimension table, if constraints are given on the voltage level attribute, then we can accurately distinguish different voltage levels and compare the data quality and voltage level. relationship; if constraints are given on the substation identification attribute, all the data of the same substation will be obtained for analysis, so that the historical changes of the data quality of the same substation can be obtained. Similarly, the relationship between operating status and data quality can be analyzed.
2、快变维度方案2. Fast-changing dimension scheme
将图4变电站维度表中变化频率较快的属性抽取出来可以组成一个独立的快变维度表,如图5所示,将其中的各属性以“波段”加以区分。将变电站快变维度表直接与数据质量事实表相连,其维度表中的“波段”如表1所示。这种设计方案的效果同渐变维度一样,也可做到在事实表中跟踪快变维度表中任一属性的变化情况,并且避免了由于属性的快速变化造成整个维度表的膨胀。通过快变维度表可以得到变电站核对进度、数据完整性、数据一致性、数据重复性及数据不规则性对数据质量产生的影响。Extracting the attributes with fast changing frequency from the substation dimension table in Figure 4 can form an independent fast-changing dimension table, as shown in Figure 5, and distinguish each attribute in it by "band". The substation fast-changing dimension table is directly connected with the data quality fact table, and the "band" in the dimension table is shown in Table 1. The effect of this design scheme is the same as that of gradually changing dimensions, and it can also track the change of any attribute in the fast-changing dimension table in the fact table, and avoid the expansion of the entire dimension table due to rapid changes in attributes. Through the fast-changing dimension table, the influence of substation verification progress, data integrity, data consistency, data repeatability and data irregularity on data quality can be obtained.
表1Table 1
通过OLAP技术进行多维的数据质量分析,对于保证数据质量,合理安排数据治理进度,提高营销及配电系统数据融合效率等都有着重要的意义。Multi-dimensional data quality analysis through OLAP technology is of great significance for ensuring data quality, reasonably arranging the progress of data governance, and improving the efficiency of data fusion in marketing and power distribution systems.
以上所述的具体实施方式,对本发明的目的、技术方案和有益效果进行了进一步详细说明,所应理解的是,以上所述仅为本发明的具体实施方式而已,并不用于限定本发明的保护范围,凡在本发明的精神和原则之内,所做的任何修改、等同替换、改进等,均应包含在本发明的保护范围之内。The specific embodiments described above have further described the purpose, technical solutions and beneficial effects of the present invention in detail. It should be understood that the above descriptions are only specific embodiments of the present invention and are not intended to limit the scope of the present invention. Protection scope, within the spirit and principles of the present invention, any modification, equivalent replacement, improvement, etc., shall be included in the protection scope of the present invention.
Claims (7)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201510923014.5A CN105488632A (en) | 2015-12-14 | 2015-12-14 | Method and device for analyzing electric data based on dimensional model |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201510923014.5A CN105488632A (en) | 2015-12-14 | 2015-12-14 | Method and device for analyzing electric data based on dimensional model |
Publications (1)
Publication Number | Publication Date |
---|---|
CN105488632A true CN105488632A (en) | 2016-04-13 |
Family
ID=55675601
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201510923014.5A Pending CN105488632A (en) | 2015-12-14 | 2015-12-14 | Method and device for analyzing electric data based on dimensional model |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN105488632A (en) |
Cited By (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN110851543A (en) * | 2019-11-08 | 2020-02-28 | 深圳市彬讯科技有限公司 | Data modeling method, device, equipment and storage medium |
CN112132705A (en) * | 2020-09-30 | 2020-12-25 | 国网智能科技股份有限公司 | Method and system for storing and reproducing panoramic data of transformer substation |
CN112559524A (en) * | 2020-12-14 | 2021-03-26 | 中国建设银行股份有限公司 | Index database establishing method and device and storage medium |
Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101681368A (en) * | 2007-06-29 | 2010-03-24 | 国际商业机器公司 | Aggregation query processing |
CN104299102A (en) * | 2014-10-31 | 2015-01-21 | 国电南瑞科技股份有限公司 | Multidimensional data model modeling method of power grid regulation and control integration system |
CN104391948A (en) * | 2014-12-01 | 2015-03-04 | 广东电网有限责任公司清远供电局 | Data standardization construction method and system of data warehouse |
CN104766151A (en) * | 2014-12-29 | 2015-07-08 | 国家电网公司 | Quality management and control method for electricity transaction data warehouses and management and control system thereof |
-
2015
- 2015-12-14 CN CN201510923014.5A patent/CN105488632A/en active Pending
Patent Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101681368A (en) * | 2007-06-29 | 2010-03-24 | 国际商业机器公司 | Aggregation query processing |
CN104299102A (en) * | 2014-10-31 | 2015-01-21 | 国电南瑞科技股份有限公司 | Multidimensional data model modeling method of power grid regulation and control integration system |
CN104391948A (en) * | 2014-12-01 | 2015-03-04 | 广东电网有限责任公司清远供电局 | Data standardization construction method and system of data warehouse |
CN104766151A (en) * | 2014-12-29 | 2015-07-08 | 国家电网公司 | Quality management and control method for electricity transaction data warehouses and management and control system thereof |
Non-Patent Citations (2)
Title |
---|
封玲等: "数据仓库维度建模技术及其应用研究", 《南京大学学报(自然科学)》 * |
金志伟等: "基于数据仓库的变化维度的研究", 《河南教育学院学报(自然科学版)》 * |
Cited By (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN110851543A (en) * | 2019-11-08 | 2020-02-28 | 深圳市彬讯科技有限公司 | Data modeling method, device, equipment and storage medium |
CN112132705A (en) * | 2020-09-30 | 2020-12-25 | 国网智能科技股份有限公司 | Method and system for storing and reproducing panoramic data of transformer substation |
CN112559524A (en) * | 2020-12-14 | 2021-03-26 | 中国建设银行股份有限公司 | Index database establishing method and device and storage medium |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN105574652B (en) | Intelligent power distribution network planning big data management and control system and method | |
CN106570778B (en) | A kind of method that data integration based on big data is calculated with line loss analyzing | |
CN107402976A (en) | Power grid multi-source data fusion method and system based on multi-element heterogeneous model | |
CN105469188A (en) | Distribution network planning data online intelligent analysis system and distribution network planning data online intelligent analysis method | |
CN106599038A (en) | Visualization statistic publishing system oriented on power application | |
CN108876154B (en) | A power grid planning big data analysis system | |
CN111552686B (en) | A method and device for evaluating power data quality | |
CN106600240A (en) | Power grid regulation and control and operation management system of power supply enterprise based on big data lean assistance | |
CN104778551A (en) | Designing and analyzing method of visualization grid | |
CN105488632A (en) | Method and device for analyzing electric data based on dimensional model | |
CN202533938U (en) | Touch query machine/mobile equipment-based land grading and evaluation processing system | |
CN114301174B (en) | Distribution station network monitoring method, device, computer equipment and storage medium | |
Cheng et al. | Design and implementation of GIS basic data quality management tools for power network | |
CN106934538A (en) | A kind of electric network data fusion method compared based on data blood relationship and gene | |
CN111082418B (en) | A system and method for analyzing topology relationship of distribution network equipment | |
CN116739382A (en) | A production cost quantitative analysis method, system, media, equipment and terminal | |
CN113723787B (en) | Multidimensional data checking method for power customer electricity safety management | |
CN108335231A (en) | A kind of power distribution network data diagnosis method of Auto-matching | |
CN108629475A (en) | A kind of exchange method of the operation information analysis system based on macroeconomic data | |
Gaofeng et al. | Application and research of enterprise-level business and data fusion data analysis service platform based on big data technology | |
Hao et al. | The application of electric power big data in distribution network planning management | |
Zhao et al. | Research on the construction of data pool and the management model of data pool integrating block chain concept for ubiquitous electric power Internet of Things | |
Sun et al. | Mechanism design for unified management of power grid planning data | |
CN103336171A (en) | Line loss analysis platform of electric quantity system | |
Liu et al. | Research on electric power with development and application of line loss rate forecasting software based on MLRM-GM |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
RJ01 | Rejection of invention patent application after publication |
Application publication date: 20160413 |
|
RJ01 | Rejection of invention patent application after publication |