CN104239447A - Power-grid big time series data storage method - Google Patents

Power-grid big time series data storage method Download PDF

Info

Publication number
CN104239447A
CN104239447A CN201410441649.7A CN201410441649A CN104239447A CN 104239447 A CN104239447 A CN 104239447A CN 201410441649 A CN201410441649 A CN 201410441649A CN 104239447 A CN104239447 A CN 104239447A
Authority
CN
China
Prior art keywords
measuring point
measuring object
measuring
label
data storage
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201410441649.7A
Other languages
Chinese (zh)
Inventor
王远
袁军
刘琛
胡健
张珂珩
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
CHINA REALTIME DATABASE Co Ltd
Original Assignee
CHINA REALTIME DATABASE Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by CHINA REALTIME DATABASE Co Ltd filed Critical CHINA REALTIME DATABASE Co Ltd
Priority to CN201410441649.7A priority Critical patent/CN104239447A/en
Publication of CN104239447A publication Critical patent/CN104239447A/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/22Indexing; Data structures therefor; Storage structures
    • G06F16/2228Indexing structures
    • G06F16/2246Trees, e.g. B+trees
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/22Indexing; Data structures therefor; Storage structures
    • G06F16/221Column-oriented storage; Management thereof

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Software Systems (AREA)
  • Data Mining & Analysis (AREA)
  • Databases & Information Systems (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

The invention discloses a power-grid big time series data storage method. The information of a batch of measurement points having location relevance to each other in service logic is re-described by selecting an open-source distributed column-oriented database HBase as a storage layer, and combining an SG-CIM model in power grid service; a reasonable index organization mode of a measurement point data storage table is designed, and the physical storage positions of the historical data of the batch of measurement points having location relevance to each other in service logic are adjacent to each other by use of the partitioning and load balancing functions of the HBase, so that the disk seek time can be reduced for querying the historical data of the batch of measurement points, and the query efficiency can be improved; instant query service can be provided for service application.

Description

The large date storage method of electrical network sequential
Technical field
The large data of time series that the present invention relates to location information sensitivity in a kind of electrical network business store, immediate inquiring method, belong to the storage of large data, distributing real-time data bank field.
Background technology
Along with intelligent, informationalized development, the large data management system of sequential, as the important foundation data platform of Large Scale Process industrial enterprise production information, faces increasing challenge.For electric system, be the immense pressure of ultra-large data processing on the one hand.During State Grid Corporation of China SG-ERP builds, magnanimity history/real time data releasing platform construction deepens constantly, its data scale is increasing, wherein measuring point (data collection point) scale is estimated to reach necessarily even more than one hundred million scale, and data storage capacity arrives more than PB byte.High-speed real-time process on the other hand.For WAMS system, need per second number of transactions to be processed can reach 10,000,000, and conventional relational database cannot tackle the challenge of high-speed real-time process like this at all, has higher requirement to the treatment scale of real-time data base, processing speed.
Tradition real-time data base is limited by its traditional software architecture, in data scale, processing power, parallel computation, load balancing, dynamically autonomy etc., cannot meet practical application request.In real-time data base field, introduce large data processing technique and solve the main method that the problems referred to above are the large data storages of current research sequential.But, in the business scenarios such as example power grid accident inverting, electrical network service alarm analysis and power transmission and transforming equipment on-line monitoring, the historical data of a collection of often measuring point that user is concerned about, and this batch of measuring point has position correlation in service logic, ideally these historical datas with a collection of measuring point of position correlation in physical store also should holding position adjacent, the seek time of disk could be reduced when inquiring about in above-mentioned business scenario application like this, improve inquiry velocity, for service application provides immediate inquiring service.Traditional real-time data base due to its architecture design, realize the reasons such as principle, do not accomplish that the historical data of a collection of measuring point service logic with position correlation is also that position is adjacent in physical store.
Through preliminary search, find no the patent entry relevant to content of the present invention temporarily.
Summary of the invention
In order to solve the problem, ensure that historical data service logic with a collection of measuring point of position correlation is that position is adjacent in physical store, for in electrical network service application, the query demand of this batch of measuring point provides immediate inquiring service, the invention provides the large date storage method of a kind of electrical network sequential, its main thought is: select to increase income distributed columnar database HBase as accumulation layer, in conjunction with SG-CIM model in electrical network business, description is re-started to a collection of measuring point information service logic with position correlation, by designing a kind of index organization's mode of measuring point data storage list, utilize subregion and the load-balancing function of HBase, the position of historical data in physical store making a collection of measuring point service logic with position correlation is adjacent.The present invention specifically comprises the steps:
(1) the Business Logic measuring point based on SG-CIM model describes
Based on SG-CIM model, description is re-started to a collection of measuring point service logic with position correlation, by the association between measuring object and measuring point, form the hierarchical relationship of measuring point; In this description, SG-CIM model class is similar to a tree falling to grow, and leaf node is measuring point, and the hierachy number between non-leaf nodes is increased according to actual needs by user or reduces; Father node from the root node of this tree to leaf node the path of process for describing measuring object; Leaf node is for representing the measurement item of this measuring object, and namely the combination of measuring object and measurement item is equivalent to the measuring point in traditional real-time data base; Article one, data record is made up of measuring object mark, timestamp, measured value, label, and label is described by one or more key-value pair.
(2) index organization of measuring point data storage list is set up
Selection increases income distributed columnar database HBase as accumulation layer, and index organization's mode of measuring point data storage list directly has influence on query performance.
The indexing model of batch query measuring point data storage list is designed to: measuring object mark+reference time+label, and wherein measuring object mark adopts based on SG-CIM model the redescribing of a collection of measuring point logic business with position correlation; Reference time selected is that to decide this reference time according to the data acquiring frequency of measuring object to be stored be whole hour/all day; Row are with the side-play amount of the timestamp of image data record relative to reference time.
The indexing model of section inquiry measuring point data storage list is designed to: timestamp+measuring object mark+label, wherein timestamp is the timestamp collecting every bar data record.
Index due to HBase table sorts by lexicographic order, then adopt in this programme based on SG-CIM model to the description of a collection of measuring point service logic with position correlation, the position of the index that the data of this batch of measuring point can be made to be recorded in measuring point data storage list is adjacent.
Because HBase carries out cutting for different region is to complete subregion with the scope of row to storage list, simultaneously in conjunction with the automatic load balancing of HBase self in units of region, the historgraphic data recording that this batch of measuring point can be made to collect also is adjacent on actual physical storage position, time like this for the batch query of this batch of measuring point and section inquiry, the seek time of disk can be reduced, for service application provides immediate inquiring service.
Further, the label in step (1) only can have a measurement item describing this measuring object, but can have the attribute description information of multiple non-measured item.
By adopting technique scheme, the inventive method can ensure that the historical data of a collection of measuring point service logic with position correlation is also adjacent in physical storage locations in the large data of electrical network sequential store, disk seek time can be reduced when like this historical data of this batch of measuring point being inquired about, improve search efficiency, for service application provides immediate inquiring service.
Accompanying drawing explanation
The description schematic diagram of Fig. 1 to be the embodiment of the present invention based on SG-CIM model to community be ammeter in service logic unit.
Fig. 2 is sortord and the subregion schematic diagram of embodiment of the present invention measuring point data storage list index.
Embodiment
Below in conjunction with drawings and Examples, the present invention is described in further detail.
The present embodiment is to be described the power information collection of certain community.Suppose that cell name is lvsejiayuan, this community have A, B, C ... Y totally 25, there are 1,2,3,4,5,6,7,8 totally 8 unit in Mei Dong building, every unit has 01,02 ..., 15 floors, each floor has 01,02 ..., 19,20 resident families.Suppose that there is one piece of ammeter (measuring object) in a resident resident family by forward active energy (direction=fd), reverse active energy (direction=bd), forward is gained merit sharp electric flux (direction=fd type=shark), forward is gained merit peak electric flux (direction=fd type=peak), forward is gained merit ordinary telegram energy (direction=fd type=shoulder), forward is gained merit paddy electric flux (direction=fd type=offpeak), oppositely meritorious sharp electric energy (direction=bd type=shark), oppositely meritorious peak electric flux (direction=bd type=peak), oppositely meritorious ordinary telegram energy (direction=bd type=shoulder), oppositely meritorious paddy electric flux (direction=bd type=offpeak) totally 10 measurement items.
(1) description is re-started to measuring object title
As shown in Figure 1, according to SG-CIM model, description is re-started to a collection of measuring point service logic with position correlation, the all ammeters referred in this community are gathered in example in this power information, service logic belongs to this community, in conjunction with service logic, all ammeters in this community are redescribed, the hierarchical relationship set up measuring object and measure between item, ammeter as certain family resident family is designated: community. building. and unit. ammeter is numbered, and measures the forward active energy that item is ammeter.According to the description of this programme, following form can be described as to the ammeter (measuring object) in community:
lvsejiayuan.A.unit1.0101
lvsejiayuan.A.unit1.0102
lvsejiayuan.A.unit1.0103
……
lvsejiayuan.Y.unit8.1517
lvsejiayuan.Y.unit8.1518
lvsejiayuan.Y.unit8.1519
lvsejiayuan.Y.unit8.1520
(2) history data store table index is set up
The sortord of measuring point data storage list index and subregion are as shown in Figure 2.Suppose to collect partial data sometime as follows:
lvsejiayuan.A.unit1.0101140220180223303direction=fd?type=shark
lvsejiayuan.A.unit1.010114022018021751direction=fd?type=peak
lvsejiayuan.A.unit1.0101140220180220858direction=fd?type=shoulder
lvsejiayuan.A.unit1.0101140220180228723direction=fd?type=offpeak
lvsejiayuan.A.unit1.0101140220180214444direction=fd
lvsejiayuan.A.unit1.0101140220180212166direction=bd?type=shark
lvsejiayuan.A.unit1.0101140220180230560direction=bd?type=peak
lvsejiayuan.A.unit1.010114022018022254direction=bd?type=shoulder
lvsejiayuan.A.unit1.0101140220180229230direction=bd?type=offpeak
lvsejiayuan.A.unit1.0101140220180227249direction=bd
……
A. the index that the above-mentioned data collected are concrete in batch query measuring point data storage list is for being respectively:
lvsejiayuan.A.unit1.01011402200000direction=bd
lvsejiayuan.A.unit1.01011402200000direction=bd?type=offpeak
lvsejiayuan.A.unit1.01011402200000direction=bd?type=peak
lvsejiayuan.A.unit1.01011402200000direction=bd?type=shark
lvsejiayuan.A.unit1.01011402200000direction=bd?type=shoulder
lvsejiayuan.A.unit1.01011402200000direction=fd
lvsejiayuan.A.unit1.01011402200000direction=fd?type=offpeak
lvsejiayuan.A.unit1.01011402200000direction=fd?type=peak
lvsejiayuan.A.unit1.01011402200000direction=fd?type=shark
lvsejiayuan.A.unit1.01011402200000direction=fd?type=shoulder
Under in batch query measuring point data storage list, every bar data record is stored in the row that this recording indexes is expert at, this is classified as the side-play amount of this data record relative to its index reference time.
B. the above-mentioned data collected concrete index in section inquiry measuring point data storage list is respectively:
1402201802lvsejiayuan.A.unit1.0101direction=bd
1402201802lvsejiayuan.A.unit1.0101direction=bdtype=offpeak
1402201802lvsejiayuan.A.unit1.0101direction=bdtype=peak
1402201802lvsejiayuan.A.unit1.0101direction=bdtype=shark
1402201802lvsejiayuan.A.unit1.0101direction=bdtype=shoulder
1402201802lvsejiayuan.A.unit1.0101direction=fd
1402201802lvsejiayuan.A.unit1.0101direction=fdtype=offpeak
1402201802lvsejiayuan.A.unit1.0101direction=fdtype=peak
1402201802lvsejiayuan.A.unit1.0101direction=fdtype=shark
1402201802lvsejiayuan.A.unit1.0101direction=fdtype=shoulder
Under in section inquiry measuring point data storage list, every bar data record storage can simply directly be stored in the row of the row at this recording indexes place, also the number that a field shows this row store data record can be increased in the index of this table, first Hash (hash) is carried out to the measuring object in every bar data record, then the line number delivery that new field in the index represents is stored, under finally this data record value being stored in row corresponding to gained modulus to every bar record.
Technical characteristic involved in above-mentioned embodiment, just can combine mutually as long as do not form conflict to each other.The invention is not restricted to above-described embodiment, all technical schemes adopting equivalent replacement or equivalence replacement to be formed all belong to the scope of protection of present invention.

Claims (2)

1. the large date storage method of electrical network sequential, is characterized in that, comprise the steps:
(1) the Business Logic measuring point based on SG-CIM model describes
Based on SG-CIM model, description is re-started to a collection of measuring point service logic with position correlation, by the association between measuring object and measuring point, form the hierarchical relationship of measuring point; In this description, SG-CIM model is tree, and wherein leaf node is measuring point, and the hierachy number between non-leaf nodes is increased according to actual needs by user or reduces; Father node from the root node of this tree to leaf node the path of process for describing measuring object; Leaf node is for representing the measurement item of this measuring object; Article one, data record is made up of measuring object mark, timestamp, measured value, label, and wherein label is described by key-value pair;
(2) index organization of measuring point data storage list is set up
Selection increases income distributed columnar database HBase as accumulation layer;
The indexing model of batch query measuring point data storage list is designed to: measuring object mark+reference time+label, and wherein measuring object mark adopts based on SG-CIM model the redescribing of a collection of measuring point logic business with position correlation; Reference time selected is that to decide this reference time according to the data acquiring frequency of measuring object to be stored be whole hour/all day; Row are with the side-play amount of the timestamp of image data record relative to reference time;
The indexing model of section inquiry measuring point data storage list is designed to: timestamp+measuring object mark+label, wherein timestamp is the timestamp collecting every bar data record.
2. method according to claim 1, is characterized in that the label in described step (1) is described by key-value pair, and this label only has a measurement item describing described measuring object.
CN201410441649.7A 2014-09-01 2014-09-01 Power-grid big time series data storage method Pending CN104239447A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201410441649.7A CN104239447A (en) 2014-09-01 2014-09-01 Power-grid big time series data storage method

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201410441649.7A CN104239447A (en) 2014-09-01 2014-09-01 Power-grid big time series data storage method

Publications (1)

Publication Number Publication Date
CN104239447A true CN104239447A (en) 2014-12-24

Family

ID=52227506

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201410441649.7A Pending CN104239447A (en) 2014-09-01 2014-09-01 Power-grid big time series data storage method

Country Status (1)

Country Link
CN (1) CN104239447A (en)

Cited By (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105574204A (en) * 2016-01-08 2016-05-11 国网冀北电力有限公司 Searching method and system for distributed power grid regulation and operation data
CN106250414A (en) * 2016-07-22 2016-12-21 北京赛博智通信息技术有限责任公司 A kind of data-storage system based on measuring point object and method
CN106250381A (en) * 2015-06-04 2016-12-21 微软技术许可有限责任公司 The row sequence optimized for input/output in list data
CN109613312A (en) * 2018-12-18 2019-04-12 宁波三星智能电气有限公司 A kind of ammeter exchange method based on menu
CN112015733A (en) * 2020-08-04 2020-12-01 国家电网有限公司客户服务中心 Method for storing and rapidly inquiring mass data of electric power customer service operation and distribution service

Citations (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP1283486A2 (en) * 2001-08-06 2003-02-12 Addamark Technologies, Inc. Storage of row-column data
CN101930481A (en) * 2009-06-22 2010-12-29 华东电网有限公司 Method used for generating CIM model describing power grid change in designated time slot and system thereof
CN102567495A (en) * 2011-12-22 2012-07-11 国网信息通信有限公司 Mass information storage system and implementation method
CN102855271A (en) * 2012-07-05 2013-01-02 中国电力科学研究院 Storage and traceable management method for multi-version power grid model
CN102890722A (en) * 2012-10-25 2013-01-23 国家电网公司 Indexing method applied to time sequence historical database
CN103116595A (en) * 2012-10-24 2013-05-22 中国电力科学研究院 Realizing method of supervisory control and data acquisition (SCADA) history data distribution-type storage facing power grid
WO2013165532A1 (en) * 2012-05-04 2013-11-07 Battelle Memorial Institute Method and system for managing power grid data
CN103390045A (en) * 2013-07-19 2013-11-13 北京京东尚科信息技术有限公司 Time sequence storage method and time sequence storage device for monitoring system
CN103488727A (en) * 2013-09-16 2014-01-01 河海大学 Two-dimensional time-series data storage and query method based on periodic logs
CN103559160A (en) * 2013-11-20 2014-02-05 国家电网公司 SG-CIM-standard-based construction method for semantic information interaction interface of intelligent power distribution system

Patent Citations (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP1283486A2 (en) * 2001-08-06 2003-02-12 Addamark Technologies, Inc. Storage of row-column data
CN101930481A (en) * 2009-06-22 2010-12-29 华东电网有限公司 Method used for generating CIM model describing power grid change in designated time slot and system thereof
CN102567495A (en) * 2011-12-22 2012-07-11 国网信息通信有限公司 Mass information storage system and implementation method
WO2013165532A1 (en) * 2012-05-04 2013-11-07 Battelle Memorial Institute Method and system for managing power grid data
CN102855271A (en) * 2012-07-05 2013-01-02 中国电力科学研究院 Storage and traceable management method for multi-version power grid model
CN103116595A (en) * 2012-10-24 2013-05-22 中国电力科学研究院 Realizing method of supervisory control and data acquisition (SCADA) history data distribution-type storage facing power grid
CN102890722A (en) * 2012-10-25 2013-01-23 国家电网公司 Indexing method applied to time sequence historical database
CN103390045A (en) * 2013-07-19 2013-11-13 北京京东尚科信息技术有限公司 Time sequence storage method and time sequence storage device for monitoring system
CN103488727A (en) * 2013-09-16 2014-01-01 河海大学 Two-dimensional time-series data storage and query method based on periodic logs
CN103559160A (en) * 2013-11-20 2014-02-05 国家电网公司 SG-CIM-standard-based construction method for semantic information interaction interface of intelligent power distribution system

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
王喜珍: ""基于树形结构的地震数据库系统设计方法"", 《地震学报》 *

Cited By (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106250381A (en) * 2015-06-04 2016-12-21 微软技术许可有限责任公司 The row sequence optimized for input/output in list data
CN106250381B (en) * 2015-06-04 2020-11-17 微软技术许可有限责任公司 System and method for determining column layout of tabular storage
US11514011B2 (en) 2015-06-04 2022-11-29 Microsoft Technology Licensing, Llc Column ordering for input/output optimization in tabular data
CN105574204A (en) * 2016-01-08 2016-05-11 国网冀北电力有限公司 Searching method and system for distributed power grid regulation and operation data
CN105574204B (en) * 2016-01-08 2020-07-24 国网冀北电力有限公司 Retrieval method and system for distributed power grid regulation and control operation data
CN106250414A (en) * 2016-07-22 2016-12-21 北京赛博智通信息技术有限责任公司 A kind of data-storage system based on measuring point object and method
CN106250414B (en) * 2016-07-22 2020-01-03 北京赛博智通信息技术有限责任公司 Data storage system and method based on measuring point object
CN109613312A (en) * 2018-12-18 2019-04-12 宁波三星智能电气有限公司 A kind of ammeter exchange method based on menu
CN112015733A (en) * 2020-08-04 2020-12-01 国家电网有限公司客户服务中心 Method for storing and rapidly inquiring mass data of electric power customer service operation and distribution service

Similar Documents

Publication Publication Date Title
CN105354641B (en) Goods picking path optimization method and goods picking path optimization device
CN104239447A (en) Power-grid big time series data storage method
CN108255981B (en) Method for storing and searching index minute freezing of timestamp sequence numbers of continuous time periods
Wang et al. Energy-efficient reverse skyline query processing over wireless sensor networks
CN104317800A (en) Hybrid storage system and method for mass intelligent power utilization data
CN106648446A (en) Time series data storage method and apparatus, and electronic device
CN109582667A (en) A kind of multiple database mixing storage method and system based on power regulation big data
CN104182435B (en) The information retrieval system and method marked based on shortage of data
CN107784098A (en) Real-time data warehouse platform
CN102567497B (en) Inquiring method of best matching with fuzzy trajectory problems
CN103500173A (en) Method for inquiring rail transit monitoring data
CN106528787A (en) Mass data multi-dimensional analysis-based query method and device
US10133779B2 (en) Query hint management for a database management system
CN105022763B (en) Realize the method and system of data query
CN109772753B (en) Power battery sorting system and method
CN105099798A (en) Online master station operation monitoring and evaluation method based on index system
CN104268298A (en) Method for creating database index and inquiring data
CN104090934A (en) Standard service platform distributed parallel computing database and retrieval method thereof
CN104346458A (en) Data storage method and device
CN104933042B (en) Database table optimization of collection technology based on big data quantity
CN105989140A (en) Data block processing method and equipment
CN102508866B (en) Digital addressing-based method for structured storage and rapid processing of command relation tree
CN103198157A (en) Compressed storage and processing method for telluric electricity field data
CN107818106B (en) Big data offline calculation data quality verification method and device
CN105574074A (en) Smart grid WAMS-based time-series big data storage method

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication
RJ01 Rejection of invention patent application after publication

Application publication date: 20141224