CN110019343A - A kind of new energy meteorological data management method and system - Google Patents

A kind of new energy meteorological data management method and system Download PDF

Info

Publication number
CN110019343A
CN110019343A CN201711353102.1A CN201711353102A CN110019343A CN 110019343 A CN110019343 A CN 110019343A CN 201711353102 A CN201711353102 A CN 201711353102A CN 110019343 A CN110019343 A CN 110019343A
Authority
CN
China
Prior art keywords
meteorological data
data
node
new energy
mpp
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201711353102.1A
Other languages
Chinese (zh)
Inventor
刘鹏
邓春宇
王晓蓉
吴紫微
车建峰
刘镇京
刘化社
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
State Grid Corp of China SGCC
China Electric Power Research Institute Co Ltd CEPRI
North China Electric Power University
Qingdao Power Supply Co of State Grid Shandong Electric Power Co Ltd
Original Assignee
State Grid Corp of China SGCC
China Electric Power Research Institute Co Ltd CEPRI
North China Electric Power University
Qingdao Power Supply Co of State Grid Shandong Electric Power Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by State Grid Corp of China SGCC, China Electric Power Research Institute Co Ltd CEPRI, North China Electric Power University, Qingdao Power Supply Co of State Grid Shandong Electric Power Co Ltd filed Critical State Grid Corp of China SGCC
Priority to CN201711353102.1A priority Critical patent/CN110019343A/en
Publication of CN110019343A publication Critical patent/CN110019343A/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G01MEASURING; TESTING
    • G01WMETEOROLOGY
    • G01W1/00Meteorology
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/24Querying
    • G06F16/245Query processing
    • G06F16/2453Query optimisation
    • G06F16/24532Query optimisation of parallel queries
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/24Querying
    • G06F16/245Query processing
    • G06F16/2458Special types of queries, e.g. statistical queries, fuzzy queries or distributed queries
    • G06F16/2471Distributed queries

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Environmental & Geological Engineering (AREA)
  • General Engineering & Computer Science (AREA)
  • Databases & Information Systems (AREA)
  • Computational Linguistics (AREA)
  • Data Mining & Analysis (AREA)
  • Environmental Sciences (AREA)
  • Atmospheric Sciences (AREA)
  • Ecology (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Biodiversity & Conservation Biology (AREA)
  • Fuzzy Systems (AREA)
  • Mathematical Physics (AREA)
  • Probability & Statistics with Applications (AREA)
  • Software Systems (AREA)
  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)

Abstract

The present invention provides a kind of new energy meteorological data management method and system, comprising: obtains meteorological data;The meteorological data of the acquisition is stored on the node of massive parallel processing MPP;Establish the meteorological data query table for being directed to the node.Technical solution provided by the invention utilizes the distributed storage ability of MPP, ten PB grades of weather bureau's metric data of support sizes;Magnanimity timing big data can be handled with efficient storage and quickly;Hash storage and MPP parallelization processing capacity on Shared nothing framework based on the Hash in the library MPP, maximum have played intelligence index and column storage capacity in single node, have greatly accelerated the scanning speed to big table data.

Description

A kind of new energy meteorological data management method and system
Technical field
The invention belongs to new energy meteorological fields, big data field, and in particular to a kind of new energy meteorological data management side Method and system.
Background technique
New energy power prediction is to predict future time period new energy power swing rule in advance by founding mathematical models at present The technology of rule.Due to can not integrally hold the huge complicated data of weather forecast of the scale of construction, meteorological measuring, power prediction The Research Thinking of data and the operation data of other new energy stations, new energy power prediction is confined to for single wind-powered electricity generation more Prediction on the independent time section of field, according to physical significance theoretic in cognitive range, artificial carries out data information Segmentation and dimension-reduction treatment, have ignored the associate feature of wind energy resources and other elements in time scale and space scale, with Two-dimensional model carrys out the data correlation in descriptive analysis hyperspace, has limited to the cognition dimension to new energy power, so as to cause Precision of prediction room for promotion is very limited.
Numerical weather forecast is the most important input data of the new energy power prediction such as wind-powered electricity generation, photovoltaic, according to statistics, new energy There are about 60% in the error of prediction power to be originated from numerical weather forecast, expands the estimation range of numerical weather forecast and improves numerical value The precision of data of weather forecast is to realize the most effective means of new energy power prediction precision improvement.Currently, towards new energy function The spatial resolution of the mesoscale NWP of rate prediction is mostly 10km × 10km, and typical installed capacity is 5MW's Wind power plant occupied area is about 3km × 3km, and the occupied area of photovoltaic plant is smaller, and the demand to computing resource is with spatial discrimination The increase of rate exponentially increases.
For the meteorological data of 3km × 3km, object storage system is stored in a distributed manner at present, and meteorological data passes through pre- Following 3 days prediction data can be generated by surveying model, and since these data volumes are bigger, warehouse-in efficiency is relatively low, thus often by It abandons without storage, to cause the loss with break-up value data.
Summary of the invention
In view of the deficiencies of the prior art, the present invention proposes a kind of new energy meteorological data management method and system, including is situated between Continue the general frames of MPP+ time series databases, proposes that data store scalability scheme, provides for basic electric power weather prognosis Efficiently, accurately data foundation, effectively support power network safety operation.
A kind of new energy meteorological data management method, comprising:
Obtain meteorological data;
The meteorological data of the acquisition is stored on the node of massive parallel processing MPP;
Establish the meteorological data query table for being directed to the node.
Further, the meteorological data by the acquisition is stored on the node of massive parallel processing MPP, Include:
The meteorological data of the acquisition is ranked up according to the time;
Average packet is carried out to the data after sequence;
Data after grouping are stored in by the way of distributed storage on the node of MPP.
It is further, described to establish the meteorological data query table for being directed to the node, comprising:
Data in the node are subjected to region ordering, establish the meteorological data query table.
Further, the data by the node carry out region ordering, before establishing the meteorological data query table Further include:
If the node memory, in data, the meteorological data that will acquire establishes interim table.
Further, further includes: the inquiry of data is carried out using the meteorological data query table, comprising:
According to hash storage method, temporally field is inquired;And/or
It is inquired according to hash storage method by zone number.
Further, the meteorological data is PB grades of meteorological datas.
Further, the node uses Share-nothing framework.
A kind of new energy meteorological data management system, comprising:
Module is obtained, for obtaining meteorological data;
Memory module, for the meteorological data of the acquisition to be stored in the node of massive parallel processing MPP;
Module is established, for establishing the meteorological data query table for being directed to the node.
Further, the memory module, is used for,
The meteorological data of the acquisition is ranked up according to the time;
Average packet is carried out to the data after sequence;
Data after grouping are stored in by the way of distributed storage on the node of MPP.
Further, the module of establishing includes:
Sorting sub-module establishes the meteorological data query table for the data in the node to be carried out region ordering.
It is further, described to establish module further include:
Interim submodule, if the meteorological data that will acquire establishes interim table for the node memory in data.
Further, further includes: enquiry module, comprising:
Time inquiring submodule, for temporally field to be inquired according to hash storage method;And/or
Site polling submodule, for being inquired according to hash storage method by zone number.
Compared with the latest prior art, technical solution provided by the invention has the advantages that
1, technical solution provided by the invention utilizes the distributed storage ability of MPP, and meteorological data is stored, and builds Vertical meteorological data query table, the convenient inquiry to data later.
2, technical solution provided by the invention utilizes the distributed storage ability of MPP, and ten PB grades of weather bureau of support sizes measures Data can handle magnanimity timing big data with efficient storage and quickly.
3, technical solution provided by the invention is deposited based on the hash on the Shared nothing framework of the Hash in the library MPP Storage and MPP parallelization processing capacity, maximum have played intelligence index and column storage capacity in single node, have greatly accelerated pair The scanning speed of big table data, theoretically scanning speed=500DC × 65536 × m intelligently indexes filterability × n node.
4, the coarseness index that intelligently indexes of the technical solution provided by the invention based on MPP, significantly control meteorological data Volume expansions rate, and prevent with data volume promoted and cause index generate cost promoted, to greatly improve data Warehouse-in efficiency can achieve the loading velocity of 3.6G between the second.
Detailed description of the invention
Fig. 1 is flow chart of the present invention;
Fig. 2 is data memory node MPP configuration diagram;
Fig. 3 is data store organisation schematic diagram;
Fig. 4 is that memory node extends schematic diagram;
Fig. 5 is back end distribution schematic diagram;
Fig. 6 is using data copy schematic diagram.
Specific embodiment
The present invention is described in further details with reference to the accompanying drawing.For purpose, the technical solution for making the embodiment of the present invention Clearer with advantage, following will be combined with the drawings in the embodiments of the present invention, and technical solution in the embodiment of the present invention carries out It clearly and completely describes, it is clear that described embodiments are some of the embodiments of the present invention, instead of all the embodiments.Base Embodiment in the present invention, it is obtained by those of ordinary skill in the art without making creative efforts it is all its Its embodiment, shall fall within the protection scope of the present invention.
Embodiment 1, the present invention provides a kind of new energy meteorological data management methods, as shown in Figure 1, comprising:
Obtain meteorological data;
The meteorological data of the acquisition is stored on the node of massive parallel processing MPP;
Establish the meteorological data query table for being directed to the node.
Using the distributed storage ability of MPP, ten PB grades of weather bureau's metric data of support sizes can with efficient storage and quickly Handle magnanimity timing big data.
Embodiment 2, the present invention provides a kind of new energy meteorological data management systems, comprising:
Module is obtained, for obtaining meteorological data;
Memory module, for the meteorological data of the acquisition to be stored in the node of massive parallel processing MPP;
Module is established, for establishing the meteorological data query table for being directed to the node.
Further, the memory module, is used for,
The meteorological data of the acquisition is ranked up according to the time;
Average packet is carried out to the data after sequence;
Data after grouping are stored in by the way of distributed storage on the node of MPP.
Further, the module of establishing includes:
Sorting sub-module establishes the meteorological data query table for the data in the node to be carried out region ordering.
It is further, described to establish module further include:
Interim submodule, if the meteorological data that will acquire establishes interim table for the node memory in data.
Further, further includes: enquiry module, comprising:
Time inquiring submodule, for temporally field to be inquired according to hash storage method;And/or
Site polling submodule, for being inquired according to hash storage method by zone number.
Embodiment 3 is required, the technical side merged using MPP+ time series database according to big data quantity demand and scalability Case.
1, the statement of requirements
Meteorological data is typical temporal information, carries out information to electric power meteorological data and derives and predict have important work With.It acts not only as historical data long-term preservation, can also reflect variation tendency of the data in application environment.When reasonable State storage model can simplify the design of application system, reduce the redundancy of data storage, improve the execution efficiency of program and be System stability.
Weather information need to be stored to be described as follows:
Range: phase 20000, a somewhere collection point information, succeeding ranges are by further expansion;
Data generate frequency: generating 1 time within every 6 hours, generating 72 hours every time, (15 minutes intervals generate one group of data, altogether 288 acquisition moment) data;
Size of data: (72 hours, interval generated one group of number to the data that each node generates every time about 57K byte within 15 minutes According to totally 288 acquisition moment, each 200 byte of moment);
Data storage period: 10 years;
Data query mode: 1) a certain 1 year data in collection point (time series approach) are inquired;2) sometime point is inquired All data (when discontinuity surface mode) of 20000 points;
Storage requires: entry time is no more than 30 points;
Query requirement: any point extraction time is less than 20s, and profile extraction time time is less than 30s;
Scalability requirement: framework can dynamic linear expansion, future is applicable to national sampling.
2, Technical Architecture
Data memory node uses MPP framework, as shown in Figure 2:
Database node uses Share Nothing framework, parallel memorizing and processing data in MPP, without coupling between data, Cross-node affairs will not be generated.When needing to carry out global query, it is distributed to after each node obtains data and is combined by MPP management program The query result of node.
Each node is an independent database, and the storage of time profile data and time series data are substantially carried out in database Storage, data store organisation are as shown in Figure 3.
3, write performance
By taking the back end of somewhere as an example:
Time series data is additional every time to be written 20000 blob, about 50 blob of write-in per second, it is contemplated that control is in 7 minutes.
1 blob, size 1.15G is written in time profile data every time, and disk writing speed is about 30M/ seconds, it is contemplated that control System is in 1 minute.
The above performance is single node performance, and when multinode distributed data, performance can be promoted according to number of nodes, or main is utilized MPP stores the data of a logical collection, reaches PB grades of storages using multinode to the linear expansion of storage, each node.
4, reading performance
By taking the back end of somewhere as an example:
Time series data reads 1 annual data of 1 collection point, reads each a part of 12 blob every time, each Size is about 7MB, total 84M, it is contemplated that control is in 10 seconds.
Time profile data read every time 20000 collection points some when discontinuity surface total data, read 1 blob, Size is about 1.15G, and disk writing speed is about 40M/ seconds, it is contemplated that control is in 30 seconds.
5, scalability
Data volume single machine database can satisfy at present, if the following construction whole nation or global metadata, it is contemplated that MPP frame Structure, since data dependence is low, no coupling, MPP can obtain the performance of linear increase.
It when MPP increases node, needs to do fast resampling in MPP management node, uniformly to utilize the processing energy of each node Power, it is as shown in Figure 4 that memory node extends schematic diagram.
6, safety
High Availabitity mode can be used in each back end, ensures Information Security and operation duration, back end point Cloth schematic diagram is as shown in Figure 5.
The quantity of equipment can also be reduced, using data copy mode according to resource situation, by the way of data copy Schematic diagram is as shown in Figure 6.
The application is referring to method, the process of equipment (system) and computer program product according to the embodiment of the present application Figure and/or block diagram describe.It should be understood that every one stream in flowchart and/or the block diagram can be realized by computer program instructions The combination of process and/or box in journey and/or box and flowchart and/or the block diagram.It can provide these computer programs Instruct the processor of general purpose computer, special purpose computer, Embedded Processor or other programmable data processing devices to produce A raw machine, so that being generated by the instruction that computer or the processor of other programmable data processing devices execute for real The device for the function of being specified in present one or more flows of the flowchart and/or one or more blocks of the block diagram.
These computer program instructions, which may also be stored in, is able to guide computer or other programmable data processing devices with spy Determine in the computer-readable memory that mode works, so that it includes referring to that instruction stored in the computer readable memory, which generates, Enable the manufacture of device, the command device realize in one box of one or more flows of the flowchart and/or block diagram or The function of being specified in multiple boxes.
These computer program instructions also can be loaded onto a computer or other programmable data processing device, so that counting Series of operation steps are executed on calculation machine or other programmable devices to generate computer implemented processing, thus in computer or The instruction executed on other programmable devices is provided for realizing in one or more flows of the flowchart and/or block diagram one The step of function of being specified in a box or multiple boxes.
Finally it should be noted that: the above embodiments are merely illustrative of the technical scheme of the present invention and are not intended to be limiting thereof, to the greatest extent Invention is explained in detail referring to above-described embodiment for pipe, it should be understood by those ordinary skilled in the art that: still It can be with modifications or equivalent substitutions are made to specific embodiments of the invention, and without departing from any of spirit and scope of the invention Modification or equivalent replacement, are intended to be within the scope of the claims of the invention.

Claims (12)

1. a kind of new energy meteorological data management method characterized by comprising
Obtain meteorological data;
The meteorological data of the acquisition is stored on the node of massive parallel processing MPP;
Establish the meteorological data query table for being directed to the node.
2. a kind of new energy meteorological data management method as described in claim 1, which is characterized in that described by the acquisition Meteorological data is stored on the node of massive parallel processing MPP, comprising:
The meteorological data of the acquisition is ranked up according to the time;
Average packet is carried out to the data after sequence;
Data after grouping are stored in by the way of distributed storage on the node of MPP.
3. a kind of new energy meteorological data management method as described in claim 1, which is characterized in that described to establish for described The meteorological data query table of node, comprising:
Data in the node are subjected to region ordering, establish the meteorological data query table.
4. a kind of new energy meteorological data management method as claimed in claim 3, which is characterized in that it is described will be in the node Data carry out region ordering, before establishing the meteorological data query table further include:
If the node memory, in data, the meteorological data that will acquire establishes interim table.
5. a kind of new energy meteorological data management method as described in claim 1, which is characterized in that further include: described in utilization The inquiry of meteorological data query table progress data, comprising:
According to hash storage method, temporally field is inquired;And/or
It is inquired according to hash storage method by zone number.
6. a kind of new energy meteorological data management method a method as claimed in any one of claims 1 to 5, which is characterized in that the meteorology number According to for PB grades of meteorological datas.
7. a kind of new energy meteorological data management method as described in claim 1, which is characterized in that the node uses Share-nothing framework.
8. a kind of new energy meteorological data management system characterized by comprising
Module is obtained, for obtaining meteorological data;
Memory module, for the meteorological data of the acquisition to be stored in the node of massive parallel processing MPP;
Module is established, for establishing the meteorological data query table for being directed to the node.
9. a kind of new energy meteorological data management system as claimed in claim 8, which is characterized in that the memory module is used In,
The meteorological data of the acquisition is ranked up according to the time;
Average packet is carried out to the data after sequence;
Data after grouping are stored in by the way of distributed storage on the node of MPP.
10. a kind of new energy meteorological data management system as claimed in claim 8, which is characterized in that described to establish module packet It includes:
Sorting sub-module establishes the meteorological data query table for the data in the node to be carried out region ordering.
11. a kind of new energy meteorological data management system as claimed in claim 10, which is characterized in that described to establish module also Include:
Interim submodule, if the meteorological data that will acquire establishes interim table for the node memory in data.
12. a kind of new energy meteorological data management system as claimed in claim 8, which is characterized in that further include: inquiry mould Block, comprising:
Time inquiring submodule, for temporally field to be inquired according to hash storage method;And/or
Site polling submodule, for being inquired according to hash storage method by zone number.
CN201711353102.1A 2017-12-15 2017-12-15 A kind of new energy meteorological data management method and system Pending CN110019343A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201711353102.1A CN110019343A (en) 2017-12-15 2017-12-15 A kind of new energy meteorological data management method and system

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201711353102.1A CN110019343A (en) 2017-12-15 2017-12-15 A kind of new energy meteorological data management method and system

Publications (1)

Publication Number Publication Date
CN110019343A true CN110019343A (en) 2019-07-16

Family

ID=67186944

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201711353102.1A Pending CN110019343A (en) 2017-12-15 2017-12-15 A kind of new energy meteorological data management method and system

Country Status (1)

Country Link
CN (1) CN110019343A (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US11386089B2 (en) 2020-01-13 2022-07-12 The Toronto-Dominion Bank Scan optimization of column oriented storage
CN116881241A (en) * 2023-09-06 2023-10-13 深圳市银河系科技有限公司 Safety management method and system applied to meteorological data

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104182438A (en) * 2014-02-25 2014-12-03 无锡天脉聚源传媒科技有限公司 Message counting method and device
CN105045929A (en) * 2015-08-31 2015-11-11 国家电网公司 MPP architecture based distributed relational database

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104182438A (en) * 2014-02-25 2014-12-03 无锡天脉聚源传媒科技有限公司 Message counting method and device
CN105045929A (en) * 2015-08-31 2015-11-11 国家电网公司 MPP architecture based distributed relational database

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US11386089B2 (en) 2020-01-13 2022-07-12 The Toronto-Dominion Bank Scan optimization of column oriented storage
CN116881241A (en) * 2023-09-06 2023-10-13 深圳市银河系科技有限公司 Safety management method and system applied to meteorological data
CN116881241B (en) * 2023-09-06 2023-11-07 深圳市银河系科技有限公司 Safety management method and system applied to meteorological data

Similar Documents

Publication Publication Date Title
US10229129B2 (en) Method and apparatus for managing time series database
Shyam et al. Apache spark a big data analytics platform for smart grid
CN106777093B (en) Skyline inquiry system based on space time sequence data flow application
Lamba et al. Uses of cluster computing techniques to perform big data analytics for smart grid automation system
CN109800898A (en) A kind of intelligence short-term load forecasting method and system
CN107194533B (en) Power distribution network full information model construction method and system
CN107798059B (en) NCO meteorological data structured storage method and device
CN111586091A (en) Edge computing gateway system for realizing computing power assembly
CN111178587A (en) Spark framework-based short-term power load rapid prediction method
CN109446230A (en) A kind of big data analysis system and method for photovoltaic power generation influence factor
Jach et al. Application of HADOOP to store and process big data gathered from an urban water distribution system
CN110019343A (en) A kind of new energy meteorological data management method and system
CN117372201A (en) Rapid construction method of intelligent water conservancy digital twin model applied to reservoir
Huang et al. Parallel map matching on massive vehicle gps data using mapreduce
Niu et al. Parallel grid-based density peak clustering of big trajectory data
CN112614207A (en) Contour line drawing method, device and equipment
CN111144629A (en) Method and system for predicting water inflow of hydroelectric power station
KR20170069396A (en) Very short range microscale weather forecast model for real-time forecasting
CN109816149A (en) A kind of wind power plant is contributed scene generating method and device at random
Colosi et al. Time series data management optimized for smart city policy decision
CN114123190A (en) Method and device for determining target region to which ammeter belongs, electronic equipment and storage medium
CN109217367B (en) Wind power generation prediction method, device and equipment
CN114021833A (en) Line loss prediction method, system, storage medium and computing device
Zou et al. An integrated disaster rapid cloud service platform using remote sensing data
CN114710481B (en) Flow ticket analysis method, device, equipment and storage medium based on big data

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination