CN111625596B - Multi-source data synchronous sharing method and system for real-time new energy consumption scheduling - Google Patents

Multi-source data synchronous sharing method and system for real-time new energy consumption scheduling Download PDF

Info

Publication number
CN111625596B
CN111625596B CN202010406657.3A CN202010406657A CN111625596B CN 111625596 B CN111625596 B CN 111625596B CN 202010406657 A CN202010406657 A CN 202010406657A CN 111625596 B CN111625596 B CN 111625596B
Authority
CN
China
Prior art keywords
data
similarity
new energy
names
name
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN202010406657.3A
Other languages
Chinese (zh)
Other versions
CN111625596A (en
Inventor
姜狄
任一丹
白静洁
潘琦
张旭
武毅
林海峰
王刚
张凤麟
常志朋
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
State Grid Corp of China SGCC
Beijing Kedong Electric Power Control System Co Ltd
State Grid Liaoning Electric Power Co Ltd
State Grid Electric Power Research Institute
Original Assignee
State Grid Corp of China SGCC
Beijing Kedong Electric Power Control System Co Ltd
State Grid Liaoning Electric Power Co Ltd
State Grid Electric Power Research Institute
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by State Grid Corp of China SGCC, Beijing Kedong Electric Power Control System Co Ltd, State Grid Liaoning Electric Power Co Ltd, State Grid Electric Power Research Institute filed Critical State Grid Corp of China SGCC
Priority to CN202010406657.3A priority Critical patent/CN111625596B/en
Publication of CN111625596A publication Critical patent/CN111625596A/en
Application granted granted Critical
Publication of CN111625596B publication Critical patent/CN111625596B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/27Replication, distribution or synchronisation of data between databases or within a distributed database system; Distributed database system architectures therefor
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/22Matching criteria, e.g. proximity measures
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/10Text processing
    • G06F40/12Use of codes for handling textual entities
    • G06F40/151Transformation
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q50/00Information and communication technology [ICT] specially adapted for implementation of business processes of specific business sectors, e.g. utilities or tourism
    • G06Q50/06Energy or water supply
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y02TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
    • Y02DCLIMATE CHANGE MITIGATION TECHNOLOGIES IN INFORMATION AND COMMUNICATION TECHNOLOGIES [ICT], I.E. INFORMATION AND COMMUNICATION TECHNOLOGIES AIMING AT THE REDUCTION OF THEIR OWN ENERGY USE
    • Y02D10/00Energy efficient computing, e.g. low power processors, power management or thermal management

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Data Mining & Analysis (AREA)
  • Health & Medical Sciences (AREA)
  • Business, Economics & Management (AREA)
  • General Engineering & Computer Science (AREA)
  • General Health & Medical Sciences (AREA)
  • Artificial Intelligence (AREA)
  • Economics (AREA)
  • Databases & Information Systems (AREA)
  • Evolutionary Biology (AREA)
  • Human Resources & Organizations (AREA)
  • Evolutionary Computation (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Public Health (AREA)
  • Water Supply & Treatment (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Computing Systems (AREA)
  • Marketing (AREA)
  • Primary Health Care (AREA)
  • Strategic Management (AREA)
  • Tourism & Hospitality (AREA)
  • General Business, Economics & Management (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Computational Linguistics (AREA)
  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

The invention discloses a multi-source data synchronous sharing method for new energy real-time consumption scheduling, which comprises the steps of obtaining names of all data sources; calculating the similarity between names; merging data source data with name similarity larger than a threshold value; transmitting the combined data to a data warehouse for storage; and in response to triggering data sharing, invoking the data to be shared according to the metadata of the data warehouse. Corresponding systems are also disclosed. According to the invention, data fusion is carried out based on similarity names, and the fused data are stored in a data warehouse, so that integrated management and interactive sharing of multi-source heterogeneous new energy data in different power grid dispatching systems are realized.

Description

Multi-source data synchronous sharing method and system for real-time new energy consumption scheduling
Technical Field
The invention relates to a multi-source data synchronous sharing method and system for new energy real-time consumption scheduling, and belongs to the technical field of power system scheduling automation.
Background
The traditional thermal power generating unit is different, because of the characteristics of new energy sources such as wind power, photovoltaic and the like, the new energy source power generation is full of great uncertainty, and especially in the system construction of wind power, photovoltaic and thermal power AGC, all systems are mutually independent, the geographic positions of the new energy sources are scattered, related data interaction is difficult, comprehensive system advantages are difficult to develop, great difficulty is brought to the absorption and decision of the new energy sources, and a serious wind and light discarding phenomenon is caused.
Because the information data related to the new energy consumption decision is not uniform in design requirements at the initial stages of different system construction, the data generally have different formats and different service attributes, and the data presents the phenomenon that more structured data coexist with unstructured data; therefore, great difficulty is brought to synchronous sharing of data, and no effective multi-source synchronous sharing method for data exists.
Disclosure of Invention
The invention provides a multi-source data synchronous sharing method and system for new energy real-time consumption scheduling, which solve the problems disclosed in the background technology.
In order to solve the technical problems, the invention adopts the following technical scheme:
the multi-source data synchronous sharing method for real-time new energy consumption scheduling comprises,
acquiring names of all data sources;
calculating the similarity between names;
merging data source data with name similarity larger than a threshold value;
transmitting the combined data to a data warehouse for storage;
and in response to triggering data sharing, invoking the data to be shared according to the metadata of the data warehouse.
The similarity between names is calculated by the specific process,
decomposing each name into a word array according to a preset word stock;
similarity between names is calculated based on the segmentation.
In the process of calculating the similarity, a regularization method is adopted to convert special characters in the names, a plurality of continuous digital single words are connected as a whole, and fields representing the unique attributes of the equipment are subjected to feature transcoding.
If the name information is missing, the key field is used for filling in the name, and then similarity calculation is carried out.
And merging the data source data with the name similarity larger than the threshold value, removing redundancy, and sending the merged and redundancy-removed data to a data warehouse for storage.
The metadata of the data warehouse comprises a core layer and an extension layer; the core layer and the extension layer both contain metadata elements; the metadata elements of the core layer are features shared by the data, and the metadata elements of the extension layer are application features not shared by the data.
In the data warehouse, the increment data of the day is connected with the total data of the previous day in an all-out way.
The multi-source data synchronous sharing system for real-time new energy consumption scheduling comprises,
name acquisition module: acquiring names of all data sources;
similarity calculation module: calculating the similarity between names;
and a data merging module: merging data source data with name similarity larger than a threshold value;
and the sending storage module is used for: transmitting the combined data to a data warehouse for storage;
and (3) a calling module: and in response to triggering data sharing, invoking the data to be shared according to the metadata of the data warehouse.
A computer readable storage medium storing one or more programs, the one or more programs comprising instructions, which when executed by a computing device, cause the computing device to perform a method of multi-source data synchronization sharing for new energy real-time consumption scheduling.
A computing device comprising one or more processors, memory, and one or more programs, wherein one or more programs are stored in the memory and configured to be executed by the one or more processors, the one or more programs comprising instructions for performing a multi-source data synchronization sharing method of new energy real-time consumption scheduling.
The invention has the beneficial effects that: according to the invention, data fusion is carried out based on similarity names, and the fused data are stored in a data warehouse, so that integrated management and interactive sharing of multi-source heterogeneous new energy data in different power grid dispatching systems are realized.
Drawings
FIG. 1 is a flow chart of the method of the present invention;
FIG. 2 is a diagram of a metadata structure;
fig. 3 is an incremental data synchronization flow.
Detailed Description
The invention is further described below with reference to the accompanying drawings. The following examples are only for more clearly illustrating the technical aspects of the present invention, and are not intended to limit the scope of the present invention.
As shown in fig. 1, the method for synchronously sharing multi-source data for real-time new energy consumption scheduling includes the following steps:
and step 1, acquiring the names of all the data sources, namely acquiring the names of corresponding devices in the new energy system.
And 2, calculating the similarity between the names.
The method comprises the following steps: and decomposing each name into a word segmentation array according to a preset word stock, and calculating the similarity between the names based on the word segmentation.
Wherein, the following improvement is made in the process of calculating the similarity:
1. special characters in the names are converted by adopting a regularization method, such as special symbols of "," \ "," # ", and the like in the normalized processing equipment names.
2. Connecting a plurality of continuous digital single words as a whole;
after word segmentation processing is carried out on the equipment names, a plurality of single word conditions appear, and continuous digital single words are connected and matched as a whole, so that the conditions of '500' or '330' and the like in the voltage class can be conveniently processed.
3. Fields that characterize the unique attributes of the device are feature transcoded to increase the importance and unique nature of the field, increasing the comparison weight when a similarity comparison is entered.
And (3) word stock construction: constructing a power grid structural feature, a paraphrasing word library, a transcoding word library and automatically expanding according to a comparison result; the specific program flow is as follows:
input: all device names
And (3) outputting: professional word stock of power equipment
Word segmentation: based on the Trie, completing feature structure word segmentation by a forward maximum matching algorithm; the specific program flow is as follows:
input: power equipment name
And (3) outputting: equipment name word segmentation array
The similarity calculation is carried out, and the specific program flow is as follows:
input: two power equipment word segmentation vectors T x =w 1 ,w 2 ,...,w n
And (3) outputting: similarity S
The names of the electric power equipment are usually composed of continuous characters, characters and numbers, the sequences of the characters and the numbers are different under the condition of lack of a spacer, the segmentation process is disagreement, the sequence of the segmented character strings is complex, and the identification of the names of the equipment is more complex than the processing of simple Chinese or English naming; in addition, because the system has no standardized management of geography and address names for a long time, multiple expression forms such as abbreviation records with local and personal habits aggravate the difficulty of identifying the names of the devices, for example, a ' Ma Zhuang 220 kV#4bus is recorded as a ' Ma Zhuang bus '. The above example results in the absence of keywords during the data sharing synchronization process.
If the name information is missing, the key field is used for filling in the name, and then similarity calculation is carried out. The Chinese word segmentation algorithm is mainly based on character string matching, and aiming at the missing keywords, character strings are scanned, if the substrings and words of the character strings are found to be identical, the matching is considered, and some heuristic rules, such as 'forward/reverse maximum matching', 'long word priority', and the like, are added.
Because the number of primary devices is large, the station to which the devices belong is set as the associated field, and data filtering is completed before word segmentation and similarity calculation, so that the efficiency is improved.
And 3, merging the data source data with the name similarity larger than the threshold value, and removing redundancy.
And step 4, sending the combined data to a data warehouse for storage.
Metadata is a logical structure of a data warehouse, which is data about data. Metadata may describe various data in the database in detail, be it data or instructions in the process. Metadata herein pertains to technical metadata, including data source information, descriptions of data transformations, definitions of objects and data structures within a data warehouse, and other specifications.
As shown in fig. 2, metadata of a data warehouse includes a core layer and an extension layer; the core layer and the extension layer both contain metadata elements; the metadata elements of the core layer are features shared by the data, the metadata of the core layer are used for describing the aggregate information of the power grid regulation data, and the most basic access information and the description of the features of the metadata when the metadata enter a data warehouse are recorded, so that the metadata has extremely strong applicability and compatibility; the metadata element of the extension layer is an application feature which is not shared by the data, and can be combined with the core layer to describe one type of metadata completely.
Taking a new energy real-time consumption scheduling auxiliary decision information sharing cloud platform based on metadata as an example, the metadata items mainly comprise power plant stations, regional (geographic) positions, unit types, scheduling relations, scheduling plan information, output information, electricity limiting reasons and time periods, deep scheduling types, compensation allocation types, spot transaction conditions and the like and contact information of related data. The extended metadata set of the data such as the scheduling decision of the new energy real-time consumption has to be extracted and generated aiming at the common meta information items. The core layer element table definition is shown in table 1.
Table 1 core layer element table definition
/>
/>
Whether the core layer or the extension layer, each data element may have a name and a value, the attribute definition of which should contain the attribute values shown in table 2.
Table 4 attribute definition of metadata elements
The metadata element may extend the attribute definition to extend additional attributes of the metadata element. The extended attribute item mainly describes the special information item of a certain type of information data and the directory structure information of the resource in the shared vocabulary data set, so that the structural relation among the multi-source information data is revealed, and the extended attribute definition of the metadata element is shown in table 3.
Table 5 extended attribute definition for metadata elements
/>
And the metadata expansion principle in the new energy real-time consumption scheduling decision system conforms to the meaning expressed by the table. According to the characteristics of real-time consumption scheduling service and new energy information sharing of the whole new energy network, when the shared metadata is determined, the complexity and the data weight of the new energy service are fully considered, and the data sharing, service handling and information query are fully satisfied. Meanwhile, considering the complexity of the new energy business data, the metadata entity can be defined as a composite entity, namely the existing metadata entity can be used as a component part of the new entity, and the metadata is allowed to represent the existing metadata element value field with the self text instead of the value field. The number of parameters can be increased, but the extended attribute has the same logic as before extension.
In the batch synchronization of the power grid regulation data, the data volume is larger and larger along with the rich scale of the power grid service, if the full-volume data is synchronized at a certain period, errors are easy to occur and the efficiency is low, and the full-volume synchronization of a service system is not scientific in the face of massive data every day. The method can be set to only synchronize the new changed data which changes each time when the strategy is selected, and then the new changed data is combined with the full data acquired in the last synchronization period.
As shown in fig. 3, in the data warehouse, the increment data of the current day and the full data of the previous day are all externally connected, the latest full data is reloaded, and under the condition of large-scale data volume, the efficiency is much higher than that of the traditional method of inserting and updating, so that the latest full version can be obtained through updating every day. Meanwhile, when the operation of physically deleting the data table exists in the service system table of the new energy real-time consumption schedule, the batch synchronization mode of the incremental data is selected under the condition that the data warehouse guarantees the advantage of the historical data, so that the latest data snapshot can be reserved permanently, and the stability, safety and accuracy of the service system are ensured.
And step 5, responding to triggering data sharing, and calling the data to be shared according to the metadata of the data warehouse.
According to the method, data fusion is carried out based on similarity names, and the fused data are stored in a data warehouse, so that integrated management and interactive sharing of multi-source heterogeneous new energy data in different power grid dispatching systems are realized.
The multi-source data synchronous sharing system for real-time new energy consumption scheduling comprises,
name acquisition module: acquiring names of all data sources;
similarity calculation module: calculating the similarity between names;
and a data merging module: merging data source data with name similarity larger than a threshold value;
and the sending storage module is used for: transmitting the combined data to a data warehouse for storage;
and (3) a calling module: and in response to triggering data sharing, invoking the data to be shared according to the metadata of the data warehouse.
A computer readable storage medium storing one or more programs, the one or more programs comprising instructions, which when executed by a computing device, cause the computing device to perform a method of multi-source data synchronization sharing for new energy real-time consumption scheduling.
A computing device comprising one or more processors, memory, and one or more programs, wherein one or more programs are stored in the memory and configured to be executed by the one or more processors, the one or more programs comprising instructions for performing a multi-source data synchronization sharing method of new energy real-time consumption scheduling.
It will be appreciated by those skilled in the art that embodiments of the present application may be provided as a method, system, or computer program product. Accordingly, the present application may take the form of an entirely hardware embodiment, an entirely software embodiment, or an embodiment combining software and hardware aspects. Furthermore, the present application may take the form of a computer program product embodied on one or more computer-usable storage media (including, but not limited to, disk storage, CD-ROM, optical storage, and the like) having computer-usable program code embodied therein.
The present application is described with reference to flowchart illustrations and/or block diagrams of methods, apparatus (systems) and computer program products according to embodiments of the application. It will be understood that each flow and/or block of the flowchart illustrations and/or block diagrams, and combinations of flows and/or blocks in the flowchart illustrations and/or block diagrams, can be implemented by computer program instructions. These computer program instructions may be provided to a processor of a general purpose computer, special purpose computer, embedded processor, or other programmable data processing apparatus to produce a machine, such that the instructions, which execute via the processor of the computer or other programmable data processing apparatus, create means for implementing the functions specified in the flowchart flow or flows and/or block diagram block or blocks.
These computer program instructions may also be stored in a computer-readable memory that can direct a computer or other programmable data processing apparatus to function in a particular manner, such that the instructions stored in the computer-readable memory produce an article of manufacture including instruction means which implement the function specified in the flowchart flow or flows and/or block diagram block or blocks.
These computer program instructions may also be loaded onto a computer or other programmable data processing apparatus to cause a series of operational steps to be performed on the computer or other programmable apparatus to produce a computer implemented process such that the instructions which execute on the computer or other programmable apparatus provide steps for implementing the functions specified in the flowchart flow or flows and/or block diagram block or blocks.
The foregoing is illustrative of the present invention and is not to be construed as limiting thereof, but rather as providing for the use of additional embodiments and advantages of all such modifications, equivalents, improvements and similar to the present invention are intended to be included within the scope of the present invention as defined by the appended claims.

Claims (8)

1. The multi-source data synchronous sharing method for real-time new energy consumption scheduling is characterized by comprising the following steps of: comprising the steps of (a) a step of,
acquiring names of all data sources;
decomposing each name into a word segmentation array according to a preset word stock, and calculating the similarity between the names based on word segmentation; wherein, the similarity is the weighted sum of semantic similarity and word sequence similarity;
in the process of calculating the similarity, converting special characters in the names by adopting a regularization method, connecting a plurality of continuous digital single words as a whole, performing feature transcoding on fields representing unique attributes of equipment, and increasing comparison proportion when similarity comparison is performed;
merging data source data with name similarity larger than a threshold value;
transmitting the combined data to a data warehouse for storage;
and in response to triggering data sharing, invoking the data to be shared according to the metadata of the data warehouse.
2. The multi-source data synchronous sharing method for new energy real-time consumption scheduling according to claim 1, wherein the method is characterized by comprising the following steps: if the name information is missing, the key field is used for filling in the name, and then similarity calculation is carried out.
3. The multi-source data synchronous sharing method for new energy real-time consumption scheduling according to claim 1, wherein the method is characterized by comprising the following steps: and merging the data source data with the name similarity larger than the threshold value, removing redundancy, and sending the merged and redundancy-removed data to a data warehouse for storage.
4. The multi-source data synchronous sharing method for new energy real-time consumption scheduling according to claim 1, wherein the method is characterized by comprising the following steps: the metadata of the data warehouse comprises a core layer and an extension layer; the core layer and the extension layer both contain metadata elements; the metadata elements of the core layer are features shared by the data, and the metadata elements of the extension layer are application features not shared by the data.
5. The multi-source data synchronous sharing method for new energy real-time consumption scheduling according to claim 1, wherein the method is characterized by comprising the following steps: in the data warehouse, the increment data of the day is connected with the total data of the previous day in an all-out way.
6. The multi-source data synchronous sharing system for real-time new energy consumption scheduling is characterized in that: comprising the steps of (a) a step of,
name acquisition module: acquiring names of all data sources;
similarity calculation module: decomposing each name into a word segmentation array according to a preset word stock, and calculating the similarity between the names based on word segmentation; wherein, the similarity is the weighted sum of semantic similarity and word sequence similarity;
in the process of calculating the similarity, converting special characters in the names by adopting a regularization method, connecting a plurality of continuous digital single words as a whole, performing feature transcoding on fields representing unique attributes of equipment, and increasing comparison proportion when similarity comparison is performed;
and a data merging module: merging data source data with name similarity larger than a threshold value;
and the sending storage module is used for: transmitting the combined data to a data warehouse for storage;
and (3) a calling module: and in response to triggering data sharing, invoking the data to be shared according to the metadata of the data warehouse.
7. A computer readable storage medium storing one or more programs, characterized by: the one or more programs include instructions, which when executed by a computing device, cause the computing device to perform any of the methods of claims 1-5.
8. A computing device, characterized by: comprising the steps of (a) a step of,
one or more processors, memory, and one or more programs, wherein one or more programs are stored in the memory and configured to be executed by the one or more processors, the one or more programs comprising instructions for performing any of the methods of claims 1-5.
CN202010406657.3A 2020-05-14 2020-05-14 Multi-source data synchronous sharing method and system for real-time new energy consumption scheduling Active CN111625596B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202010406657.3A CN111625596B (en) 2020-05-14 2020-05-14 Multi-source data synchronous sharing method and system for real-time new energy consumption scheduling

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202010406657.3A CN111625596B (en) 2020-05-14 2020-05-14 Multi-source data synchronous sharing method and system for real-time new energy consumption scheduling

Publications (2)

Publication Number Publication Date
CN111625596A CN111625596A (en) 2020-09-04
CN111625596B true CN111625596B (en) 2023-12-26

Family

ID=72271916

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202010406657.3A Active CN111625596B (en) 2020-05-14 2020-05-14 Multi-source data synchronous sharing method and system for real-time new energy consumption scheduling

Country Status (1)

Country Link
CN (1) CN111625596B (en)

Families Citing this family (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112561499A (en) * 2021-02-01 2021-03-26 开封大学 New energy data management system based on low-carbon economy
CN113259470B (en) * 2021-06-03 2021-09-24 长视科技股份有限公司 Data synchronization method and data synchronization system

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102982168A (en) * 2012-12-12 2013-03-20 江苏省电力公司信息通信分公司 Metadata schema matching method based on XML (extensive markup language) document
CN107862047A (en) * 2017-11-08 2018-03-30 爱财科技有限公司 Natural person's data processing method and system based on multiple data sources
CN110457018A (en) * 2019-08-16 2019-11-15 浪潮云信息技术有限公司 A kind of data management system and its management method based on Hadoop
CN110866397A (en) * 2019-11-14 2020-03-06 国网冀北电力有限公司 Electric power equipment model feature matching method based on Terry Search Trie
US10599395B1 (en) * 2017-10-11 2020-03-24 Amperity, Inc. Dynamically merging database tables
CN111126055A (en) * 2019-10-28 2020-05-08 国电南瑞科技股份有限公司 Power grid equipment name matching method and system

Family Cites Families (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US10387389B2 (en) * 2014-09-30 2019-08-20 International Business Machines Corporation Data de-duplication
US10929400B2 (en) * 2016-10-26 2021-02-23 International Business Machines Corporation Self-optimizing value range synopsis in analytical databases

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102982168A (en) * 2012-12-12 2013-03-20 江苏省电力公司信息通信分公司 Metadata schema matching method based on XML (extensive markup language) document
US10599395B1 (en) * 2017-10-11 2020-03-24 Amperity, Inc. Dynamically merging database tables
CN107862047A (en) * 2017-11-08 2018-03-30 爱财科技有限公司 Natural person's data processing method and system based on multiple data sources
CN110457018A (en) * 2019-08-16 2019-11-15 浪潮云信息技术有限公司 A kind of data management system and its management method based on Hadoop
CN111126055A (en) * 2019-10-28 2020-05-08 国电南瑞科技股份有限公司 Power grid equipment name matching method and system
CN110866397A (en) * 2019-11-14 2020-03-06 国网冀北电力有限公司 Electric power equipment model feature matching method based on Terry Search Trie

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
基于本体的数据信息融合模型的研究;陈飞;杨秋红;;安徽电子信息职业技术学院学报(02);全文 *

Also Published As

Publication number Publication date
CN111625596A (en) 2020-09-04

Similar Documents

Publication Publication Date Title
CN106777644B (en) Automatic generation method and device for power plant identification system code
CN101079024B (en) Special word list dynamic generation system and method
CN112527997B (en) Intelligent question-answering method and system based on power grid field scheduling scene knowledge graph
CN104035754A (en) XML (Extensible Markup Language)-based custom code generation method and generator
CN102426609A (en) Index generation method and index generation device based on MapReduce programming architecture
CN111625596B (en) Multi-source data synchronous sharing method and system for real-time new energy consumption scheduling
CN101799808A (en) Data processing method and system thereof
CN110674154A (en) Spark-based method for inserting, updating and deleting data in Hive
CN114077674A (en) Power grid dispatching knowledge graph data optimization method and system
CN103235811A (en) Data storage method and device
CN110851511A (en) Data synchronization method and device
Elizarov et al. Big Math Methods in Lobachevskii-DML Digital Library.
CN115687468A (en) System for processing data in distributed service by ETL process button
CN114186533A (en) Model training method and device, knowledge extraction method and device, equipment and medium
CN115757810A (en) Method for constructing standard ontology of knowledge graph
CN101916260A (en) Method for establishing semantic mapping between disaster body and relational database
CN103927176A (en) Method for generating program feature tree on basis of hierarchical topic model
CN113032371A (en) Database grammar analysis method and device and computer equipment
CN112214602A (en) Text classification method and device based on humor, electronic equipment and storage medium
CN117056564A (en) Power topology island detection method, device, equipment and storage medium
CN114116779A (en) Deep learning-based power grid regulation and control field information retrieval method, system and medium
CN103020162B (en) CC2000 model conversion imports the method for D5000 system
CN115937881A (en) Method for automatically identifying content of knowledge graph construction standard form
CN112214494B (en) Retrieval method and device
CN115033280A (en) Knowledge graph-based automatic generation method for requirement specification document and storage medium

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant