CN112801817A - Electric energy quality data center construction method and system - Google Patents

Electric energy quality data center construction method and system Download PDF

Info

Publication number
CN112801817A
CN112801817A CN202110123467.5A CN202110123467A CN112801817A CN 112801817 A CN112801817 A CN 112801817A CN 202110123467 A CN202110123467 A CN 202110123467A CN 112801817 A CN112801817 A CN 112801817A
Authority
CN
China
Prior art keywords
data
standing book
book data
ledger
bhattacharyya
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN202110123467.5A
Other languages
Chinese (zh)
Other versions
CN112801817B (en
Inventor
徐思尧
李妍
程晨
张子瑛
彭明洋
周刚
陈晓科
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Electric Power Research Institute of Guangdong Power Grid Co Ltd
Original Assignee
Electric Power Research Institute of Guangdong Power Grid Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Electric Power Research Institute of Guangdong Power Grid Co Ltd filed Critical Electric Power Research Institute of Guangdong Power Grid Co Ltd
Publication of CN112801817A publication Critical patent/CN112801817A/en
Application granted granted Critical
Publication of CN112801817B publication Critical patent/CN112801817B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q50/00Information and communication technology [ICT] specially adapted for implementation of business processes of specific business sectors, e.g. utilities or tourism
    • G06Q50/06Energy or water supply
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/21Design, administration or maintenance of databases
    • G06F16/215Improving data quality; Data cleansing, e.g. de-duplication, removing invalid entries or correcting typographical errors
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/29Geographical information databases
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/25Fusion techniques
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q10/00Administration; Management
    • G06Q10/06Resources, workflows, human or project management; Enterprise or organisation planning; Enterprise or organisation modelling
    • G06Q10/063Operations research, analysis or management
    • G06Q10/0637Strategic management or analysis, e.g. setting a goal or target of an organisation; Planning actions based on goals; Analysis or evaluation of effectiveness of goals
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q10/00Administration; Management
    • G06Q10/06Resources, workflows, human or project management; Enterprise or organisation planning; Enterprise or organisation modelling
    • G06Q10/063Operations research, analysis or management
    • G06Q10/0639Performance analysis of employees; Performance analysis of enterprise or organisation operations
    • G06Q10/06395Quality analysis or management
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y04INFORMATION OR COMMUNICATION TECHNOLOGIES HAVING AN IMPACT ON OTHER TECHNOLOGY AREAS
    • Y04SSYSTEMS INTEGRATING TECHNOLOGIES RELATED TO POWER NETWORK OPERATION, COMMUNICATION OR INFORMATION TECHNOLOGIES FOR IMPROVING THE ELECTRICAL POWER GENERATION, TRANSMISSION, DISTRIBUTION, MANAGEMENT OR USAGE, i.e. SMART GRIDS
    • Y04S10/00Systems supporting electrical power generation, transmission or distribution
    • Y04S10/50Systems or methods supporting the power network operation or management, involving a certain degree of interaction with the load-side end user applications

Landscapes

  • Engineering & Computer Science (AREA)
  • Business, Economics & Management (AREA)
  • Human Resources & Organizations (AREA)
  • Theoretical Computer Science (AREA)
  • Economics (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Strategic Management (AREA)
  • Data Mining & Analysis (AREA)
  • Entrepreneurship & Innovation (AREA)
  • Educational Administration (AREA)
  • Databases & Information Systems (AREA)
  • General Engineering & Computer Science (AREA)
  • General Business, Economics & Management (AREA)
  • Tourism & Hospitality (AREA)
  • Quality & Reliability (AREA)
  • Development Economics (AREA)
  • Marketing (AREA)
  • Health & Medical Sciences (AREA)
  • Game Theory and Decision Science (AREA)
  • Operations Research (AREA)
  • Evolutionary Computation (AREA)
  • General Health & Medical Sciences (AREA)
  • Water Supply & Treatment (AREA)
  • Primary Health Care (AREA)
  • Public Health (AREA)
  • Remote Sensing (AREA)
  • Evolutionary Biology (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Artificial Intelligence (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)

Abstract

The invention discloses a construction method and a system of a power quality data center, comprising the following steps: splitting the data of the system into ledger data and production data according to the attribute characteristics; and judging whether the standing book data in each system are the same according to the Bhattacharyya distance and the DTW algorithm so as to integrate the standing book data and mount the production data in the integrated standing book data. The invention starts from the characteristics of a power grid system, provides a fusion algorithm which is designed according to the fusion framework and the ledger data characteristics of the industry according to the power grid commonality, improves the matching correctness of single words, can solve the realization of the fusion of ledgers with different lengths, and improves the fusion efficiency of ledger data.

Description

Electric energy quality data center construction method and system
Technical Field
The invention relates to the technical field of power systems, in particular to a method and a system for constructing a power quality data center, a computer terminal device and a readable storage medium.
Background
In recent years, with the rapid development of high-voltage direct-current transmission technology, distributed micro-grid and other technologies, the form of a power grid is changed greatly, the power quality mechanism of the power grid caused by the new technologies is more complex, and the new technologies extend to ultra-high voltage and distribution networks, so that the problem of power quality exists in each link of a modern power system.
In contrast, grid companies have installed very few power quality monitoring devices, and cover primarily 10kV and above buses. With the development of the power grid, the power grid production system constructed by devices such as the synchronous phasor measurement unit containing the power quality data enables the acquisition of the power quality data covering the whole power grid.
In view of the complexity and the immediacy of the power quality data, how to reasonably design and establish a power quality data center is of great importance to the whole monitoring system. Because the power quality industry is lack of uniform data formats and specifications all the time, a plurality of power companies and enterprises can produce monitoring equipment, the monitoring equipment and analysis tools produced by various manufacturers have characteristics, the monitoring data emphasis is different, and the data formats are more diverse and incompatible. This is very disadvantageous for information sharing and application integration between applications within the utility company and between utility companies.
Disclosure of Invention
The invention aims to provide a method for constructing an electric energy quality data center, which is characterized in that a fusion algorithm which is designed according to the fusion framework and the ledger data characteristics of the industry is provided according to the power grid commonality, so that the matching correctness of single words is improved, the realization of the ledger fusion with unequal lengths can be solved, and the fusion efficiency of ledger data is improved.
In order to achieve the above object, an embodiment of the present invention provides a method for constructing an electric energy quality data center, including:
splitting the data of the system into ledger data and production data according to the attribute characteristics;
judging whether the standing book data in each system are the same according to the Bhattacharyya distance and a DTW algorithm so as to integrate the standing book data;
and mounting the production data in the integrated standing book data.
In one embodiment, the system comprises a production management system, a scheduling automation system, a distribution network automation system, a metering automation system, a marketing system, a GIS system, a voltage system, and a power quality system.
In a certain embodiment, before determining whether the standing book data of each system are the same according to the Bhattacharyya distance and the DTW algorithm, the method further includes performing subset division on the standing book data according to a management unit.
In a certain embodiment, the determining whether the ledger data of each system is the same according to the Bhattacharyya distance and the DTW algorithm includes:
obtaining a single-word Bhattacharyya coefficient in the standing book data according to the Bhattacharyya distance;
according to the DTW algorithm, sequentially accumulating coefficients of all the points Bhattacharyya passing through, and traversing the standing book data Q in the standing book dataaAnd standing book data CgChinese characters can obtain the cumulative distance gamma (a, g), a represents the standing book QaThe number of words of the name of the middle standing book, g represents the Q of the standing bookgThe number of words of the middle standing account name; the calculation formula is as follows:
γ(a,g)=B(qa,cg)+max{γ(a-1,g-1),γ(a-1,g),γ(a,g-1)}
wherein, B (q)a,cg) Represents the ledger data QaChinese character qaAnd the ledger data CgChinese character cgThe Bhattacharyya coefficient of (a);
and judging whether the two standing book data are the same or not, wherein the judgment formula is as follows:
Figure BDA0002922302290000021
wherein r is the effective matching times of gamma (a, g), and tau is a preset threshold.
The embodiment of the invention also provides a construction system of the electric energy quality data center, which is applied to the construction method of the electric energy quality data center in any embodiment. The method comprises the following steps:
the system data splitting module is used for splitting the system data into ledger data and production data according to the attribute characteristics;
the standing book data integration module is used for judging whether the standing book data of each system are the same according to the Bhattacharyya distance and the DTW algorithm so as to integrate the standing book data;
and the production data mounting module is used for mounting the production data in the integrated standing book data.
In one embodiment, the system further comprises an account data subset dividing module, and the account data subset dividing module is used for performing subset division on the account data according to a management unit.
In one embodiment, the ledger data integration module includes:
the standing book data single-character similarity calculation unit is used for calculating a single-character Bhattacharyya coefficient in the standing book data according to the Bhattacharyya distance;
the standing book data accumulation distance calculation unit is used for sequentially accumulating coefficients of all the points Bhattacharyya passing through according to a DTW algorithm and traversing the standing book data Q in the standing book dataaAnd standing book data CgChinese characters can obtain the cumulative distance gamma (a, g), a represents the standing book QaThe number of words of the name of the middle standing book, g represents the Q of the standing bookgThe number of words of the middle standing account name; the calculation formula is as follows:
γ(a,g)=B(qa,cg)+max{γ(a-1,g-1),γ(a-1,g),γ(a,g-1)}
wherein, B (q)a,cg) Represents the ledger data QaChinese character qaAnd the ledger data CgChinese character cgThe Bhattacharyya coefficient of (a);
the standing book data identity judging unit is used for judging whether the two standing book data are the same or not, and the judging formula is as follows:
Figure BDA0002922302290000031
wherein r is the effective matching times of gamma (a, g), and tau is a preset threshold.
The embodiment of the invention also provides computer terminal equipment which comprises one or more processors and a memory. A memory coupled to the processor for storing one or more programs; when executed by the one or more processors, cause the one or more processors to implement a method of building a power quality data center as in any of the embodiments described above.
The embodiment of the present invention further provides a computer-readable storage medium, on which a computer program is stored, where the computer program, when executed by a processor, implements the method for constructing the power quality data center according to any of the above embodiments.
According to the electric energy quality data center construction method and the electric energy quality data center construction system, based on the characteristics of a power grid system, a fusion algorithm which is designed according to the fusion framework and the ledger data characteristics of the industry is provided according to the power grid commonalities, so that the matching correctness of single words is improved, the realization of ledger fusion with unequal lengths can be solved, and the ledger data fusion efficiency is improved.
Drawings
In order to more clearly illustrate the technical solution of the present invention, the drawings needed to be used in the embodiments will be briefly described below, and it is obvious that the drawings in the following description are only some embodiments of the present invention, and it is obvious for those skilled in the art that other drawings can be obtained according to the drawings without creative efforts.
Fig. 1 is a schematic flow chart of a method for constructing a power quality data center according to an embodiment of the present invention;
fig. 2 is a frame diagram of a power quality data center construction method according to an embodiment of the present invention;
fig. 3 is a schematic flow chart illustrating subset partitioning in a power quality data center construction method according to an embodiment of the present invention;
fig. 4 is a schematic diagram of a result of a full-spelling probability calculation in a power quality data center construction method according to an embodiment of the present invention;
fig. 5 is a schematic structural diagram of a computer terminal device according to an embodiment of the present invention.
Detailed Description
The technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the drawings in the embodiments of the present invention, and it is obvious that the described embodiments are only a part of the embodiments of the present invention, and not all of the embodiments. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present invention.
It should be understood that the step numbers used herein are for convenience of description only and are not intended as limitations on the order in which the steps are performed.
It is to be understood that the terminology used in the description of the invention herein is for the purpose of describing particular embodiments only and is not intended to be limiting of the invention. As used in the specification of the present invention and the appended claims, the singular forms "a," "an," and "the" are intended to include the plural forms as well, unless the context clearly indicates otherwise.
The terms "comprises" and "comprising" indicate the presence of the described features, integers, steps, operations, elements, and/or components, but do not preclude the presence or addition of one or more other features, integers, steps, operations, elements, components, and/or groups thereof.
The term "and/or" refers to and includes any and all possible combinations of one or more of the associated listed items.
Referring to fig. 1, an embodiment of the present invention provides a method for constructing a power quality data center, including:
s10, splitting the data of the system into ledger data and production data according to the attribute characteristics;
s20, judging whether the standing book data in each system are the same according to the Bhattacharyya distance and the DTW algorithm so as to integrate the standing book data;
and S30, mounting the production data in the integrated ledger data.
Referring to fig. 2, in the present embodiment, system data (a production management system, a scheduling automation system, a distribution network automation system, a metering automation system, a marketing system, a GIS system, a voltage system, and an electric energy quality system) is divided into ledger data and production data, where the ledger data is archive information representing an electric power object and is composed of a plurality of sub-attributes, and each attribute value is generally fixed and unchanged, for example, the ledger data of a transformer generally includes a transformer name, a model, a capacity, an id allocated to a system where the transformer is located, and the production data is variable, and the production data represents dynamic operation data of the electric power object and is generally related to time, for example: the transformer operation data comprises voltage, current, work, reactive power and the like of each time sequence, so that the production data is stored by virtue of the ledger data.
Finding out the commonalities of the standing book data of each system is the key to realize data integration. And (3) solving the similarity of the Chinese character full spelling of the account data by utilizing the Bhattacharyya distance, solving the account similarity by utilizing improved DTW (dynamic Time warping) dynamic Time normalization, and then judging the identity. And finally, mounting the production data in the integrated ledger data to complete the construction of the whole power quality data center. A fusion framework conforming to the industry is provided according to the power grid commonality, and the fusion algorithm designed according to the standing book data characteristics improves the matching correctness of single characters.
In one embodiment, the system comprises a production management system, a scheduling automation system, a distribution network automation system, a metering automation system, a marketing system, a GIS system, a voltage system, and a power quality system.
In a certain embodiment, before determining whether the standing book data of each system are the same according to the Bhattacharyya distance and the DTW algorithm, the method further includes performing subset division on the standing book data according to a management unit.
Referring to fig. 3, in the present embodiment, in the standing book integration implementation module, word set division is performed according to a management unit, so that standing book repetition is reduced and fusion efficiency is improved. Ledger names are the names of power system objects, but may be repeated from system to system, such as: the Zhongshan power supply bureau and the Zhuhai power supply bureau of the voltage monitoring system both find that the standing account name is the distribution transformer name of the private transformer of the local public security bureau. Therefore, by means of the management relation of the power grid company, the management unit to which the standing book name belongs is judged from top to bottom until the unit directly responsible for the standing book is found. Taking the integrated provincial system as an example, the next level is a power supply bureau, the next level of the power supply bureau is a sub-county bureau, and the next level of the sub-county bureau is a power supply station or a transformer substation, which are sequentially searched downwards. The processing is a classification process, and the multi-system ledger integration efficiency can be improved.
In a certain embodiment, the determining whether the ledger data of each system is the same according to the Bhattacharyya distance and the DTW algorithm includes:
obtaining a single-word Bhattacharyya coefficient in the standing book data according to the Bhattacharyya distance;
according to the DTW algorithm, sequentially accumulating coefficients of all the points Bhattacharyya passing through, and traversing the standing book data Q in the standing book dataaAnd standing book data CgChinese characters can obtain the cumulative distance gamma (a, g), a represents the standing book QaThe number of words of the name of the middle standing book, g represents the Q of the standing bookgThe number of words of the middle standing account name; the calculation formula is as follows:
γ(a,g)=B(qa,cg)+max{γ(a-1,g-1),γ(a-1,g),γ(a,g-1)}
wherein, B (q)a,cg) Represents the ledger data QaChinese character qaAnd the ledger data CgChinese character cgThe Bhattacharyya coefficient of (a);
and judging whether the two standing book data are the same or not, wherein the judgment formula is as follows:
Figure BDA0002922302290000051
wherein r is the effective matching times of gamma (a, g), and tau is a preset threshold.
In this embodiment, the similarity of the full spelling of the Chinese characters is obtained according to the distance Bhattacharyya. The Bhattacharyya distance is used to measure the similarity of two discrete or continuous probabilities, which is defined as: in the same domain X, the babbitt distance of two discrete probability distributions p and q is defined as follows:
DB(p,q)=-ln(BC(p,q)) (1)
Figure BDA0002922302290000052
the Chinese character full spelling is converted into a probability histogram, the similarity of any two Chinese characters is obtained by using the formula (1) and the formula (2), the higher the similarity is, the closer BC is to 1, and otherwise, the closer BC is to 0.
Firstly, probability conversion is carried out on Chinese characters, and the process is as follows: for any purposeChinese character y, its complete spelling is sequence H, H ═ H1,h2..hr.,ht],hrIs the r-th letter of the full spelling of y, and t represents the full spelling length. Numbering according to the alphabet in sequence, and taking the numbering as a horizontal coordinate of the histogram; counting the total number a of phonetic letters of the Chinese character y and the number n of each letterrThe ratio P (h) of each letter is calculated according to the formula (3)i) And as the value of the histogram.
Figure BDA0002922302290000053
And then DTW (dynamic Time warping) dynamic Time integration is carried out to obtain the standing book similarity. The lengths of the names of the same objects in the systems are mostly different, and on the other hand, the standing book names of the power systems are named according to the power supply relation and can be considered to have time sequence, so that the DTW is suitable for solving the similarity of the standing book names.
Referring to FIG. 4, for any two ledger names QaAnd CgThe subscript indicates the number of Chinese characters, and a and g may be different. By finding the cumulative distance γ: starting from (0,0), search is performed to find Q using the Bhattacharyya distanceaAnd CgSimilarity of two Chinese characters, e.g. QaContains the Chinese character "Tang", and Cg contains the Chinese character "box", and the total spelling probability is respectively obtained by using a formula (3). Calculating Bhattacharyya coefficients of the Tang and the frame to be 0.67 through the formula (2), sequentially accumulating the Bhattacharyya coefficients passing through all points, obtaining an accumulated distance gamma after reaching the end points (a, g), solving the formula to be the formula (4),
equation (4) is a variation of the existing DTW, modified by: the Bhattacharyya coefficient B () is used for replacing a common Euclidean distance, the maximum value is obtained by the searching process, the traditional minimum value is not solved, the maximum value is determined by the Bhattacharyya coefficient characteristic, and the calculation formula is as follows:
γ(i,j)=B(qi,cj)+max{γ(i-1,j-1),γ(i-1,j),γ(i,j-1)} (4)
qirepresents QaThe ith Chinese character of (1), cjIs represented by CgThe jth Chinese character of (1), and B (q)i,cj) ThenRepresenting a computational Chinese character qiAnd cjThe Bhattacharyya coefficient of (a),
finally, when the weight end point (a, g) is reached, the cumulative distance γ (a, g) is obtained, and the formula is as follows:
γ(a,g)=B(qa,cg)+max{γ(a-1,g-1),γ(a-1,g),γ(a,g-1)}
wherein, B (q)a,cg) Represents the ledger data QaChinese character qaAnd the ledger data CgChinese character cgThe Bhattacharyya coefficient of (a);
and finally, judging whether the two standing book data are the same through a formula (5), wherein the formula is as follows
Figure BDA0002922302290000061
r is the effective matching times of gamma (i, j), tau is a preset threshold, when the formula (5) is satisfied, the two standing book data are judged to be the same, and if the formula (5) is not satisfied, the two standing book data are judged to be different.
The embodiment of the invention also provides a construction system of the electric energy quality data center, which is applied to the construction method of the electric energy quality data center in any embodiment. The method comprises the following steps:
the system data splitting module is used for splitting the system data into ledger data and production data according to the attribute characteristics;
the standing book data integration module is used for judging whether the standing book data of each system are the same according to the Bhattacharyya distance and the DTW algorithm so as to integrate the standing book data;
and the production data mounting module is used for mounting the production data in the integrated standing book data.
In one embodiment, the system further comprises an account data subset dividing module, and the account data subset dividing module is used for performing subset division on the account data according to a management unit.
In one embodiment, the ledger data integration module includes:
the standing book data single-character similarity calculation unit is used for calculating a single-character Bhattacharyya coefficient in the standing book data according to the Bhattacharyya distance;
the standing book data accumulation distance calculation unit is used for sequentially accumulating coefficients of all the points Bhattacharyya passing through according to a DTW algorithm and traversing the standing book data Q in the standing book dataaAnd standing book data CgChinese characters can obtain the cumulative distance gamma (a, g), a represents the standing book QaThe number of words of the name of the middle standing book, g represents the Q of the standing bookgThe number of words of the middle standing account name; the calculation formula is as follows:
γ(a,g)=B(qa,cg)+max{γ(a-1,g-1),γ(a-1,g),γ(a,g-1)}
wherein, B (q)a,cg) Represents the ledger data QaChinese character qaAnd the ledger data CgChinese character cgThe Bhattacharyya coefficient of (a);
the standing book data identity judging unit is used for judging whether the two standing book data are the same or not, and the judging formula is as follows:
Figure BDA0002922302290000071
wherein r is the effective matching times of gamma (a, g), and tau is a preset threshold.
For specific limitations of the power quality data center construction system, reference may be made to the above limitations on the power quality data center construction method, which will not be described herein again. The modules in the construction system of the power quality data center can be wholly or partially realized by software, hardware and a combination thereof. The modules can be embedded in a hardware form or independent from a processor in the computer device, and can also be stored in a memory in the computer device in a software form, so that the processor can call and execute operations corresponding to the modules.
Referring to fig. 5, an embodiment of the invention provides a computer terminal device, which includes one or more processors and a memory. The memory is coupled to the processor and configured to store one or more programs that, when executed by the one or more processors, cause the one or more processors to implement the method of building a power quality data center as in any of the embodiments described above.
The processor is used for controlling the overall operation of the computer terminal equipment so as to complete all or part of the steps of the construction method of the power quality data center. The memory is used to store various types of data to support the operation at the computer terminal device, which data may include, for example, instructions for any application or method operating on the computer terminal device, as well as application-related data. The Memory may be implemented by any type of volatile or non-volatile Memory device or combination thereof, such as Static Random Access Memory (SRAM), Electrically Erasable Programmable Read-Only Memory (EEPROM), Erasable Programmable Read-Only Memory (EPROM), Programmable Read-Only Memory (PROM), Read-Only Memory (ROM), magnetic Memory, flash Memory, magnetic disk, or optical disk.
In an exemplary embodiment, the computer terminal Device may be implemented by one or more Application Specific 1 integrated circuits (AS 1C), a Digital Signal Processor (DSP), a Digital Signal Processing Device (DSPD), a Programmable Logic Device (PLD), a Field Programmable Gate Array (FPGA), a controller, a microcontroller, a microprocessor or other electronic components, and is configured to perform the above-mentioned method for constructing the power quality data center, and achieve the technical effects consistent with the above-mentioned method.
In another exemplary embodiment, there is also provided a computer readable storage medium comprising program instructions which, when executed by a processor, implement the steps of the method of constructing a power quality data center in any of the above embodiments. For example, the computer readable storage medium may be the above-mentioned memory including program instructions, which are executable by the processor of the computer terminal device to perform the above-mentioned method for constructing the power quality data center, and achieve the technical effects consistent with the above-mentioned method.
According to the electric energy quality data center construction method and the electric energy quality data center construction system, based on the characteristics of a power grid system, a fusion algorithm which is designed according to the fusion framework and the ledger data characteristics of the industry is provided according to the power grid commonalities, so that the matching correctness of single words is improved, the realization of the fusion of ledgers with different lengths can be solved, and the fusion efficiency of ledger data is improved.
While the foregoing is directed to the preferred embodiment of the present invention, it will be understood by those skilled in the art that various changes and modifications may be made without departing from the spirit and scope of the invention.

Claims (9)

1. A construction method of a power quality data center is characterized by comprising the following steps:
splitting the data of the system into ledger data and production data according to the attribute characteristics;
judging whether the standing book data in each system are the same according to the Bhattacharyya distance and a DTW algorithm so as to integrate the standing book data;
and mounting the production data in the integrated standing book data.
2. The method of constructing a power quality data center according to claim 1,
the system comprises a production management system, a scheduling automation system, a distribution network automation system, a metering automation system, a marketing system, a GIS system, a voltage system and an electric energy quality system.
3. The method for constructing the power quality data center according to claim 1, wherein before judging whether the standing book data of each system are the same according to the Bhattacharyya distance and the DTW algorithm, the method further comprises the step of performing subset division on the standing book data according to a management unit.
4. The method for constructing the power quality data center according to claim 1, wherein the step of judging whether the standing book data of each system are the same according to the Bhattacharyya distance and the DTW algorithm comprises the steps of:
obtaining a single-word Bhattacharyya coefficient in the standing book data according to the Bhattacharyya distance;
according to the DTW algorithm, sequentially accumulating coefficients of all the points Bhattacharyya passing through, and traversing the standing book data Q in the standing book dataaAnd standing book data CgChinese characters can obtain the cumulative distance gamma (a, g), a represents the standing book QaThe number of words of the name of the middle standing book, g represents the Q of the standing bookgThe number of words of the middle standing account name; the calculation formula is as follows:
γ(a,g)=B(qa,cg)+max{γ(a-1,g-1),γ(a-1,g),γ(a,g-1)}
wherein, B (q)a,cg) Represents the ledger data QaChinese character qaAnd the ledger data CgChinese character cgThe Bhattacharyya coefficient of (a);
and judging whether the two standing book data are the same or not, wherein the judgment formula is as follows:
Figure FDA0002922302280000011
wherein r is the effective matching times of gamma (a, g), and tau is a preset threshold.
5. A construction system of a power quality data center is characterized by comprising:
the system data splitting module is used for splitting the system data into ledger data and production data according to the attribute characteristics;
the standing book data integration module is used for judging whether the standing book data of each system are the same according to the Bhattacharyya distance and the DTW algorithm so as to integrate the standing book data;
and the production data mounting module is used for mounting the production data in the integrated standing book data.
6. The system for constructing the power quality data center according to claim 5, further comprising a ledger data subset partitioning module, wherein the ledger data subset partitioning module is configured to perform subset partitioning on the ledger data according to a management unit.
7. The system for building a power quality data center according to claim 5, wherein the ledger data integration module comprises:
the standing book data single-character similarity calculation unit is used for calculating a single-character Bhattacharyya coefficient in the standing book data according to the Bhattacharyya distance;
the standing book data accumulation distance calculation unit is used for sequentially accumulating coefficients of all the points Bhattacharyya passing through according to a DTW algorithm and traversing the standing book data Q in the standing book dataaAnd standing book data CgChinese characters can obtain the cumulative distance gamma (a, g), a represents the standing book QaThe number of words of the name of the middle standing book, g represents the Q of the standing bookgThe number of words of the middle standing account name; the calculation formula is as follows:
γ(a,g)=B(qa,cg)+max{γ(a-1,g-1),γ(a-1,g),γ(a,g-1)}
wherein, B (q)a,cg) Represents the ledger data QaChinese character qaAnd the ledger data CgChinese character cgThe Bhattacharyya coefficient of (a);
the standing book data identity judging unit is used for judging whether the two standing book data are the same or not, and the judging formula is as follows:
Figure FDA0002922302280000021
wherein r is the effective matching times of gamma (a, g), and tau is a preset threshold.
8. A computer terminal device, comprising:
one or more processors;
a memory coupled to the processor for storing one or more programs;
when executed by the one or more processors, cause the one or more processors to implement the method of constructing a power quality data center of any of claims 1 to 4.
9. A computer-readable storage medium, on which a computer program is stored, which, when being executed by a processor, carries out a method of building a power quality data center according to any one of claims 1 to 4.
CN202110123467.5A 2020-12-29 2021-01-28 Electric energy quality data center construction method and system thereof Active CN112801817B (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN2020116106410 2020-12-29
CN202011610641 2020-12-29

Publications (2)

Publication Number Publication Date
CN112801817A true CN112801817A (en) 2021-05-14
CN112801817B CN112801817B (en) 2023-07-21

Family

ID=75812692

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202110123467.5A Active CN112801817B (en) 2020-12-29 2021-01-28 Electric energy quality data center construction method and system thereof

Country Status (1)

Country Link
CN (1) CN112801817B (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113610663A (en) * 2021-07-31 2021-11-05 云南电网有限责任公司信息中心 Power grid network frame consistency checking method based on fusion algorithm

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101996631A (en) * 2009-08-28 2011-03-30 国际商业机器公司 Method and device for aligning texts
CN109710647A (en) * 2018-12-29 2019-05-03 广东电网有限责任公司 A kind of power grid account data fusion method and device based on keyword search
CN109977188A (en) * 2019-03-28 2019-07-05 国网河南省电力公司经济技术研究院 A kind of multi-specialized data correlation fusion method of gradual power grid and device
CN111160868A (en) * 2019-12-31 2020-05-15 国网北京市电力公司 Unified distribution network frame topology construction method based on graph fusion technology

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101996631A (en) * 2009-08-28 2011-03-30 国际商业机器公司 Method and device for aligning texts
CN109710647A (en) * 2018-12-29 2019-05-03 广东电网有限责任公司 A kind of power grid account data fusion method and device based on keyword search
CN109977188A (en) * 2019-03-28 2019-07-05 国网河南省电力公司经济技术研究院 A kind of multi-specialized data correlation fusion method of gradual power grid and device
CN111160868A (en) * 2019-12-31 2020-05-15 国网北京市电力公司 Unified distribution network frame topology construction method based on graph fusion technology

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113610663A (en) * 2021-07-31 2021-11-05 云南电网有限责任公司信息中心 Power grid network frame consistency checking method based on fusion algorithm

Also Published As

Publication number Publication date
CN112801817B (en) 2023-07-21

Similar Documents

Publication Publication Date Title
CN111797210A (en) Information recommendation method, device and equipment based on user portrait and storage medium
CN108595523B (en) Equipment data retrieval model construction method and device and computer equipment
CN108733810B (en) Address data matching method and device
CN109066687A (en) A kind of electric power system tide calculation method, system and electronic equipment
CN110990390A (en) Data cooperative processing method and device, computer equipment and storage medium
CN105045927A (en) Automatic coding method and system for data of labor, materials and machines of construction project
CN112383044B (en) Power grid model comparison method and device based on hierarchical topological structure
CN113344450A (en) Low-voltage station area subscriber identification method, system, terminal equipment and storage medium
CN112686418A (en) Method and device for predicting performance timeliness
CN113987190A (en) Data quality check rule extraction method and system
CN112801817A (en) Electric energy quality data center construction method and system
CN106022590B (en) Voltage quality evaluation method and device for active power distribution network
CN111984673B (en) Fuzzy retrieval method and device for tree structure of power grid electric energy metering system
CN106127602B (en) Electricity stealing identification method and device based on reduction outlier algorithm
CN105512270B (en) Method and device for determining related objects
CN114385794A (en) Method, device, equipment and storage medium for generating enterprise knowledge graph
CN113778681B (en) Data processing method and device based on cloud computing and storage medium
CN115203281A (en) Information searching method and device, electronic equipment and storage medium
CN114065961A (en) Intelligent text knowledge management method and system
CN114970495A (en) Name disambiguation method and device, electronic equipment and storage medium
CN113742344A (en) Method and device for indexing power system data
CN113065354A (en) Method for identifying geographic position in corpus and related equipment thereof
CN102609510B (en) Chinese name data processing method and device
CN113987164A (en) Project studying and judging method and device based on domain event knowledge graph
CN112861368A (en) Power distribution network information model construction method and device and terminal equipment

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant