CN112801817A - Electric energy quality data center construction method and system - Google Patents
Electric energy quality data center construction method and system Download PDFInfo
- Publication number
- CN112801817A CN112801817A CN202110123467.5A CN202110123467A CN112801817A CN 112801817 A CN112801817 A CN 112801817A CN 202110123467 A CN202110123467 A CN 202110123467A CN 112801817 A CN112801817 A CN 112801817A
- Authority
- CN
- China
- Prior art keywords
- data
- standing book
- book data
- ledger
- bhattacharyya
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
- 238000010276 construction Methods 0.000 title claims abstract description 21
- 238000004519 manufacturing process Methods 0.000 claims abstract description 27
- 238000004422 calculation algorithm Methods 0.000 claims abstract description 24
- 238000000034 method Methods 0.000 claims description 27
- 238000004364 calculation method Methods 0.000 claims description 14
- 230000010354 integration Effects 0.000 claims description 11
- 230000001186 cumulative effect Effects 0.000 claims description 8
- 238000009826 distribution Methods 0.000 claims description 7
- 238000003860 storage Methods 0.000 claims description 5
- 238000000638 solvent extraction Methods 0.000 claims description 4
- 238000009825 accumulation Methods 0.000 claims description 3
- 238000004590 computer program Methods 0.000 claims description 3
- 230000004927 fusion Effects 0.000 abstract description 19
- 238000012544 monitoring process Methods 0.000 description 5
- 238000005516 engineering process Methods 0.000 description 4
- 238000010586 diagram Methods 0.000 description 3
- 238000011161 development Methods 0.000 description 2
- 230000000694 effects Effects 0.000 description 2
- 238000012545 processing Methods 0.000 description 2
- 238000004458 analytical method Methods 0.000 description 1
- 230000005540 biological transmission Effects 0.000 description 1
- 238000006243 chemical reaction Methods 0.000 description 1
- 238000013461 design Methods 0.000 description 1
- 238000005259 measurement Methods 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 238000012806 monitoring device Methods 0.000 description 1
- 238000010606 normalization Methods 0.000 description 1
- 230000003287 optical effect Effects 0.000 description 1
- 230000003068 static effect Effects 0.000 description 1
- 230000001360 synchronised effect Effects 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06Q—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
- G06Q50/00—Information and communication technology [ICT] specially adapted for implementation of business processes of specific business sectors, e.g. utilities or tourism
- G06Q50/06—Energy or water supply
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/20—Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
- G06F16/21—Design, administration or maintenance of databases
- G06F16/215—Improving data quality; Data cleansing, e.g. de-duplication, removing invalid entries or correcting typographical errors
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/20—Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
- G06F16/29—Geographical information databases
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F18/00—Pattern recognition
- G06F18/20—Analysing
- G06F18/25—Fusion techniques
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06Q—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
- G06Q10/00—Administration; Management
- G06Q10/06—Resources, workflows, human or project management; Enterprise or organisation planning; Enterprise or organisation modelling
- G06Q10/063—Operations research, analysis or management
- G06Q10/0637—Strategic management or analysis, e.g. setting a goal or target of an organisation; Planning actions based on goals; Analysis or evaluation of effectiveness of goals
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06Q—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
- G06Q10/00—Administration; Management
- G06Q10/06—Resources, workflows, human or project management; Enterprise or organisation planning; Enterprise or organisation modelling
- G06Q10/063—Operations research, analysis or management
- G06Q10/0639—Performance analysis of employees; Performance analysis of enterprise or organisation operations
- G06Q10/06395—Quality analysis or management
-
- Y—GENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y04—INFORMATION OR COMMUNICATION TECHNOLOGIES HAVING AN IMPACT ON OTHER TECHNOLOGY AREAS
- Y04S—SYSTEMS INTEGRATING TECHNOLOGIES RELATED TO POWER NETWORK OPERATION, COMMUNICATION OR INFORMATION TECHNOLOGIES FOR IMPROVING THE ELECTRICAL POWER GENERATION, TRANSMISSION, DISTRIBUTION, MANAGEMENT OR USAGE, i.e. SMART GRIDS
- Y04S10/00—Systems supporting electrical power generation, transmission or distribution
- Y04S10/50—Systems or methods supporting the power network operation or management, involving a certain degree of interaction with the load-side end user applications
Landscapes
- Engineering & Computer Science (AREA)
- Business, Economics & Management (AREA)
- Human Resources & Organizations (AREA)
- Theoretical Computer Science (AREA)
- Economics (AREA)
- Physics & Mathematics (AREA)
- General Physics & Mathematics (AREA)
- Strategic Management (AREA)
- Data Mining & Analysis (AREA)
- Entrepreneurship & Innovation (AREA)
- Educational Administration (AREA)
- Databases & Information Systems (AREA)
- General Engineering & Computer Science (AREA)
- General Business, Economics & Management (AREA)
- Tourism & Hospitality (AREA)
- Quality & Reliability (AREA)
- Development Economics (AREA)
- Marketing (AREA)
- Health & Medical Sciences (AREA)
- Game Theory and Decision Science (AREA)
- Operations Research (AREA)
- Evolutionary Computation (AREA)
- General Health & Medical Sciences (AREA)
- Water Supply & Treatment (AREA)
- Primary Health Care (AREA)
- Public Health (AREA)
- Remote Sensing (AREA)
- Evolutionary Biology (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Bioinformatics & Computational Biology (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Artificial Intelligence (AREA)
- Life Sciences & Earth Sciences (AREA)
- Management, Administration, Business Operations System, And Electronic Commerce (AREA)
Abstract
The invention discloses a construction method and a system of a power quality data center, comprising the following steps: splitting the data of the system into ledger data and production data according to the attribute characteristics; and judging whether the standing book data in each system are the same according to the Bhattacharyya distance and the DTW algorithm so as to integrate the standing book data and mount the production data in the integrated standing book data. The invention starts from the characteristics of a power grid system, provides a fusion algorithm which is designed according to the fusion framework and the ledger data characteristics of the industry according to the power grid commonality, improves the matching correctness of single words, can solve the realization of the fusion of ledgers with different lengths, and improves the fusion efficiency of ledger data.
Description
Technical Field
The invention relates to the technical field of power systems, in particular to a method and a system for constructing a power quality data center, a computer terminal device and a readable storage medium.
Background
In recent years, with the rapid development of high-voltage direct-current transmission technology, distributed micro-grid and other technologies, the form of a power grid is changed greatly, the power quality mechanism of the power grid caused by the new technologies is more complex, and the new technologies extend to ultra-high voltage and distribution networks, so that the problem of power quality exists in each link of a modern power system.
In contrast, grid companies have installed very few power quality monitoring devices, and cover primarily 10kV and above buses. With the development of the power grid, the power grid production system constructed by devices such as the synchronous phasor measurement unit containing the power quality data enables the acquisition of the power quality data covering the whole power grid.
In view of the complexity and the immediacy of the power quality data, how to reasonably design and establish a power quality data center is of great importance to the whole monitoring system. Because the power quality industry is lack of uniform data formats and specifications all the time, a plurality of power companies and enterprises can produce monitoring equipment, the monitoring equipment and analysis tools produced by various manufacturers have characteristics, the monitoring data emphasis is different, and the data formats are more diverse and incompatible. This is very disadvantageous for information sharing and application integration between applications within the utility company and between utility companies.
Disclosure of Invention
The invention aims to provide a method for constructing an electric energy quality data center, which is characterized in that a fusion algorithm which is designed according to the fusion framework and the ledger data characteristics of the industry is provided according to the power grid commonality, so that the matching correctness of single words is improved, the realization of the ledger fusion with unequal lengths can be solved, and the fusion efficiency of ledger data is improved.
In order to achieve the above object, an embodiment of the present invention provides a method for constructing an electric energy quality data center, including:
splitting the data of the system into ledger data and production data according to the attribute characteristics;
judging whether the standing book data in each system are the same according to the Bhattacharyya distance and a DTW algorithm so as to integrate the standing book data;
and mounting the production data in the integrated standing book data.
In one embodiment, the system comprises a production management system, a scheduling automation system, a distribution network automation system, a metering automation system, a marketing system, a GIS system, a voltage system, and a power quality system.
In a certain embodiment, before determining whether the standing book data of each system are the same according to the Bhattacharyya distance and the DTW algorithm, the method further includes performing subset division on the standing book data according to a management unit.
In a certain embodiment, the determining whether the ledger data of each system is the same according to the Bhattacharyya distance and the DTW algorithm includes:
obtaining a single-word Bhattacharyya coefficient in the standing book data according to the Bhattacharyya distance;
according to the DTW algorithm, sequentially accumulating coefficients of all the points Bhattacharyya passing through, and traversing the standing book data Q in the standing book dataaAnd standing book data CgChinese characters can obtain the cumulative distance gamma (a, g), a represents the standing book QaThe number of words of the name of the middle standing book, g represents the Q of the standing bookgThe number of words of the middle standing account name; the calculation formula is as follows:
γ(a,g)=B(qa,cg)+max{γ(a-1,g-1),γ(a-1,g),γ(a,g-1)}
wherein, B (q)a,cg) Represents the ledger data QaChinese character qaAnd the ledger data CgChinese character cgThe Bhattacharyya coefficient of (a);
and judging whether the two standing book data are the same or not, wherein the judgment formula is as follows:
wherein r is the effective matching times of gamma (a, g), and tau is a preset threshold.
The embodiment of the invention also provides a construction system of the electric energy quality data center, which is applied to the construction method of the electric energy quality data center in any embodiment. The method comprises the following steps:
the system data splitting module is used for splitting the system data into ledger data and production data according to the attribute characteristics;
the standing book data integration module is used for judging whether the standing book data of each system are the same according to the Bhattacharyya distance and the DTW algorithm so as to integrate the standing book data;
and the production data mounting module is used for mounting the production data in the integrated standing book data.
In one embodiment, the system further comprises an account data subset dividing module, and the account data subset dividing module is used for performing subset division on the account data according to a management unit.
In one embodiment, the ledger data integration module includes:
the standing book data single-character similarity calculation unit is used for calculating a single-character Bhattacharyya coefficient in the standing book data according to the Bhattacharyya distance;
the standing book data accumulation distance calculation unit is used for sequentially accumulating coefficients of all the points Bhattacharyya passing through according to a DTW algorithm and traversing the standing book data Q in the standing book dataaAnd standing book data CgChinese characters can obtain the cumulative distance gamma (a, g), a represents the standing book QaThe number of words of the name of the middle standing book, g represents the Q of the standing bookgThe number of words of the middle standing account name; the calculation formula is as follows:
γ(a,g)=B(qa,cg)+max{γ(a-1,g-1),γ(a-1,g),γ(a,g-1)}
wherein, B (q)a,cg) Represents the ledger data QaChinese character qaAnd the ledger data CgChinese character cgThe Bhattacharyya coefficient of (a);
the standing book data identity judging unit is used for judging whether the two standing book data are the same or not, and the judging formula is as follows:
wherein r is the effective matching times of gamma (a, g), and tau is a preset threshold.
The embodiment of the invention also provides computer terminal equipment which comprises one or more processors and a memory. A memory coupled to the processor for storing one or more programs; when executed by the one or more processors, cause the one or more processors to implement a method of building a power quality data center as in any of the embodiments described above.
The embodiment of the present invention further provides a computer-readable storage medium, on which a computer program is stored, where the computer program, when executed by a processor, implements the method for constructing the power quality data center according to any of the above embodiments.
According to the electric energy quality data center construction method and the electric energy quality data center construction system, based on the characteristics of a power grid system, a fusion algorithm which is designed according to the fusion framework and the ledger data characteristics of the industry is provided according to the power grid commonalities, so that the matching correctness of single words is improved, the realization of ledger fusion with unequal lengths can be solved, and the ledger data fusion efficiency is improved.
Drawings
In order to more clearly illustrate the technical solution of the present invention, the drawings needed to be used in the embodiments will be briefly described below, and it is obvious that the drawings in the following description are only some embodiments of the present invention, and it is obvious for those skilled in the art that other drawings can be obtained according to the drawings without creative efforts.
Fig. 1 is a schematic flow chart of a method for constructing a power quality data center according to an embodiment of the present invention;
fig. 2 is a frame diagram of a power quality data center construction method according to an embodiment of the present invention;
fig. 3 is a schematic flow chart illustrating subset partitioning in a power quality data center construction method according to an embodiment of the present invention;
fig. 4 is a schematic diagram of a result of a full-spelling probability calculation in a power quality data center construction method according to an embodiment of the present invention;
fig. 5 is a schematic structural diagram of a computer terminal device according to an embodiment of the present invention.
Detailed Description
The technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the drawings in the embodiments of the present invention, and it is obvious that the described embodiments are only a part of the embodiments of the present invention, and not all of the embodiments. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present invention.
It should be understood that the step numbers used herein are for convenience of description only and are not intended as limitations on the order in which the steps are performed.
It is to be understood that the terminology used in the description of the invention herein is for the purpose of describing particular embodiments only and is not intended to be limiting of the invention. As used in the specification of the present invention and the appended claims, the singular forms "a," "an," and "the" are intended to include the plural forms as well, unless the context clearly indicates otherwise.
The terms "comprises" and "comprising" indicate the presence of the described features, integers, steps, operations, elements, and/or components, but do not preclude the presence or addition of one or more other features, integers, steps, operations, elements, components, and/or groups thereof.
The term "and/or" refers to and includes any and all possible combinations of one or more of the associated listed items.
Referring to fig. 1, an embodiment of the present invention provides a method for constructing a power quality data center, including:
s10, splitting the data of the system into ledger data and production data according to the attribute characteristics;
s20, judging whether the standing book data in each system are the same according to the Bhattacharyya distance and the DTW algorithm so as to integrate the standing book data;
and S30, mounting the production data in the integrated ledger data.
Referring to fig. 2, in the present embodiment, system data (a production management system, a scheduling automation system, a distribution network automation system, a metering automation system, a marketing system, a GIS system, a voltage system, and an electric energy quality system) is divided into ledger data and production data, where the ledger data is archive information representing an electric power object and is composed of a plurality of sub-attributes, and each attribute value is generally fixed and unchanged, for example, the ledger data of a transformer generally includes a transformer name, a model, a capacity, an id allocated to a system where the transformer is located, and the production data is variable, and the production data represents dynamic operation data of the electric power object and is generally related to time, for example: the transformer operation data comprises voltage, current, work, reactive power and the like of each time sequence, so that the production data is stored by virtue of the ledger data.
Finding out the commonalities of the standing book data of each system is the key to realize data integration. And (3) solving the similarity of the Chinese character full spelling of the account data by utilizing the Bhattacharyya distance, solving the account similarity by utilizing improved DTW (dynamic Time warping) dynamic Time normalization, and then judging the identity. And finally, mounting the production data in the integrated ledger data to complete the construction of the whole power quality data center. A fusion framework conforming to the industry is provided according to the power grid commonality, and the fusion algorithm designed according to the standing book data characteristics improves the matching correctness of single characters.
In one embodiment, the system comprises a production management system, a scheduling automation system, a distribution network automation system, a metering automation system, a marketing system, a GIS system, a voltage system, and a power quality system.
In a certain embodiment, before determining whether the standing book data of each system are the same according to the Bhattacharyya distance and the DTW algorithm, the method further includes performing subset division on the standing book data according to a management unit.
Referring to fig. 3, in the present embodiment, in the standing book integration implementation module, word set division is performed according to a management unit, so that standing book repetition is reduced and fusion efficiency is improved. Ledger names are the names of power system objects, but may be repeated from system to system, such as: the Zhongshan power supply bureau and the Zhuhai power supply bureau of the voltage monitoring system both find that the standing account name is the distribution transformer name of the private transformer of the local public security bureau. Therefore, by means of the management relation of the power grid company, the management unit to which the standing book name belongs is judged from top to bottom until the unit directly responsible for the standing book is found. Taking the integrated provincial system as an example, the next level is a power supply bureau, the next level of the power supply bureau is a sub-county bureau, and the next level of the sub-county bureau is a power supply station or a transformer substation, which are sequentially searched downwards. The processing is a classification process, and the multi-system ledger integration efficiency can be improved.
In a certain embodiment, the determining whether the ledger data of each system is the same according to the Bhattacharyya distance and the DTW algorithm includes:
obtaining a single-word Bhattacharyya coefficient in the standing book data according to the Bhattacharyya distance;
according to the DTW algorithm, sequentially accumulating coefficients of all the points Bhattacharyya passing through, and traversing the standing book data Q in the standing book dataaAnd standing book data CgChinese characters can obtain the cumulative distance gamma (a, g), a represents the standing book QaThe number of words of the name of the middle standing book, g represents the Q of the standing bookgThe number of words of the middle standing account name; the calculation formula is as follows:
γ(a,g)=B(qa,cg)+max{γ(a-1,g-1),γ(a-1,g),γ(a,g-1)}
wherein, B (q)a,cg) Represents the ledger data QaChinese character qaAnd the ledger data CgChinese character cgThe Bhattacharyya coefficient of (a);
and judging whether the two standing book data are the same or not, wherein the judgment formula is as follows:
wherein r is the effective matching times of gamma (a, g), and tau is a preset threshold.
In this embodiment, the similarity of the full spelling of the Chinese characters is obtained according to the distance Bhattacharyya. The Bhattacharyya distance is used to measure the similarity of two discrete or continuous probabilities, which is defined as: in the same domain X, the babbitt distance of two discrete probability distributions p and q is defined as follows:
DB(p,q)=-ln(BC(p,q)) (1)
the Chinese character full spelling is converted into a probability histogram, the similarity of any two Chinese characters is obtained by using the formula (1) and the formula (2), the higher the similarity is, the closer BC is to 1, and otherwise, the closer BC is to 0.
Firstly, probability conversion is carried out on Chinese characters, and the process is as follows: for any purposeChinese character y, its complete spelling is sequence H, H ═ H1,h2..hr.,ht],hrIs the r-th letter of the full spelling of y, and t represents the full spelling length. Numbering according to the alphabet in sequence, and taking the numbering as a horizontal coordinate of the histogram; counting the total number a of phonetic letters of the Chinese character y and the number n of each letterrThe ratio P (h) of each letter is calculated according to the formula (3)i) And as the value of the histogram.
And then DTW (dynamic Time warping) dynamic Time integration is carried out to obtain the standing book similarity. The lengths of the names of the same objects in the systems are mostly different, and on the other hand, the standing book names of the power systems are named according to the power supply relation and can be considered to have time sequence, so that the DTW is suitable for solving the similarity of the standing book names.
Referring to FIG. 4, for any two ledger names QaAnd CgThe subscript indicates the number of Chinese characters, and a and g may be different. By finding the cumulative distance γ: starting from (0,0), search is performed to find Q using the Bhattacharyya distanceaAnd CgSimilarity of two Chinese characters, e.g. QaContains the Chinese character "Tang", and Cg contains the Chinese character "box", and the total spelling probability is respectively obtained by using a formula (3). Calculating Bhattacharyya coefficients of the Tang and the frame to be 0.67 through the formula (2), sequentially accumulating the Bhattacharyya coefficients passing through all points, obtaining an accumulated distance gamma after reaching the end points (a, g), solving the formula to be the formula (4),
equation (4) is a variation of the existing DTW, modified by: the Bhattacharyya coefficient B () is used for replacing a common Euclidean distance, the maximum value is obtained by the searching process, the traditional minimum value is not solved, the maximum value is determined by the Bhattacharyya coefficient characteristic, and the calculation formula is as follows:
γ(i,j)=B(qi,cj)+max{γ(i-1,j-1),γ(i-1,j),γ(i,j-1)} (4)
qirepresents QaThe ith Chinese character of (1), cjIs represented by CgThe jth Chinese character of (1), and B (q)i,cj) ThenRepresenting a computational Chinese character qiAnd cjThe Bhattacharyya coefficient of (a),
finally, when the weight end point (a, g) is reached, the cumulative distance γ (a, g) is obtained, and the formula is as follows:
γ(a,g)=B(qa,cg)+max{γ(a-1,g-1),γ(a-1,g),γ(a,g-1)}
wherein, B (q)a,cg) Represents the ledger data QaChinese character qaAnd the ledger data CgChinese character cgThe Bhattacharyya coefficient of (a);
and finally, judging whether the two standing book data are the same through a formula (5), wherein the formula is as follows
r is the effective matching times of gamma (i, j), tau is a preset threshold, when the formula (5) is satisfied, the two standing book data are judged to be the same, and if the formula (5) is not satisfied, the two standing book data are judged to be different.
The embodiment of the invention also provides a construction system of the electric energy quality data center, which is applied to the construction method of the electric energy quality data center in any embodiment. The method comprises the following steps:
the system data splitting module is used for splitting the system data into ledger data and production data according to the attribute characteristics;
the standing book data integration module is used for judging whether the standing book data of each system are the same according to the Bhattacharyya distance and the DTW algorithm so as to integrate the standing book data;
and the production data mounting module is used for mounting the production data in the integrated standing book data.
In one embodiment, the system further comprises an account data subset dividing module, and the account data subset dividing module is used for performing subset division on the account data according to a management unit.
In one embodiment, the ledger data integration module includes:
the standing book data single-character similarity calculation unit is used for calculating a single-character Bhattacharyya coefficient in the standing book data according to the Bhattacharyya distance;
the standing book data accumulation distance calculation unit is used for sequentially accumulating coefficients of all the points Bhattacharyya passing through according to a DTW algorithm and traversing the standing book data Q in the standing book dataaAnd standing book data CgChinese characters can obtain the cumulative distance gamma (a, g), a represents the standing book QaThe number of words of the name of the middle standing book, g represents the Q of the standing bookgThe number of words of the middle standing account name; the calculation formula is as follows:
γ(a,g)=B(qa,cg)+max{γ(a-1,g-1),γ(a-1,g),γ(a,g-1)}
wherein, B (q)a,cg) Represents the ledger data QaChinese character qaAnd the ledger data CgChinese character cgThe Bhattacharyya coefficient of (a);
the standing book data identity judging unit is used for judging whether the two standing book data are the same or not, and the judging formula is as follows:
wherein r is the effective matching times of gamma (a, g), and tau is a preset threshold.
For specific limitations of the power quality data center construction system, reference may be made to the above limitations on the power quality data center construction method, which will not be described herein again. The modules in the construction system of the power quality data center can be wholly or partially realized by software, hardware and a combination thereof. The modules can be embedded in a hardware form or independent from a processor in the computer device, and can also be stored in a memory in the computer device in a software form, so that the processor can call and execute operations corresponding to the modules.
Referring to fig. 5, an embodiment of the invention provides a computer terminal device, which includes one or more processors and a memory. The memory is coupled to the processor and configured to store one or more programs that, when executed by the one or more processors, cause the one or more processors to implement the method of building a power quality data center as in any of the embodiments described above.
The processor is used for controlling the overall operation of the computer terminal equipment so as to complete all or part of the steps of the construction method of the power quality data center. The memory is used to store various types of data to support the operation at the computer terminal device, which data may include, for example, instructions for any application or method operating on the computer terminal device, as well as application-related data. The Memory may be implemented by any type of volatile or non-volatile Memory device or combination thereof, such as Static Random Access Memory (SRAM), Electrically Erasable Programmable Read-Only Memory (EEPROM), Erasable Programmable Read-Only Memory (EPROM), Programmable Read-Only Memory (PROM), Read-Only Memory (ROM), magnetic Memory, flash Memory, magnetic disk, or optical disk.
In an exemplary embodiment, the computer terminal Device may be implemented by one or more Application Specific 1 integrated circuits (AS 1C), a Digital Signal Processor (DSP), a Digital Signal Processing Device (DSPD), a Programmable Logic Device (PLD), a Field Programmable Gate Array (FPGA), a controller, a microcontroller, a microprocessor or other electronic components, and is configured to perform the above-mentioned method for constructing the power quality data center, and achieve the technical effects consistent with the above-mentioned method.
In another exemplary embodiment, there is also provided a computer readable storage medium comprising program instructions which, when executed by a processor, implement the steps of the method of constructing a power quality data center in any of the above embodiments. For example, the computer readable storage medium may be the above-mentioned memory including program instructions, which are executable by the processor of the computer terminal device to perform the above-mentioned method for constructing the power quality data center, and achieve the technical effects consistent with the above-mentioned method.
According to the electric energy quality data center construction method and the electric energy quality data center construction system, based on the characteristics of a power grid system, a fusion algorithm which is designed according to the fusion framework and the ledger data characteristics of the industry is provided according to the power grid commonalities, so that the matching correctness of single words is improved, the realization of the fusion of ledgers with different lengths can be solved, and the fusion efficiency of ledger data is improved.
While the foregoing is directed to the preferred embodiment of the present invention, it will be understood by those skilled in the art that various changes and modifications may be made without departing from the spirit and scope of the invention.
Claims (9)
1. A construction method of a power quality data center is characterized by comprising the following steps:
splitting the data of the system into ledger data and production data according to the attribute characteristics;
judging whether the standing book data in each system are the same according to the Bhattacharyya distance and a DTW algorithm so as to integrate the standing book data;
and mounting the production data in the integrated standing book data.
2. The method of constructing a power quality data center according to claim 1,
the system comprises a production management system, a scheduling automation system, a distribution network automation system, a metering automation system, a marketing system, a GIS system, a voltage system and an electric energy quality system.
3. The method for constructing the power quality data center according to claim 1, wherein before judging whether the standing book data of each system are the same according to the Bhattacharyya distance and the DTW algorithm, the method further comprises the step of performing subset division on the standing book data according to a management unit.
4. The method for constructing the power quality data center according to claim 1, wherein the step of judging whether the standing book data of each system are the same according to the Bhattacharyya distance and the DTW algorithm comprises the steps of:
obtaining a single-word Bhattacharyya coefficient in the standing book data according to the Bhattacharyya distance;
according to the DTW algorithm, sequentially accumulating coefficients of all the points Bhattacharyya passing through, and traversing the standing book data Q in the standing book dataaAnd standing book data CgChinese characters can obtain the cumulative distance gamma (a, g), a represents the standing book QaThe number of words of the name of the middle standing book, g represents the Q of the standing bookgThe number of words of the middle standing account name; the calculation formula is as follows:
γ(a,g)=B(qa,cg)+max{γ(a-1,g-1),γ(a-1,g),γ(a,g-1)}
wherein, B (q)a,cg) Represents the ledger data QaChinese character qaAnd the ledger data CgChinese character cgThe Bhattacharyya coefficient of (a);
and judging whether the two standing book data are the same or not, wherein the judgment formula is as follows:
wherein r is the effective matching times of gamma (a, g), and tau is a preset threshold.
5. A construction system of a power quality data center is characterized by comprising:
the system data splitting module is used for splitting the system data into ledger data and production data according to the attribute characteristics;
the standing book data integration module is used for judging whether the standing book data of each system are the same according to the Bhattacharyya distance and the DTW algorithm so as to integrate the standing book data;
and the production data mounting module is used for mounting the production data in the integrated standing book data.
6. The system for constructing the power quality data center according to claim 5, further comprising a ledger data subset partitioning module, wherein the ledger data subset partitioning module is configured to perform subset partitioning on the ledger data according to a management unit.
7. The system for building a power quality data center according to claim 5, wherein the ledger data integration module comprises:
the standing book data single-character similarity calculation unit is used for calculating a single-character Bhattacharyya coefficient in the standing book data according to the Bhattacharyya distance;
the standing book data accumulation distance calculation unit is used for sequentially accumulating coefficients of all the points Bhattacharyya passing through according to a DTW algorithm and traversing the standing book data Q in the standing book dataaAnd standing book data CgChinese characters can obtain the cumulative distance gamma (a, g), a represents the standing book QaThe number of words of the name of the middle standing book, g represents the Q of the standing bookgThe number of words of the middle standing account name; the calculation formula is as follows:
γ(a,g)=B(qa,cg)+max{γ(a-1,g-1),γ(a-1,g),γ(a,g-1)}
wherein, B (q)a,cg) Represents the ledger data QaChinese character qaAnd the ledger data CgChinese character cgThe Bhattacharyya coefficient of (a);
the standing book data identity judging unit is used for judging whether the two standing book data are the same or not, and the judging formula is as follows:
wherein r is the effective matching times of gamma (a, g), and tau is a preset threshold.
8. A computer terminal device, comprising:
one or more processors;
a memory coupled to the processor for storing one or more programs;
when executed by the one or more processors, cause the one or more processors to implement the method of constructing a power quality data center of any of claims 1 to 4.
9. A computer-readable storage medium, on which a computer program is stored, which, when being executed by a processor, carries out a method of building a power quality data center according to any one of claims 1 to 4.
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN2020116106410 | 2020-12-29 | ||
CN202011610641 | 2020-12-29 |
Publications (2)
Publication Number | Publication Date |
---|---|
CN112801817A true CN112801817A (en) | 2021-05-14 |
CN112801817B CN112801817B (en) | 2023-07-21 |
Family
ID=75812692
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202110123467.5A Active CN112801817B (en) | 2020-12-29 | 2021-01-28 | Electric energy quality data center construction method and system thereof |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN112801817B (en) |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN113610663A (en) * | 2021-07-31 | 2021-11-05 | 云南电网有限责任公司信息中心 | Power grid network frame consistency checking method based on fusion algorithm |
Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101996631A (en) * | 2009-08-28 | 2011-03-30 | 国际商业机器公司 | Method and device for aligning texts |
CN109710647A (en) * | 2018-12-29 | 2019-05-03 | 广东电网有限责任公司 | A kind of power grid account data fusion method and device based on keyword search |
CN109977188A (en) * | 2019-03-28 | 2019-07-05 | 国网河南省电力公司经济技术研究院 | A kind of multi-specialized data correlation fusion method of gradual power grid and device |
CN111160868A (en) * | 2019-12-31 | 2020-05-15 | 国网北京市电力公司 | Unified distribution network frame topology construction method based on graph fusion technology |
-
2021
- 2021-01-28 CN CN202110123467.5A patent/CN112801817B/en active Active
Patent Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101996631A (en) * | 2009-08-28 | 2011-03-30 | 国际商业机器公司 | Method and device for aligning texts |
CN109710647A (en) * | 2018-12-29 | 2019-05-03 | 广东电网有限责任公司 | A kind of power grid account data fusion method and device based on keyword search |
CN109977188A (en) * | 2019-03-28 | 2019-07-05 | 国网河南省电力公司经济技术研究院 | A kind of multi-specialized data correlation fusion method of gradual power grid and device |
CN111160868A (en) * | 2019-12-31 | 2020-05-15 | 国网北京市电力公司 | Unified distribution network frame topology construction method based on graph fusion technology |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN113610663A (en) * | 2021-07-31 | 2021-11-05 | 云南电网有限责任公司信息中心 | Power grid network frame consistency checking method based on fusion algorithm |
Also Published As
Publication number | Publication date |
---|---|
CN112801817B (en) | 2023-07-21 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN111797210A (en) | Information recommendation method, device and equipment based on user portrait and storage medium | |
CN108595523B (en) | Equipment data retrieval model construction method and device and computer equipment | |
CN108733810B (en) | Address data matching method and device | |
CN109066687A (en) | A kind of electric power system tide calculation method, system and electronic equipment | |
CN110990390A (en) | Data cooperative processing method and device, computer equipment and storage medium | |
CN105045927A (en) | Automatic coding method and system for data of labor, materials and machines of construction project | |
CN112383044B (en) | Power grid model comparison method and device based on hierarchical topological structure | |
CN113344450A (en) | Low-voltage station area subscriber identification method, system, terminal equipment and storage medium | |
CN112686418A (en) | Method and device for predicting performance timeliness | |
CN113987190A (en) | Data quality check rule extraction method and system | |
CN112801817A (en) | Electric energy quality data center construction method and system | |
CN106022590B (en) | Voltage quality evaluation method and device for active power distribution network | |
CN111984673B (en) | Fuzzy retrieval method and device for tree structure of power grid electric energy metering system | |
CN106127602B (en) | Electricity stealing identification method and device based on reduction outlier algorithm | |
CN105512270B (en) | Method and device for determining related objects | |
CN114385794A (en) | Method, device, equipment and storage medium for generating enterprise knowledge graph | |
CN113778681B (en) | Data processing method and device based on cloud computing and storage medium | |
CN115203281A (en) | Information searching method and device, electronic equipment and storage medium | |
CN114065961A (en) | Intelligent text knowledge management method and system | |
CN114970495A (en) | Name disambiguation method and device, electronic equipment and storage medium | |
CN113742344A (en) | Method and device for indexing power system data | |
CN113065354A (en) | Method for identifying geographic position in corpus and related equipment thereof | |
CN102609510B (en) | Chinese name data processing method and device | |
CN113987164A (en) | Project studying and judging method and device based on domain event knowledge graph | |
CN112861368A (en) | Power distribution network information model construction method and device and terminal equipment |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant |