CN111882289B - Device and method for measuring and calculating project data auditing index interval - Google Patents

Device and method for measuring and calculating project data auditing index interval Download PDF

Info

Publication number
CN111882289B
CN111882289B CN202010618175.4A CN202010618175A CN111882289B CN 111882289 B CN111882289 B CN 111882289B CN 202010618175 A CN202010618175 A CN 202010618175A CN 111882289 B CN111882289 B CN 111882289B
Authority
CN
China
Prior art keywords
interval
auditing
data
standard
calculating
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN202010618175.4A
Other languages
Chinese (zh)
Other versions
CN111882289A (en
Inventor
王艳芹
王林峰
董帧
徐宁
王勇
王冬超
刘云
赵贤
郭振新
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
State Grid Corp of China SGCC
State Grid Hebei Electric Power Co Ltd
Economic and Technological Research Institute of State Grid Hebei Electric Power Co Ltd
Original Assignee
State Grid Corp of China SGCC
State Grid Hebei Electric Power Co Ltd
Economic and Technological Research Institute of State Grid Hebei Electric Power Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by State Grid Corp of China SGCC, State Grid Hebei Electric Power Co Ltd, Economic and Technological Research Institute of State Grid Hebei Electric Power Co Ltd filed Critical State Grid Corp of China SGCC
Priority to CN202010618175.4A priority Critical patent/CN111882289B/en
Publication of CN111882289A publication Critical patent/CN111882289A/en
Application granted granted Critical
Publication of CN111882289B publication Critical patent/CN111882289B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q10/00Administration; Management
    • G06Q10/10Office automation; Time management
    • G06Q10/103Workflow collaboration or project management
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q50/00Information and communication technology [ICT] specially adapted for implementation of business processes of specific business sectors, e.g. utilities or tourism
    • G06Q50/06Energy or water supply

Landscapes

  • Business, Economics & Management (AREA)
  • Engineering & Computer Science (AREA)
  • Human Resources & Organizations (AREA)
  • Strategic Management (AREA)
  • Economics (AREA)
  • Tourism & Hospitality (AREA)
  • Health & Medical Sciences (AREA)
  • Marketing (AREA)
  • Entrepreneurship & Innovation (AREA)
  • Theoretical Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Physics & Mathematics (AREA)
  • General Business, Economics & Management (AREA)
  • Data Mining & Analysis (AREA)
  • Quality & Reliability (AREA)
  • Operations Research (AREA)
  • Public Health (AREA)
  • Water Supply & Treatment (AREA)
  • General Health & Medical Sciences (AREA)
  • Primary Health Care (AREA)
  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)

Abstract

The application discloses a device and a method for measuring and calculating project data auditing index intervals, and relates to the technical field of power grid technical transformation; the device comprises a basic data acquisition module, a data cleaning module, a characteristic acquisition module, a calculation auditing interval module, a judging module and a standard auditing interval determining module, wherein the characteristic acquisition module is used for acquiring characteristic attributes of auditing indexes; the calculation auditing interval module is used for measuring and calculating a standard auditing interval of auditing indexes under the characteristic attribute of the monomer engineering type; the judging module is used for judging whether the standard auditing interval is reasonable or not; the method comprises the steps of obtaining basic data, cleaning the data, obtaining characteristics, calculating an auditing interval, judging and determining a standard auditing interval and the like; the high accuracy of measuring and calculating the auditing interval is realized by the basic data acquisition module, the cleaning data module, the characteristic acquisition module, the auditing interval calculation module, the judgment module, the standard auditing interval determination module and the like.

Description

Device and method for measuring and calculating project data auditing index interval
Technical Field
The application relates to the technical field of power grid technical transformation, in particular to a device and a method for measuring and calculating project data auditing index intervals.
Background
In the construction management process of a technical improvement project of a power grid, the cost control is an important link, directly influences the economic benefit and the social benefit of the project and enterprises, and aims to continuously improve the engineering cost control level of the enterprises and develop technical improvement project cost analysis work in a normalized mode every year.
However, when the construction cost analysis work is carried out, a large amount of project data needs to be filled, the filling error, the filling omission and the like are easy to occur in the filling process, in order to ensure the normal operation of the construction cost analysis work, workers often need to spend a large amount of time to test the data filled in each project unit, the accuracy and the rationality of the data test are greatly dependent on the rationality of the data interval value setting, and the accuracy of the manual measurement and verification interval is insufficient.
Problems and considerations in the prior art:
how to solve the technical problem of insufficient accuracy of manual measurement and calculation of the auditing interval.
Disclosure of Invention
The application aims to solve the technical problem of providing a device and a method for measuring and calculating project data auditing index interval, which realize high accuracy of measuring and calculating auditing interval through a basic data acquisition module, a data cleaning module, a characteristic acquisition module, a auditing interval calculation module, a judgment module, a standard auditing interval determination module and the like.
In order to solve the technical problems, the application adopts the following technical scheme: the device for measuring and calculating the project data audit index interval comprises a basic data acquisition module, a cleaning data module, an acquisition characteristic module, a calculation audit interval module, a judgment module and a standard audit interval determination module, wherein the basic data acquisition module is used for acquiring production technical improvement cost analysis data and taking the production technical improvement cost analysis data as measured and calculated basic data; the cleaning data module is used for selecting the monomer engineering type to be measured and calculated and cleaning the basic data; the characteristic acquisition module is used for acquiring characteristic attributes of the auditing indexes; the calculation auditing interval module is used for measuring and calculating a standard auditing interval of auditing indexes under the characteristic attribute of the monomer engineering type; the judging module is used for judging whether the standard auditing interval is reasonable or not; and the standard auditing interval determining module is used for forming a standard auditing interval list according to the judging result.
The further technical proposal is that: the cleaning data module is also used for carrying out data cleaning on the basic data and analyzing based on judgment and dispersion; judging abnormal engineering data in the basic data; the dispersion analysis comprises the steps of calculating unit capacity cost and term cost proportion of the single engineering type, and eliminating abnormal values in basic data based on the Grabbs criterion.
The further technical proposal is that: the characteristic acquisition module is also used for extracting and acquiring characteristic attributes of the auditing indexes based on the information gain; extracting based on the information gain comprises obtaining project characteristic attributes in basic data, calculating and obtaining information gain sequencing of the characteristic attributes based on information entropy, eliminating the characteristic attributes smaller than a threshold value based on a set engineering experience threshold value, and obtaining the most effective characteristic attributes of the audit indexes.
The further technical proposal is that: the calculation auditing interval module is also used for measuring and calculating a standard auditing interval of auditing indexes under the attribute of the characteristics of the monomer engineering type based on the naive Bayes basic model; the standard auditing interval for measuring and calculating the auditing index under the characteristic attribute of the monomer engineering type comprises the steps of obtaining basic data after data cleaning, obtaining the characteristic attribute extracted based on the information gain, substituting the basic data and the characteristic attribute into a naive Bayesian basic model, and obtaining the standard auditing interval of the auditing index.
The further technical proposal is that: the standard auditing interval determining module is further used for obtaining the standard auditing interval as a final standard auditing interval if the standard auditing interval is judged to be reasonable; if the standard auditing interval is not reasonable, the standard auditing interval is adjusted, and the adjusted interval value is used as the final standard auditing interval.
The method for measuring and calculating the project data auditing index interval comprises the following steps:
acquiring basic data, acquiring production technology improvement cost analysis data and taking the data as basic data for measurement and calculation;
cleaning data, namely selecting a monomer engineering type to be measured and calculated, and cleaning the basic data;
acquiring characteristics and characteristic attributes of audit indexes;
calculating an audit interval, and measuring and calculating a standard audit interval of audit indexes under the characteristic attribute of the monomer engineering type;
judging whether a judgment standard auditing interval is reasonable or not;
and determining a standard auditing interval, and forming a standard auditing interval list according to the judging result.
The further technical proposal is that: in the step of cleaning the data, data cleaning is performed on the basic data based on judgment and dispersion analysis; judging abnormal engineering data in the basic data; the dispersion analysis comprises the steps of calculating the unit capacity cost and the term cost ratio of the monomer engineering type, and eliminating abnormal values in the basic data based on the Grabbs criterion.
The further technical proposal is that: in the step of acquiring the characteristics, extracting and acquiring characteristic attributes of the auditing indexes based on the information gain; the information gain based extraction comprises the following steps of obtaining item feature attributes in basic data; calculating and acquiring information gain sequencing of the characteristic attribute based on the information entropy; based on the set engineering experience threshold, eliminating the characteristic attribute smaller than the threshold; and obtaining the most effective characteristic attribute of the auditing index.
The further technical proposal is that: in the step of calculating the auditing interval, a standard auditing interval of auditing indexes under the attribute of the characteristics of the monomer engineering type is calculated based on a naive Bayesian basic model; the standard auditing interval for measuring and calculating auditing indexes under the attribute of the characteristics of the single engineering type comprises the following steps of obtaining basic data after data cleaning; acquiring characteristic attributes extracted based on information gain; substituting the basic data and the characteristic attribute into a naive Bayes basic model to obtain a standard auditing interval of the auditing index.
The further technical proposal is that: in the step of determining the standard auditing interval, if the standard auditing interval is judged to be reasonable, the standard auditing interval is obtained and is used as a final standard auditing interval; if the standard auditing interval is not reasonable, the standard auditing interval is adjusted, and the adjusted interval value is used as the final standard auditing interval.
The beneficial effects of adopting above-mentioned technical scheme to produce lie in:
the device for measuring and calculating the project data audit index interval comprises a basic data acquisition module, a cleaning data module, an acquisition characteristic module, a calculation audit interval module, a judgment module and a standard audit interval determination module, wherein the basic data acquisition module is used for acquiring production technical improvement cost analysis data and taking the production technical improvement cost analysis data as measured and calculated basic data; the cleaning data module is used for selecting the monomer engineering type to be measured and calculated and cleaning the basic data; the characteristic acquisition module is used for acquiring characteristic attributes of the auditing indexes; the calculation auditing interval module is used for measuring and calculating a standard auditing interval of auditing indexes under the characteristic attribute of the monomer engineering type; the judging module is used for judging whether the standard auditing interval is reasonable or not; and the standard auditing interval determining module is used for forming a standard auditing interval list according to the judging result. The high accuracy of measuring and calculating the auditing interval is realized by the basic data acquisition module, the cleaning data module, the characteristic acquisition module, the auditing interval calculation module, the judgment module, the standard auditing interval determination module and the like.
The method for measuring and calculating the project data auditing index interval comprises the following steps: acquiring basic data, acquiring production technology improvement cost analysis data and taking the data as basic data for measurement and calculation; cleaning data, namely selecting a monomer engineering type to be measured and calculated, and cleaning the basic data; acquiring characteristics and characteristic attributes of audit indexes; calculating an audit interval, and measuring and calculating a standard audit interval of audit indexes under the characteristic attribute of the monomer engineering type; judging whether a judgment standard auditing interval is reasonable or not; and determining a standard auditing interval, and forming a standard auditing interval list according to the judging result. The method realizes high accuracy in measuring and calculating the auditing interval through the steps of acquiring basic data, cleaning data, acquiring characteristics, calculating the auditing interval, judging and determining the standard auditing interval and the like.
See the description of the detailed description section.
Drawings
FIG. 1 is a schematic block diagram of embodiment 1 of the present application;
FIG. 2 is a flow chart of embodiment 2 of the present application;
fig. 3 is a flowchart of a naive bayes algorithm in embodiment 2 of the present application;
FIG. 4 is a screen shot of the results of a naive Bayesian classifier in example 2 of the present application;
fig. 5 is a schematic block diagram of embodiment 3 of the present application.
Detailed Description
The following description of the embodiments of the present application will be made clearly and completely with reference to the accompanying drawings, in which it is apparent that the embodiments described are only some embodiments of the present application, but not all embodiments. The following description of at least one exemplary embodiment is merely exemplary in nature and is in no way intended to limit the application, its application, or uses. All other embodiments, which can be made by those skilled in the art based on the embodiments of the application without making any inventive effort, are intended to be within the scope of the application.
In the following description, numerous specific details are set forth in order to provide a thorough understanding of the present application, but the present application may be practiced in other ways other than those described herein, and persons skilled in the art will readily appreciate that the present application is not limited to the specific embodiments disclosed below.
Example 1:
as shown in FIG. 1, the application discloses a device for measuring and calculating project data auditing index intervals, which comprises a basic data acquisition module, a data cleaning module, a characteristic acquisition module, a auditing interval calculation module, a judgment module and a standard auditing interval determination module.
The basic data acquisition module is used for acquiring production technology improvement cost analysis data and taking the production technology improvement cost analysis data as basic data for measurement and calculation.
The cleaning data module is used for selecting the monomer engineering type to be measured and calculated and cleaning the basic data; performing data cleaning on the basic data based on judgment and dispersion analysis; judging abnormal engineering data in the basic data; the dispersion analysis comprises the steps of calculating unit capacity cost and term cost proportion of the single engineering type, and eliminating abnormal values in basic data based on the Grabbs criterion.
The characteristic acquisition module is used for acquiring characteristic attributes of the auditing indexes; extracting and obtaining characteristic attributes of the auditing indexes based on the information gain; extracting based on the information gain comprises obtaining project characteristic attributes in basic data, calculating and obtaining information gain sequencing of the characteristic attributes based on information entropy, eliminating the characteristic attributes smaller than a threshold value based on a set engineering experience threshold value, and obtaining the most effective characteristic attributes of the audit indexes.
The calculation auditing interval module is used for measuring and calculating a standard auditing interval of auditing indexes under the characteristic attribute of the monomer engineering type; measuring and calculating a standard auditing interval of auditing indexes under the attribute of the characteristics of the monomer engineering type based on a naive Bayes basic model; the standard auditing interval for measuring and calculating the auditing index under the characteristic attribute of the monomer engineering type comprises the steps of obtaining basic data after data cleaning, obtaining the characteristic attribute extracted based on the information gain, substituting the basic data and the characteristic attribute into a naive Bayesian basic model, and obtaining the standard auditing interval of the auditing index.
The judging module is used for judging whether the standard auditing interval is reasonable or not.
The standard auditing interval determining module is used for forming a standard auditing interval list according to the judging result; if the standard auditing interval is judged to be reasonable, the standard auditing interval is obtained and is used as a final standard auditing interval; if the standard auditing interval is not reasonable, the standard auditing interval is adjusted, and the adjusted interval value is used as the final standard auditing interval.
Description of the technical scheme in example 1:
as shown in fig. 1, includes:
the basic data acquisition module is used for acquiring the construction cost analysis data of the annual production technology improvement as the basic data of measurement and calculation.
And the cleaning data module is used for selecting the monomer engineering type to be measured and calculated and cleaning the basic data based on a specific data algorithm.
And the characteristic acquisition module is used for acquiring the characteristic attribute of the auditing index based on a specific data algorithm.
And the calculation auditing interval module is used for measuring and calculating the standard auditing interval of the auditing index under the characteristic attribute of the monomer engineering type based on a specific mathematical algorithm.
And the judging module is used for judging whether the standard auditing interval is reasonable or not by an expert.
And the standard auditing interval determining module is used for forming a final standard auditing interval list according to the expert judging result.
After the scheme is adopted, the power grid company summarizes years of data verification experience, a big data analysis method is utilized, the data index rule information of each item is deeply mined based on the technical improvement engineering data of the past year, the range of the data auditing interval is calculated, the cost of manually calculating the auditing interval is effectively reduced, the auditing efficiency and accuracy of the manufacturing cost analysis data are improved, and the working efficiency of staff is improved: according to basic data of analysis of the manufacturing cost of the technical improvement of production in the past, characteristic attributes affecting the auditing indexes and auditing intervals of the auditing indexes are measured and calculated, and compared with the conventional manual measurement and calculation, the working efficiency can be greatly improved, and the working efficiency of staff is improved. The reliability of the data auditing interval is improved: the standard auditing interval can be more accurately and effectively obtained based on the algorithm, the condition of negligence and error leakage does not exist, and the accuracy of the interval can be ensured compared with the traditional manual measurement and calculation.
Example 2:
as shown in fig. 2, the application discloses a method for measuring and calculating a project data auditing index interval, which comprises the following steps:
s1, acquiring basic data
And acquiring production technical improvement cost analysis data serving as basic data for measurement and calculation.
S2 cleaning data
And selecting a monomer engineering type to be measured and calculated, and cleaning the basic data.
In the step of S2 cleaning the data, the data cleaning of the base data is based on the decision and the dispersion analysis.
The judging step is to eliminate abnormal engineering data in the basic data.
The dispersion analysis step comprises the steps of calculating unit capacity cost and term cost proportion of the monomer engineering type, and eliminating abnormal values in the basic data based on the Grabbs criterion.
S3 acquisition of features
And obtaining the characteristic attribute of the auditing index.
In the step of acquiring the features in S3, feature attributes of the audit trail are extracted and acquired based on the information gain.
The gain extraction based on the information comprises the steps of,
and acquiring the characteristic attribute of the item in the basic data.
And calculating and acquiring the information gain ordering of the characteristic attributes based on the information entropy.
And eliminating the characteristic attribute smaller than the threshold value based on the set engineering experience threshold value.
And obtaining the most effective characteristic attribute of the auditing index.
S4, calculating an audit interval
And measuring and calculating a standard auditing interval of auditing indexes under the attribute of the characteristics of the monomer engineering type.
And S4, calculating a standard auditing interval of auditing indexes under the characteristic attribute of the monomer engineering type based on a naive Bayesian basic model.
The standard auditing interval for measuring and calculating the auditing index under the attribute of the characteristics of the monomer engineering type comprises the following steps,
and acquiring the basic data after data cleaning.
The feature attributes extracted based on the information gain are acquired.
Substituting the basic data and the characteristic attribute into a naive Bayes basic model to obtain a standard auditing interval of the auditing index.
S5 judgment
And judging whether the standard auditing interval is reasonable or not.
S6, determining a standard auditing interval
And forming a standard auditing interval list according to the judging result.
In the step of determining the standard auditing interval in S6, if the expert judges that the standard auditing interval is reasonable, the standard auditing interval is obtained and is used as a final standard auditing interval.
If the expert judges that the standard auditing interval is unreasonable, the standard auditing interval is adjusted, and the adjusted interval value is used as a final standard auditing interval.
Description of the technical scheme in example 2:
as shown in fig. 2, the method comprises the following steps:
step S1, acquiring analysis data of the manufacturing cost of the technical improvement of the production in the past year, and taking the analysis data as basic data of measurement and calculation.
Specifically, the technical improvement cost analysis data of the production technology is selected through multidimensional inquiry, the information such as the voltage level, the completion time, the engineering type, the equipment name, the equipment code, the rated capacity, the equipment type, the equipment unit price, the sub-item cost, the total investment, the project management and the construction unit of the production technology improvement project is extracted, and standard auditing interval measurement and calculation are carried out on the basis of the selected data.
And S2, selecting a monomer engineering type to be measured and calculated, and cleaning the basic data based on a specific data algorithm.
Specifically, for example, the selected monomer engineering type is to replace a main transformer, and abnormal engineering data is removed based on expert judgment and dispersion analysis.
And step S3, acquiring characteristic attributes of the auditing indexes based on a specific data algorithm.
Specifically, numerous characteristic attributes of production technical improvement project information are extracted, some attributes belong to redundancy attributes or noise attributes in the process of measuring and calculating a data auditing interval, information gain is introduced to perform characteristic selection, and the characteristic attribute which is most useful for sample classification is selected through calculation.
And S4, measuring and calculating a standard auditing interval of the auditing index under the characteristic attribute of the monomer engineering type based on a specific mathematical algorithm.
Specifically, after all data information of the monomer engineering is removed according to the Grabbs criterion, characteristic attributes are extracted by utilizing information gain, and standard auditing intervals of auditing indexes are calculated based on a naive Bayesian basic model.
And S5, judging whether the standard auditing interval is reasonable or not by the expert.
Specifically, the measurement accuracy can reach more than 90%. When the manual auditing is considered to be correct, the engineering is considered to be an abnormal engineering, and the reason for the abnormal engineering is found through analysis because the engineering has special actual conditions and fluctuation of the auditing interval. The expert considers the fluctuation to be within a reasonable range, and therefore needs to correct this audit interval.
And S6, forming a final standard auditing interval list according to the expert judging result.
Specifically, if the expert judges that the standard auditing interval is reasonable, the standard auditing interval is obtained and is used as a final standard auditing interval; if the expert judges that the standard auditing interval is unreasonable, the standard auditing interval is adjusted based on engineering experience, and the adjusted interval value is used as a final standard auditing interval.
Furthermore, in one specific instance, the data cleansing of the underlying data is based on expert decisions and dispersion analysis.
The expert determination includes rejecting the special engineering data from the base data:
specifically, in order to ensure the validity of the data and the universality of the data result, the expert judges that the data of special engineering content engineering such as equipment utilization engineering, equipment factory return engineering, remote special area engineering and disaster-containing reconstruction are removed first.
The dispersion analysis includes:
calculating the unit capacity cost and the term cost ratio of the monomer engineering type;
and eliminating abnormal values in the basic data based on a Grabbs criterion.
Specifically, the unit capacity of all single projects in one project type, namely single project and single kilometer project, and the manufacturing cost level is c 1 ,c 2 ,c 3 ......c n Taking the unit capacity manufacturing cost of 110kV and 50MVA transformer single engineering as an example, the single engineering data of all production technical improvements 110kV and 50MVA transformers in the range of the traditional national network are extracted for calculation and analysis, and the relevant data to be analyzed are shown in table 1, namely the single engineering details of the 110kV50MVA transformers:
table 1:110kV50MVA transformer monomer engineering table
Analysis of the data in Table 1, calculation to obtain the Grabbs critical value, the significant level α of 0.05, yields the unit cost of the abnormal monomer engineering, as shown in Table 2.
Table 2: single engineering list with abnormal unit capacity cost
Furthermore, in a specific case, the feature attribute of the audit trail is extracted based on information gain.
The extracting based on the information gain includes:
acquiring item feature attributes in the basic data:
based on information entropy calculation, obtaining information gain ordering of each characteristic attribute;
based on the set engineering experience threshold, eliminating the characteristic attribute smaller than the threshold;
and obtaining the most effective characteristic attribute of the audit indexes.
Specifically, the production engineering improvement project information is extracted, which includes various characteristic attributes such as voltage class, completion time, engineering type, equipment name, equipment code, rated capacity, equipment type, equipment unit price, sub-term cost, total investment, project management and construction unit, etc., the information entropy H (C) of each category is calculated, and in event f i The following conditional entropy H (C|f i ). And obtaining information gain between the classification and the feature.
IG(C,f i )=H(C)-H(C|f i ) (1)
Wherein,
for example: based on the selected test sample data, the gain measurement results of each attribute information are shown in table 3.
Table 3: attribute information gain measuring and calculating result table
Setting a threshold value T=0.4, namely engineering experience value, taking the information gain value larger than the threshold value as a classification characteristic attribute, eliminating the characteristic attribute smaller than the threshold value, and finally determining the characteristic attribute as voltage class, rated capacity and equipment unit price through characteristic selection.
In addition, in a specific instance, a standard auditing interval for measuring and calculating auditing indexes under the attribute of the features of the monomer engineering type is based on a naive Bayesian basic model;
the standard auditing interval for measuring and calculating the auditing index under the characteristic attribute of the monomer engineering type comprises the following steps:
acquiring the basic data after the data are cleaned;
acquiring the characteristic attribute extracted based on the information gain;
substituting the basic data and the characteristic attribute into a naive Bayesian basic model to obtain a standard auditing interval of the auditing index.
As shown in fig. 3, specifically, a naive bayes algorithm calculation using 110kV and 50MVA transformers as samples is shown.
As shown in fig. 4, the results were measured using a naive bayes classifier.
Therefore, the standard reference interval for auditing the unit capacity manufacturing level data of the single engineering of the 110kV and 50MVA transformers can be determined to be [42,60].
Other monomer engineering measurement and calculation are also the same, and are all within the protection scope of the scheme.
Furthermore, in one specific case, the method further comprises:
if the expert judges that the standard auditing interval is reasonable, the standard auditing interval is obtained and is used as a final standard auditing interval;
if the expert judges that the standard auditing interval is unreasonable, the standard auditing interval is adjusted based on engineering experience, and the adjusted interval value is used as a final standard auditing interval.
After the scheme is adopted, the power grid company summarizes years of data verification experience, a big data analysis method is utilized, the data index rule information of each item is deeply mined based on the technical improvement engineering data of the past year, the range of the data auditing interval is calculated, the cost of manually calculating the auditing interval is effectively reduced, the auditing efficiency and accuracy of the manufacturing cost analysis data are improved, and the working efficiency of staff is improved: according to basic data of analysis of the manufacturing cost of the technical improvement of production in the past, characteristic attributes affecting the auditing indexes and auditing intervals of the auditing indexes are measured and calculated, and compared with the conventional manual measurement and calculation, the working efficiency can be greatly improved, and the working efficiency of staff is improved. The reliability of the data auditing interval is improved: the standard auditing interval can be more accurately and effectively obtained based on the algorithm, the condition of negligence and error leakage does not exist, and the accuracy of the interval can be ensured compared with the traditional manual measurement and calculation.
Example 3:
as shown in FIG. 5, the application discloses a device for measuring and calculating project data auditing index intervals, which comprises a basic data acquisition module, a data cleaning module, a characteristic acquisition module, a auditing interval calculation module, a judgment module and a standard auditing interval determination module.
A processor 30, a memory 31 and a computer program 32 stored in the memory 31 and executable on the processor 30, the computer program 32 being the program modules in embodiment 1. The steps of embodiment 2 are implemented when the processor 30 executes the computer program 32.
Example 3 description of the technical scheme:
as shown in fig. 5, includes: a processor 30, a memory 31, and a computer program 32, such as an audit trail interval measuring program, stored in the memory 31 and executable on the processor 30. The steps of embodiment 2 are implemented by the processor 30 when executing the computer program 32.
Illustratively, the computer program 32 may be partitioned into one or more modules/units that are stored in the memory 31 and executed by the processor 30 to complete the present application. The one or more modules/units may be a series of computer program instruction segments capable of performing specific functions for describing the execution of the computer program 32 in the terminal device 3.
Acquiring analysis data of manufacturing cost of technical improvement of production in the past year as basic data of measurement and calculation;
selecting a monomer engineering type to be measured and calculated, and cleaning the data of the basic data based on a specific data algorithm;
based on a specific data algorithm, acquiring characteristic attributes of the auditing indexes;
based on a specific mathematical algorithm, measuring and calculating a standard auditing interval of auditing indexes under the characteristic attribute of the monomer engineering type;
judging whether the standard auditing interval is reasonable or not by an expert;
and forming a final standard auditing interval list according to the expert judging result.
The terminal device 3 may be a computing device such as a desktop computer, a notebook computer, a palm computer, or a cloud server. The terminal device 3 may include, but is not limited to, a processor 30, a memory 31. It will be appreciated by those skilled in the art that the terminal device 3 is merely an example and does not constitute a limitation of the terminal device 3, and may comprise more or less components than shown, or may combine certain components, or different components, e.g. the terminal device 3 may further comprise input and output devices, network access devices, buses, etc.
The processor 30 may be a central processing unit Central Processing Unit, a CPU, or other general purpose processor, a digital signal processor (Digital Signal Processor, DSP, application specific integrated circuit Application Specific Integrated Circuit, ASIC, off-the-shelf programmable gate array Field-Programmable Gate Array, FPGA or other programmable logic device, discrete gate or transistor logic, discrete hardware components, etc., or any conventional processor.
The memory 31 may be an internal storage unit of the terminal device 3, such as a hard disk or a memory of the terminal device 3. The memory 31 may be an external storage device of the terminal device 3, for example, a plug-in hard disk, a Smart Media Card, an SMC, a Secure Digital (Secure Digital), an SD Card, a Flash Card, or the like, which are provided on the terminal device 3. Further, the memory 31 may also include both an internal storage unit and an external storage device of the terminal device 3. The memory 31 is used for storing the computer program and other programs and data required for the calculation of the audit interval of the audit indexes. The memory 31 may also be used for temporarily storing data that has been output or is to be output.
In the foregoing embodiments, the descriptions of the embodiments are emphasized, and in part, not described or illustrated in one embodiment, reference may be made to the related descriptions of other embodiments.
Those of ordinary skill in the art will appreciate that the various illustrative elements and algorithm steps described in connection with the embodiments disclosed herein may be implemented as electronic hardware, or combinations of computer software and electronic hardware. Whether such functionality is implemented as hardware or software depends upon the particular application and design constraints imposed on the solution. Skilled artisans may implement the described functionality in varying ways for each particular application, but such implementation decisions should not be interpreted as causing a departure from the scope of the present application.
In the embodiments provided in the present application, it should be understood that the disclosed method may be implemented in other manners. For example, the above-described examples are merely illustrative, e.g., the division of the modules or units is merely a logical division, and there may be additional divisions in actual implementation, e.g., multiple units or components may be combined or integrated into another system, or some features may be omitted, or not performed. Alternatively, the coupling or direct coupling or communication connection shown or discussed may be an indirect coupling or communication connection via interfaces, devices or units, which may be in electrical, mechanical or other forms.
The units described as separate units may or may not be physically separate, and units shown as units may or may not be physical units, may be located in one place, or may be distributed on a plurality of network units. Some or all of the units may be selected according to actual needs to achieve the purpose of the solution of this embodiment.
In addition, each functional unit in the embodiments of the present application may be integrated in one processing unit, or each unit may exist alone physically, or two or more units may be integrated in one unit. The integrated units may be implemented in hardware or in software functional units.
The integrated modules/units, if implemented in the form of software functional units and sold or used as stand-alone products, may be stored in a computer readable storage medium. Based on such understanding, the present application may implement all or part of the flow of the method of the above embodiment, or may be implemented by a computer program to instruct related hardware, where the computer program may be stored in a computer readable storage medium, and when the computer program is executed by a processor, the computer program may implement the steps of each of the method embodiments described above. Wherein the computer program comprises computer program code which may be in source code form, object code form, executable file or some intermediate form etc. The computer readable medium may include: any entity or device capable of carrying the computer program code, a recording medium, a U disk, a removable hard disk, a magnetic disk, an optical disk, a computer Memory, a Read-Only Memory ROM, a random access Memory RAM, random Access Memory, an electrical carrier signal, a telecommunication signal, a software distribution medium, and so forth. It should be noted that the computer readable medium may include content that is subject to appropriate increases and decreases as required by jurisdictions in which such content is subject to legislation and patent practice, such as in certain jurisdictions in which such content is not included as electrical carrier signals and telecommunication signals.
The above embodiments are only for illustrating the technical solution of the present application, and not for limiting the same; although the application has been described in detail with reference to the foregoing embodiments, it will be understood by those of ordinary skill in the art that: the technical scheme described in the foregoing embodiments can be modified or some technical features thereof can be replaced by equivalents; such modifications and substitutions do not depart from the spirit and scope of the technical solutions of the embodiments of the present application, and are intended to be included in the scope of the present application.
After the application is operated for a period of time in a secret way, the feedback of field technicians is beneficial in that:
by the scheme, the cost of manual measurement and calculation of the auditing interval can be effectively reduced, and the auditing efficiency of the cost analysis data is improved.
The power grid company summarizes years of data verification experience, utilizes a big data analysis method, proposes to deeply mine various data index rule information based on the technical improvement engineering data of the past year, calculates and calculates the range of the data auditing interval, effectively reduces the cost of manually calculating and checking the interval, improves the auditing efficiency and accuracy of the manufacturing cost analysis data, and improves the working efficiency of staff.

Claims (8)

1. The utility model provides a device of project data audit index interval measurement and calculation which characterized in that: the method comprises a basic data acquisition module, a cleaning data module, a characteristic acquisition module, a calculation auditing interval module, a judging module and a standard auditing interval determining module, wherein the basic data acquisition module is used for acquiring production technical improvement cost analysis data and taking the production technical improvement cost analysis data as basic data for measurement and calculation; the cleaning data module is used for selecting the monomer engineering type to be measured and calculated and cleaning the basic data; the characteristic acquisition module is used for acquiring characteristic attributes of the auditing indexes; the calculation auditing interval module is used for measuring and calculating a standard auditing interval of auditing indexes under the characteristic attribute of the monomer engineering type; the judging module is used for judging whether the standard auditing interval is reasonable or not; the standard auditing interval determining module is used for forming a standard auditing interval list according to the judging result;
the calculation auditing interval module is also used for measuring and calculating a standard auditing interval of auditing indexes under the attribute of the characteristics of the monomer engineering type based on the naive Bayes basic model; the standard auditing interval for measuring and calculating auditing indexes under the characteristic attribute of the monomer engineering type comprises the steps of acquiring basic data after data cleaning, acquiring the characteristic attribute extracted based on information gain, substituting the basic data and the characteristic attribute into a naive Bayesian basic model, and acquiring the standard auditing interval of the auditing indexes;
the step of extracting the standard auditing interval obtained according to the naive Bayesian basic model comprises the following steps:
determining characteristic attributes and category attributes of a sample, wherein the characteristic attributes comprise voltage level, rated capacity and unit price of equipment, and the category attributes comprise unit capacity cost;
selecting a training sample according to a ten-fold cross validation method;
dividing the unit cost of the category attribute into five possible interval categories according to the data distribution, and calculating the probability P (C) of occurrence of the unit cost interval for each category k );
Calculating conditional probability P (X) i /C k );
Calculating a unit capacity cost interval P (C k )P(X i /C k )
In unit capacity cost intervals P (C k )P(X i /C k ) The largest item in (3) is taken as the category to which the sample belongs, namely the unit capacity cost checking interval.
2. The apparatus for project data audit trail interval measurement and calculation according to claim 1 wherein: the cleaning data module is also used for carrying out data cleaning on the basic data and analyzing based on judgment and dispersion; judging abnormal engineering data in the basic data; the dispersion analysis comprises the steps of calculating unit capacity cost and term cost proportion of the single engineering type, and eliminating abnormal values in basic data based on the Grabbs criterion.
3. The apparatus for project data audit trail interval measurement and calculation according to claim 1 wherein: the characteristic acquisition module is also used for extracting and acquiring characteristic attributes of the auditing indexes based on the information gain; extracting based on the information gain comprises obtaining project characteristic attributes in basic data, calculating and obtaining information gain sequencing of the characteristic attributes based on information entropy, eliminating the characteristic attributes smaller than a threshold value based on a set engineering experience threshold value, and obtaining the most effective characteristic attributes of the audit indexes.
4. The apparatus for project data audit trail interval measurement and calculation according to claim 1 wherein: the standard auditing interval determining module is further used for obtaining the standard auditing interval as a final standard auditing interval if the standard auditing interval is judged to be reasonable; if the standard auditing interval is not reasonable, the standard auditing interval is adjusted, and the adjusted interval value is used as the final standard auditing interval.
5. A method for measuring and calculating project data auditing index interval is characterized by comprising the following steps: the method comprises the following steps:
acquiring basic data, acquiring production technology improvement cost analysis data and taking the data as basic data for measurement and calculation;
cleaning data, namely selecting a monomer engineering type to be measured and calculated, and cleaning the basic data;
acquiring characteristics and characteristic attributes of audit indexes;
calculating an audit interval, and measuring and calculating a standard audit interval of audit indexes under the characteristic attribute of the monomer engineering type;
in the step of calculating the auditing interval, a standard auditing interval of auditing indexes under the attribute of the characteristics of the monomer engineering type is calculated based on a naive Bayesian basic model; the standard auditing interval for measuring and calculating auditing indexes under the attribute of the characteristics of the single engineering type comprises the following steps of obtaining basic data after data cleaning; acquiring characteristic attributes extracted based on information gain; substituting the basic data and the characteristic attribute into a naive Bayes basic model to obtain a standard auditing interval of an auditing index;
the step of extracting the standard auditing interval obtained according to the naive Bayesian basic model comprises the following steps:
determining characteristic attributes and category attributes of a sample, wherein the characteristic attributes comprise voltage level, rated capacity and unit price of equipment, and the category attributes comprise unit capacity cost;
selecting a training sample according to a ten-fold cross validation method;
dividing the unit cost of the category attribute into five possible interval categories according to the data distribution, and calculating the probability P (C) of occurrence of the unit cost interval for each category k );
Calculating conditional probability P (X) i /C k );
Calculating a unit capacity cost interval P (C k )P(X i /C k )
In unit capacity cost intervals P (C k )P(X i /C k ) The largest item in (a) is taken as the category to which the sample belongs, namely a unit capacity cost checking interval;
judging whether a judgment standard auditing interval is reasonable or not;
and determining a standard auditing interval, and forming a standard auditing interval list according to the judging result.
6. The method for measuring and calculating project data audit trail interval according to claim 5, characterized in that: in the step of cleaning the data, data cleaning is performed on the basic data based on judgment and dispersion analysis; judging abnormal engineering data in the basic data; the dispersion analysis comprises the steps of calculating the unit capacity cost and the term cost ratio of the monomer engineering type, and eliminating abnormal values in the basic data based on the Grabbs criterion.
7. The method for measuring and calculating project data audit trail interval according to claim 5, characterized in that: in the step of acquiring the characteristics, extracting and acquiring characteristic attributes of the auditing indexes based on the information gain; the information gain based extraction comprises the following steps of obtaining item feature attributes in basic data; calculating and acquiring information gain sequencing of the characteristic attribute based on the information entropy; based on the set engineering experience threshold, eliminating the characteristic attribute smaller than the threshold; and obtaining the most effective characteristic attribute of the auditing index.
8. The method for measuring and calculating project data audit trail interval according to claim 5, characterized in that: in the step of determining the standard auditing interval, if the standard auditing interval is judged to be reasonable, the standard auditing interval is obtained and is used as a final standard auditing interval; if the standard auditing interval is not reasonable, the standard auditing interval is adjusted, and the adjusted interval value is used as the final standard auditing interval.
CN202010618175.4A 2020-07-01 2020-07-01 Device and method for measuring and calculating project data auditing index interval Active CN111882289B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202010618175.4A CN111882289B (en) 2020-07-01 2020-07-01 Device and method for measuring and calculating project data auditing index interval

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202010618175.4A CN111882289B (en) 2020-07-01 2020-07-01 Device and method for measuring and calculating project data auditing index interval

Publications (2)

Publication Number Publication Date
CN111882289A CN111882289A (en) 2020-11-03
CN111882289B true CN111882289B (en) 2023-11-14

Family

ID=73157348

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202010618175.4A Active CN111882289B (en) 2020-07-01 2020-07-01 Device and method for measuring and calculating project data auditing index interval

Country Status (1)

Country Link
CN (1) CN111882289B (en)

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113326254A (en) * 2021-06-18 2021-08-31 立信(重庆)数据科技股份有限公司 Research data cleaning method and system

Citations (15)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2007188285A (en) * 2006-01-13 2007-07-26 Hitachi Ltd Method and system for setting threshold
CN104572391A (en) * 2013-10-16 2015-04-29 深圳市腾讯计算机系统有限公司 Monitoring alarm strategy collocation method and device and monitoring alarm method and device
CN107070941A (en) * 2017-05-05 2017-08-18 北京匡恩网络科技有限责任公司 The method and apparatus of abnormal traffic detection
CN107909472A (en) * 2017-12-08 2018-04-13 上海壹账通金融科技有限公司 Management data checking method, device, equipment and computer-readable recording medium
CN108809716A (en) * 2018-06-12 2018-11-13 郑州云海信息技术有限公司 A kind of method and apparatus of gathered data audit
CN109360109A (en) * 2018-09-26 2019-02-19 平安医疗健康管理股份有限公司 Intelligent Claims Review householder method and system based on big data analysis and processing
CN109740869A (en) * 2018-12-14 2019-05-10 平安国际融资租赁有限公司 Data checking method, device, computer equipment and storage medium
CN110046633A (en) * 2018-11-23 2019-07-23 阿里巴巴集团控股有限公司 A kind of data quality checking method and device
CN110059982A (en) * 2019-04-29 2019-07-26 国网经济技术研究院有限公司 Technological transformation project method for quantitatively evaluating and system are administered in a kind of lightning protection
CN110247911A (en) * 2019-06-14 2019-09-17 曹严清 A kind of Traffic anomaly detection method and system
CN110287081A (en) * 2019-06-21 2019-09-27 腾讯科技(成都)有限公司 A kind of service monitoring system and method
CN110705998A (en) * 2019-09-02 2020-01-17 深圳壹账通智能科技有限公司 Block chain based information auditing method and device, electronic equipment and storage medium
CN110837011A (en) * 2019-09-27 2020-02-25 深圳市火乐科技发展有限公司 Abnormal data processing method and device, intelligent projector and storage medium
CN110889646A (en) * 2019-12-06 2020-03-17 华北电力大学 Method for checking correctness of operating parameters of generator set adapting to working condition change
CN110889451A (en) * 2019-11-26 2020-03-17 Oppo广东移动通信有限公司 Event auditing method and device, terminal equipment and storage medium

Family Cites Families (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
TW500989B (en) * 2001-03-07 2002-09-01 Macronix Int Co Ltd Automatic auditing system and method for machine production parameters
US10185634B2 (en) * 2016-09-07 2019-01-22 Facebook, Inc. Detecting data loss and corruption of data processed by a data processing system of an online system during a specified time interval

Patent Citations (15)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2007188285A (en) * 2006-01-13 2007-07-26 Hitachi Ltd Method and system for setting threshold
CN104572391A (en) * 2013-10-16 2015-04-29 深圳市腾讯计算机系统有限公司 Monitoring alarm strategy collocation method and device and monitoring alarm method and device
CN107070941A (en) * 2017-05-05 2017-08-18 北京匡恩网络科技有限责任公司 The method and apparatus of abnormal traffic detection
CN107909472A (en) * 2017-12-08 2018-04-13 上海壹账通金融科技有限公司 Management data checking method, device, equipment and computer-readable recording medium
CN108809716A (en) * 2018-06-12 2018-11-13 郑州云海信息技术有限公司 A kind of method and apparatus of gathered data audit
CN109360109A (en) * 2018-09-26 2019-02-19 平安医疗健康管理股份有限公司 Intelligent Claims Review householder method and system based on big data analysis and processing
CN110046633A (en) * 2018-11-23 2019-07-23 阿里巴巴集团控股有限公司 A kind of data quality checking method and device
CN109740869A (en) * 2018-12-14 2019-05-10 平安国际融资租赁有限公司 Data checking method, device, computer equipment and storage medium
CN110059982A (en) * 2019-04-29 2019-07-26 国网经济技术研究院有限公司 Technological transformation project method for quantitatively evaluating and system are administered in a kind of lightning protection
CN110247911A (en) * 2019-06-14 2019-09-17 曹严清 A kind of Traffic anomaly detection method and system
CN110287081A (en) * 2019-06-21 2019-09-27 腾讯科技(成都)有限公司 A kind of service monitoring system and method
CN110705998A (en) * 2019-09-02 2020-01-17 深圳壹账通智能科技有限公司 Block chain based information auditing method and device, electronic equipment and storage medium
CN110837011A (en) * 2019-09-27 2020-02-25 深圳市火乐科技发展有限公司 Abnormal data processing method and device, intelligent projector and storage medium
CN110889451A (en) * 2019-11-26 2020-03-17 Oppo广东移动通信有限公司 Event auditing method and device, terminal equipment and storage medium
CN110889646A (en) * 2019-12-06 2020-03-17 华北电力大学 Method for checking correctness of operating parameters of generator set adapting to working condition change

Non-Patent Citations (5)

* Cited by examiner, † Cited by third party
Title
对大熊猫数量调查方法中咬节区分机制的准确性评价;尹玉峰, 王昊, 陈艾, 刘国琪;生物多样性(第05期);正文全文 *
无线传感器网络中基于预测的时域数据融合技术;回春立;崔莉;;计算机工程与应用(第21期);正文全文 *
浅析工程造价的审核;施建伟;施建兴;祝胜标;徐朝阳;陈小江;;建筑施工(第08期);正文全文 *
许鑫.《基于文本特征计算的信息分析方法》.上海科学技术文献出版社,2015,79. *
黑马程序员.《Python数据分析与应用 从数据获取到可视化》.中国铁道出版社,2019,83. *

Also Published As

Publication number Publication date
CN111882289A (en) 2020-11-03

Similar Documents

Publication Publication Date Title
WO2021052031A1 (en) Statistical interquartile range-based commodity inventory risk early warning method and system, and computer readable storage medium
CN109409628B (en) Acquisition terminal manufacturer evaluation method based on metering big data clustering model
CN103247008A (en) Quality evaluation method of electricity statistical index data
CN111786385B (en) Power grid operation and maintenance scheme planning method, system and equipment
CN114462133A (en) Digital twin technology equipment product-based carbon footprint digital accounting method and system
CN114065223A (en) Multi-dimensional software security risk assessment method based on CVSS
CN110321912A (en) More metering anomalous event correlation analysis methods
CN112801315A (en) State diagnosis method and device for power secondary equipment and terminal
CN114757468A (en) Root cause analysis method for flow execution abnormity in flow mining
CN111882289B (en) Device and method for measuring and calculating project data auditing index interval
CN102567536A (en) Key performance target analyzing method based on data statistics
CN117035563B (en) Product quality safety risk monitoring method, device, monitoring system and medium
CN111242424A (en) Quality data processing method and device
CN115292969B (en) Equipment outfield reliability assessment method and system based on factory and repair data
CN110781959A (en) Power customer clustering method based on BIRCH algorithm and random forest algorithm
CN113780724B (en) Product quality batch stability quantitative evaluation criterion calculation method
CN110795308A (en) Server inspection method, device, equipment and storage medium
CN113806343B (en) Evaluation method and system for Internet of vehicles data quality
CN112614005B (en) Method and device for processing reworking state of enterprise
CN115907461A (en) Electric power engineering method based on mechanism derivation equation
CN115796665A (en) Multi-index carbon efficiency grading evaluation method and device for green energy power generation project
CN115951295A (en) Automatic identification method and system for daily clear power abnormity
CN114553473A (en) Abnormal login behavior detection system and method based on login IP and login time
CN113935819A (en) Method for extracting checking abnormal features
CN112329108A (en) Optimized anti-floating checking calculation method and system for subway station

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant