CN104216985B - A kind of method and system for screening abnormal data - Google Patents
A kind of method and system for screening abnormal data Download PDFInfo
- Publication number
- CN104216985B CN104216985B CN201410446368.0A CN201410446368A CN104216985B CN 104216985 B CN104216985 B CN 104216985B CN 201410446368 A CN201410446368 A CN 201410446368A CN 104216985 B CN104216985 B CN 104216985B
- Authority
- CN
- China
- Prior art keywords
- service
- type
- acquisition system
- sample data
- data acquisition
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
- 238000012216 screening Methods 0.000 title claims abstract description 121
- 238000000034 method Methods 0.000 title claims abstract description 40
- 230000002159 abnormal effect Effects 0.000 title claims abstract description 38
- 238000012937 correction Methods 0.000 claims abstract description 26
- 230000008569 process Effects 0.000 claims abstract description 22
- 230000005611 electricity Effects 0.000 claims description 9
- 230000008859 change Effects 0.000 claims description 6
- 238000013480 data collection Methods 0.000 claims 1
- 238000005070 sampling Methods 0.000 abstract description 14
- 241001269238 Data Species 0.000 abstract description 5
- 238000007689 inspection Methods 0.000 description 4
- 230000008901 benefit Effects 0.000 description 3
- 230000006870 function Effects 0.000 description 3
- 238000005516 engineering process Methods 0.000 description 2
- 230000006872 improvement Effects 0.000 description 2
- 238000011835 investigation Methods 0.000 description 2
- TVZRAEYQIKYCPH-UHFFFAOYSA-N 3-(trimethylsilyl)propane-1-sulfonic acid Chemical compound C[Si](C)(C)CCCS(O)(=O)=O TVZRAEYQIKYCPH-UHFFFAOYSA-N 0.000 description 1
- 230000003542 behavioural effect Effects 0.000 description 1
- 238000013461 design Methods 0.000 description 1
- 230000009977 dual effect Effects 0.000 description 1
- 238000004870 electrical engineering Methods 0.000 description 1
- 239000000686 essence Substances 0.000 description 1
- 238000013210 evaluation model Methods 0.000 description 1
- 230000003993 interaction Effects 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 238000010606 normalization Methods 0.000 description 1
- 238000006467 substitution reaction Methods 0.000 description 1
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06Q—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
- G06Q10/00—Administration; Management
- G06Q10/06—Resources, workflows, human or project management; Enterprise or organisation planning; Enterprise or organisation modelling
- G06Q10/063—Operations research, analysis or management
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06Q—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
- G06Q50/00—Information and communication technology [ICT] specially adapted for implementation of business processes of specific business sectors, e.g. utilities or tourism
- G06Q50/06—Energy or water supply
Landscapes
- Business, Economics & Management (AREA)
- Engineering & Computer Science (AREA)
- Human Resources & Organizations (AREA)
- Economics (AREA)
- Strategic Management (AREA)
- Theoretical Computer Science (AREA)
- Entrepreneurship & Innovation (AREA)
- Health & Medical Sciences (AREA)
- Marketing (AREA)
- General Physics & Mathematics (AREA)
- General Business, Economics & Management (AREA)
- Tourism & Hospitality (AREA)
- Physics & Mathematics (AREA)
- Public Health (AREA)
- Primary Health Care (AREA)
- Water Supply & Treatment (AREA)
- Development Economics (AREA)
- Educational Administration (AREA)
- General Health & Medical Sciences (AREA)
- Game Theory and Decision Science (AREA)
- Operations Research (AREA)
- Quality & Reliability (AREA)
- Management, Administration, Business Operations System, And Electronic Commerce (AREA)
Abstract
The embodiment of the invention discloses a kind of method for screening abnormal data, it is having multiple sample data sets to close realization, and this method includes:Obtain first sample data acquisition system and corresponding multiple types of service;Screening rule is respectively provided with corresponding each type of service, and according to the screening rule of setting, obtains the garbled data of each type of service;Judge whether the garbled data of each type of service is present in the correction data set of multiple sample data sets screening in addition to first sample data acquisition system;If it is, determining that garbled data is abnormal data.The embodiment of the present invention, can correct and there is deviation in sampling process, the problem of just larger error occurs in analysis result, and available for the sampling of multiple subclass, reduce the error rate of sampled result;Meanwhile, can be in complicated big data(Data are overall, and are not only sample set)In, all abnormal datas are precisely locked rapidly.
Description
Technical field
The present invention relates to power system marketing inspection technical field, more particularly to a kind of method for screening abnormal data and it is
System.
Background technology
Power system marketing inspection is according to relevant policies, regulation and rules and regulations, to marketing institutional improvement with performing, seeking
Sell behavioural norm and marketing work quality etc. and carry out internal specialty inspection supervision.
The marketing inspection work system of existing normalization, scientific sampling and evaluation model based on Principle of Statistics, first
Business datum is imported into statistical software, then is sampled by the decimation blocks of general statistical software, finally by investigation result
Data imported into statistical software and carry out statistical inference, therefore in the case where not collecting or analyzing total data, by collecting
Random sample, the deduction of high accurancy and precision is made with less cost, and its shortcoming is:Once exist in sampling process any inclined
Larger error just occurs in difference, analysis result, while stochastical sampling is in the sampling for multiple subclass, stochastical sampling result
Error rate can greatly increase.
Meanwhile, after business datum rolls up, the method that abnormal data is found out by the method for sample investigation is present
The problem of whole abnormal datas and low search efficiency can not be searched, i.e., can not be in complicated big data, the rapid abnormal number of locking
According to.
When in face of data rich, " big data " of complexity, analyze, obtained at most with minimum data with random sampling
" small data " Different Period of information, we need to collect, utilize all data(Data at least as much as possible), i.e., " sample=
It is overall ", to according to carrying out depth analysis, excavating, bringing higher accuracy totally.
The content of the invention
The purpose of the embodiment of the present invention is to provide a kind of method and system for screening abnormal data, can correct and sample
There is deviation, the problem of just larger error occurs in analysis result in journey, and available for the sampling of multiple subclass, reduce sampling
As a result error rate, and all abnormal datas can be locked rapidly in complicated big data.
In order to solve the above-mentioned technical problem, the embodiments of the invention provide a kind of method for screening abnormal data, it is having
Multiple sample data sets close realization, and methods described includes:
Obtain corresponding multiple types of service in first sample data acquisition system and the first sample data acquisition system;
It is respectively provided with screening rule in the corresponding each type of service of first sample data acquisition system of the acquisition, and according to
The screening rule of the setting, obtains the garbled data of each type of service in the first sample data acquisition system;
Whether the garbled data of each type of service is present in removing in the first sample data acquisition system obtained described in judging
In the correction data set of the multiple sample data sets screening outside the first sample data acquisition system;
If it is, determining in the case of identical services type, it is present in addition to the first sample data acquisition system
Garbled data in the correction data set of the multiple sample data sets screening is abnormal data.
Wherein, screening rule are respectively provided with the corresponding each type of service of the first sample data acquisition system in the acquisition
Then, and according to the screening rule of the setting, the garbled data of each type of service in the first sample data acquisition system is obtained
Specific steps include:
In the first sample data acquisition system of the acquisition, according to the corresponding each business of the first sample data acquisition system
Type, sets one or more screening attributes contained by the corresponding screening rule of each type of service;
One or more screening attributes according to contained by the corresponding screening rule of each type of service of the setting, are obtained
The garbled data of each type of service in the first sample data acquisition system;Wherein, the garbled data is the first sample
The data acquisition system that each type of service passes through corresponding one or more screening attribute selections in data acquisition system.
Wherein, whether the garbled data of each type of service is equal in the first sample data acquisition system obtained described in the judgement
It is present in specific in the correction data of the multiple sample data sets screening in addition to the first sample data acquisition system
Step includes:
Obtain the screening rule that each type of service is correspondingly arranged in the first sample data acquisition system;
The screening rule that each type of service of the acquisition is correspondingly arranged, is separately positioned on except the first sample number
According in the multiple sample data sets outside set, the correction data set of each type of service is obtained;
Judge whether the garbled data of each type of service obtained in the first sample data acquisition system is contained in phase
With type of service in corresponding correction data set.
Wherein, the type of service include Business Process System, electricity consumption change, recording, checking, and charging, metering, power utility check, customer service,
Controlling line loss.
The embodiment of the present invention additionally provides a kind of system for screening abnormal data, and the system includes:Acquiring unit, screening
Unit, judging unit and determining unit;Wherein,
The acquiring unit, it is corresponding in first sample data acquisition system and the first sample data acquisition system for obtaining
Multiple types of service;
The screening unit, for being all provided with the corresponding each type of service of the first sample data acquisition system of the acquisition
Screening rule is put, and according to the screening rule of the setting, obtains each type of service in the first sample data acquisition system
Garbled data;
The judging unit, the screening number for judging each type of service in the obtained first sample data acquisition system
According to whether be present in addition to the first sample data acquisition system the multiple sample data sets screening correction data
In set;
The determining unit, in the case of identical services type, be present in except the first sample data acquisition system it
Garbled data in the correction data set of outer the multiple sample data sets screening is abnormal data.
Wherein, the screening unit includes:
Setup module, in the first sample data acquisition system of the acquisition, according to the first sample data acquisition system
Corresponding each type of service, sets one or more screening attributes contained by the corresponding screening rule of each type of service;
Screening module, for one or more contained by the corresponding screening rule of each type of service according to the setting
Attribute is screened, the garbled data of each type of service in the first sample data acquisition system is obtained;Wherein, the garbled data is
The data that each type of service passes through corresponding one or more screening attribute selections in the first sample data acquisition system
Set.
Wherein, the judging unit includes:
First acquisition module, for obtaining the screening that each type of service is correspondingly arranged in the first sample data acquisition system
Rule;
Second acquisition module, for the screening rule for being correspondingly arranged each type of service of the acquisition, is set respectively
In the multiple sample data sets in addition to the first sample data acquisition system, the contrast number of each type of service is obtained
According to set;
Judge module, for the garbled data for each type of service for judging to obtain in the first sample data acquisition system
Whether it is contained in the corresponding correction data set of identical services type.
Wherein, the type of service include Business Process System, electricity consumption change, recording, checking, and charging, metering, power utility check, customer service,
Controlling line loss.
Implement the embodiment of the present invention, have the advantages that:
In embodiments of the present invention, due in a sample data sets each type of service be respectively provided with screening rule,
So as to realize while for the sampling of multiple subclass, reduce the error rate of sampled result, and by each type of service pair
The garbled data that should be obtained is contrasted with other data acquisition systems in addition to the sample data sets, so as to lock rapidly abnormal
Data, have corrected and there is deviation in sampling process, the problem of just larger error occurs in analysis result.Meanwhile, in face of data
When abundant, complicated " big data ", total data that can be overall to business(That is " sample=totality "), according to the screening of setting
Rule carries out abnormal data examination, rapidly, precisely locks all abnormal datas, realizes the depth analysis to total evidence, excavates,
Bring higher accuracy.
Brief description of the drawings
In order to illustrate more clearly about the embodiment of the present invention or technical scheme of the prior art, below will be to embodiment or existing
There is the accompanying drawing used required in technology description to be briefly described, it should be apparent that, drawings in the following description are only this
Some embodiments of invention, for those of ordinary skill in the art, without having to pay creative labor, according to
These accompanying drawings obtain other accompanying drawings and still fall within scope of the invention.
Fig. 1 is the flow chart of the method for examination abnormal data provided in an embodiment of the present invention;
Fig. 2 is the structural representation of the system of examination abnormal data provided in an embodiment of the present invention.
Embodiment
In order to make the purpose , technical scheme and advantage of the present invention be clearer, it is right below in conjunction with drawings and Examples
The present invention is further elaborated.It should be appreciated that the specific embodiments described herein are merely illustrative of the present invention, and
It is not used in the restriction present invention.
As shown in figure 1, in the embodiment of the present invention, proposing a kind of method for screening abnormal data, it is having multiple sample numbers
Realization is closed according to collection, methods described includes:
Corresponding multiple business in step S101, acquisition first sample data acquisition system and the first sample data acquisition system
Type;Wherein, type of service includes but is not limited to Business Process System, electricity consumption change, recording, checking, and charging, metering, power utility check, client's clothes
Business, Controlling line loss.
Step S102, it is respectively provided with the corresponding each type of service of first sample data acquisition system of the acquisition screening rule
Then, and according to the screening rule of the setting, the garbled data of each type of service in the first sample data acquisition system is obtained;
Detailed process is, in first sample data acquisition system, according to the corresponding each service class of first sample data acquisition system
Type, sets one or more screening attributes contained by the corresponding screening rule of each type of service;
One or more screening attributes according to contained by the corresponding screening rule of each type of service of setting, obtain first
The garbled data of each type of service in sample data sets;Wherein, garbled data is each industry in first sample data acquisition system
The data acquisition system that service type passes through corresponding one or more screening attribute selections.
Whether the garbled data of each type of service is equal in the first sample data acquisition system obtained described in step S103, judgement
In the correction data for being present in the multiple sample data sets screening in addition to the first sample data acquisition system;If
It is then to perform next step S104;If it is not, then terminating.
Detailed process is to obtain the screening rule that each type of service is correspondingly arranged in first sample data acquisition system;
The screening rule that each type of service of acquisition is correspondingly arranged, be separately positioned on except first sample data acquisition system it
In outer the multiple sample data sets, the correction data set of each type of service is obtained;
Judge whether the garbled data of each type of service obtained in first sample data acquisition system is contained in the mutually same trade
In the corresponding correction data set of service type.
Step S104, determination are present in addition to the first sample data acquisition system in the case of identical services type
Garbled data in the correction data set of the multiple sample data sets screening is abnormal data.
As an example, first sample data acquisition system is business and the data acquisition system of distribution integral system, other samples
Notebook data set includes but is not limited to the data acquisition system of marketing management system, the data acquisition system of metering automation management system, battalion
Data acquisition system, the data acquisition system of customer service information system of DSS are sold, aforesaid plurality of data acquisition system constitutes data interaction
Warehouse, and multiple types of service are divided in first sample data acquisition system, type of service includes:Business Process System, electricity consumption are changed, copied
Core receipts, metering, power utility check, customer service, Controlling line loss etc..
In first sample data acquisition system, according to each corresponding type of service of first sample data acquisition system, set every
One or more screening attributes contained by a kind of corresponding screening rule of type of service, so as to obtain all type of service set
Garbled data inventory.
Assuming that business datum set includes multiple data acquisition systems such as E1 to En, wherein, first sample data acquisition system is E1,
Screening rule is built in first sample data acquisition system E1(Condition)R, under the conditions of screening rule R, by first sample data acquisition system
Data in E1 are classified, screened, and draw required abnormal traffic data acquisition system C.
Function expression:y=f(x), wherein, f is screening rule R, and x is that attribute A1, A2 ... An, y are abnormal traffic number
According to set C.Attribute A1, A2 ... the An, regular R that R rules include can be attribute A1, A2 ... An set, i.e. R ∈ A1,
A2……An}.R process is built, is also to constantly look for its attribute A1, A2 ... An, the institute under the conditions of the An that draws A1, A2 ...
There is set.
Specially in the data acquisition system that first sample data acquisition system E1 types of service are Business Process System, screening rule is set
R1 is:Power supply plan replies time-out.Now, screening rule R1 can be several attribute A1, A2 set, i.e. R1 ∈ A1,
A2……}.In " power supply plan reply time-out " under the conditions of this:As long as meeting two attributes, it just can find out power supply plan and surpass
When abnormal data.The two attributes are respectively:From A1=accept customer electricity application certainly, single supply client is more than 15
From working day, A2=accept customer electricity application certainly, more than 30 working days of dual power supply client, therefore examination goes out industry and expands report
Fill the abnormal data of power supply plan time-out in class data.
By that analogy, with screening rule R2:Client is audited time-out by electrical engineering design data;Regular R3:Complete dress table
The setting of the screening rules such as electric time-out is connect, Business Process System class garbled data set just can be determined in the data acquisition system of Business Process System
C1。
Meanwhile, screening rule R1, R2 and R3 are separately positioned on other business in addition to first sample data acquisition system E1
Data acquisition system(Such as E2 to En)In, obtain other business datum set(Such as E2 to En)Middle type of service is the contrast of Business Process System
Data acquisition system Cm, wherein, correction data set is by other business datum set(Such as E2 to En)Respectively according to screening rule R1, R2
With the big data set Cm constituted after R3.
When the Business Process System class garbled data set C1 in first sample data acquisition system E1 is contained in other business datum collection
Close(Such as E2 to En)In Business Process System class big data set Cm, i.e. C1 Cm, so that it is determined that in first sample data acquisition system E1
Business Process System class garbled data set C1 be Business Process System class abnormal data set.
Implement the embodiment of the present invention, have the advantages that:
In embodiments of the present invention, due in a sample data sets each type of service be respectively provided with screening rule,
So as to realize while for the sampling of multiple subclass, reduce the error rate of sampled result, and by each type of service pair
The garbled data that should be obtained is contrasted with other data acquisition systems in addition to the sample data sets, so as to lock rapidly abnormal
Data, have corrected and there is deviation in sampling process, the problem of just larger error occurs in analysis result.Meanwhile, in face of data
When abundant, complicated " big data ", total data that can be overall to business(That is " sample=totality "), according to the screening of setting
Rule carries out abnormal data examination, rapidly, precisely locks all abnormal datas, realizes the depth analysis to total evidence, excavates,
Bring higher accuracy.
As described in Figure 2, in the embodiment of the present invention, it is also proposed that a kind of system for screening abnormal data, the system includes:Obtain
Take unit 210, screening unit 220, judging unit 230 and determining unit 240;Wherein,
The acquiring unit 210 is right in first sample data acquisition system and the first sample data acquisition system for obtaining
The multiple types of service answered;
The screening unit 220, in the corresponding each type of service of the first sample data acquisition system of the acquisition
Screening rule is respectively provided with, and according to the screening rule of the setting, obtains each service class in the first sample data acquisition system
The garbled data of type;
The judging unit 230, the sieve for judging each type of service in the obtained first sample data acquisition system
Select whether data are present in the contrast of the multiple sample data sets screening in addition to the first sample data acquisition system
In data acquisition system;
The determining unit 240, for determining in the case of identical services type, is present in except the first sample data
Garbled data in the correction data set of the multiple sample data sets screening outside set is abnormal data.
Wherein, screening unit 220 includes:
Setup module 2201, in the first sample data acquisition system of the acquisition, according to the first sample data
Gather corresponding each type of service, one or more screenings category contained by the corresponding screening rule of each type of service is set
Property;
Screening module 2202, for one contained by the corresponding screening rule of each type of service according to the setting or
Multiple screening attributes, obtain the garbled data of each type of service in the first sample data acquisition system;Wherein, the screening number
Formed according to each type of service in the first sample data acquisition system by corresponding one or more screening attribute selections
Data acquisition system.
Wherein, judging unit 230 includes:
First acquisition module 2301, is correspondingly arranged for obtaining each type of service in the first sample data acquisition system
Screening rule;
Second acquisition module 2302, for the screening rule for being correspondingly arranged each type of service of the acquisition, difference
It is arranged in the multiple sample data sets in addition to the first sample data acquisition system, obtains pair of each type of service
Compare data acquisition system;
Judge module 2303, for the screening for each type of service for judging to obtain in the first sample data acquisition system
Whether data are contained in the corresponding correction data set of identical services type.
Wherein, type of service includes Business Process System, electricity consumption change, recording, checking, and charging, metering, power utility check, customer service, line loss
Management.
In embodiments of the present invention, the system for screening abnormal data obtains first sample number by acquiring unit 210 first
According to set and first sample data acquisition system in corresponding multiple types of service, and by screening unit 220 in first sample
Screening rule is respectively provided with the corresponding each type of service of data acquisition system, and according to the screening rule of setting, obtains first sample
The garbled data of each type of service in data acquisition system, judges each industry in first sample data acquisition system in judging unit 230
Whether the garbled data of service type is present in pair of multiple sample data sets screening in addition to first sample data acquisition system
Than in data acquisition system, if it is, being determined by determining unit 240 in the case of identical services type, being present in except the first sample
Garbled data in the correction data set of multiple sample data sets screening outside notebook data set is abnormal data.
It is worth noting that, in said system embodiment, each included system unit simply enters according to function logic
What row was divided, but above-mentioned division is not limited to, as long as corresponding function can be realized;In addition, each functional unit
Specific name is also only to facilitate mutually distinguish, the protection domain being not intended to limit the invention.
Can be with one of ordinary skill in the art will appreciate that realizing that all or part of step in above-described embodiment method is
The hardware of correlation is instructed to complete by program, described program can be stored in a computer read/write memory medium,
Described storage medium, such as ROM/RAM, disk, CD.
The foregoing is merely illustrative of the preferred embodiments of the present invention, is not intended to limit the invention, all essences in the present invention
Any modifications, equivalent substitutions and improvements made within refreshing and principle etc., should be included in the scope of the protection.
Claims (6)
1. a kind of method for screening abnormal data, it is characterised in that it closes realization, methods described there is multiple sample data sets
Including:
Obtain corresponding multiple types of service in first sample data acquisition system and the first sample data acquisition system;
Screening rule is respectively provided with the corresponding each type of service of first sample data acquisition system of the acquisition, and according to described
The screening rule of setting, obtains the garbled data of each type of service in the first sample data acquisition system;
Whether the garbled data of each type of service is present in except described in the first sample data acquisition system obtained described in judging
In the correction data set of the multiple sample data sets screening outside first sample data acquisition system;
If it is, determining in the case of identical services type, it is present in described in addition to the first sample data acquisition system
Garbled data in the correction data set of multiple sample data sets screenings is abnormal data;
Wherein, screening rule is respectively provided with the corresponding each type of service of the first sample data acquisition system in the acquisition,
And according to the screening rule of the setting, obtain the tool of the garbled data of each type of service in the first sample data acquisition system
Body step includes:
In the first sample data acquisition system of the acquisition, according to the corresponding each service class of the first sample data acquisition system
Type, sets one or more screening attributes contained by the corresponding screening rule of each type of service;
One or more screening attributes according to contained by the corresponding screening rule of each type of service of the setting, obtain described
The garbled data of each type of service in first sample data acquisition system;Wherein, the garbled data is the first sample data
The data acquisition system that each type of service passes through corresponding one or more screening attribute selections in set.
2. the method as described in claim 1, it is characterised in that every in the first sample data acquisition system obtained described in the judgement
Whether the garbled data of one type of service is present in the multiple sample data in addition to the first sample data acquisition system
The specific steps gathered in the correction data set of screening include:
Obtain the screening rule that each type of service is correspondingly arranged in the first sample data acquisition system;By each of the acquisition
The screening rule that type of service is correspondingly arranged, is separately positioned on the multiple sample in addition to the first sample data acquisition system
In data acquisition system, the correction data set of each type of service is obtained;
Judge whether the garbled data of each type of service obtained in the first sample data acquisition system is contained in the mutually same trade
In the corresponding correction data set of service type.
3. method as claimed in claim 1 or 2, it is characterised in that the type of service include Business Process System, electricity consumption change,
Recording, checking, and charging, metering, power utility check, customer service, Controlling line loss.
4. a kind of system for screening abnormal data, it is characterised in that the system includes:Acquiring unit, screening unit, judgement are single
Member and determining unit;Wherein,
The acquiring unit, it is corresponding multiple in first sample data acquisition system and the first sample data acquisition system for obtaining
Type of service;
The screening unit, for being respectively provided with sieve in the corresponding each type of service of the first sample data acquisition system of the acquisition
Choosing rule, and according to the screening rule of the setting, obtain the screening of each type of service in the first sample data acquisition system
Data;
The judging unit, for judging that the garbled data of each type of service in the obtained first sample data acquisition system is
The correction data set of the no the multiple sample data sets screening being present in addition to the first sample data acquisition system
In;
The determining unit, for determining in the case of identical services type, be present in except the first sample data acquisition system it
Garbled data in the correction data set of outer the multiple sample data sets screening is abnormal data;
Wherein, the screening unit includes:
Setup module, in the first sample data acquisition system of the acquisition, according to first sample data acquisition system correspondence
Each type of service, one or more screening attributes contained by the corresponding screening rule of each type of service are set;
Screening module, for one or more screenings contained by the corresponding screening rule of each type of service according to the setting
Attribute, obtains the garbled data of each type of service in the first sample data acquisition system;Wherein, the garbled data is described
The data acquisition system that each type of service passes through corresponding one or more screening attribute selections in first sample data acquisition system.
5. system as claimed in claim 4, it is characterised in that the judging unit includes:
First acquisition module, for obtaining the screening rule that each type of service is correspondingly arranged in the first sample data acquisition system
Then;
Second acquisition module, for the screening rule for being correspondingly arranged each type of service of the acquisition, is separately positioned on and removes
In the multiple sample data sets outside the first sample data acquisition system, the correction data collection of each type of service is obtained
Close;
Judge module, for the garbled data of each type of service that judges to obtain in the first sample data acquisition system whether
It is contained in the corresponding correction data set of identical services type.
6. the system as described in claim 4 or 5, it is characterised in that the type of service include Business Process System, electricity consumption change,
Recording, checking, and charging, metering, power utility check, customer service, Controlling line loss.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201410446368.0A CN104216985B (en) | 2014-09-04 | 2014-09-04 | A kind of method and system for screening abnormal data |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201410446368.0A CN104216985B (en) | 2014-09-04 | 2014-09-04 | A kind of method and system for screening abnormal data |
Publications (2)
Publication Number | Publication Date |
---|---|
CN104216985A CN104216985A (en) | 2014-12-17 |
CN104216985B true CN104216985B (en) | 2017-09-01 |
Family
ID=52098475
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201410446368.0A Active CN104216985B (en) | 2014-09-04 | 2014-09-04 | A kind of method and system for screening abnormal data |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN104216985B (en) |
Families Citing this family (9)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN104809676B (en) * | 2015-05-11 | 2019-12-17 | 林辉 | Method and device for analyzing error type of answer |
CN106296399A (en) * | 2015-06-11 | 2017-01-04 | 交通银行股份有限公司 | The data processing method of business rule formulation and system |
CN106156791B (en) * | 2016-06-15 | 2021-03-30 | 北京京东尚科信息技术有限公司 | Business data classification method and device |
CN106251094B (en) * | 2016-08-26 | 2022-12-30 | 国网江西省电力公司电力科学研究院 | 10kV business expansion and installation work order transaction analysis device and analysis method |
CN109598525B (en) * | 2017-09-30 | 2023-01-17 | 北京国双科技有限公司 | Data processing method and device |
CN108363680B (en) * | 2018-01-04 | 2021-09-28 | 国网福建省电力有限公司 | Power consumer electricity utilization characteristic presentation method and user terminal |
CN108833188B (en) * | 2018-07-17 | 2021-12-28 | 顺丰科技有限公司 | Alarm information management method, device, equipment and storage medium |
CN110782550B (en) * | 2019-09-20 | 2021-08-31 | 腾讯科技(深圳)有限公司 | Data acquisition method, device and equipment |
CN113496440B (en) * | 2021-06-28 | 2023-12-12 | 国网上海市电力公司 | User abnormal electricity consumption detection method and system |
Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP2007257283A (en) * | 2006-03-23 | 2007-10-04 | Tdk Corp | Memory controller and flash memory system |
CN101557111A (en) * | 2009-05-20 | 2009-10-14 | 重庆市电力公司 | Intelligent acquired electricity consumption data screening processing system |
CN103514514A (en) * | 2013-09-23 | 2014-01-15 | 广州供电局有限公司 | On-line monitoring method for electricity marketing business data |
-
2014
- 2014-09-04 CN CN201410446368.0A patent/CN104216985B/en active Active
Patent Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP2007257283A (en) * | 2006-03-23 | 2007-10-04 | Tdk Corp | Memory controller and flash memory system |
CN101557111A (en) * | 2009-05-20 | 2009-10-14 | 重庆市电力公司 | Intelligent acquired electricity consumption data screening processing system |
CN103514514A (en) * | 2013-09-23 | 2014-01-15 | 广州供电局有限公司 | On-line monitoring method for electricity marketing business data |
Non-Patent Citations (3)
Title |
---|
多系统数据分析对比在营销稽查工作中的探索及应用;叶永乐;《电子世界》;20140530;全文 * |
营销稽查监控系统的建设与应用;王海燕等;《电力信息化》;20120315;全文 * |
青海电力营销稽查监控系统架构设计与应用;李红梅等;《电力信息化》;20111215;全文 * |
Also Published As
Publication number | Publication date |
---|---|
CN104216985A (en) | 2014-12-17 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN104216985B (en) | A kind of method and system for screening abnormal data | |
Veugelers et al. | Linking technology intelligence to open innovation | |
US8370181B2 (en) | System and method for supply chain data mining and analysis | |
Musciotto et al. | Detecting informative higher-order interactions in statistically validated hypergraphs | |
Estevão et al. | The doing business ranking and the GDP. A qualitative study | |
CN104572449A (en) | Automatic test method based on case library | |
US10332030B2 (en) | Multi-sensor data summarization | |
Lagerström et al. | Visualizing and measuring enterprise application architecture: an exploratory telecom case | |
CN102650996A (en) | Method and device for determining data mapping relationship between database tables | |
Nguyen et al. | Vasabi: Hierarchical user profiles for interactive visual user behaviour analytics | |
CN101894319A (en) | Tobacco enterprise data quality management system and method | |
Lagerström et al. | Visualizing and measuring enterprise architecture: an exploratory biopharma case | |
CN107146150A (en) | Auditing method, device, storage medium and the processor of the audit target | |
CN109934483A (en) | A kind of manufacturing quality information managing and control system and method towards quality in kind promotion | |
CN111178688A (en) | Self-service analysis method and system for power technology supervision data, storage medium and computer equipment | |
Zhang et al. | A survey on quality assurance techniques for big data applications | |
Meyer et al. | Understanding usage data-driven product planning: a systematic literature review | |
Zhang et al. | Scientific relatedness and intellectual base: a citation analysis of un-cited and highly-cited papers in the solar energy field | |
Dai | Designing an Accounting Information Management System Using Big Data and Cloud Technology | |
DE202016101711U1 (en) | Capacity planning tool, in particular an information technology infrastructure | |
CN109064211A (en) | Sales service data analysing method, device and server | |
US20150106499A1 (en) | Transforming cloud service measurements into anonymized extramural business rankings | |
Parekh et al. | Issues in bottleneck detection in multi-tier enterprise applications | |
CN108345541A (en) | A kind of program detecting method and system | |
Müller | Managing the open cathedral |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant |