CN103559330B - Method and system for detecting data consistency - Google Patents

Method and system for detecting data consistency Download PDF

Info

Publication number
CN103559330B
CN103559330B CN201310471592.0A CN201310471592A CN103559330B CN 103559330 B CN103559330 B CN 103559330B CN 201310471592 A CN201310471592 A CN 201310471592A CN 103559330 B CN103559330 B CN 103559330B
Authority
CN
China
Prior art keywords
data
data set
evaluation index
compared
result
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201310471592.0A
Other languages
Chinese (zh)
Other versions
CN103559330A (en
Inventor
李彪
李强
郑小汭
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Shanghai Huawei Technologies Co Ltd
Original Assignee
Shanghai Huawei Technologies Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Shanghai Huawei Technologies Co Ltd filed Critical Shanghai Huawei Technologies Co Ltd
Priority to CN201310471592.0A priority Critical patent/CN103559330B/en
Publication of CN103559330A publication Critical patent/CN103559330A/en
Application granted granted Critical
Publication of CN103559330B publication Critical patent/CN103559330B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Landscapes

  • Machine Translation (AREA)

Abstract

The invention provides a method and a system for detecting data consistency. The method includes acquiring a data consistency comparison semantic database corresponding to objects to be compared; acquiring first data sets and second data sets which are about to be compared to one another and correspond to various assessment indexes in the data consistency comparison semantic database; processing the first data sets and the second data sets according to data processing algorithms corresponding to the various assessment indexes; acquiring processing results corresponding to the various assessment indexes; comparing the various processing results to corresponding preset threshold values; determining consistency detection results according to comparison results; determining consistency detection results of the objects to be compared. The first data sets and the second data sets correspond to the various assessment indexes. The consistency detection results correspond to the various assessment indexes. The method and the system have the advantage that the detection efficiency can be improved.

Description

Data consistency detection and system
Technical field
The present invention relates to technical field of data processing, more particularly to a kind of communication system data consistency detection and System.
Background technology
Emulation technology there has been in every field it is commonly used, become in complex system study indispensable one it is important Link, emulates the important branch that data analysiss are increasingly becoming in emulation field.In same research field, there is the imitative of terminal True instrument is selective, and in this case, user inevitably runs into simulation result concordance and confirms problem, for example, Confirm whether different emulators are consistent to the simulation result of same scheme, confirm same emulator Multi simulation running result whether one Cause, or confirm whether simulation result is consistent with measured result etc..
At present, the research both at home and abroad with regard to data consistency detection is also fewer, and data consistency detection scheme is all base In the comparison of coherence of binary character, that is, compare two files computer symbols whether complete in computer systems Cause, and for larger communication analogue system, because business model is very complicated, simulation result is can result in business model inconsistent The numerous and different emulator of factor due to realization mechanism itself it is not quite identical, even if using identical emulation Model, simulation result be also impossible to it is definitely identical, so, for larger communication analogue system, the emulation based on binary character Data consistency testing result always " inconsistent ", therefore, existing comparing scheme be not particularly suited for larger communication emulation The consistency detection of the emulation data in system.
At present for the consistency detection of the emulation data in larger communication analogue system, mainly domain expert is according to individual People's experience makes concordance judgement to emulating after data are analyzed, detection efficiency is than relatively low.
The content of the invention
A kind of data consistency detection is embodiments provided, to improve data consistency detection efficiency.
A first aspect of the present invention provides a kind of data consistency detection, including:
Obtain the data consistency corresponding with target to be compared and compare semantic base, the data consistency compares semantic base Include at least one evaluation index corresponding with the target to be compared;
Each evaluation index that acquisition is compared in semantic base with the data consistency is corresponding, the first number to be compared According to collection and the second data set;
According to the data processing algorithm pair corresponding with each evaluation index first number corresponding with each evaluation index Processed according to collection and the second data set, obtained the result corresponding with each evaluation index;
Each result is judged whether in preset range, determine according to judged result corresponding with each evaluation index Consistency detection result, when all evaluation indexes consistency detection result all for it is consistent when, determine the target to be compared It is consistent.
In the first possible implementation of first aspect, the number in first data set and the second data set During according to the data for being switching value property, the foundation data processing algorithm pair corresponding with each evaluation index is commented with each Estimate the first corresponding data set of index and the second data set is processed, obtain the process knot corresponding with each evaluation index Fruit includes:
The relative deviation of calculating first data set corresponding with each evaluation index and the second data set, obtains and each The corresponding relative standard deviation values of evaluation index.
In second possible implementation of first aspect, the number in first data set and the second data set During according to being the data counted with represented as histograms, the foundation data processing algorithm corresponding with each evaluation index Pair first data set corresponding with each evaluation index and the second data set are processed, and are obtained relative with each evaluation index The result answered includes:
The rectangular histogram correlation coefficient of calculating first data set corresponding with each evaluation index and the second data set, obtains The rectangular histogram correlation coefficient value corresponding with each evaluation index.
In the third possible implementation of first aspect, the number in first data set and the second data set During according to being ordered sequence, the foundation data processing algorithm pair corresponding with each evaluation index and each evaluation index phase Corresponding first data set and the second data set are processed, and obtaining the result corresponding with each evaluation index includes:
The correlation coefficient of calculating first data set corresponding with each evaluation index and the second data set, obtains and each The corresponding correlation coefficient value of evaluation index.
A second aspect of the present invention provides a kind of data consistency detecting system, including:
First acquisition module, the data consistency for obtaining corresponding with target to be compared compares semantic base, the number Include at least one evaluation index corresponding with the target to be compared according to comparison of coherence semantic base;
Second acquisition module, it is corresponding for obtaining each evaluation index compared with the data consistency in semantic base The first data set to be compared and the second data set;
Data processing module, refers to for the foundation data processing algorithm pair corresponding with each evaluation index with each assessment The first corresponding data set of mark and second data set are processed, and obtain the process knot corresponding with each evaluation index Really;
Judge module, for judging each result whether in preset range, determines and each according to judged result The corresponding consistency detection result of evaluation index, when the consistency detection result of all evaluation indexes is all consistent, it is determined that The target to be compared is consistent.
In the first possible implementation of second aspect, the data processing module includes:
First processing units, for being switching value property when the data in first data set and the second data set During data, the relative deviation of calculating first data set corresponding with each evaluation index and the second data set is obtained and each The corresponding relative standard deviation values of evaluation index.
In second possible implementation of second aspect, the data processing module includes:
Second processing unit, for being with represented as histograms when the data in first data set and the second data set During the data for being counted, the rectangular histogram of calculating first data set corresponding to each evaluation index and the second data set is related Coefficient, obtains the rectangular histogram correlation coefficient value corresponding with each evaluation index.
In the third possible implementation of second aspect, the data processing module includes:
3rd processing unit, for when the data in first data set and the second data set are ordered sequence, The correlation coefficient of calculating first data set corresponding with each evaluation index and the second data set, obtains and each evaluation index Corresponding correlation coefficient value.
A third aspect of the present invention provides a kind of data consistency data handling system, including:
At least one processor, and the memorizer coupled with least one processor;
At least one processor is configured to:
Obtain the data consistency corresponding with target to be compared and compare semantic base, the data consistency compares semantic base Include at least one evaluation index corresponding with the target to be compared;
Obtain corresponding the first number to be compared of each evaluation index compared with the data consistency in semantic base According to collection and the second data set;
According to the data processing algorithm pair corresponding with each evaluation index first number corresponding with each evaluation index Processed according to collection and the second data set, obtained the result corresponding with each evaluation index;
Each result is judged whether in preset range, determine according to judged result corresponding with each evaluation index Consistency detection result, when all evaluation indexes consistency detection result all for it is consistent when, determine the target to be compared It is consistent.
It is described to be configured to according to relative with each evaluation index in the first possible implementation of the third aspect The data processing algorithm pair answered first data set corresponding with each evaluation index and the second data set are processed, obtain with The processor of the corresponding result of each evaluation index is configured to:
When the data in first data set and the second data set are the data of switching value property, calculate and each The relative deviation of the first corresponding data set of evaluation index and the second data set, obtains the phase corresponding with each evaluation index To deviation.
It is described to be configured to according to relative with each evaluation index in second possible implementation of the third aspect The data processing algorithm pair answered first data set corresponding with each evaluation index and the second data set are processed, obtain with The processor of the corresponding result of each evaluation index is configured to:
When the data in first data set and the second data set are the data counted with represented as histograms, The rectangular histogram correlation coefficient of calculating first data set corresponding with each evaluation index and the second data set, acquisition is commented with each Estimate the corresponding rectangular histogram correlation coefficient value of index.
It is described to be configured to according to relative with each evaluation index in the third possible implementation of the third aspect The data processing algorithm pair answered first data set corresponding with each evaluation index and the second data set are processed, obtain with The processor of the corresponding result of each evaluation index is configured to:
When the data in first data set and the second data set are ordered sequence, calculate and each evaluation index Corresponding the first data set and the correlation coefficient of the second data set, obtain the correlation coefficient corresponding with each evaluation index Value.
A kind of data consistency detection provided in an embodiment of the present invention, it is corresponding with target to be compared by obtaining Data consistency compares semantic base, and it is corresponding to obtain each evaluation index compared with the data consistency in semantic base First data set to be compared and the second data set, according to the data processing algorithm pair corresponding with each evaluation index and each The first corresponding data set of evaluation index and the second data set are processed, and obtain the process corresponding with each evaluation index As a result, then each result is compared with corresponding predetermined threshold value, determines according to comparative result and refer to each assessment The corresponding consistency detection result of mark;
Due to evaluation index be with explicit semantic meaning, therefore, the data in the first data set and the second data set are all Data with explicit semantic meaning, and, when comparison of coherence is carried out, first to the first data set and the second data set at Reason, by judge result whether in preset range it is whether consistent to determine data.That is, the embodiment of the present application is carried For data consistency detection, data consistency detection is carried out based on semanteme, it is not necessary to which staff has association area Business specialty background can carry out data consistency detection, and then determine compatibility of goals testing result to be compared, improve Detection efficiency.
Description of the drawings
In order to be illustrated more clearly that the embodiment of the present invention or technical scheme of the prior art, below will be to embodiment or existing The accompanying drawing to be used needed for having technology description is briefly described, it should be apparent that, drawings in the following description are only this Some embodiments of invention, for those of ordinary skill in the art, on the premise of not paying creative work, can be with Other accompanying drawings are obtained according to these accompanying drawings.
Fig. 1 is a kind of flow chart of data consistency detection that the embodiment of the present application is provided;
Fig. 2 is the structural representation of the data consistency detecting system that the embodiment of the present application is provided;
Fig. 3 is the structural representation of another kind of data consistency detecting system that the embodiment of the present application is provided;
The structural representation of another data consistency detecting system that Fig. 4 is provided for the embodiment of the present application;
The structural representation of another data consistency detecting system that Fig. 5 is provided for the embodiment of the present application;
The structural representation of another data consistency detecting system that Fig. 6 is provided for the embodiment of the present application.
Term " first ", " second ", " the 3rd " " 4th " in specification and claims and above-mentioned accompanying drawing etc.(If Exist)It is the part for distinguishing similar, without being used to describe specific order or precedence.It should be appreciated that so using Data can exchange in the appropriate case, so that embodiments herein described herein can be with except illustrating here Order in addition is implemented.
Specific embodiment
In order that those skilled in the art can further appreciate that the present invention feature and technology contents, refer to below in connection with Detailed description of the invention and accompanying drawing, accompanying drawing only provide with reference to and explanation, not for limiting the present invention.
Refer to Fig. 1, a kind of flow chart of data consistency detection that Fig. 1 is provided for the embodiment of the present application, including:
Step S101:Obtain the data consistency corresponding with target to be compared and compare semantic base, the data consistency Relatively semantic base includes at least one evaluation index corresponding with the target to be compared;
The target to be compared is referred to and which data to carry out comparison of coherence to, and what concrete target to be compared is according to real Border situation determines.
For example, in a communications system, can be using " network coverage ability " as target to be compared, i.e., to network coverage ability Concordance be compared.
And network coverage ability is assessed typically from following index:Service coverage probability, cell capacity, access delay, gather around Plug probability etc., that is to say, that when target to be compared is network coverage ability, the data consistency compares commenting in semantic base Estimating index can at least include:" service coverage probability ", " cell capacity ", " access delay ", " congestion probability " this four.
And if by " the signal covering measures of speech business " as target to be compared, then, due to the letter of speech business Number covering measures typically pass through CDF(Cumulative Distribution Function, add up branch's function)Curve determination, Therefore, when target to be compared is the signal covering measures of speech business, the data consistency compares the assessment in semantic base Index then can for " CDF curves " this.
In the embodiment of the present application, the target to be compared and the evaluation index corresponding with the target to be compared can be with Perform in standard or empirical data from industry agreement or industry and extract.In the communications field, the target to be compared and with treat The corresponding evaluation index of comparison object can be performed in standard or empirical data from communication protocol or the communications industry and extracted.Example Such as, in the field of communications, " the signal covering measures of speech business are 85% to the coverage rate index request of signal, estimated bias thresholding No more than 3% ", based on this, can be using " the signal covering measures of speech business " as target to be compared, and speech business Signal covering measures are typically estimated by CDF curves, therefore, this is to be compared with " the signal covering measures of speech business " The corresponding evaluation index of target is just " CDF curves ".
Specifically, it is determined that after target to be compared, the evaluation index corresponding with the target to be compared can be by work It is manually entered as personnel;I.e. staff is relative with the target to be compared according to searching in industry agreement or industry execution standard The evaluation index answered, and input system.
The evaluation index corresponding with the target to be compared can also be obtained from the data base for prestoring, the number According to several targets to be compared and the evaluation index that is associated with each target to be compared of being stored with storehouse, it is determined that waiting to compare After compared with target, it is possible to obtain the corresponding evaluation index of the target to be compared with determined by automatically from data base;
Step S102:Each evaluation index that acquisition is compared in semantic base with the data consistency is corresponding, waits to compare Compared with the first data set and the second data set;
It is determined that after evaluation index, can compare from data consistency and extract in data base corresponding with each evaluation index Data to be compared, i.e. the first data set and the second data set, that is to say, that the first data set and the second data set are from difference File in extract for the occurrence of same evaluation index.
Illustrate, in a communications system, in order to realize certain function, often provide at least two schemes, Cong Zhongxuan Go out an optimal case, and optimal case typically by emulating to optional program, compare collected in simulation process A little singal reporting codes are selected with the presence or absence of larger error.Hypothesis will select an optimal case from option A and option b, Option A and option b can respectively be emulated by emulator, the emulation daily record for carrying out emulating acquisition to option A is LOGA, The emulation daily record for carrying out emulating acquisition to option b is LOGB.In the embodiment of the present application, for data consistency compares in data base Certain evaluation index, it is assumed that for " cutting off rate ", then, the first data set be from emulation daily record LOGA in extract cutting off rate Value, and the second data set is the cutting off rate value extracted from emulation daily record LOGB.
Step S103:It is corresponding with each evaluation index according to the data processing algorithm pair corresponding with each evaluation index The first data set and the second data set processed, obtain the result corresponding with each evaluation index;
In the embodiment of the present application, carrying out process to the first data set and the second data set includes:To the first data set and Two data sets are processed to obtain the deviation size between the first data set and the second data set, will the first data set and Deviation size between two data sets adjudicates index as the concordance of evaluation index.
Because the form of the data corresponding to different evaluation indexes is different, e.g., some evaluation indexes are with switching value Property, i.e., the evaluation index only has two states, e.g., for handover success rate this evaluation index, or switching into Work(, otherwise failure, only both states, therefore, the handover success rate as data with switching value property;And some assessments Index is ordered sequence, and such as data service throughput this evaluation index, reflection is that user transports along specified path When dynamic, user diverse geographic location data transmission capabilities Changing Pattern, therefore, for data service throughput, can adopt The data of collection [geographical position, user throughput] are to as data set corresponding with data service throughput;And some assessments refer to Mark needs to carry out data statisticss with represented as histograms, for example, for this evaluation index of user's access time delay, because time delay is one The mixed and disorderly data of heap, its rectangular histogram is the statistical distribution result of data.Therefore, to different data format in the embodiment of the present application Evaluation index, is processed the first data set and the second data set using different data processing algorithms.
Step S104:Each result is judged whether in default scope, determine according to comparative result and commented with each Estimate the corresponding consistency detection result of index, when the consistency detection result of all evaluation indexes is all consistent, determine institute It is consistent to state target to be compared.
Judge whether each result may refer in default scope, for corresponding with some evaluation index Result, judge the result whether in the preset range corresponding with the evaluation index, that is to say, that for Each evaluation index, all to there is a preset range;
For the result of each evaluation index, determine according to comparative result corresponding with each evaluation index Consistency detection result can include:
The result of evaluation index is judged whether in the preset range corresponding with the evaluation index, if, Illustrate that first data set corresponding with the evaluation index and the second data set are consistent, otherwise explanation and the evaluation index phase Corresponding first data set and the second data set are inconsistent.
For the result of each evaluation index, determine according to comparative result corresponding with each evaluation index Consistency detection result can also include:
The result of evaluation index is judged whether in the preset range corresponding with the evaluation index, if, Illustrate that first data set corresponding with the evaluation index and the second data set are inconsistent, otherwise explanation and the evaluation index Corresponding the first data set and the second data set is consistent, and certainly, corresponding with evaluation index in this embodiment is pre- If the preset range corresponding from same evaluation index in scope and previous embodiment is different.
Specifically when implementing, the preset range can rule of thumb set, it is also possible to be set according to user's request, here It is not specifically limited.
A kind of data consistency comparative approach that the embodiment of the present application is provided, its main thought is, based on domain knowledge, to return Receiving out has the evaluation index and data processing algorithm of explicit semantic meaning, for the degree of consistency of two groups of data of interpretation.Specially:It is logical Cross the acquisition data consistency corresponding with target to be compared and compare semantic base, and acquisition compares semantic with the data consistency Corresponding the first data set to be compared of each evaluation index in storehouse and the second data set, according to and each evaluation index phase Corresponding data processing algorithm pair first data set corresponding with each evaluation index and the second data set are processed, and are obtained The result corresponding with each evaluation index, is then compared each result with corresponding predetermined threshold value, according to Determine the consistency detection result corresponding with each evaluation index according to comparative result, and then determine compatibility of goals to be compared Testing result;
Due to evaluation index be with explicit semantic meaning, therefore, the data in the first data set and the second data set are all Data with explicit semantic meaning, and, when comparison of coherence is carried out, first to the first data set and the second data set at Reason, by judge result whether in preset range it is whether consistent to determine data.That is, the embodiment of the present application is carried For data consistency detection, data consistency detection is carried out based on semanteme, and by building semantic base auto-associating symbol Close the evaluation index that emulation is intended to, it is not necessary to which it is consistent that staff carries out data by having the business specialty background of association area Property detection, improve detection efficiency, at the same reduce data consistency detection difficulty.
In addition, the data consistency comparative approach that the embodiment of the present application is provided, base is relative to traditional " binary character one It is a kind of more loose comparison constraint rule for the comparison rule of cause "(Preset range is set), obtain between data set Deviation size, according to the deviation size between data consistency detection is carried out, consistent with the data analysiss purpose of reality.
And, feature is carried out to data set by data processing algorithm(That is deviation size)Extract, the feature extracted is anti- The statistical property of data in a period of time is reflected, has met system(And the data source of data set)Performance estimating method.
Further, the data consistency detection that the application is provided, is based on the data consistency for building semantic base Detection method, can allow emulation personnel and modeling personnel to work independently, and emulation personnel only need to the clear and definite Data Comparison of oneself to be needed Ask, the related evaluation index of system energy auto-associating, the concrete meaning without removing to understand each evaluation index reduces number According to analysis threshold and lift analysis efficiency;Meanwhile, data analyst can directly obtain the conclusion of oneself problem of interest, from Dynamic other uncorrelated evaluation indexes that shield are to analyzing the puzzlement of personnel.
In above-described embodiment, when the data in first data set and the second data set are the data of switching value property When, the foundation data processing algorithm pair corresponding with each evaluation index first data corresponding with each evaluation index Collection and the second data set are processed, and obtaining the result corresponding with each evaluation index can include:
The relative deviation of calculating first data set corresponding with each evaluation index and the second data set, obtains and each The corresponding relative standard deviation values of evaluation index.
That is, when the data in first data set and the second data set are the data of switching value property, Such as in the communications field, it is successfully accessed rate, cutting off rate, handover success rate etc. and belongs to the singal reporting code with switching value property, institute Using data processing algorithm be relative deviation computing, specifically can be according to formula(1)Calculate the first data set and the second data set Relative deviation:
P=|PB-PA|/PB(1)
Wherein, P is relative standard deviation values;PAFor the data in the first data set;PBFor the data in the second data set.And by Base-line data is usually present in comparing, with base-line data as denominator, i.e. PBFor base-line data, and concrete number to be compared Which data can be specified as base-line data by user according in.
For example, correspondence is successfully accessed rate, and P is the relative standard deviation values of the rate that is successfully accessed;PAFor the rate that is successfully accessed of option A, PB Rate is successfully accessed for option b.
In above-described embodiment, carried out with represented as histograms when the data in first data set and the second data set are During the data of statistics, the foundation data processing algorithm pair corresponding with each evaluation index is corresponding with each evaluation index The first data set and the second data set processed, obtaining the result corresponding with each evaluation index includes:
The rectangular histogram correlation coefficient of calculating first data set corresponding with each evaluation index and the second data set, obtains The rectangular histogram correlation coefficient value corresponding with each evaluation index.
That is, being counted with represented as histograms when the data in first data set and the second data set are Data when, the data processing algorithm applied be histogrammic diversity size, specifically, can by the first data set institute it is right Histogrammic absolute deviation corresponding to the rectangular histogram answered and the second data set carries out adding up as the first data set and the second number According to the deviation of collection, specifically can be according to formula(2)Calculate corresponding to the rectangular histogram and the second data set corresponding to the first data set Histogrammic absolute deviation cumulative and,
Wherein, Diff is histogrammic definitely inclined corresponding to the rectangular histogram and the second data set corresponding to the first data set Poor is cumulative with N represents the sample number counted in rectangular histogram;H1I () is represented i-th in the corresponding rectangular histogram of the first data set The property value of sample;H2I () represents the property value of i-th sample in the corresponding rectangular histogram of the second data set;
Wherein, the corresponding property value of i-th sample is in first data set in the corresponding rectangular histogram of the first data set I-th element value;The corresponding property value correspondence of i-th sample second data in the corresponding rectangular histogram of second data set I-th element value concentrated.
It is described when the data in first data set and the second data set are ordered sequence in above-described embodiment According to the data processing algorithm pair corresponding with each evaluation index first data set corresponding with each evaluation index and the Two data sets are processed, and obtaining the result corresponding with each evaluation index includes:
The correlation coefficient of calculating first data set corresponding with each evaluation index and the second data set, obtains and each The corresponding correlation coefficient value of evaluation index.
That is, when the data in first data set and the second data set are ordered sequence, being applied Data processing algorithm can be the correlation coefficient for calculating ordered sequence, will the first data set and the second data set correlation coefficient As the first data set and the deviation of the second data set, specifically, formula can be passed through(3)To formula(6)Calculate the first data Correlation coefficient between collection and the second data set:
Wherein, C12For the first data set and the covariance of the second data set;N is represented in the first data set and the second data set The number of element be all N;SiThe value of i-th element in the first data set is represented, S represents all elements in the first data set Average;DiThe value of i-th element in the second data set is represented, D represents the average of all elements in the second data set;C1For first The standard deviation of data set;C2For the standard deviation of the second data set;R is the correlation coefficient of the first data set and the second data set;
Below the embodiment of the present application is illustrated by taking the comparison of coherence of emulation data and measured data as an example:
In wireless network planning, it will usually based on CQT(Call Quality Test, call quality test calls)Number According to correcting to phantom, one of them important link is exactly to compare the concordance of CQT data and emulation data, it is assumed that treated Comparison object is the concordance of specific region existing network user's communication quality and simulation result, that is to say, that target to be compared is to use Family speech quality.Flow process is compared based on the data consistency of this programme as follows:
Step1:Obtain the data consistency corresponding with target to be compared and compare semantic base, the data consistency compares Semantic base includes at least one evaluation index corresponding with the target to be compared;Specifically:
It is determined that with target to be compared(That is user's communication quality)Data consistency compare semantic base, user's communication quality one As from following standard assessing:User is successfully accessed rate, user's access delay, speech business MOS score values, data service and gulps down The amount of telling, cutting off rate, handover success rate, hence, it can be determined that the data consistency compare evaluation index in semantic base can be with Including:User is successfully accessed rate, user's access delay, speech business MOS(Mean Opinion Score)Score value, data service Handling capacity, cutting off rate and handover success rate.
Step2:Each evaluation index that acquisition is compared in semantic base with the data consistency is corresponding, to be compared First data set and the second data set;Specifically:
The emulation of the evaluation index determined in extracting data step Step1 for obtaining with actual measurement in emulation data respectively Data and measured data;
Step3:It is corresponding with each evaluation index according to the data processing algorithm pair corresponding with each evaluation index First data set and the second data set are processed, and obtain the result corresponding with each evaluation index;Specifically:
For speech business MOS score values, data service throughput this two evaluation indexes, reaction is user along specified circuit When footpath is moved, in the Changing Pattern of the data transmission capabilities of diverse geographic location, its collection is that [geographical position, MOS divides to user Value] and [geographical position, user throughput] data pair, that is to say, that for speech business MOS score values, data traffic throughputs This two evaluation indexes are measured, what it was gathered is the data of specific location, therefore, speech business MOS score values and data service are gulped down Data corresponding to the amount of telling this two evaluation indexes are ordered into sequence, therefore, for speech business MOS score values, data service are gulped down The amount of telling this two evaluation indexes, the data processing algorithm applied can be the correlation coefficient for calculating ordered sequence, specifically can be with According to formula(3)To formula(6)Calculated.
For the rate that is successfully accessed, cutting off rate, this evaluation index with switching value property of handover success rate, can be direct The deviation of emulation data and measured data is embodied using the relative standard deviation values of initial data, specifically, can be according to formula(1) Calculating is successfully accessed the relative of the emulation data of rate and the relative deviation of measured data, the emulation data of cutting off rate and measured data Deviation, the emulation data of handover success rate and the relative deviation of measured data.
For user's access delay, because time delay is the mixed and disorderly data of a pile, its rectangular histogram is the statistical distribution knot of data Really, the data processing algorithm applied can be the correlation coefficient for calculating ordered sequence, specifically, can be according to formula(2)Meter Calculate the emulation data of user's access delay this evaluation index and the correlation coefficient of measured data.
Step4:Each result is judged whether in preset range, determine and each evaluation index according to judged result Corresponding consistency detection result.Specifically:
For different evaluation indexes, corresponding preset range is set, in this example, phase is provided with to different evaluation indexes The thresholding answered, specifically:
Speech business MOS score values, data service throughput, correlation coefficient is not less than 0.9.That is, speech business MOS When the emulation data of score value and the correlation coefficient of measured data are more than or equal to 0.9, the emulation data of speech business MOS score values and Measured data is consistent, and otherwise, the emulation data and measured data of speech business MOS score values are inconsistent;In the same manner, data When the emulation data of business throughput and the correlation coefficient of measured data are more than or equal to 0.9, the emulation of data service throughput Data are consistent with measured data, and otherwise, the emulation data of data service throughput and measured data are inconsistent;
Rate, cutting off rate, handover success rate are successfully accessed, relative standard deviation values are less than 1%.That is, being successfully accessed rate When the relative standard deviation values of emulation data and measured data are less than or equal to 1%, being successfully accessed the emulation data and measured data of rate is Consistent, otherwise, the emulation data and measured data for being successfully accessed rate are inconsistent;In the same manner, the emulation data and reality of cutting off rate When the relative standard deviation values for surveying data are less than or equal to 1%, emulation data and the measured data of cutting off rate be it is consistent, otherwise, call drop The emulation data of rate and measured data are inconsistent;The emulation data of handover success rate and the relative standard deviation values of measured data are less than Or during equal to 1%, emulation data and the measured data of handover success rate be it is consistent, otherwise, the emulation data of handover success rate and Measured data is inconsistent;
User's access delay, histogrammic correlation coefficient is not less than 0.9.That is, the emulation number of user's access delay When being more than or equal to 0.9 according to the rectangular histogram correlation coefficient with measured data, the emulation data of user's access delay and measured data It is consistent, otherwise, the emulation data of user's access delay and measured data are inconsistent.
When all evaluation indexes(I.e. user is successfully accessed rate, user's access delay, speech business MOS score values, data service Handling capacity, cutting off rate and handover success rate)Consistency detection result all for it is consistent when, determine the user's communication qualitative data It is consistent.
Corresponding with embodiment of the method, the embodiment of the present application also provides a kind of data consistency detecting system, the data The structural representation of consistency detection system is as shown in Fig. 2 can include:
First acquisition module 201, the second acquisition module 202, data processing module 203 and judge module 204;Wherein,
First acquisition module 201 is used for the acquisition data consistency corresponding with target to be compared and compares semantic base, described Data consistency compares semantic base and includes at least one evaluation index corresponding with the target to be compared;
Second acquisition module 202 is relative for each evaluation index that acquisition is compared in semantic base with the data consistency The first data set to be compared answered and the second data set;
Data processing module 203 is used to be assessed with each according to the data processing algorithm pair corresponding with each evaluation index The first corresponding data set of index and second data set are processed, and obtain the process corresponding with each evaluation index As a result;
Judge module 204 is used to judge each result whether in preset range, according to judged result determine with it is each The corresponding consistency detection result of individual evaluation index, when the consistency detection result of all evaluation indexes is all consistent, really The fixed target to be compared is consistent.
A kind of data consistency comparison system that the embodiment of the present application is provided, it is corresponding with target to be compared by obtaining Data consistency compares semantic base, and it is corresponding to obtain each evaluation index compared with the data consistency in semantic base First data set to be compared and the second data set, according to the data processing algorithm pair corresponding with each evaluation index and each The first corresponding data set of evaluation index and the second data set are processed, and obtain the process corresponding with each evaluation index As a result, then each result is compared with corresponding predetermined threshold value, determines according to comparative result and refer to each assessment The corresponding consistency detection result of mark, when the consistency detection result of all evaluation indexes is all consistent, it is determined that described treat Comparison object is consistent;
Due to evaluation index be with explicit semantic meaning, therefore, the data in the first data set and the second data set are all Data with explicit semantic meaning, and, when comparison of coherence is carried out, first to the first data set and the second data set at Reason, by judge result whether in preset range it is whether consistent to determine data.That is, the embodiment of the present application is carried For data consistency detection, data consistency detection is carried out based on semanteme, it is not necessary to which staff has association area Business specialty background can carry out data consistency detection, and then determine compatibility of goals testing result to be compared, improve Detection efficiency.
On the basis of embodiment illustrated in fig. 2, the structure of another kind of data consistency detecting system that the application is provided is shown It is intended to as shown in figure 3, the data processing module 203 can include:
First processing units 301, for being switching value when the data in first data set and the second data set During the data of matter, calculate the relative deviation of first data set corresponding with each evaluation index and the second data set, obtain and The corresponding relative standard deviation values of each evaluation index.
On the basis of embodiment illustrated in fig. 2, the structure of another data consistency detecting system that the application is provided is shown It is intended to as shown in figure 4, the data processing module 203 can include:
Second processing unit 401, for being with rectangular histogram when the data in first data set and the second data set During the data that form is counted, the rectangular histogram of calculating first data set corresponding with each evaluation index and the second data set Correlation coefficient, obtains the rectangular histogram correlation coefficient value corresponding with each evaluation index.
On the basis of embodiment illustrated in fig. 2, the structure of another data consistency detecting system that the application is provided is shown It is intended to as shown in figure 5, the data processing module 203 can include:
3rd processing unit 501, for being ordered sequence when the data in first data set and the second data set When, the correlation coefficient of calculating first data set corresponding with each evaluation index and the second data set is obtained and each assessment The corresponding correlation coefficient value of index.
In the embodiment of the present application, the first processing units 301, the processing unit 501 of second processing unit 401 and the 3rd Can be integrated in simultaneously in the data processing module 203, or, the first processing units 301, second processing unit 401 Realized by same functional module with the function of the 3rd processing unit 501, be not specifically limited here.
The structural representation of another data consistency detecting system that the embodiment of the present application is provided is as shown in fig. 6, can be with Including:
At least one processor 601, and the memorizer 602 coupled with least one processor;Wherein, it is described extremely A few processor 601 is configured to:
Obtain the data consistency corresponding with target to be compared and compare semantic base, the data consistency compares semantic base Include at least one evaluation index corresponding with the target to be compared;
Obtain corresponding the first number to be compared of each evaluation index compared with the data consistency in semantic base According to collection and the second data set;
According to the data processing algorithm pair corresponding with each evaluation index first number corresponding with each evaluation index Processed according to collection and the second data set, obtained the result corresponding with each evaluation index;
Each result is judged whether in preset range, determine according to judged result corresponding with each evaluation index Consistency detection result, when all evaluation indexes consistency detection result all for it is consistent when, determine the target to be compared It is consistent.
In above-described embodiment, it is described be configured to according to the data processing algorithm pair corresponding with each evaluation index with it is each The first corresponding data set of individual evaluation index and the second data set are processed, and obtain the place corresponding with each evaluation index The processor of reason result is configured to:
When the data in first data set and the second data set are the data of switching value property, calculate and each The relative deviation of the first corresponding data set of evaluation index and the second data set, obtains the phase corresponding with each evaluation index To deviation.
In above-described embodiment, it is described be configured to according to the data processing algorithm pair corresponding with each evaluation index with it is each The first corresponding data set of individual evaluation index and the second data set are processed, and obtain the place corresponding with each evaluation index The processor of reason result is configured to:
When the data in first data set and the second data set are the data counted with represented as histograms, The rectangular histogram correlation coefficient of calculating first data set corresponding with each evaluation index and the second data set, acquisition is commented with each Estimate the corresponding rectangular histogram correlation coefficient value of index.
In above-described embodiment, it is described be configured to according to the data processing algorithm pair corresponding with each evaluation index with it is each The first corresponding data set of individual evaluation index and the second data set are processed, and obtain the place corresponding with each evaluation index The processor of reason result is configured to:
When the data in first data set and the second data set are ordered sequence, calculate and each evaluation index Corresponding the first data set and the correlation coefficient of the second data set, obtain the correlation coefficient corresponding with each evaluation index Value.
Specifically, the method that the embodiments of the present invention are disclosed can apply in processor 601, in other words by processor 601 realize.A kind of possibly IC chip of processor 601, the disposal ability with signal.It is above-mentioned during realization Each step of method can be completed by the instruction of the integrated logic circuit of the hardware in processor 601 or software form.With In the method that the embodiment of the present invention is disclosed is performed, above-mentioned processor can be general processor, digital signal processor (DSP), special IC(ASIC), ready-made programmable gate array(FPGA)Or other PLDs, discrete gate Or transistor logic, discrete hardware components.Can realize or perform disclosed each method in the embodiment of the present invention, Step and logic diagram.General processor can be microprocessor or the processor can also be any conventional processor, Decoder etc..The step of method with reference to disclosed in the embodiment of the present invention, can be embodied directly in hardware processor and perform and complete, Or completed with the hardware in processor and software module combination execution.Software module may be located at random access memory ram, flash memory Flash Memory, read only memory ROM, programmable read only memory or electrically erasable programmable memory, depositor etc. In the ripe storage medium in this area.The storage medium is located at memorizer 602, and processor reads the information in memorizer 602, knot The step of closing its hardware and complete said method.
The description of the real-time mode by more than, those skilled in the art can be understood that the present invention can be with Realized with hardware, or firmware is realized, or combinations thereof mode is realizing.When implemented in software, can be by above-mentioned functions It is stored in computer-readable medium or is transmitted as one or more instructions on computer-readable medium or code.Meter Calculation machine computer-readable recording medium includes computer-readable storage medium and communication media, and wherein communication media includes being easy to from a place to another Any medium of individual place transmission computer program.Storage medium can be any usable medium that computer can be accessed.With As a example by this but it is not limited to:Computer-readable medium can include RAM, ROM, EEPROM, CD-ROM or other optical disc storages, disk Storage medium or other magnetic storage apparatus or can be used in carrying or store the expectation with instruction or data structure form Program code and can be by any other medium of computer access.The foregoing description of the disclosed embodiments, makes ability Domain professional and technical personnel can realize or using the present invention.Professional technique people to various modifications of these embodiments to this area Will be apparent for member, generic principles defined herein can be in the feelings without departing from the spirit or scope of the present invention Under condition, realize in other embodiments.Therefore, invention described above embodiment, is not constituted to present invention protection model The restriction enclosed.Any modification made within the spirit and principles in the present invention, equivalent and improvement etc., should be included in this Within the claims of invention.

Claims (12)

1. a kind of data consistency detection, it is characterised in that data consistency detection is carried out based on semanteme, including:
It is determined that after target to be compared, obtaining the data one corresponding with target to be compared automatically from the data base for prestoring Cause property compares semantic base, and the data consistency compares semantic base and includes that at least one is corresponding with the target to be compared Evaluation index;Several targets to be compared that are stored with the data base and the assessment being associated with each target to be compared Index;
Obtain the first corresponding, to be compared data set of each evaluation index compared with the data consistency in semantic base With the second data set;
According to the data processing algorithm pair corresponding with each evaluation index first data set corresponding with each evaluation index Processed with the second data set, obtained the result corresponding with each evaluation index;
Each result is judged whether in preset range, according to judged result one corresponding with each evaluation index is determined Cause property testing result, when the consistency detection result of all evaluation indexes is all consistent, determines that the target to be compared is one Cause.
2. method according to claim 1, it is characterised in that the data in first data set and the second data set When being the data of switching value property, the foundation data processing algorithm pair corresponding with each evaluation index is assessed with each The first corresponding data set of index and the second data set are processed, and obtain the result corresponding with each evaluation index Including:
The relative deviation of calculating first data set corresponding with each evaluation index and the second data set, obtains and each assessment The corresponding relative standard deviation values of index.
3. method according to claim 1, it is characterised in that the data in first data set and the second data set When being the data counted with represented as histograms, the foundation data processing algorithm pair corresponding with each evaluation index First data set and second data set corresponding with each evaluation index is processed, and is obtained corresponding with each evaluation index Result include:
Calculate the rectangular histogram correlation coefficient of first data set corresponding with each evaluation index and the second data set, acquisition with it is each The corresponding rectangular histogram correlation coefficient value of individual evaluation index.
4. method according to claim 1, it is characterised in that the data in first data set and the second data set When being ordered sequence, the foundation data processing algorithm pair corresponding with each evaluation index is relative with each evaluation index The first data set and the second data set answered is processed, and obtaining the result corresponding with each evaluation index includes:
The correlation coefficient of calculating first data set corresponding with each evaluation index and the second data set, obtains and each assessment The corresponding correlation coefficient value of index.
5. a kind of data consistency detecting system, it is characterised in that data consistency detection is carried out based on semanteme, including:
First acquisition module, for it is determined that after target to be compared, obtain automatically from the data base for prestoring with it is to be compared The corresponding data consistency of target compares semantic base, the data consistency compare semantic base include at least one with it is described The corresponding evaluation index of target to be compared;Several targets to be compared that are stored with the data base and wait to compare with each Compared with the evaluation index that target is associated;
Second acquisition module, for obtaining, each evaluation index compared with the data consistency in semantic base is corresponding to be treated The first data set relatively and the second data set;
Data processing module, for the foundation data processing algorithm pair corresponding with each evaluation index and each evaluation index phase Corresponding first data set and second data set are processed, and obtain the result corresponding with each evaluation index;
Judge module, for judging each result whether in preset range, determines and each assessment according to judged result The corresponding consistency detection result of index, when the consistency detection result of all evaluation indexes is all consistent, it is determined that described Target to be compared is consistent.
6. system according to claim 5, it is characterised in that the data processing module includes:
First processing units, for being the data of switching value property when the data in first data set and the second data set When, the relative deviation of calculating first data set corresponding with each evaluation index and the second data set is obtained and each assessment The corresponding relative standard deviation values of index.
7. system according to claim 5, it is characterised in that the data processing module includes:
Second processing unit, for being carried out with represented as histograms when the data in first data set and the second data set are During the data of statistics, the rectangular histogram phase relation of first data set and second data set corresponding with each evaluation index is calculated Number, obtains the rectangular histogram correlation coefficient value corresponding with each evaluation index.
8. system according to claim 5, it is characterised in that the data processing module includes:
3rd processing unit, for when the data in first data set and the second data set are ordered sequence, calculating First data set corresponding with each evaluation index and the correlation coefficient of the second data set, obtain relative with each evaluation index The correlation coefficient value answered.
9. a kind of data consistency detecting system, it is characterised in that data consistency detection is carried out based on semanteme, including:
At least one processor, and the memorizer coupled with least one processor;
At least one processor is configured to:
It is determined that after target to be compared, obtaining the data one corresponding with target to be compared automatically from the data base for prestoring Cause property compares semantic base, and the data consistency compares semantic base and includes that at least one is corresponding with the target to be compared Evaluation index;Several targets to be compared that are stored with the data base and the assessment being associated with each target to be compared Index;
Obtain corresponding the first data set to be compared of each evaluation index compared with the data consistency in semantic base With the second data set;
According to the data processing algorithm pair corresponding with each evaluation index first data set corresponding with each evaluation index Processed with the second data set, obtained the result corresponding with each evaluation index;
Each result is judged whether in preset range, according to judged result one corresponding with each evaluation index is determined Cause property testing result, when the consistency detection result of all evaluation indexes is all consistent, determines that the target to be compared is one Cause.
10. system according to claim 9, it is characterised in that described to be configured to according to relative with each evaluation index The data processing algorithm pair answered first data set corresponding with each evaluation index and the second data set are processed, obtain with The processor of the corresponding result of each evaluation index is configured to:
When the data in first data set and the second data set are the data of switching value property, calculate and each assessment The relative deviation of the first corresponding data set of index and the second data set, obtains corresponding with each evaluation index relative inclined Difference.
11. systems according to claim 9, it is characterised in that described to be configured to according to relative with each evaluation index The data processing algorithm pair answered first data set corresponding with each evaluation index and the second data set are processed, obtain with The processor of the corresponding result of each evaluation index is configured to:
When the data in first data set and the second data set are the data counted with represented as histograms, calculate First data set corresponding with each evaluation index and the rectangular histogram correlation coefficient of the second data set, acquisition refers to each assessment The corresponding rectangular histogram correlation coefficient value of mark.
12. systems according to claim 9, it is characterised in that described to be configured to according to relative with each evaluation index The data processing algorithm pair answered first data set corresponding with each evaluation index and the second data set are processed, obtain with The processor of the corresponding result of each evaluation index is configured to:
When the data in first data set and the second data set are ordered sequence, calculate relative with each evaluation index The first data set answered and the correlation coefficient of the second data set, obtain the correlation coefficient value corresponding with each evaluation index.
CN201310471592.0A 2013-10-10 2013-10-10 Method and system for detecting data consistency Active CN103559330B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201310471592.0A CN103559330B (en) 2013-10-10 2013-10-10 Method and system for detecting data consistency

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201310471592.0A CN103559330B (en) 2013-10-10 2013-10-10 Method and system for detecting data consistency

Publications (2)

Publication Number Publication Date
CN103559330A CN103559330A (en) 2014-02-05
CN103559330B true CN103559330B (en) 2017-04-12

Family

ID=50013576

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201310471592.0A Active CN103559330B (en) 2013-10-10 2013-10-10 Method and system for detecting data consistency

Country Status (1)

Country Link
CN (1) CN103559330B (en)

Families Citing this family (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105786911B (en) * 2014-12-25 2019-08-16 阿里巴巴集团控股有限公司 Using the method for calibration and device of data
CN107065838B (en) * 2017-06-05 2018-04-20 广东顺德西安交通大学研究院 Industrial control system attack detection method with model response analysis is perceived based on instruction
CN109101509A (en) * 2017-06-20 2018-12-28 中兴通讯股份有限公司 Data accuracy detection method, device, server and computer storage medium
CN107807972B (en) * 2017-10-19 2020-12-22 北京科技大学 Test data consistency detection method
CN108038132A (en) * 2017-11-17 2018-05-15 上海数据交易中心有限公司 Data Quality Analysis method and device, storage medium, terminal
WO2020037555A1 (en) * 2018-08-22 2020-02-27 深圳市汇顶科技股份有限公司 Method, device, apparatus, and system for evaluating microphone array consistency

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1777127A (en) * 2005-12-08 2006-05-24 中国移动通信集团公司 Protocol conformance measuring device and method
US8490244B1 (en) * 2012-04-16 2013-07-23 International Business Machines Corporation Methodologies for automatic 3-D device structure synthesis from circuit layouts for device simulation

Family Cites Families (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN100367722C (en) * 2004-12-10 2008-02-06 中兴通讯股份有限公司 System for testing unification of communication protocol
CN102136963B (en) * 2010-10-27 2013-06-05 华为软件技术有限公司 Method and system for checking data consistency
CN102904779B (en) * 2012-10-26 2015-07-22 广东电网公司电力科学研究院 Communication protocol consistency detection method and system

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1777127A (en) * 2005-12-08 2006-05-24 中国移动通信集团公司 Protocol conformance measuring device and method
US8490244B1 (en) * 2012-04-16 2013-07-23 International Business Machines Corporation Methodologies for automatic 3-D device structure synthesis from circuit layouts for device simulation

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
仿真结果与试验数据的一致性研究;汤淑春 等;《系统仿真学报》;19980831;第10卷(第4期);第37-41页 *

Also Published As

Publication number Publication date
CN103559330A (en) 2014-02-05

Similar Documents

Publication Publication Date Title
CN103559330B (en) Method and system for detecting data consistency
CN103136471B (en) A kind of malice Android application program detection method and system
US10009064B2 (en) Method for differentiating power distribution areas and phases by using voltage characteristics
CN103297267B (en) A kind of methods of risk assessment of network behavior and system
CN106789844B (en) Malicious user identification method and device
CN108776616A (en) A kind of method, block chain node and the system of determining block chain node trusted status
CN107241696A (en) Multipath effect discriminating conduct and method for estimating distance based on channel condition information
CN103294594A (en) Test based static analysis misinformation eliminating method
WO2023168812A1 (en) Optimization method and apparatus for search system, and storage medium and computer device
CN113300986B (en) Unmanned aerial vehicle image transmission signal and hotspot signal identification method, medium and computer equipment
CN110516713A (en) A kind of target group's recognition methods, device and equipment
CN114579972A (en) Vulnerability identification method and system for embedded development program
CN111770053B (en) Malicious program detection method based on improved clustering and self-similarity
CN104301170B (en) The mobile terminal application friendly evaluation method of feature based classification
CN112532645A (en) Internet of things equipment operation data monitoring method and system and electronic equipment
CN106488554A (en) A kind of fingerprint database method for building up and system
CN109697575A (en) Data processing method and system based on evaluation result
CN110414543A (en) A kind of method of discrimination, equipment and the computer storage medium of telephone number danger level
CN110458707B (en) Behavior evaluation method and device based on classification model and terminal equipment
CN111385342B (en) Internet of things industry identification method and device, electronic equipment and storage medium
CN111835541B (en) Method, device, equipment and system for detecting aging of flow identification model
CN110858984B (en) Method and device for determining target user
CN102981952A (en) Procedure performance analysis method based on target machine
CN112632364A (en) News propagation speed evaluation method and system
CN112700270A (en) Grading data processing method, device, equipment and storage medium

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant