CN106991080A - A kind of quantile of data determines method and device - Google Patents
A kind of quantile of data determines method and device Download PDFInfo
- Publication number
- CN106991080A CN106991080A CN201710235860.7A CN201710235860A CN106991080A CN 106991080 A CN106991080 A CN 106991080A CN 201710235860 A CN201710235860 A CN 201710235860A CN 106991080 A CN106991080 A CN 106991080A
- Authority
- CN
- China
- Prior art keywords
- quantile
- sequence
- value
- fitting
- data
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/10—Complex mathematical operations
- G06F17/18—Complex mathematical operations for evaluating statistical data, e.g. average values, frequency distributions, probability functions, regression analysis
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N7/00—Computing arrangements based on specific mathematical models
- G06N7/01—Probabilistic graphical models, e.g. probabilistic networks
Abstract
Method and device is determined the embodiment of the invention discloses a kind of quantile of data, methods described includes:The training data chosen from target data is fitted, the corresponding probability density function p (x) of training data is obtained;Using the probability density function p (x), the corresponding fitting distribution function F (x) of training data and its inverse function F are calculated‑1(x), wherein, the fitting distribution function F (x) be nonlinear function;For presetting each quantile that quantile sequence P is included, the inverse function F is utilized‑1(x) the corresponding fitting quantile of the quantile, is calculated, and by the fitting quantile storage into fitting quantile sequence B;Obtain the target data sequence D of quantile to be calculated;For the target data sequence D, the fitting distribution function F (x), the inverse function F are utilized‑1(x) and the fitting quantile sequence B, determine that each quantile distinguishes corresponding current quantile.Using the embodiment of the present invention, the error for the quantile determined is reduced.
Description
Technical field
The present invention relates to technical field of data processing, more particularly to a kind of quantile of data determines method and device.
Background technology
In the current big data epoch, it is more and more the need for data are flowed into the further working process of row, to obtain
More useful information.Quantile is carried out to data stream and determines it is the important processing mode of one of which, in computer and finance
Had a wide range of applications Deng field.Quantile refers to the numerical value that the probability distribution scope of a stochastic variable is divided into several equal portions
Point, conventional has median, quartile, percentile etc.., can intuitively observed number by determining a series of its quantile
According to the cumulative distribution function of stream, and then analyze its statistical property.
Existing quantile determines method, it may not be necessary to which the probability Distribution Model to data flow is analyzed, but logical
Cross and extract Partial Elements representative in data flow and calculated, so that it is determined that going out the quantile of data flow.Due to right
The probability distribution of data flow without analyse, can determine quantile using linear function, for the data flow of distribution uniform, profit
Determine that error is smaller during quantile with linear function.
But in actual applications, heavytailed distribution situation i.e. pockety is often presented in data flow.In this case,
It is the equally distributed linear function of default data stream due to what is utilized, the quantile error determined can be caused larger, it is impossible to is accurate
Really reflect the statistical nature of data flow.
The content of the invention
The purpose of the embodiment of the present invention is that providing a kind of quantile of data determines method and device, is determined with reducing
Quantile error.
To reach above-mentioned purpose, method is determined the embodiment of the invention discloses a kind of quantile of data, method includes:Number
According to training process and fractional-dimension calculus process;
The data training process, including:
The training data chosen from target data is fitted, the corresponding probability density function p of training data is obtained
(x);
Using the probability density function p (x), the corresponding fitting distribution function F (x) of training data and its inverse function are calculated
F-1(x), wherein, the fitting distribution function F (x) be nonlinear function;
For presetting each quantile that quantile sequence P is included, the inverse function F is utilized-1(x) this point of position, is calculated
The corresponding fitting quantile of point, and by the fitting quantile storage into fitting quantile sequence B;
The fractional-dimension calculus process, including:
Obtain the target data sequence D of quantile to be calculated;
For the target data sequence D, the fitting distribution function F (x), the inverse function F are utilized-1(x) and institute
Fitting quantile sequence B is stated, determines that each quantile distinguishes corresponding current quantile.
Preferably, described utilize the probability density function p (x), the corresponding fitting distribution function F of training data is calculated
And its inverse function F (x)-1(x) the step of, including:
For the probability density function p (x), the p (x) is calculated from the negative infinite definite integral to variable x, obtains described
It is fitted distribution function F (x);
Calculate the inverse function F of the fitting distribution function F (x)-1(x)。
Preferably, described each quantile included for default quantile sequence P, utilizes the inverse function F-1(x),
The step of quantile is corresponding to be fitted quantile is calculated, including:
For presetting each quantile that quantile sequence is included, the value of the quantile is substituted into the inverse function,
The corresponding contrafunctional value of the quantile is calculated, the value of the corresponding fitting quantile of the quantile is used as.
Preferably, the step of target data sequence D of the acquisition quantile to be calculated, including:
1 data element d in target data is received, and d is stored to the target data sequence D that size is N, Zhi Daosuo
Target data sequence D is stated to be filled;
It is described to be directed to the target data sequence D, utilize the fitting distribution function F (x), the inverse function F-1(x) with
And the fitting quantile sequence B, the step of each quantile distinguishes corresponding current quantile is determined, including:
S1:Judge whether the first design variables C currency is more than 0, wherein, the initial value of the C is 0;
If more than 0, performing S2;
If equal to 0, the second design variables T initialization value is set to 0, and by default corresponding point of quantile sequence P
All quantiles in digit sequence Q are reset, and perform S3, wherein, P={ p0, p1..., pm..., pM, Q={ q0, q1...,
qm..., qM, m ∈ [0, M];
S2:For each element a in quantile sequence Q and the union X of data sequence D, if a is less than q0, then really
Determine the corresponding CDF of element aQValue be 0, perform S3;
If a is more than or equal to qM, it is determined that the corresponding CDF of element aQValue be 1, perform S3;
If a is more than or equal to q0And less than qM, then quantile q is searched from quantile sequence Qm, so that a is more than or waited
In qmAnd less than qm+1, wherein, m is more than or equal to 0 and less than or equal to (M-1);
Q is obtained from fitting quantile sequence BmCorresponding bm, and qm+1Corresponding bm+1, wherein, B={ b0, b1...,
bm..., bM, m ∈ [0, M];
Determine the corresponding CDF of element aQValue CDFQ(a)=F (bm+(bm+1-bm)*(a-qm)/(qm+1-qm)), perform
S3;
S3:Order by the data in data sequence D according to data value from small to large, is ranked up, after being sorted
Data sequence D1, S4 is performed, wherein, D1={ d1, d2..., dn..., dN, n ∈ [0, N];
S4:For quantile sequence Q and data sequence D1Union X1In each element b, if b is less than or equal to
d1, it is determined that the corresponding ECDF of element bD -Value be 0, perform S5;
If b is more than dN, it is determined that the corresponding ECDF of element bD -Value be 1, perform S5;
If b is more than d1And less than or equal to dN, then from data sequence D1Middle lookup dn, so that b is more than dnAnd be less than or wait
In dn+1;
Determine the corresponding ECDF of element bD -Value ECDFD -(b)=n/ (n+M), performs S5;
S5:For quantile sequence Q and data sequence D1Union X1In each element b, if b be less than d1, then really
Determine the corresponding ECDF of element bD +Value be 0, perform S6;
If b is more than or equal to dN, it is determined that the corresponding ECDF of element bD +Value be 1, perform S6;
If b is more than or equal to d1And less than dN, then using binary chop, from data sequence D1Middle lookup dn, so that b
More than or equal to dnAnd less than dn+1;
Determine the corresponding ECDF of element bD +Value ECDFD +(b)=n/ (n+M), performs S6;
S6:For quantile sequence Q and data sequence D1Union X1In each element b, calculate the element b's
ACDF-Value ACDF-(b)=(T*CDFQ(b)+N*ECDFD -(b))/(T+N), and the element b ACDF+Value ACDF+
(b)=(T*CDFQ(b)+N*ECDFD +(b))/(T+N), performs S7;
S7:For presetting each quantile p in quantile sequence Pm, calculate the quantile pmCorresponding qm -And qm +
Value, perform S8, wherein, qm -=max { b ∣ ACDF-(b)≤pm, b ∈ X1, qm +=max { b ∣ ACDF+(b)≥pm, b ∈ X1};
S8:For presetting each quantile p in quantile sequence Pm, judge quantile pmCorresponding qm -And qm +'s
Whether value is identical;
If identical, by qm -Value be defined as the quantile pmCorresponding current quantile qmValue;
If it is not the same, then by (k*qm -+(1-k)*qm +), it is defined as the quantile pmCorresponding current quantile qm's
Value, wherein, k=F-(ACDF+(qm +)-F-(pm))/F-(ACDF+(qm +)-ACDF-(qm -))。
Preferably, after step s8, methods described also includes:
S9:The currency of the second design variables T is added into N, and the currency of the first design variables C is added 1, is held
Row S10;
S10:Judge Jia 1 after the first design variables C currency whether the value for being the 3rd design variables cycle
Integral multiple;
If not being the integral multiple of cycle value, return and perform the target data sequence D for obtaining quantile to be calculated
The step of;
If the integral multiple of cycle value, the fitting quantile sequence is updated, the execution acquisition is returned to be calculated
The step of target data sequence D of quantile, wherein, each quantile that the fitting quantile sequence B after renewal is included
It is worth for (T*bm+M*qm)/(T+M)。
To reach above-mentioned purpose, the embodiment of the present invention puies forward the quantile determining device for disclosing a kind of data, and device includes:
Data training module and fractional-dimension calculus module;
The data training module, for being fitted the training data chosen from target data, obtains training number
According to corresponding probability density function p (x);Using the probability density function p (x), the corresponding fitting distribution of training data is calculated
Function F (x) and its inverse function F-1(x), wherein, the fitting distribution function F (x) be nonlinear function;For presetting quantile
Each quantile that sequence P is included, utilizes the inverse function F-1(x) the corresponding fitting quantile of the quantile, is calculated, and will
The fitting quantile storage is into fitting quantile sequence B;
The fractional-dimension calculus module, the target data sequence D for obtaining quantile to be calculated;For the number of targets
According to sequence D, the fitting distribution function F (x), the inverse function F are utilized-1(x) and it is described fitting quantile sequence B, it is determined that
Each quantile distinguishes corresponding current quantile.
Preferably, the data training module includes:
Function Fitting submodule, for being fitted to the training data chosen from target data, obtains training data
Corresponding probability density function p (x);
First calculating sub module, for utilizing the probability density function p (x), calculates the corresponding fitting point of training data
Cloth function F (x) and its inverse function F-1(x), wherein, the fitting distribution function F (x) be nonlinear function;
Second calculating sub module, for for presetting each quantile that quantile sequence P is included, utilizing the inverse letter
Number F-1(x) the corresponding fitting quantile of the quantile, is calculated, and fitting quantile sequence B is arrived into the fitting quantile storage
In;
The fractional-dimension calculus module, including:
Target data sequence obtains submodule, the target data sequence D for obtaining quantile to be calculated;
Quantile determination sub-module, for for the target data sequence D, using the fitting distribution function F (x),
The inverse function F-1(x) and the fitting quantile sequence B, determine that each quantile distinguishes corresponding current quantile.
Preferably, first calculating sub module, specifically for:
For the probability density function p (x), the p (x) is calculated from the negative infinite definite integral to variable x, obtains described
It is fitted distribution function F (x);
Calculate the inverse function F of the fitting distribution function F (x)-1(x)。
Preferably, second calculating sub module, specifically for:
For presetting each quantile that quantile sequence is included, the value of the quantile is substituted into the inverse function,
The corresponding contrafunctional value of the quantile is calculated, the value of the corresponding fitting quantile of the quantile is used as.
Preferably, the target data sequence obtains submodule, specifically for:
1 data element d in target data is received, and d is stored to the target data sequence D that size is N, Zhi Daosuo
Target data sequence D is stated to be filled;
The quantile determination sub-module, specifically for:
S1:Judge whether the first design variables C currency is more than 0, wherein, the initial value of the C is 0;
If more than 0, performing S2;
If equal to 0, the second design variables T initialization value is set to 0, and by default corresponding point of quantile sequence P
All quantiles in digit sequence Q are reset, and perform S3, wherein, P={ p0, p1..., pm..., pM, Q={ q0, q1...,
qm..., qM, m ∈ [0, M];
S2:For each element a in quantile sequence Q and the union X of data sequence D, if a is less than q0, then really
Determine the corresponding CDF of element aQValue be 0, perform S3;
If a is more than or equal to qM, it is determined that the corresponding CDF of element aQValue be 1, perform S3;
If a is more than or equal to q0And less than qM, then quantile q is searched from quantile sequence Qm, so that a is more than or waited
In qmAnd less than qm+1, wherein, m is more than or equal to 0 and less than or equal to (M-1);
Q is obtained from fitting quantile sequence BmCorresponding bm, and qm+1Corresponding bm+1, wherein, B={ b0, b1...,
bm..., bM, m ∈ [0, M];
Determine the corresponding CDF of element aQValue CDFQ(a)=F (bm+(bm+1-bm)*(a-qm)/(qm+1-qm)), perform
S3;
S3:Order by the data in data sequence D according to data value from small to large, is ranked up, after being sorted
Data sequence D1, S4 is performed, wherein, D1={ d1, d2..., dn..., dN, n ∈ [0, N];
S4:For quantile sequence Q and data sequence D1Union X1In each element b, if b is less than or equal to
d1, it is determined that the corresponding ECDF of element bD -Value be 0, perform S5;
If b is more than dN, it is determined that the corresponding ECDF of element bD -Value be 1, perform S5;
If b is more than d1And less than or equal to dN, then from data sequence D1Middle lookup dn, so that b is more than dnAnd be less than or wait
In dn+1;
Determine the corresponding ECDF of element bD -Value ECDFD -(b)=n/ (n+M), performs S5;
S5:For quantile sequence Q and data sequence D1Union X1In each element b, if b be less than d1, then really
Determine the corresponding ECDF of element bD +Value be 0, perform S6;
If b is more than or equal to dN, it is determined that the corresponding ECDF of element bD +Value be 1, perform S6;
If b is more than or equal to d1And less than dN, then using binary chop, from data sequence D1Middle lookup dn, so that b
More than or equal to dnAnd less than dn+1;
Determine the corresponding ECDF of element bD +Value ECDFD +(b)=n/ (n+M), performs S6;
S6:For quantile sequence Q and data sequence D1Union X1In each element b, calculate the element b's
ACDF-Value ACDF-(b)=(T*CDFQ(b)+N*ECDFD -(b))/(T+N), and the element b ACDF+Value ACDF+
(b)=(T*CDFQ(b)+N*ECDFD +(b))/(T+N), performs S7;
S7:For presetting each quantile p in quantile sequence Pm, calculate the quantile pmCorresponding qm -And qm +
Value, perform S8, wherein, qm -=max { b ∣ ACDF-(b)≤pm, b ∈ X1, qm +=max { b ∣ ACDF+(b)≥pm, b ∈ X1};
S8:For presetting each quantile p in quantile sequence Pm, judge quantile pmCorresponding qm -And qm +'s
Whether value is identical;
If identical, by qm -Value be defined as the quantile pmCorresponding current quantile qmValue;
If it is not the same, then by (k*qm -+(1-k)*qm +), it is defined as the quantile pmCorresponding current quantile qm's
Value, wherein, k=F-(ACDF+(qm +)-F-(pm))/F-(ACDF+(qm +)-ACDF-(qm -))。
Preferably, described device also includes:
Plus computing module, for the currency of the second design variables T to be added into N, and by the first design variables C's
Currency adds 1;
Judge module, for judging whether the currency of the first design variables C after Jia 1 is the 3rd design variables
The integral multiple of cycle value;In the case where the judge module judged result is no, triggering target data sequence obtains submodule
Block;
Update module, in the case of being in the judge module judged result, updates the fitting quantile sequence
Row, and target data sequence acquisition submodule is triggered, wherein, each point of position that the fitting quantile sequence B after renewal is included
Several values is (T*bm+M*qm)/(T+M)。
As seen from the above technical solutions, method and dress are determined the embodiments of the invention provide a kind of quantile of data
Put, including:Data training process and fractional-dimension calculus process;The data training process, including:To being chosen from target data
Training data be fitted, obtain the corresponding probability density function p (x) of training data;Utilize the probability density function p
(x) the corresponding fitting distribution function F (x) of training data and its inverse function F, are calculated-1(x), wherein, the fitting distribution function F
(x) it is nonlinear function;For presetting each quantile that quantile sequence P is included, the inverse function F is utilized-1(x), count
The corresponding fitting quantile of the quantile is calculated, and by the fitting quantile storage into fitting quantile sequence B;Described point of position
Number estimation procedure, including:Obtain the target data sequence D of quantile to be calculated;For the target data sequence D, institute is utilized
State fitting distribution function F (x), the inverse function F-1(x) and the fitting quantile sequence B, each quantile point is determined
Not corresponding current quantile.
It can be seen that, because fitting distribution function F (x) is nonlinear function, thus it is distribution that heavytailed distribution, which is presented, in data flow
In the case of uneven, it is possible to use embody the fitting distribution function F (x) of data flow non-uniform Distribution, rather than default data
Equally distributed linear function is flowed, with reference to the inverse function F calculated-1(x) and it is described fitting quantile sequence B go determine point position
Number, so as to reduce the error for the quantile determined.
Certainly, any product or method for implementing the present invention it is not absolutely required to while reaching all the above excellent
Point.
Brief description of the drawings
In order to illustrate more clearly about the embodiment of the present invention or technical scheme of the prior art, below will be to embodiment or existing
There is the accompanying drawing used required in technology description to be briefly described, it should be apparent that, drawings in the following description are only this
Some embodiments of invention, for those of ordinary skill in the art, on the premise of not paying creative work, can be with
Other accompanying drawings are obtained according to these accompanying drawings.
Fig. 1 determines a kind of schematic flow sheet of method for the quantile of data provided in an embodiment of the present invention;
Fig. 2 be embodiment illustrated in fig. 1 in, a kind of step S105 idiographic flow schematic diagram;
Fig. 3 is a kind of structural representation of the quantile determining device of data provided in an embodiment of the present invention;
Fig. 4 is another structural representation of the quantile determining device of data provided in an embodiment of the present invention.
Embodiment
Below in conjunction with the accompanying drawing in the embodiment of the present invention, the technical scheme in the embodiment of the present invention is carried out clear, complete
Site preparation is described, it is clear that described embodiment is only a part of embodiment of the invention, rather than whole embodiments.It is based on
Embodiment in the present invention, it is every other that those of ordinary skill in the art are obtained under the premise of creative work is not made
Embodiment, belongs to the scope of protection of the invention.
Method, which is described in detail, to be determined to a kind of quantile of data provided in an embodiment of the present invention first below.
Referring to Fig. 1, Fig. 1 determines a kind of schematic flow sheet of method for the quantile of data provided in an embodiment of the present invention,
It can include:Data training process and fractional-dimension calculus process;
The data training process, may include steps of:
S101, is fitted to the training data chosen from target data, obtains the corresponding probability density of training data
Function p (x);
Specifically, target data can be target data stream, the high-speed data-flow for example constantly reached.
Wherein it is possible to choose a part of data as training data from target data stream, probability density is obtained by fitting
Function p (x).Data flow is different, then the probability density function fitted is different.For example, assuming that data flow Normal Distribution
On the premise of, it is respectively μ and σ to calculate and obtain the average and standard deviation of training data, then
Wherein it is possible to be realized using prior art:The average and standard deviation of training data are calculated, the embodiment of the present invention is herein
It is not repeated.
In actual applications, it can be realized using prior art:The training data chosen from target data is intended
Close, obtain the corresponding probability density function p (x) of training data.
S102, using the probability density function p (x), calculate the corresponding fitting distribution function F (x) of training data and its
Inverse function F-1(x), wherein, the fitting distribution function F (x) be nonlinear function;
Specifically, described utilize the probability density function p (x), the corresponding fitting distribution function F of training data is calculated
And its inverse function F (x)-1(x) the step of, it can include:For the probability density function p (x), the p (x) is calculated from negative nothing
The poor definite integral for arriving variable x, obtains the fitting distribution function F (x);Calculate the inverse function F of the fitting distribution function F (x)-1
(x)。
From the foregoing,It is public that the probability density function p (x) that fitting is obtained brings the integration into
Formula, you can calculating obtains fitting distribution function F (x), found a function to F (x) inverse and then obtains inverse function F-1(x)。
S103, for presetting each quantile that quantile sequence P is included, utilizes the inverse function F-1(x), calculating should
The corresponding fitting quantile of quantile, and by the fitting quantile storage into fitting quantile sequence B;
Specifically, described each quantile included for default quantile sequence P, utilizes the inverse function F-1(x),
The step of quantile is corresponding to be fitted quantile is calculated, can be included:For presetting each point that quantile sequence is included
Site, the inverse function is substituted into by the value of the quantile, is calculated the corresponding contrafunctional value of the quantile, is used as this point of position
The value of the corresponding fitting quantile of point.
For example, P={ p0, p1..., pm..., pM, p0、p1、…、pMIt is the quantile that quantile sequence P is included, wherein, m
∈ [0, M].Each quantile p that P is includedmBring inverse function F into respectively-1(x) F, is calculated-1(pm) value, be used as pmCorrespondence
Fitting quantile value, by value storage to fitting quantile sequence B={ b0, b1..., bm..., bMIn, i.e. bm=F-1
(pm), wherein, b0、b1、…、bMIt is to be fitted the fitting quantile that quantile sequence B is included.
The fractional-dimension calculus process, may include steps of:
S104, obtains the target data sequence D of quantile to be calculated;
Specifically, the step of target data sequence D of the acquisition quantile to be calculated, can include:Receive number of targets
1 data element d in, and d is stored to the target data sequence D that size is N, until the target data sequence D quilt
Fill up.
In actual applications, a data element d can be received from target data stream, it is N's to be stored to size
In data sequence D, then judge whether D is filled:
If D is not filled, continuation receives another data element e from target data stream, is then stored into D, then
Judge whether D is filled ... and so circulate, untill data sequence D is filled, next step S105 is performed afterwards;
If D is already filled up, next step S105 is directly performed.
S105, for the target data sequence D, utilizes the fitting distribution function F (x), the inverse function F-1(x) with
And the fitting quantile sequence B, determine that each quantile distinguishes corresponding current quantile.
Specifically, referring to Fig. 2, Fig. 2 be embodiment illustrated in fig. 1 in, a kind of step S105 idiographic flow schematic diagram.It is described
Step 105, for the target data sequence D, the fitting distribution function F (x), the inverse function F are utilized-1(x) and institute
Fitting quantile sequence B is stated, determines that each quantile distinguishes corresponding current quantile, may comprise steps of:
S105A:Judge whether the first design variables C currency is more than 0;Wherein, the initial value of the C is 0;If being more than
0, perform S105B;If equal to 0, performing S105C;
S105B:For each element a in quantile sequence Q and the union X of data sequence D, if a is less than q0, then
Determine the corresponding CDF of element aQValue be 0, perform S105D;
If a is more than or equal to qM, it is determined that the corresponding CDF of element aQValue be 1, perform S105D;
If a is more than or equal to q0And less than qM, then quantile q is searched from quantile sequence Qm, so that a is more than or waited
In qmAnd less than qm+1, wherein, m is more than or equal to 0 and less than or equal to (M-1);
Q is obtained from fitting quantile sequence BmCorresponding bm, and qm+1Corresponding bm+1, wherein, B={ b0, b1...,
bm..., bM, m ∈ [0, M];
Determine the corresponding CDF of element aQValue CDFQ(a)=F (bm+(bm+1-bm)*(a-qm)/(qm+1-qm)), perform
S105D;
S105C:Second design variables T initialization value is set to 0, and by the corresponding quantiles of default quantile sequence P
All quantiles in sequence Q are reset, and perform S105D, wherein, P={ p0, p1..., pm..., pM, Q={ q0, q1...,
qm..., qM, m ∈ [0, M];
S105D:Order by the data in data sequence D according to data value from small to large, is ranked up, and obtains after sequence
Data sequence D1, S105E is performed, wherein, D1={ d1, d2..., dn..., dN, n ∈ [0, N];
S105E:For quantile sequence Q and data sequence D1Union X1In each element b, if b is less than or waited
In d1, it is determined that the corresponding ECDF of element bD -Value be 0, perform S105F;
If b is more than dN, it is determined that the corresponding ECDF of element bD -Value be 1, perform S105F;
If b is more than d1And less than or equal to dN, then from data sequence D1Middle lookup dn, so that b is more than dnAnd be less than or wait
In dn+1;
Determine the corresponding ECDF of element bD -Value ECDFD -(b)=n/ (n+M), performs S105F;
S105F:For quantile sequence Q and data sequence D1Union X1In each element b, if b be less than d1,
Then determine the corresponding ECDF of element bD +Value be 0, perform S105G;
If b is more than or equal to dN, it is determined that the corresponding ECDF of element bD +Value be 1, perform S105G;
If b is more than or equal to d1And less than dN, then using binary chop, from data sequence D1Middle lookup dn, so that b
More than or equal to dnAnd less than dn+1;
Determine the corresponding ECDF of element bD +Value ECDFD +(b)=n/ (n+M), performs S105G;
S105G:For quantile sequence Q and data sequence D1Union X1In each element b, calculate the element b
ACDF-Value ACDF-(b)=(T*CDFQ(b)+N*ECDFD -(b))/(T+N), and the element b ACDF+Value ACDF+(b)=(T*CDFQ(b)+N*ECDFD +(b))/(T+N), performs S105H;
S105H:For presetting each quantile p in quantile sequence Pm, calculate the quantile pmCorresponding qm -
And qm +Value, perform S105I, wherein, qm -=max { b ∣ ACDF-(b)≤pm, b ∈ X1, qm +=max { b ∣ ACDF+(b)≥pm, b
∈X1};
S105I:For presetting each quantile p in quantile sequence Pm, judge quantile pmCorresponding qm -And qm +Value it is whether identical;
If identical, by qm -Value be defined as the quantile pmCorresponding current quantile qmValue;
If it is not the same, then by (k*qm -+(1-k)*qm +), it is defined as the quantile pmCorresponding current quantile qm's
Value, wherein, k=F-(ACDF+(qm +)-F-(pm))/F-(ACDF+(qm +)-ACDF-(qm -))。
It should be noted that when the currency for judging the C in step S105A is equal to 0, step S105B can be skipped, turned
And step S105C is performed, then perform S105D.When going to step S105G, ACDF is calculated-And ACDF (b)+(b) value
In formula, now CDFQ(b) value is also equal to 0.
Also, in step S105D, data sequence D is sorted from small to large, the data sequence D after sequence1=
{d1, d2..., dn..., dN, you can know d1Value it is minimum, d2Take second place, increase successively, until dNValue it is maximum.
In addition, in step S105F, can be realized using prior art:Using binary chop, from data sequence D1In look into
Look for dn, so that b is more than or equal to dnAnd less than dn+1。
Specifically, after step S105I, can also comprise the following steps:
The currency of the second design variables T is added into N, and the currency of the first design variables C is added 1;
Judge Jia 1 after the first design variables C currency whether the integer for the value for being the 3rd design variables cycle
Times;
If not being the integral multiple of cycle value, return and perform the target data sequence D for obtaining quantile to be calculated
The step of;
If the integral multiple of cycle value, the fitting quantile sequence is updated, the execution acquisition is returned to be calculated
The step of target data sequence D of quantile, wherein, each quantile that the fitting quantile sequence B after renewal is included
It is worth for (T*bm+M*qm)/(T+M)。
Wherein, in actual applications, design variables cycle value can be set to 50 or 100.
It can be seen that, because fitting distribution function F (x) is nonlinear function, thus it is distribution that heavytailed distribution, which is presented, in data flow
In the case of uneven, it is possible to use embody the fitting distribution function F (x) of data flow non-uniform Distribution, rather than default data
Equally distributed linear function is flowed, with reference to the inverse function F calculated-1(x) and it is described fitting quantile sequence B go determine point position
Number, so as to reduce the error for the quantile determined.
Referring to Fig. 3, Fig. 3 is a kind of structural representation of the quantile determining device of data provided in an embodiment of the present invention,
Corresponding with the flow shown in Fig. 1, the determining device can include:Data training module 301 and fractional-dimension calculus module 302;
The data training module 301,
For being fitted to the training data chosen from target data, the corresponding probability density letter of training data is obtained
Number p (x);Using the probability density function p (x), the corresponding fitting distribution function F (x) of training data and its inverse function are calculated
F-1(x), wherein, the fitting distribution function F (x) be nonlinear function;For presetting each point that quantile sequence P is included
Site, utilizes the inverse function F-1(x) the corresponding fitting quantile of the quantile, is calculated, and the fitting quantile is stored
Into fitting quantile sequence B;
The fractional-dimension calculus module 302, the target data sequence D for obtaining quantile to be calculated;For the mesh
Data sequence D is marked, the fitting distribution function F (x), the inverse function F is utilized-1(x) and it is described fitting quantile sequence B,
Determine that each quantile distinguishes corresponding current quantile.
Referring to Fig. 4, Fig. 4 is another structural representation of the quantile determining device of data provided in an embodiment of the present invention
Figure, the determining device can include:Data training module 400 and fractional-dimension calculus module 410;
The data training module 400, can include:
Function Fitting submodule 401, for being fitted the training data chosen from target data, obtains training number
According to corresponding probability density function p (x);
First calculating sub module 402, for utilizing the probability density function p (x), calculates the corresponding fitting of training data
Distribution function F (x) and its inverse function F-1(x), wherein, the fitting distribution function F (x) be nonlinear function;
Specifically, first calculating sub module 402, specifically can be used for:
For the probability density function p (x), the p (x) is calculated from the negative infinite definite integral to variable x, obtains described
It is fitted distribution function F (x);
Calculate the inverse function F of the fitting distribution function F (x)-1(x)。
Second calculating sub module 403, for for presetting each quantile for including of quantile sequence P, using described
Inverse function F-1(x) the corresponding fitting quantile of the quantile, is calculated, and fitting quantile sequence is arrived into the fitting quantile storage
Arrange in B;
Specifically, second calculating sub module 403, specifically can be used for:
For presetting each quantile that quantile sequence is included, the value of the quantile is substituted into the inverse function,
The corresponding contrafunctional value of the quantile is calculated, the value of the corresponding fitting quantile of the quantile is used as.
The fractional-dimension calculus module 410, can include:
Target data sequence obtains submodule 411, the target data sequence D for obtaining quantile to be calculated;
Quantile determination sub-module 412, for for the target data sequence D, utilizing the fitting distribution function F
(x), the inverse function F-1(x) and the fitting quantile sequence B, determine that each quantile distinguishes corresponding current point of position
Number.
Specifically, the target data sequence obtains submodule 411, specifically it can be used for:
1 data element d in target data is received, and d is stored to the target data sequence D that size is N, Zhi Daosuo
Target data sequence D is stated to be filled;
The quantile determination sub-module 412, specifically can be used for:
S1:Judge whether the first design variables C currency is more than 0, wherein, the initial value of the C is 0;
If more than 0, performing S2;
If equal to 0, the second design variables T initialization value is set to 0, and by default corresponding point of quantile sequence P
All quantiles in digit sequence Q are reset, and perform S3, wherein, P={ p0, p1..., pm..., pM, Q={ q0, q1...,
qm..., qM, m ∈ [0, M];
S2:For each element a in quantile sequence Q and the union X of data sequence D, if a is less than q0, then really
Determine the corresponding CDF of element aQValue be 0, perform S3;
If a is more than or equal to qM, it is determined that the corresponding CDF of element aQValue be 1, perform S3;
If a is more than or equal to q0And less than qM, then quantile q is searched from quantile sequence Qm, so that a is more than or waited
In qmAnd less than qm+1, wherein, m is more than or equal to 0 and less than or equal to (M-1);
Q is obtained from fitting quantile sequence BmCorresponding bm, and qm+1Corresponding bm+1, wherein, B={ b0, b1...,
bm..., bM, m ∈ [0, M];
Determine the corresponding CDF of element aQValue CDFQ(a)=F (bm+(bm+1-bm)*(a-qm)/(qm+1-qm)), perform
S3;
S3:Order by the data in data sequence D according to data value from small to large, is ranked up, after being sorted
Data sequence D1, S4 is performed, wherein, D1={ d1, d2..., dn..., dN, n ∈ [0, N];
S4:For quantile sequence Q and data sequence D1Union X1In each element b, if b is less than or equal to
d1, it is determined that the corresponding ECDF of element bD -Value be 0, perform S5;
If b is more than dN, it is determined that the corresponding ECDF of element bD -Value be 1, perform S5;
If b is more than d1And less than or equal to dN, then from data sequence D1Middle lookup dn, so that b is more than dnAnd be less than or wait
In dn+1;
Determine the corresponding ECDF of element bD -Value ECDFD -(b)=n/ (n+M), performs S5;
S5:For quantile sequence Q and data sequence D1Union X1In each element b, if b be less than d1, then really
Determine the corresponding ECDF of element bD +Value be 0, perform S6;
If b is more than or equal to dN, it is determined that the corresponding ECDF of element bD +Value be 1, perform S6;
If b is more than or equal to d1And less than dN, then using binary chop, from data sequence D1Middle lookup dn, so that b
More than or equal to dnAnd less than dn+1;
Determine the corresponding ECDF of element bD +Value ECDFD +(b)=n/ (n+M), performs S6;
S6:For quantile sequence Q and data sequence D1Union X1In each element b, calculate the element b's
ACDF-Value ACDF-(b)=(T*CDFQ(b)+N*ECDFD -(b))/(T+N), and the element b ACDF+Value ACDF+
(b)=(T*CDFQ(b)+N*ECDFD +(b))/(T+N), performs S7;
S7:For presetting each quantile p in quantile sequence Pm, calculate the quantile pmCorresponding qm -And qm +
Value, perform S8, wherein, qm -=max { b ∣ ACDF-(b)≤pm, b ∈ X1, qm +=max { b ∣ ACDF+(b)≥pm, b ∈ X1};
S8:For presetting each quantile p in quantile sequence Pm, judge quantile pmCorresponding qm -And qm +'s
Whether value is identical;
If identical, by qm -Value be defined as the quantile pmCorresponding current quantile qmValue;
If it is not the same, then by (k*qm -+(1-k)*qm +), it is defined as the quantile pmCorresponding current quantile qm's
Value, wherein, k=F-(ACDF+(qm +)-F-(pm))/F-(ACDF+(qm +)-ACDF-(qm -))。
Specifically, described device can also include:
Plus computing module, for the currency of the second design variables T to be added into N, and by the first design variables C's
Currency adds 1;
Judge module, for judging whether the currency of the first design variables C after Jia 1 is the 3rd design variables
The integral multiple of cycle value;In the case where the judge module judged result is no, triggering target data sequence obtains submodule
Block 304;
Update module, in the case of being in the judge module judged result, updates the fitting quantile sequence
Row, and target data sequence acquisition submodule 304 is triggered, wherein, each point that the fitting quantile sequence B after renewal is included
The value of digit is (T*bm+M*qm)/(T+M)。
It can be seen that, because fitting distribution function F (x) is nonlinear function, thus it is distribution that heavytailed distribution, which is presented, in data flow
In the case of uneven, it is possible to use embody the fitting distribution function F (x) of data flow non-uniform Distribution, rather than default data
Equally distributed linear function is flowed, with reference to the inverse function F calculated-1(x) and it is described fitting quantile sequence B go determine point position
Number, so as to reduce the error for the quantile determined.
It should be noted that herein, such as first and second or the like relational terms are used merely to a reality
Body or operation make a distinction with another entity or operation, and not necessarily require or imply these entities or deposited between operating
In any this actual relation or order.Moreover, term " comprising ", "comprising" or its any other variant are intended to
Nonexcludability is included, so that process, method, article or equipment including a series of key elements not only will including those
Element, but also other key elements including being not expressly set out, or also include being this process, method, article or equipment
Intrinsic key element.In the absence of more restrictions, the key element limited by sentence "including a ...", it is not excluded that
Also there is other identical element in process, method, article or equipment including the key element.
Each embodiment in this specification is described by the way of related, identical similar portion between each embodiment
Divide mutually referring to what each embodiment was stressed is the difference with other embodiment.It is real especially for device
Apply for example, because it is substantially similar to embodiment of the method, so description is fairly simple, related part is referring to embodiment of the method
Part explanation.
Can one of ordinary skill in the art will appreciate that realizing that all or part of step in above method embodiment is
To instruct the hardware of correlation to complete by program, described program can be stored in computer read/write memory medium,
The storage medium designated herein obtained, such as:ROM/RAM, magnetic disc, CD etc..
The foregoing is merely illustrative of the preferred embodiments of the present invention, is not intended to limit the scope of the present invention.It is all
Any modification, equivalent substitution and improvements made within the spirit and principles in the present invention etc., are all contained in protection scope of the present invention
It is interior.
Claims (10)
1. a kind of quantile of data determines method, it is characterised in that methods described includes:Data training process and quantile are estimated
Meter process;
The data training process, including:
The training data chosen from target data is fitted, the corresponding probability density function p (x) of training data is obtained;
Using the probability density function p (x), the corresponding fitting distribution function F (x) of training data and its inverse function F are calculated-1
(x), wherein, the fitting distribution function F (x) be nonlinear function;
For presetting each quantile that quantile sequence P is included, the inverse function F is utilized-1(x) quantile pair, is calculated
The fitting quantile answered, and by the fitting quantile storage into fitting quantile sequence B;
The fractional-dimension calculus process, including:
Obtain the target data sequence D of quantile to be calculated;
For the target data sequence D, the fitting distribution function F (x), the inverse function F are utilized-1(x) and it is described intend
Quantile sequence B is closed, determines that each quantile distinguishes corresponding current quantile.
2. according to the method described in claim 1, it is characterised in that described to utilize the probability density function p (x), calculate instruction
Practice the corresponding fitting distribution function F (x) of data and its inverse function F-1(x) the step of, including:
For the probability density function p (x), the p (x) is calculated from the negative infinite definite integral to variable x, the fitting is obtained
Distribution function F (x);
Calculate the inverse function F of the fitting distribution function F (x)-1(x)。
3. according to the method described in claim 1, it is characterised in that each included for default quantile sequence P
Quantile, utilizes the inverse function F-1(x) the step of quantile is corresponding to be fitted quantile, is calculated, including:
For presetting each quantile that quantile sequence is included, the value of the quantile is substituted into the inverse function, calculated
The corresponding contrafunctional value of the quantile, is used as the value of the corresponding fitting quantile of the quantile.
4. method according to claim 2, it is characterised in that the target data sequence D of the acquisition quantile to be calculated
The step of, including:
1 data element d in target data is received, and d is stored to the target data sequence D that size is N, until the mesh
Mark data sequence D is filled;
It is described to be directed to the target data sequence D, utilize the fitting distribution function F (x), the inverse function F-1(x) and institute
Fitting quantile sequence B is stated, the step of each quantile distinguishes corresponding current quantile is determined, including:
S1:Judge whether the first design variables C currency is more than 0, wherein, the initial value of the C is 0;
If more than 0, performing S2;
If equal to 0, the second design variables T initialization value is set to 0, and by the corresponding quantiles of default quantile sequence P
All quantiles in sequence Q are reset, and perform S3, wherein, P={ p0, p1..., pm..., pM, Q={ q0, q1..., qm...,
qM, m ∈ [0, M];
S2:For each element a in quantile sequence Q and the union X of data sequence D, if a is less than q0, it is determined that this yuan
The corresponding CDF of plain aQValue be 0, perform S3;
If a is more than or equal to qM, it is determined that the corresponding CDF of element aQValue be 1, perform S3;
If a is more than or equal to q0And less than qM, then quantile q is searched from quantile sequence Qm, so that a is more than or equal to qmAnd
Less than qm+1, wherein, m is more than or equal to 0 and less than or equal to (M-1);
Q is obtained from fitting quantile sequence BmCorresponding bm, and qm+1Corresponding bm+1, wherein, B={ b0, b1..., bm...,
bM, m ∈ [0, M];
Determine the corresponding CDF of element aQValue CDFQ(a)=F (bm+(bm+1-bm)*(a-qm)/(qm+1-qm)), perform S3;
S3:Order by the data in data sequence D according to data value from small to large, is ranked up, the data after being sorted
Sequence D1, S4 is performed, wherein, D1={ d1, d2..., dn..., dN, n ∈ [0, N];
S4:For quantile sequence Q and data sequence D1Union X1In each element b, if b be less than or equal to d1, then
Determine the corresponding ECDF of element bD -Value be 0, perform S5;
If b is more than dN, it is determined that the corresponding ECDF of element bD -Value be 1, perform S5;
If b is more than d1And less than or equal to dN, then from data sequence D1Middle lookup dn, so that b is more than dnAnd be less than or equal to
dn+1;
Determine the corresponding ECDF of element bD -Value ECDFD -(b)=n/ (n+M), performs S5;
S5:For quantile sequence Q and data sequence D1Union X1In each element b, if b be less than d1, it is determined that should
The corresponding ECDF of element bD +Value be 0, perform S6;
If b is more than or equal to dN, it is determined that the corresponding ECDF of element bD +Value be 1, perform S6;
If b is more than or equal to d1And less than dN, then using binary chop, from data sequence D1Middle lookup dn, so that b be more than or
Equal to dnAnd less than dn+1;
Determine the corresponding ECDF of element bD +Value ECDFD +(b)=n/ (n+M), performs S6;
S6:For quantile sequence Q and data sequence D1Union X1In each element b, calculate the ACDF of the element b-
Value ACDF-(b)=(T*CDFQ(b)+N*ECDFD -(b))/(T+N), and the element b ACDF+Value ACDF+(b)=
(T*CDFQ(b)+N*ECDFD +(b))/(T+N), performs S7;
S7:For presetting each quantile p in quantile sequence Pm, calculate the quantile pmCorresponding qm -And qm +'s
Value, performs S8, wherein, qm -=max { b ∣ ACDF-(b)≤pm, b ∈ X1, qm +=max { b ∣ ACDF+(b)≥pm, b ∈ X1};
S8:For presetting each quantile p in quantile sequence Pm, judge quantile pmCorresponding qm -And qm +Value be
It is no identical;
If identical, by qm -Value be defined as the quantile pmCorresponding current quantile qmValue;
If it is not the same, then by (k*qm -+(1-k)*qm +), it is defined as the quantile pmCorresponding current quantile qmValue,
Wherein, k=F-(ACDF+(qm +)-F-(pm))/F-(ACDF+(qm +)-ACDF-(qm -))。
5. method according to claim 4, it is characterised in that after step s8, methods described also includes:
S9:The currency of the second design variables T is added into N, and the currency of the first design variables C is added 1, is performed
S10;
S10:Judge Jia 1 after the first design variables C currency whether the integer for the value for being the 3rd design variables cycle
Times;
If not being the integral multiple of cycle value, the step for performing the target data sequence D for obtaining quantile to be calculated is returned
Suddenly;
If the integral multiple of cycle value, the fitting quantile sequence is updated, returns and performs to be calculated point of position of the acquisition
The step of several target data sequence D, wherein, the value for each quantile that the fitting quantile sequence B after renewal is included is
(T*bm+M*qm)/(T+M)。
6. the quantile determining device of a kind of data, it is characterised in that described device includes:Data training module and quantile are estimated
Count module;
The data training module, for being fitted to the training data chosen from target data, obtains training data pair
The probability density function p (x) answered;Using the probability density function p (x), the corresponding fitting distribution function F of training data is calculated
And its inverse function F (x)-1(x), wherein, the fitting distribution function F (x) be nonlinear function;For presetting quantile sequence P
Comprising each quantile, utilize the inverse function F-1(x) the corresponding fitting quantile of the quantile, is calculated, and will be described
Quantile storage is fitted into fitting quantile sequence B;
The fractional-dimension calculus module, the target data sequence D for obtaining quantile to be calculated;For the target data sequence
D is arranged, the fitting distribution function F (x), the inverse function F is utilized-1(x) and it is described fitting quantile sequence B, determine each
Individual quantile distinguishes corresponding current quantile.
7. device according to claim 6, it is characterised in that the data training module includes:
Function Fitting submodule, for being fitted to the training data chosen from target data, obtains training data correspondence
Probability density function p (x);
First calculating sub module, for utilizing the probability density function p (x), calculates the corresponding fitting distribution letter of training data
Number F (x) and its inverse function F-1(x), wherein, the fitting distribution function F (x) be nonlinear function;
Second calculating sub module, for for presetting each quantile that quantile sequence P is included, utilizing the inverse function F-1
(x) the corresponding fitting quantile of the quantile, is calculated, and by the fitting quantile storage into fitting quantile sequence B;
The fractional-dimension calculus module, including:
Target data sequence obtains submodule, the target data sequence D for obtaining quantile to be calculated;
Quantile determination sub-module, for for the target data sequence D, utilizes the fitting distribution function F (x), described
Inverse function F-1(x) and the fitting quantile sequence B, determine that each quantile distinguishes corresponding current quantile.
8. device according to claim 7, it is characterised in that first calculating sub module, specifically for:
For the probability density function p (x), the p (x) is calculated from the negative infinite definite integral to variable x, the fitting is obtained
Distribution function F (x);
Calculate the inverse function F of the fitting distribution function F (x)-1(x)。
9. device according to claim 7, it is characterised in that second calculating sub module, specifically for:
For presetting each quantile that quantile sequence is included, the value of the quantile is substituted into the inverse function, calculated
The corresponding contrafunctional value of the quantile, is used as the value of the corresponding fitting quantile of the quantile.
10. device according to claim 8, it is characterised in that the target data sequence obtains submodule, specific to use
In:
1 data element d in target data is received, and d is stored to the target data sequence D that size is N, until the mesh
Mark data sequence D is filled;
The quantile determination sub-module, specifically for:
S1:Judge whether the first design variables C currency is more than 0, wherein, the initial value of the C is 0;
If more than 0, performing S2;
If equal to 0, the second design variables T initialization value is set to 0, and by the corresponding quantiles of default quantile sequence P
All quantiles in sequence Q are reset, and perform S3, wherein, P={ p0, p1..., pm..., pM, Q={ q0, q1..., qm...,
qM, m ∈ [0, M];
S2:For each element a in quantile sequence Q and the union X of data sequence D, if a is less than q0, it is determined that this yuan
The corresponding CDF of plain aQValue be 0, perform S3;
If a is more than or equal to qM, it is determined that the corresponding CDF of element aQValue be 1, perform S3;
If a is more than or equal to q0And less than qM, then quantile q is searched from quantile sequence Qm, so that a is more than or equal to qmAnd
Less than qm+1, wherein, m is more than or equal to 0 and less than or equal to (M-1);
Q is obtained from fitting quantile sequence BmCorresponding bm, and qm+1Corresponding bm+1, wherein, B={ b0, b1..., bm...,
bM, m ∈ [0, M];
Determine the corresponding CDF of element aQValue CDFQ(a)=F (bm+(bm+1-bm)*(a-qm)/(qm+1-qm)), perform S3;
S3:Order by the data in data sequence D according to data value from small to large, is ranked up, the data after being sorted
Sequence D1, S4 is performed, wherein, D1={ d1, d2..., dn..., dN, n ∈ [0, N];
S4:For quantile sequence Q and data sequence D1Union X1In each element b, if b be less than or equal to d1, then
Determine the corresponding ECDF of element bD -Value be 0, perform S5;
If b is more than dN, it is determined that the corresponding ECDF of element bD -Value be 1, perform S5;
If b is more than d1And less than or equal to dN, then from data sequence D1Middle lookup dn, so that b is more than dnAnd be less than or equal to
dn+1;
Determine the corresponding ECDF of element bD -Value ECDFD -(b)=n/ (n+M), performs S5;
S5:For quantile sequence Q and data sequence D1Union X1In each element b, if b be less than d1, it is determined that should
The corresponding ECDF of element bD +Value be 0, perform S6;
If b is more than or equal to dN, it is determined that the corresponding ECDF of element bD +Value be 1, perform S6;
If b is more than or equal to d1And less than dN, then using binary chop, from data sequence D1Middle lookup dn, so that b be more than or
Equal to dnAnd less than dn+1;
Determine the corresponding ECDF of element bD +Value ECDFD +(b)=n/ (n+M), performs S6;
S6:For quantile sequence Q and data sequence D1Union X1In each element b, calculate the ACDF of the element b-
Value ACDF-(b)=(T*CDFQ(b)+N*ECDFD -(b))/(T+N), and the element b ACDF+Value ACDF+(b)=
(T*CDFQ(b)+N*ECDFD +(b))/(T+N), performs S7;
S7:For presetting each quantile p in quantile sequence Pm, calculate the quantile pmCorresponding qm -And qm +'s
Value, performs S8, wherein, qm -=max { b ∣ ACDF-(b)≤pm, b ∈ X1, qm +=max { b ∣ ACDF+(b)≥pm, b ∈ X1};
S8:For presetting each quantile p in quantile sequence Pm, judge quantile pmCorresponding qm -And qm +Value be
It is no identical;
If identical, by qm -Value be defined as the quantile pmCorresponding current quantile qmValue;
If it is not the same, then by (k*qm -+(1-k)*qm +), it is defined as the quantile pmCorresponding current quantile qmValue,
Wherein, k=F-(ACDF+(qm +)-F-(pm))/F-(ACDF+(qm +)-ACDF-(qm -))。
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201710235860.7A CN106991080A (en) | 2017-04-12 | 2017-04-12 | A kind of quantile of data determines method and device |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201710235860.7A CN106991080A (en) | 2017-04-12 | 2017-04-12 | A kind of quantile of data determines method and device |
Publications (1)
Publication Number | Publication Date |
---|---|
CN106991080A true CN106991080A (en) | 2017-07-28 |
Family
ID=59414996
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201710235860.7A Pending CN106991080A (en) | 2017-04-12 | 2017-04-12 | A kind of quantile of data determines method and device |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN106991080A (en) |
Cited By (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN108090139A (en) * | 2017-11-30 | 2018-05-29 | 北京邮电大学 | A kind of document retrieval method and device |
CN110969197A (en) * | 2019-11-22 | 2020-04-07 | 上海交通大学 | Quantile prediction method for wind power generation based on instance migration |
CN111989897A (en) * | 2018-04-10 | 2020-11-24 | 奈特朗茨公司 | Measurement indicators for computer networks |
CN113806691A (en) * | 2021-09-29 | 2021-12-17 | 河南星环众志信息科技有限公司 | Method and device for acquiring quantile and storage medium |
CN115017215A (en) * | 2022-08-05 | 2022-09-06 | 阿里巴巴(中国)有限公司 | Data processing method and computing device |
-
2017
- 2017-04-12 CN CN201710235860.7A patent/CN106991080A/en active Pending
Cited By (9)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN108090139A (en) * | 2017-11-30 | 2018-05-29 | 北京邮电大学 | A kind of document retrieval method and device |
CN108090139B (en) * | 2017-11-30 | 2021-10-01 | 北京邮电大学 | File retrieval method and device |
CN111989897A (en) * | 2018-04-10 | 2020-11-24 | 奈特朗茨公司 | Measurement indicators for computer networks |
CN111989897B (en) * | 2018-04-10 | 2023-09-01 | 瞻博网络公司 | Measuring index of computer network |
CN110969197A (en) * | 2019-11-22 | 2020-04-07 | 上海交通大学 | Quantile prediction method for wind power generation based on instance migration |
CN110969197B (en) * | 2019-11-22 | 2022-01-04 | 上海交通大学 | Quantile prediction method for wind power generation based on instance migration |
CN113806691A (en) * | 2021-09-29 | 2021-12-17 | 河南星环众志信息科技有限公司 | Method and device for acquiring quantile and storage medium |
CN113806691B (en) * | 2021-09-29 | 2024-03-15 | 河南星环众志信息科技有限公司 | Quantile acquisition method, quantile acquisition equipment and storage medium |
CN115017215A (en) * | 2022-08-05 | 2022-09-06 | 阿里巴巴(中国)有限公司 | Data processing method and computing device |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN106991080A (en) | A kind of quantile of data determines method and device | |
CN108683530B (en) | Data analysis method and device for multi-dimensional data and storage medium | |
CN110022531A (en) | A kind of localization difference privacy municipal refuse data report and privacy calculation method | |
CN103530431A (en) | Data processing method and device for webpage clicking amount statistics | |
CN111898765A (en) | Feature binning method, device, equipment and readable storage medium | |
CN116596095B (en) | Training method and device of carbon emission prediction model based on machine learning | |
CN110059294A (en) | Non-linear Granger Causality detection method based on core recurrence maximum cross-correlation entropy algorithm | |
CN108805211A (en) | IN service type cognitive method based on machine learning | |
CN110535686A (en) | Anomalous event treating method and apparatus | |
Sadiq et al. | Developing environmental indices using fuzzy numbers ordered weighted averaging (FN-OWA) operators | |
CN114679335B (en) | Power monitoring system network security risk assessment training method, assessment method and equipment | |
CN111104339A (en) | Software interface element detection method and system based on multi-granularity learning, computer equipment and storage medium | |
CN110968835A (en) | Approximate quantile calculation method and device | |
Hušková et al. | Detection of structural changes in regression | |
Klochkov et al. | Localizing multivariate caviar | |
CN111291792B (en) | Flow data type integrated classification method and device based on double evolution | |
CN113360218A (en) | Service scheme selection method, device, equipment and storage medium | |
Kim et al. | Formational bounds of link prediction in collaboration networks | |
WO2017062026A1 (en) | Generating cohorts using automated weighting and multi-level ranking | |
CN113407537A (en) | Data processing method and device and electronic equipment | |
CN114722242A (en) | Binary counting type summarization method and device based on graph data stream and computer equipment | |
CN110610479A (en) | Object scoring method and device | |
CN113362342B (en) | Image segmentation method and related device thereof | |
CN116821885B (en) | Data acquisition method, device, computer equipment and storage medium | |
Kandilakis et al. | On the existence of solutions for random differential inclusions in a Banach space |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
RJ01 | Rejection of invention patent application after publication | ||
RJ01 | Rejection of invention patent application after publication |
Application publication date: 20170728 |