CN105718756A - Real-time computing method and system for nonrandom informational probability in continuous game data streams - Google Patents

Real-time computing method and system for nonrandom informational probability in continuous game data streams Download PDF

Info

Publication number
CN105718756A
CN105718756A CN201610236364.9A CN201610236364A CN105718756A CN 105718756 A CN105718756 A CN 105718756A CN 201610236364 A CN201610236364 A CN 201610236364A CN 105718756 A CN105718756 A CN 105718756A
Authority
CN
China
Prior art keywords
nonrandom
tau
data stream
module
exchange hand
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201610236364.9A
Other languages
Chinese (zh)
Other versions
CN105718756B (en
Inventor
沈天瑞
王�琦
涂世涛
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Shanghai Leiton Capital Management Co Ltd
Original Assignee
Shanghai Leiton Capital Management Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Shanghai Leiton Capital Management Co Ltd filed Critical Shanghai Leiton Capital Management Co Ltd
Priority to CN201610236364.9A priority Critical patent/CN105718756B/en
Publication of CN105718756A publication Critical patent/CN105718756A/en
Application granted granted Critical
Publication of CN105718756B publication Critical patent/CN105718756B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16ZINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS, NOT OTHERWISE PROVIDED FOR
    • G16Z99/00Subject matter not provided for in other main groups of this subclass

Landscapes

  • Complex Calculations (AREA)
  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)

Abstract

The invention discloses a real-time computing method and system for nonrandom informational probability in continuous game data streams. The real-time computing method comprises the steps of maintaining a current trading volume data bucket by virtue of a series of input time series data, simultaneously maintaining a queue with a length equal to the number n of sampled data buckets, adding VtauB and VtauS computed in the current trading volume data bucket into the head of the current queue whenever the current trading volume data bucket is full, and computing the nonrandom informational probability according to n latest VtauB and VtauS by virtue of a sliding window analogue method. According to the real-time computing method, the online rolling real-time computation can be realized, furthermore, the stability and the adaptivity are relatively good, obstacles for estimating unobservable parameters are eliminated, and defect of inaccuracy of a time dimension algorithm during real-time trading and particularly high-frequency trading is overcome.

Description

The real-time computing technique of nonrandom informational probability and system in Continuous Game data stream
Technical field
The invention belongs to field of computer technology, particularly relate to a kind of data analysing method and system.
Background technology
Continuous Game, such as auction, Ask-Bid System, real-time deal etc., the data stream formed, there is the factor such as information asymmetry, irrational decision-making in the main body owing to participating in game, is mingled with the sticgastuc deicision information of a large amount of main body in data stream.Stochastic Decision-making probability in this characteristic data flow is calculated, or informational probability nonrandom in this characteristic data flow is calculated, the effective information in betting data time series can be better profited from, thus helping to solve many practical problems, such as the undulatory property of prediction markets, prevention market risk etc..
But, current conventional nonrandom informational probability evaluation method is based on what the statistical method that some unobservable parameters carry out maximum likelihood method carried out, inevitably need parameter estimation is carried out artificial hypothesis, and the result of parameter estimation is inconvenient to upgrade in time, thus the instant situation of nonrandom informational probability in this characteristic data flow can not be reflected.Therefore, find a kind of on-line data analysis method and system, the real-time data stream formed for Continuous Game implements a series of objective technical finesses, thus obtaining the evaluation of nonrandom informational probability in this characteristic data flow, is technical problem currently in the urgent need to address.
Summary of the invention
In view of this, it is an object of the invention to provide the real-time computing technique of nonrandom informational probability in a kind of Continuous Game data stream and system, nonrandom informational probability in the Continuous Game time series data calculating different time granularity in real time, the undulatory property of timely prediction markets can be rolled online.
The real-time computing technique of nonrandom informational probability in a kind of Continuous Game data stream, comprises the following steps:
(1) data stream Continuous Game formed is sampled by constant duration, and obtaining sampling time interval is T0Time series data stream, be designated as: { T1,P1,V1}、{T2,P2,V2}…{Ti,Pi,Vi..., wherein, TiFor ith sample moment, PiFor the End-price in ith sample moment, ViFor the T between ith sample moment and the i-th-1 sampling instant0Total volume in interval;
(2) from described time series data stream, extract activity data stream, be designated as: { V1,V2,…,Vi... }, wherein, the T between ith sample moment and the i-th-1 sampling instant0Total volume V in intervaliIt is regarded as the V that can splitiPart unit exchange hand;
(3) by described activity data stream according to etc. exchange hand interval carry out being polymerized and splitting, and be sequentially filled in n data bucket, the maximum exchange hand summation that each data bucket holds is V;
(4) whether full described data bucket is detected in real time, i.e. in described data bucket, whether the summation of activity data has reached V, if it is, carry out step (5);If it is not, then return to step (1);
(5) (I) and (II) calculates the exchange hand V bought in each described data bucket according to the following formula respectivelyτ BWith the exchange hand V soldτ SValue, and by (Vτ B,Vτ S) it is filled into sampling buffer area as a paired sampled value;
V τ B = Σ i = t ( τ - 1 ) + 1 t ( τ ) V i · S ( P i - P i - 1 A T R ) - - - ( I )
V τ S = Σ i = t ( τ - 1 ) + 1 t ( τ ) V i · [ 1 - S ( P i - P i - 1 A T R ) ] = V - V τ B - - - ( I I )
Wherein, t (τ) is last sampling instant, V in τ described data bucketiFor the T between ith sample moment and the i-th-1 sampling instant0Total volume in interval;Pi、Pi-1The respectively End-price of ith sample moment and the i-th-1 sampling instant, ATR is the average true wave amplitude (AverageTrueRange) of described time series data stream, and V is the maximum exchange hand summation held in each described data bucket;S is Sigmoid function, and its definition is as follows:B is the parameter of control function smoothness;
(6) whether full detect in real time described sampling buffer area, namely detect the value of the number m of paired sampled value in described sampling buffer area, if m is < n, return to step (1) or (5);If m=n, then carry out step (7);
(7) according to the following formula (III) calculates nonrandom informational probability ρ output in real time in the data stream that presently described Continuous Game is formed:
&rho; &ap; &Sigma; &tau; = 1 n | V &tau; S - V &tau; B | n V - - - ( I I I ) .
In the present invention, the data stream that described Continuous Game is formed is obtained from metadata provider by application programming interface (ApplicationProgrammingInterface, API).
In the present invention, described interval T0Value be preferably 1 minute, 10 seconds or 30 seconds.
In the present invention, the value of the number n of described data bucket is preferably 50.
In the present invention, ATR is true wave amplitude (TrueRange, TR) rolling average, and TR=Max{ (H-L), (H-PC), (PC-L) }, wherein, the ceiling price of H and L respectively present sample time period and lowest price, PC is the End-price of previous sampling time section.When calculating, the number n of described data bucket can be considered sliding window number, then, in n described data bucket, there is Y value T in time series data corresponding to activity data0Sampling time interval, then ATR can value be the rolling average of the true wave amplitude (TrueRange, TR) of Y.
Meanwhile, present invention also offers the real time computation system of nonrandom informational probability in a kind of Continuous Game data stream, including:
(1) sampling module: the data stream for Continuous Game is formed is sampled by constant duration, and obtaining sampling time interval is T0Time series data stream, be designated as: { T1,P1,V1}、{T2,P2,V2}…{Ti,Pi,Vi..., wherein, TiFor ith sample moment, PiFor the End-price in ith sample moment, ViFor the T between ith sample moment and the i-th-1 sampling instant0Total volume in interval;
(2) data extraction module: for extracting activity data stream from the described time series data stream exported by described sampling module, be designated as: { V1,V2,…,Vi... }, wherein, the T between ith sample moment and the i-th-1 sampling instant0Total volume V in intervaliIt is regarded as the V that can splitiPart unit exchange hand;
(3) data concentrating module: for by the described activity data stream exported by described data extraction module according to etc. exchange hand interval carry out being polymerized and splitting, and it being sequentially filled in n data bucket, the maximum exchange hand summation that each data bucket holds is V;Whether full it is simultaneously used for detecting described data bucket in real time, i.e. in described data bucket, whether the summation of activity data has reached V, if it is, proceed to exchange hand classification of type computing module;If it is not, then return to described sampling module;
(4) exchange hand classified counting module: for calculating by the exchange hand V bought in each described data bucket of described data concentrating module outputτ BWith the exchange hand V soldτ SValue, wherein, the exchange hand V boughtτ BWith the exchange hand V soldτ SValue respectively according to the following formula (I) and (II) be calculated:
V &tau; B = &Sigma; i = t ( &tau; - 1 ) + 1 t ( &tau; ) V i &CenterDot; S ( P i - P i - 1 A T R ) - - - ( I )
V &tau; S = &Sigma; i = t ( &tau; - 1 ) + 1 t ( &tau; ) V i &CenterDot; &lsqb; 1 - S ( P i - P i - 1 A T R ) &rsqb; = V - V &tau; B - - - ( I I )
Wherein, t (τ) is last sampling instant, V in τ described data bucketiFor the T between ith sample moment and the i-th-1 sampling instant0Total volume in interval;Pi、Pi-1The respectively End-price of ith sample moment and the i-th-1 sampling instant, ATR is the average true wave amplitude (AverageTrueRange) of described time series data stream, and V is the maximum exchange hand summation held in each described data bucket;S is Sigmoid function, and its definition is as follows:B is the parameter of control function smoothness;
(5) sampling cache module: for the exchange hand V that will buy in each described data bucket exported by described exchange hand classified counting moduleτ BWith the exchange hand V soldτ SValue be filled into sampling buffer area as a paired sampled value, and it is whether full for detecting described sampling buffer area in real time, namely the value of the number m of paired sampled value in described sampling buffer area is detected, if m is < n, return to described sampling module or described exchange hand classified counting module;If m=n, then proceed to nonrandom informational probability estimation module;
(6) nonrandom informational probability estimation module: for calculating nonrandom informational probability ρ in the data stream that presently described Continuous Game is formed according to the data of described sampling cache module output, described nonrandom informational probability ρ (III) according to the following formula is calculated:
&rho; &ap; &Sigma; &tau; = 1 n | V &tau; S - V &tau; B | n V - - - ( I I I ) ;
(7) data outputting module: for the value of calculated for described nonrandom informational probability estimation module described nonrandom informational probability is exported.
In the present invention, the data stream that described Continuous Game is formed is obtained from metadata provider by application programming interface (ApplicationProgrammingInterface, API).
In the present invention, described interval T0Value be preferably 1 minute, 10 seconds or 30 seconds.
In the present invention, the value of the number n of described data bucket is preferably 50.
In the present invention, ATR is true wave amplitude (TrueRange, TR) rolling average, and TR=Max{ (H-L), (H-PC), (PC-L) }, wherein, the ceiling price of H and L respectively present sample time period and lowest price, PC is the End-price of previous sampling time section.When calculating, the number n of described data bucket can be considered sliding window number, then, in n described data bucket, there is Y value T in time series data corresponding to activity data0Sampling time interval, then ATR can value be the rolling average of the true wave amplitude (TrueRange, TR) of Y.
In the present invention, described data outputting module is digital output module, exports result with the form of numerical value;Can also be the combination of digital output module and images outputting module, export result in a graphical form.
In the present invention, described data outputting module can be docked with other system, so that other system can call the data processed result of described data outputting module output.
In the present invention, for the deficiency in being constituted with the time for dimension research exchange hand in prior art, adopt the statistical method being dimension with exchange hand, calculate nonrandom informational probability in the data stream that Continuous Game is formed based on the exchange hand of different time granularity and knock-down price, can be used for the prediction of price wave motion to the financial market such as stock, futures.
Specifically, the present invention is that a series of time series data first passing through input safeguards a current activity data bucket, safeguard a long queue for sampled data bucket number n simultaneously, whenever current activity data bucket has been expired time, can by calculated V in current activity data bucketτ BAnd Vτ SJoin in current queue heads, adopt the method that sliding window is similar here, take n nearest Vτ BAnd Vτ SCalculate nonrandom informational probability.In the present invention, for Vτ BAnd Vτ SCalculating introduces ATR index absolute drift is normalized, there is better stability and adaptivity;Further, in the present invention, being calculated as of nonrandom informational probability ρ calculates online in real time, it is not limited to certain time period of analysis of history.Adopt object oriented designing pattern to be encapsulated in the real time computation system of nonrandom informational probability by the calculating of ρ when realizing, by constantly to the incoming online time series data of system, system can according to the nonrandom informational probability ρ value of the situation real-time update of current exchange hand.
Compared with prior art, the present invention has following useful technique effect:
(1) eliminate the obstacle that unobservable parameter is estimated, overcome the inaccurate defect that time dimension algorithm is concluded the business at present deal especially high frequency.
(2)Vτ BAnd Vτ SPartitioning algorithm in creativeness introduce ATR index absolute drift be normalized, there is better stability and adaptivity.
(3) achieve the calculating of nonrandom informational probability in the Continuous Game time series data for different grain size (day, time, grade), and real-time calculating can be rolled online.
Accompanying drawing explanation
Fig. 1 is microcosmic Trading Model figure.
Fig. 2 is that the present invention carries out the flow chart of calculating in real time for nonrandom informational probability in Continuous Game data stream.
Fig. 3 be the present invention a specific embodiment in sampling time interval be the time series data stream of 1 minute.
Fig. 4 be the present invention a specific embodiment in informational probability nonrandom in the time series data stream of Fig. 3 carried out to the result of in real time calculating.
Fig. 5 be the present invention a specific embodiment in for the real-time result of calculation of nonrandom informational probability of Fig. 4 in the time interval of subsequently 2 hours with the dependency of price wave motion.
Detailed description of the invention
Hereinafter, in conjunction with specific embodiments, technical scheme is described in further detail.Following example are carried out under premised on technical solution of the present invention, give detailed embodiment and process, but protection scope of the present invention is not limited to following embodiment.
For nonrandom informational probability in Continuous Game data stream, illustrated by the microcosmic Trading Model built as shown in Figure 1, wherein, α is that message event produces probability, μ be in the know (namely, nonrandom information) the trading instruction speed of dealer, ε is the trading instruction speed of non-(that is, the random information) dealer that knows the inside story.Thus, the probability P IN (that is, nonrandom Transaction Information probability) of Informed Trade information can be expressed as follows:
P I N = &alpha; &mu; &alpha; &mu; + 2 &epsiv;
As in figure 2 it is shown, in a specific embodiment of the present invention, the real-time computing technique of nonrandom informational probability in a kind of Continuous Game data stream, comprise the following steps:
Step S1: the data stream (can be obtained from metadata provider by application programming interface API) that Continuous Game is formed is sampled by constant duration, and obtaining sampling time interval is T0Time series data stream, be designated as: { T1,P1,V1}、{T2,P2,V2}…{Ti,Pi,Vi..., wherein, TiFor ith sample moment, PiFor the End-price in ith sample moment, ViFor the T between ith sample moment and the i-th-1 sampling instant0Total volume in interval, T2-T1=Ti-Ti-1=T0
Step S2: extract activity data stream from described time series data stream, be designated as: { V1,V2,…,Vi... }, wherein, the T between ith sample moment and the i-th-1 sampling instant0Total volume V in intervaliIt is regarded as the V that can splitiPart unit exchange hand;
Step S3: by described activity data stream according to etc. exchange hand interval carry out being polymerized and splitting, and be sequentially filled in n data bucket, the maximum exchange hand summation that each data bucket holds is V;
Step S4: whether detection current data bucket is full in real time, i.e. in current data bucket, whether the summation of activity data has reached V, if it is, carry out step S5;If it is not, then return to step S1;
Step S5: (I) and (II) calculates the exchange hand V bought in current data bucket according to the following formula respectivelyτ BWith the exchange hand V soldτ SValue;
V &tau; B = &Sigma; i = t ( &tau; - 1 ) + 1 t ( &tau; ) V i &CenterDot; S ( P i - P i - 1 A T R ) - - - ( I )
V &tau; S = &Sigma; i = t ( &tau; - 1 ) + 1 t ( &tau; ) V i &CenterDot; &lsqb; 1 - S ( P i - P i - 1 A T R ) &rsqb; = V - V &tau; B - - - ( I I )
Wherein,
T (τ) is last sampling instant in τ described data bucket,
ViFor the T between ith sample moment and the i-th-1 sampling instant0Total volume in interval;
Pi、Pi-1The respectively End-price of ith sample moment and the i-th-1 sampling instant;
ATR is the average true wave amplitude (AverageTrueRange) of described time series data stream, value is true wave amplitude (TrueRange, TR) rolling average, and TR=Max{ (H-L), (H-PC), (PC-L) }, wherein, the ceiling price of H and L respectively present sample time period and lowest price, PC is the End-price of previous sampling time section;
V is the maximum exchange hand summation held in each described data bucket;
S is Sigmoid function, and its definition is as follows:Wherein, b is the parameter of control function smoothness, and b can be set to 1;
Step S6: by (Vτ B,Vτ S) it is filled into sampling buffer area as a paired sampled value;
Step S7: whether full detect in real time described sampling buffer area, namely detects the value of the number m of paired sampled value in described sampling buffer area, if m is < n, returns to step S1 or S5;If m=n, then carry out step S8;
Step S8: (III) calculates nonrandom informational probability ρ output in real time in the data stream that presently described Continuous Game is formed according to the following formula:
&rho; &ap; &Sigma; &tau; = 1 n | V &tau; S - V &tau; B | n V - - - ( I I I ) .
For the ease of understanding and illustrating, give one example the exchange hand V illustrating to buy at thisτ BWith the exchange hand V soldτ SThe calculating process of value:
Step S1 obtains the data stream that Continuous Game is formed, interval T by application programming interface (ApplicationProgrammingInterface, API) from metadata provider0Value 1 minute;Activity data stream { V in step S21,V2,…,Vi... }, it is assumed that for { 5,2,4,1,7,13,5 ... };In step S3 etc. exchange hand interval V be 10, owing to each exchange hand is ViData regard as and split into ViIndividual exchange hand is the data of 1, then when certain activity data is filled in current data bucket to make this data bucket exchange hand reach 10, it is necessary to according to exchange hand, these data are split into two parts, a part fills up current data bucket, and another part the next data bucket to be added such as then, therefore, activity data stream after polymerization and fractionation is { (5,2,3), (1,1,7,1), (10), (2,5 ...) ... };By (5,2,3), (1,1,7,1), (10), (2,5 ...) ... it is sequentially filled in n data bucket, step S5 calculates each data bucket such as (5,2,3) respectively, for another example (1,1,7,1) accounting of the exchange hand bought in the data of totally 10 unit exchange hands and the exchange hand sold.For time series data, the granularity of data can split, and such as daily the data at interval can be split as Hour, or be polymerized to and monthly count.According to the security bargain data minute counted, (5,2,3) are the data of 3 minutes, totally 10 unit exchange hands for modal, for each data and per minute, exist out, high and low, receive four prices, Pi、Pi-1Respectively " receipts " price of ith sample moment and the i-th-1 sampling instant, ATR is the rolling average of true wave amplitude (TrueRange, TR), for instance, value is the average of 250 minutes true wave amplitude TR,Meet Sigmoid function
Correspondingly, in a specific embodiment of the present invention, the real time computation system of nonrandom informational probability in a kind of Continuous Game data stream, including:
(1) sampling module: the data stream (can be obtained from metadata provider by application programming interface API) for Continuous Game is formed is sampled by constant duration, and obtaining sampling time interval is T0Time series data stream, be designated as: { T1,P1,V1}、{T2,P2,V2}…{Ti,Pi,Vi..., wherein, TiFor ith sample moment, PiFor the End-price in ith sample moment, ViFor the T between ith sample moment and the i-th-1 sampling instant0Total volume in interval;
(2) data extraction module: for extracting activity data stream from the described time series data stream exported by described sampling module, be designated as: { V1,V2,…,Vi... }, wherein, the T between ith sample moment and the i-th-1 sampling instant0Total volume V in intervaliIt is regarded as the V that can splitiPart unit exchange hand;
(3) data concentrating module: for by the described activity data stream exported by described data extraction module according to etc. exchange hand interval carry out being polymerized and splitting, and it being sequentially filled in n data bucket, the maximum exchange hand summation that each data bucket holds is V;Whether full it is simultaneously used for detecting described data bucket in real time, i.e. in described data bucket, whether the summation of activity data has reached V, if it is, proceed to exchange hand classification of type computing module;If it is not, then return to described sampling module;
(4) exchange hand classified counting module: for calculating by the exchange hand V bought in each described data bucket of described data concentrating module outputτ BWith the exchange hand V soldτ SValue, wherein, the exchange hand V boughtτ BWith the exchange hand V soldτ SValue respectively according to the following formula (I) and (II) be calculated:
V &tau; B = &Sigma; i = t ( &tau; - 1 ) + 1 t ( &tau; ) V i &CenterDot; S ( P i - P i - 1 A T R ) - - - ( I )
V &tau; S = &Sigma; i = t ( &tau; - 1 ) + 1 t ( &tau; ) V i &CenterDot; &lsqb; 1 - S ( P i - P i - 1 A T R ) &rsqb; = V - V &tau; B - - - ( I I )
Wherein,
T (τ) is last sampling instant in τ described data bucket,
ViFor the T between ith sample moment and the i-th-1 sampling instant0Total volume in interval;
Pi、Pi-1The respectively End-price of ith sample moment and the i-th-1 sampling instant;
ATR is the average true wave amplitude (AverageTrueRange) of described time series data stream, value is true wave amplitude (TrueRange, TR) rolling average, and TR=Max{ (H-L), (H-PC), (PC-L) }, wherein, the ceiling price of H and L respectively present sample time period and lowest price, PC is the End-price of previous sampling time section;
V is the maximum exchange hand summation held in each described data bucket;
S is Sigmoid function, and its definition is as follows:Wherein, b is the parameter of control function smoothness, and b can be set to 1;
(5) sampling cache module: for the exchange hand V that will buy in each described data bucket exported by described exchange hand classified counting moduleτ BWith the exchange hand V soldτ SValue be filled into sampling buffer area as a paired sampled value, and it is whether full for detecting described sampling buffer area in real time, namely the value of the number m of paired sampled value in described sampling buffer area is detected, if m is < n, return to described sampling module or described exchange hand classified counting module;If m=n, then proceed to nonrandom informational probability estimation module;
(6) nonrandom informational probability estimation module: for calculating nonrandom informational probability ρ in the data stream that presently described Continuous Game is formed according to the data of described sampling cache module output, described nonrandom informational probability ρ (III) according to the following formula is calculated:
&rho; &ap; &Sigma; &tau; = 1 n | V &tau; S - V &tau; B | n V - - - ( I I I ) ;
(7) data outputting module: for the value of calculated for described nonrandom informational probability estimation module described nonrandom informational probability is exported.Data outputting module can be digital output module, exports result with the form of numerical value;Can also be the combination of digital output module and images outputting module, export result in a graphical form.Data outputting module can be docked with other system, so that other system can call the data processed result of this system data output module output.
In the process implemented, by the real time computation system of nonrandom informational probability, time series data in the on-line continuous betting data stream of input is processed in real time, when current activity data bucket is full time, calculate V according to all of exchange hand in current activity data bucket and conclusion of the business price differentialτ BAnd Vτ S, then update the activity data bucket queue calculating ρ;When current activity data bucket queue fills up sampling buffer, calculate nonrandom informational probability ρ value according to current queue immediately.The time produced due to ρ value is unfixed, so needing joining day stamp conveniently to display as index in the process of Data Integration.
Utilize above-mentioned real time computation system, the value of nonrandom informational probability under current time can be obtained by above-mentioned real-time computing technique, and take following methods to calculate the dependency of itself and price fluctuation further:
(1) calculate the extreme difference from the initial certain time concluded price in interval of current time or standard deviation, be designated as price wave motion VarPrice;
(2) calculate the covariance Cov (ρ, VarPrice) of the value of nonrandom informational probability and price wave motion in this time interval, be in this time interval nonrandom informational probability and the dependency of price wave motion.
Above-mentioned time interval is set in 1.5 to 2.5 hours, and the value of nonrandom informational probability is the most notable with the dependency of the price wave motion of corresponding time interval.
In one specific embodiment of Fig. 3, Fig. 4, Fig. 5 respectively present invention sampling time interval be 1 minute time series data stream, nonrandom informational probability in this time series data stream is carried out to the real-time result of calculating, the dependency of this nonrandom informational probability and price wave motion in the time interval of 120 minutes.Be can be seen that by Fig. 4 and Fig. 5, nonrandom informational probability and price wave motion significant correlation.As can be seen here, the method that the value of the nonrandom informational probability that said method calculates may be used for judging Continuous Game market fluctuation.
Additionally, set the number n of activity data bucket filling up sampling buffer and sliding window size value is 50, set the maximum V of the summation of activity data in each data bucket as VD/ 200, and set the sampling time interval T obtaining time series data stream respectively0It it is 1 minute, 10 seconds, 30 seconds, utilize above-mentioned real time computation system by the value of the nonrandom informational probability of above-mentioned real-time computing technique acquisition same day, result shows: the value performance of nonrandom informational probability is similar, and all shows the strong correlation with price wave motion.
Additionally, set the number n of activity data bucket filling up sampling buffer and sliding window size value is 50, set the sampling time interval T obtaining time series data stream0It is 1 minute, and sets the maximum V of the summation of activity data in each data bucket respectively as VD/ 200, VD/ 20, VD/ 5 (is wherein VDThe per day exchange hand on past 50 days), utilize above-mentioned real time computation system by the value of the nonrandom informational probability of above-mentioned real-time computing technique acquisition same day, result shows: the value performance of nonrandom informational probability is similar, and all shows the strong correlation with price wave motion.
As can be seen here, the present invention not only can be implemented in line and rolls calculating in real time, but also there is better stability and adaptivity, and eliminate the obstacle that unobservable parameter is estimated, overcome the inaccurate defect that time dimension algorithm is concluded the business at present deal especially high frequency.
It should be understood by those skilled in the art that the embodiments of the invention shown in foregoing description are only used as citing for the present invention is described, and should not be taken as limiting the scope of the invention.
As can be seen here, the purpose of the present invention completely and is effectively achieved.The function of the present invention and structural principle are shown in an embodiment and are illustrated, when without departing substantially from described principle, embodiment can do any amendment.So, present invention comprises all variant embodiment based on claim spirit and right.

Claims (8)

1. the real-time computing technique of nonrandom informational probability in a Continuous Game data stream, it is characterised in that comprise the following steps:
(1) data stream Continuous Game formed is sampled by constant duration, and obtaining sampling time interval is T0Time series data stream, be designated as: { T1,P1,V1}、{T2,P2,V2}…{Ti,Pi,Vi..., wherein, TiFor ith sample moment, PiFor the End-price in ith sample moment, ViFor the T between ith sample moment and the i-th-1 sampling instant0Total volume in interval;
(2) from described time series data stream, extract activity data stream, be designated as: { V1,V2,…,Vi... }, wherein, the T between ith sample moment and the i-th-1 sampling instant0Total volume V in intervaliIt is regarded as the V that can splitiPart unit exchange hand;
(3) by described activity data stream according to etc. exchange hand interval carry out being polymerized and splitting, and be sequentially filled in n data bucket, the maximum exchange hand summation that each data bucket holds is V;
(4) whether full described data bucket is detected in real time, i.e. in described data bucket, whether the summation of activity data has reached V, if it is, carry out step (5);If it is not, then return to step (1);
(5) (I) and (II) calculates the exchange hand V bought in each described data bucket according to the following formula respectivelyτ BWith the exchange hand V soldτ SValue, and by (Vτ B,Vτ S) it is filled into sampling buffer area as a paired sampled value;
V &tau; B = &Sigma; i = t ( &tau; - 1 ) + 1 t ( &tau; ) V i &CenterDot; S ( P i - P i - 1 A T R ) - - - ( I )
V &tau; S = &Sigma; i = t ( &tau; - 1 ) + 1 t ( &tau; ) V i &CenterDot; &lsqb; 1 - S ( P i - P i - 1 A T R ) &rsqb; = V - V &tau; B - - - ( I I )
Wherein, t (τ) is last sampling instant, V in τ described data bucketiFor the T between ith sample moment and the i-th-1 sampling instant0Total volume in interval;Pi、Pi-1The respectively End-price of ith sample moment and the i-th-1 sampling instant, ATR is the average true wave amplitude of described time series data stream, and V is the maximum exchange hand summation held in each described data bucket;S is Sigmoid function, and its definition is as follows:B is the parameter of control function smoothness;
(6) whether full detect in real time described sampling buffer area, namely detect the value of the number m of paired sampled value in described sampling buffer area, if m is < n, return to step (1) or (5);If m=n, then carry out step (7);
(7) according to the following formula (III) calculates nonrandom informational probability ρ output in real time in the data stream that presently described Continuous Game is formed:
&rho; &ap; &Sigma; &tau; = 1 n | V &tau; S - V &tau; B | n V - - - ( I I I ) .
2. the real-time computing technique of nonrandom informational probability in Continuous Game data stream as claimed in claim 1, it is characterised in that described interval T0Value be 1 minute, 10 seconds or 30 seconds.
3. the real-time computing technique of nonrandom informational probability in Continuous Game data stream as claimed in claim 1, it is characterised in that the value of the number n of described data bucket is 50.
4. the real time computation system of nonrandom informational probability in a Continuous Game data stream, it is characterised in that including:
(1) sampling module: the data stream for Continuous Game is formed is sampled by constant duration, and obtaining sampling time interval is T0Time series data stream, be designated as: { T1,P1,V1}、{T2,P2,V2}…{Ti,Pi,Vi..., wherein, TiFor ith sample moment, PiFor the End-price in ith sample moment, ViFor the T between ith sample moment and the i-th-1 sampling instant0Total volume in interval;
(2) data extraction module: for extracting activity data stream from the described time series data stream exported by described sampling module, be designated as: { V1,V2,…,Vi... }, wherein, the T between ith sample moment and the i-th-1 sampling instant0Total volume V in intervaliIt is regarded as the V that can splitiPart unit exchange hand;
(3) data concentrating module: for by the described activity data stream exported by described data extraction module according to etc. exchange hand interval carry out being polymerized and splitting, and it being sequentially filled in n data bucket, the maximum exchange hand summation that each data bucket holds is V;Whether full it is simultaneously used for detecting described data bucket in real time, i.e. in described data bucket, whether the summation of activity data has reached V, if it is, proceed to exchange hand classification of type computing module;If it is not, then return to described sampling module;
(4) exchange hand classified counting module: for calculating by the exchange hand V bought in each described data bucket of described data concentrating module outputτ BWith the exchange hand V soldτ SValue, wherein, the exchange hand V boughtτ BWith the exchange hand V soldτ SValue respectively according to the following formula (I) and (II) be calculated:
V &tau; B = &Sigma; i = t ( &tau; - 1 ) + 1 t ( &tau; ) V i &CenterDot; S ( P i - P i - 1 A T R ) - - - ( I )
V &tau; S = &Sigma; i = t ( &tau; - 1 ) + 1 t ( &tau; ) V i &CenterDot; &lsqb; 1 - S ( P i - P i - 1 A T R ) &rsqb; = V - V &tau; B - - - ( I I )
Wherein, t (τ) is last sampling instant, V in τ described data bucketiFor the T between ith sample moment and the i-th-1 sampling instant0Total volume in interval;Pi、Pi-1The respectively End-price of ith sample moment and the i-th-1 sampling instant, ATR is the average true wave amplitude of described time series data stream, and V is the maximum exchange hand summation held in each described data bucket;S is Sigmoid function, and its definition is as follows:B is the parameter of control function smoothness;
(5) sampling cache module: for the exchange hand V that will buy in each described data bucket exported by described exchange hand classified counting moduleτ BWith the exchange hand V soldτ SValue be filled into sampling buffer area as a paired sampled value, and it is whether full for detecting described sampling buffer area in real time, namely the value of the number m of paired sampled value in described sampling buffer area is detected, if m is < n, return to described sampling module or described exchange hand classified counting module;If m=n, then proceed to nonrandom informational probability estimation module;
(6) nonrandom informational probability estimation module: for calculating nonrandom informational probability ρ in the data stream that presently described Continuous Game is formed according to the data of described sampling cache module output, described nonrandom informational probability ρ (III) according to the following formula is calculated:
&rho; &ap; &Sigma; &tau; = 1 n | V &tau; S - V &tau; B | n V - - - ( I I I ) ;
(7) data outputting module: for the value of calculated for described nonrandom informational probability estimation module described nonrandom informational probability is exported.
5. the real time computation system of nonrandom informational probability in Continuous Game data stream as claimed in claim 4, it is characterised in that described interval T0Value be 1 minute, 10 seconds or 30 seconds.
6. the real time computation system of nonrandom informational probability in Continuous Game data stream as claimed in claim 4, it is characterised in that the value of the number n of described data bucket is 50.
7. the real time computation system of nonrandom informational probability in Continuous Game data stream as claimed in claim 4, it is characterised in that described data outputting module is digital output module.
8. the real time computation system of nonrandom informational probability in Continuous Game data stream as claimed in claim 4, it is characterised in that described data outputting module is the combination of digital output module and images outputting module.
CN201610236364.9A 2016-04-15 2016-04-15 The real-time computing technique and system of nonrandom informational probability in Continuous Game data flow Active CN105718756B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201610236364.9A CN105718756B (en) 2016-04-15 2016-04-15 The real-time computing technique and system of nonrandom informational probability in Continuous Game data flow

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201610236364.9A CN105718756B (en) 2016-04-15 2016-04-15 The real-time computing technique and system of nonrandom informational probability in Continuous Game data flow

Publications (2)

Publication Number Publication Date
CN105718756A true CN105718756A (en) 2016-06-29
CN105718756B CN105718756B (en) 2018-03-27

Family

ID=56160310

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201610236364.9A Active CN105718756B (en) 2016-04-15 2016-04-15 The real-time computing technique and system of nonrandom informational probability in Continuous Game data flow

Country Status (1)

Country Link
CN (1) CN105718756B (en)

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20030231548A1 (en) * 2002-06-13 2003-12-18 Exxonmobil Upstream Research Company Method of calculating a throw volume for quantitative fault analysis
JP2004318273A (en) * 2003-04-11 2004-11-11 Canon System Solutions Inc Data analysis device and its control method, and computer program, and computer readable storage medium
CN103455943A (en) * 2013-09-02 2013-12-18 深圳市国泰安信息技术有限公司 Stock or stock investment portfolio volatility prediction method and device
CN104574137A (en) * 2014-12-12 2015-04-29 深圳祥云信息科技有限公司 Method for generating expected probability distribution data of business data
CN104750830A (en) * 2015-04-01 2015-07-01 东南大学 Cycle mining method of time series data

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20030231548A1 (en) * 2002-06-13 2003-12-18 Exxonmobil Upstream Research Company Method of calculating a throw volume for quantitative fault analysis
JP2004318273A (en) * 2003-04-11 2004-11-11 Canon System Solutions Inc Data analysis device and its control method, and computer program, and computer readable storage medium
CN103455943A (en) * 2013-09-02 2013-12-18 深圳市国泰安信息技术有限公司 Stock or stock investment portfolio volatility prediction method and device
CN104574137A (en) * 2014-12-12 2015-04-29 深圳祥云信息科技有限公司 Method for generating expected probability distribution data of business data
CN104750830A (en) * 2015-04-01 2015-07-01 东南大学 Cycle mining method of time series data

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
陈自强: "大规模机器学习在算法交易中的应用与研究", 《中国优秀硕士学位论文全文数据库信息科技辑》 *
黄超: "基于特征分析的金融时间序列挖掘若干关键问题研究", 《中国优秀博硕士学位论文全文数据库 (博士)信息科技辑》 *

Also Published As

Publication number Publication date
CN105718756B (en) 2018-03-27

Similar Documents

Publication Publication Date Title
CN110009174B (en) Risk recognition model training method and device and server
Nakano et al. Generalized exponential moving average (EMA) model with particle filtering and anomaly detection
Yao et al. A case study on using neural networks to perform technical forecasting of forex
CN108665159A (en) A kind of methods of risk assessment, device, terminal device and storage medium
Roberts When bigger is better: A critique of the Herfindahl-Hirschman Index's use to evaluate mergers in network industries
JP2004500646A (en) Competitive bidding revenue simulation method and system
De Rossi Maximum likelihood estimation of the Cox–Ingersoll–Ross model using particle filters
CN103903249B (en) Image matching system and method
CN110443350B (en) Model quality detection method, device, terminal and medium based on data analysis
CN109767225B (en) Network payment fraud detection method based on self-learning sliding time window
CN109166034A (en) A kind of Risk Forecast Method and system
CN106408325A (en) User consumption behavior prediction analysis method based on user payment information and system
CN109360022A (en) A kind of market Sales Volume of Commodity prediction technique, device and equipment based on data mining
Zimmer et al. Influenza forecasting framework based on Gaussian Processes
CN114841526A (en) Detection method of high-risk user, computing device and readable storage medium
Vink et al. Likelihood ratio method for the interpretation of iphone health app data in digital forensics
CN108647714A (en) Acquisition methods, terminal device and the medium of negative label weight
US20200193486A1 (en) System and method for determining bid vector transformed predictive click-through rate
Lu et al. Application of grey relational analysis for evaluating road traffic safety measures: advanced driver assistance systems against infrastructure redesign
Chaudhuri et al. Ranking and calibrating click-attributed purchases in performance display advertising
CN105718756A (en) Real-time computing method and system for nonrandom informational probability in continuous game data streams
Njegovan A leading indicator approach to predicting short‐term shifts in demand for business travel by air to and from the UK
CN110866672A (en) Data processing method, device, terminal and medium
Yang et al. Dynamically optimizing display advertising profits under diverse budget settings
McKenzie et al. Nonlinearity and smoothing in venture capital performance data

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant