CN106909799A - A kind of Application of Data Mining theoretical based on new random fractal - Google Patents

A kind of Application of Data Mining theoretical based on new random fractal Download PDF

Info

Publication number
CN106909799A
CN106909799A CN201710151985.1A CN201710151985A CN106909799A CN 106909799 A CN106909799 A CN 106909799A CN 201710151985 A CN201710151985 A CN 201710151985A CN 106909799 A CN106909799 A CN 106909799A
Authority
CN
China
Prior art keywords
fractal
time series
self
similarity
nlari
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201710151985.1A
Other languages
Chinese (zh)
Inventor
何宗路
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Individual
Original Assignee
Individual
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Individual filed Critical Individual
Priority to CN201710151985.1A priority Critical patent/CN106909799A/en
Publication of CN106909799A publication Critical patent/CN106909799A/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16ZINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS, NOT OTHERWISE PROVIDED FOR
    • G16Z99/00Subject matter not provided for in other main groups of this subclass

Landscapes

  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

The invention discloses a kind of Application of Data Mining theoretical based on new random fractal, by addition polymerization time series being amplifying observation yardstick changes point shape slope index parameter and wave amplitude index parameters of NLARI, identification Long Memory Properties, self-similarity, have concurrently Long Memory Properties and self-similarity difference point shape level minimized aggregation degree time series generating process and dynamic characteristic.The present invention can simultaneously recognition time sequence varying level fractal generation mechanism and dynamic characteristic.The self similarity sequence of present invention identification minimized aggregation yardstick provides scientific and standard for dynamic data sampling, compression, feature extraction;Identification data generting machanism of the invention, annotate translate point formed because, adjust memory span, using or avoid the fractal structure from producing various uses;Although the approach that the present invention sets up new fractal theory is extremely complex, the fractal method that the theory is provided is abnormal simple.

Description

A kind of Application of Data Mining theoretical based on new random fractal
Technical field
It is dynamic in big data the invention belongs to nonlinear kinetics, fractal behavior and time series modeling and analysis theories State data mining applied technical field, more particularly to a kind of Application of Data Mining theoretical based on new random fractal.
Background technology
The fractal behavior of recognition time sequence is most one of problem of challenge in Dynamic Data Mining.It is typical random Fractal behavior includes statistical self-similarity, power law and Long Memory Properties (i.e. time-length interrelation), and these behaviors occur certainly extensively So, the complication system such as medical science, ecology, water conservancy, engineering, network, economy and finance.It was found that self similarity sequence, growth or shortening length The methods such as degree of correlation bring various uses.For example, self similarity is in many fields such as network traffics, stock market's dynamic, physiological signals There is important application.Stock yield Long Memory Properties mean that volatility has a kind of persistence or long-rang dependence, to assets The effect of pricing model has potential material impact, thus growth dividend yield Long Memory Properties have major economic value.
Fractal date is excavated according to the meaning of digging utilization data set fractal dimension to data set, is existed in fractal dimension at present Characteristic attribute selection, cluster, correlation rule, classification and prediction etc. be on direction, network data excavation, finance data analysis, There is certain application in the fields such as reason information excavating.Fractal date faces lot of challenges according to digging technology, such as how to judge data Collection with fractal characteristic, how quickly to calculate data set fractal dimension, how on computers simulated implementation, how to explain number Practical significance according to collection fractal dimension etc..These problems are mainly due to fractal dimension itself, including 1) fractal dimension as general Suitable complexity scaling law is introduced into, but it is not a qualified demarcation rule, and not can determine that a Fractal Pattern.Point Shape dimension has many definition modes, such as Hausdorff dimension, information dimension, correlation dimension, similar dimension, capacity dimension, multiple Divide shape spectrum, filling dimension, Divider dimension, Lyapunov exponent, group's dimension, quality dimension, differential dimension, Brig dimension Number, fuzzy dimension, Generalized Dimension etc..The fractal dimension estimate of same target can because computational methods are different different, phase Same fractal dimension can correspond to different Fractal Patterns.2) estimation that is difficult to of fractal dimension leads to not quickly calculate data set Fractal dimension.Obtained by statistics or approximate method mostly, for example, calculate the most frequently used Hausdorff dimension, typically led to Box-counting dimension is crossed to estimate its upper bound and estimate its lower bound by local dimension.3) classical point shape process Such as discrete fractal Brown motion, it can not iterative make it difficult to realize simulation on computers.4) fractal dimension with point Between shape behavior, either parse relation or intuitive relationship is all unclear, hinder fractal method and participate in Accurate Model, as right The fine description of Fractal Phenomenon as petroleum reservoir crack is also still the worldwide difficulty that petroleum geology circle fails to be fully solved Topic.The time series models that fractal dimension is incorporated into classics attempt to realize Accurate Model, but bring new problem again, for example, pass through The ARFIMA of allusion quotation points of shape process model building needs to calculate very big sample inverse matrix.5) fractal dimension is no clear and definite as scaling law Physical significance, so the origin cause of formation on fractal behavior and its correlated phenomena cannot be provided.Why emerging cannot for example explain Market generally existing Long Memory Properties, and phenomenon of the international meat market in the absence of significant Long Memory Properties as the U.S..
The generting machanism of recognition time sequence is the highest objective of Dynamic Data Mining.It is expected to take off using data genaration mechanism Show the formation of behavioral characteristics and controlling mechanism and following data are inferred and predicted.But existing dynamic data The generating process of data set is not provided with fractal date according to method for digging.Many models such as Time Series AR MA and ARCH models, skill Art such as obscurity model building, neutral net, genetic algorithm, mathematical optimization and self-organizing method, be used to extract useful letter in dynamic data Breath, generally obtains a result, and is unable to explanation results, less provides the generating process of data set.On the other hand, behavioral characteristics and point Shape behavior is closely related with observation yardstick, and too small observation yardstick can influence intactly to reflect data genaration mechanism, excessive sight Examining yardstick can be because the time span of sample be excessive so that cannot be collected into reflection systematic sample number enough, or cause money Source wastes and because timeliness loses data value.Therefore the identification behavioral characteristics of reflection complication system and fractal behavior enough The generating process of minimum observation yardstick will produce substantial worth.Such as Hydrologic scaling is just put into 21 century hydrology basis The advanced subject of research.From the point of view of current academic research or patent, there is no and started with derived mould from system based on physics principle Type is used for the methods and techniques of non-linear kinetic characteristic and random fractal behavior.In recent years, existed by newton second law of motion One class is restrained oneself recover the application of regulating system and then by discretization at random, is derived a quasi-nonlinear autoregression and is integrated (NLARI) Process.This kind of system for recovering regulation with self-discipline is also referred to as Stochastic Elasticity system universally present in nature, ecology, medical science, work In many real systems such as journey, economy and society.NLARI processes can be by specific as follows:
Allow Yt=Xtt, equation (1) can be rewritten as
There
Whereinω represents the expected value of external disturbance, and σ is represented The standard deviation of external disturbance, εtIt is the Gaussian white noise of standard variance σ, α is resistance coefficient, and β is to recover force coefficient, κ1It is in resistance Time lag in power, κ2It is the time lag in restoring force.There is ε when σ=0 or to all time tt=0, equation (2) is one Individual deterministic system, with respect to the stability and bifurcation that restoring force coefficient gamma controls the system:As κ2=1, it be one gradually Zero fixed point of nearly stabilization is in 0 < γ < 1, an asymptotically stable two cycles ring One unstable Two cycles ring Notice that stability here is locally rather than Existence of Global Stable.When γ=0, NLARI mistakes Journey deteriorates to a linear autoregression and integrates ARI (2,1) process.The present invention will be based on NLARI models dynamic characteristic and Statistical property develops a kind of new fractal theory, so as to derive difference point shape level and its control and the life of recognition time sequence Into the method for mechanism.
In sum, also include novel data Mining stream, fractal date according to digging in addition to traditional technology in the prior art Pick, on-line analysis mining, empirical mode decomposition, contact discovery, trend analysis, variance analysis etc., are calculated by statistics and mathematics The method such as method such as obscurity model building, neutral net, genetic algorithm, optimization, self-organizing method, in attribute reduction, classification, cluster, association Rule, sequence pattern, predict, the point analysis that peels off, spatial data analysis etc. have certain application on direction.Many aspects are also stopped Stay in the improvement to traditional static method, face problems.Dynamic Data Mining technology only provides result, it is impossible to explain knot Really, the generating process of data set is not provided.Fractal date is faced danger or disaster in estimation fractal dimension, modeling point shape process, in meter according to excavation surface Simulation is realized on calculation machine, it is impossible to disclose the difficulty point formed because of, controlling mechanism and practical significance.And between most fractal dimensions Relation, the relation of fractal dimension and fractal behavior, the relation of fractal dimension and dynamic mode it is all indefinite.These problems influence Fractal date is according to excavation.In order to thoroughly solve fractal date according to Mining Problems, the present invention will be new based on one of the foundation of NLARI models Random fractal it is theoretical;Prove the interior raw structural property of non-linear kinetic characteristic and fractal behavior respectively as same complication system The response property disturbed with system external portion;Point shape row of the self-similarity power law of identification data collection simultaneously and Long Memory Properties is provided It is, stabilization fixed point stabilization and the method for the non-linear kinetic characteristic and their control parameter of unstable periodic ring;There is provided The method that the behavioral characteristics of reflection complication system and the minimum of fractal behavior observe yardstick and generating process enough.
The content of the invention
It is an object of the invention to provide a kind of Application of Data Mining theoretical based on new random fractal, it is intended to solve Certainly too small and excessive observation yardstick not only influences completely to reflect data genaration mechanism that can also bring sample number few, resource is unrestrained Take, data lose value because of timeliness;In conventional time series model extraction dynamic data, it is impossible to explanation results, it is impossible to which number is provided According to the problem of generating process.
The present invention is achieved in that a kind of Application of Data Mining theoretical based on new random fractal, the base In the theoretical Application of Data Mining of new random fractal being amplifying observation yardstick by addition polymerization time series changes NLARI Point shape slope index parameter and wave amplitude index parameters, identification Long Memory Properties, self-similarity, have Long Memory Properties and self-similarity concurrently Difference point shape level minimized aggregation degree time series generating process and dynamic characteristic.
Further, the time sequence of different point shape levels and dynamic characteristic is recognized by the concentration class of control time sequence Column-generation process;Specifically include:
Step one, data absolute value downsizing treatment, is designated as X=(Xt:T=1 ..., T);
Step 2, least square method regression straight line is calculated using XUseAnd
ΔYt=Yt-Yt-1, it is rightMake Least Square Method and obtain parameter ValuationNote Y=(Y '10..., Y '1t..., Y '1T-1) ',s11And s22Represent respectively MatrixThe first row first row element and the element of the second row secondary series;
Step 3, calculates θ1Confidential intervalWhereinIt is that t is distributed in confidence levelIt is critical It is worth and returns the statistic without hypothesis γ=0If θ1Confidential interval be comprised in interval In (- 1,1) and if returning and being rejected without hypothesis γ=0, receives opposition hypothesis γ > 0, then evidence supports data from NLARI Process, performs step 4, otherwise to j=j+1 (initial value j=1), calculates j reunion collection time serieses Xj, it is designated as X=Xj, perform step Rapid two;If to that can not continue to be aggregated, output result X is that a non-NLARI process or one have γ to circulation time sequence =0 degeneration ARI (2,1) process, exits analysis;
Step 4, remembers j1=j, allowsFractal Identification is performed, j is obtained2Meet again and collect sequenceDivide shape It is (δ to spend1, k, δ2, k), it is designated asOr without a point shape
Step 5 is rightWithStep 2 is performed respectively obtains θ1Confidential intervalθ2Confidence It is intervalAnd the confidential interval of γWherein If θ1, θ2, the confidential interval of γ is comprised in interval (- 1,1), (0,4), (0,1), then to support that X carrys out self-stabilization motionless for evidence NLARI processes on point domain;If θ1, θ2, the confidential interval of γ is comprised in intervalIt is interior, then demonstrate,prove According to supporting X from the NLARI processes in stable period annulus;If θ1, θ2, the confidential interval of γ is comprised in intervalInterior, then evidence supports X from the NLARI processes in unstable cycle annulus;Otherwise X comes from and faces NLARI processes on dividing value;Fractal sequences of the output with these behavioral characteristicsWith without fractal sequences Including model parameter as conclusion.
Further, the minimized aggregation degree Time Series Method of different long memory levels is recognized, including:
1) one is selected on the occasion of descending series δ1, kIf, initial value k=j=1 and X1=X;
2) calculate jth and meet again and collect time series Xj
3) X is calculatedjSample autocorrelation coefficient ρnAs n=1 ..., N and LM (Xj);If LM (Xj) < δ1, k, then perform Step 4), step 2 otherwise is performed to j=j+1), when being recycled to time series and can not continue to be aggregated, output result is minimum poly- The Long Memory Properties time series X of intensityj-11, k-1) and model parameter X therej1,0) mean former time seriesWithout length Memorability;
If 4) k < K, j=1 and k=k+1 is made, performs step 2), otherwise export the long memory of conclusion minimized aggregation degree Property time series Xj1, K) and model parameter.
Further, the minimized aggregation degree Time Series Method of different self similarity levels is recognized, including:
A, selected one on the occasion of descending series δ2, kIf, initial value k=j=1 and X1=X;
B, calculating jth are met again and collect time series Xj
C, calculating XjSample likelihood ratio rH (i, im)And SShm(Xj) as m=1 ..., M, h=1 ..., H and i=1 ..., n; If SShm(Xj) < δ2, kSet up as given m=1 ..., M and h=1 ..., H then performs step D, otherwise to j=j+1 Step B is performed, when being recycled to time series and can not continue to be aggregated, the self-similarity time sequence of output result minimized aggregation degree Row Xj-12, k-1) and model parameter X therej2,0) mean former time seriesThere is no self-similarity;
If D, k < K, j=1 and k=k+1 is made, perform step B, otherwise export the self similarity of conclusion minimized aggregation degree Property time series Xj2, K) and model parameter.
Further, the different points of minimized aggregation degree Time Series Methods of shape level are recognized, including:
A, selected two on the occasion of descending series δ1, kAnd δ2, kIf, initial value k=j=1 and X1=X;
B, calculating jth are met again and collect time series Xj
C, calculating XjSample autocorrelation coefficient ρnAs n=1 ..., N and LM (Xj).If LM (Xj) < δ1, k, then perform Step d, otherwise to j=j+1 perform step b, when time series can not continue to be aggregated, output result minimized aggregation degree point Shape time series Xj-11, k-1, δ2, k-1) and model parameter, (δ1,0, δ2,0) it is former sequenceWithout fractal structure;
D, calculating XjSample likelihood ratio rH (i, im)As i=1 ..., n and SShm(Xj) as m=1 ..., M and h= 1 ..., H;If SShm(Xj) < δ2, kAs h=1 ..., H and m=1 ..., M set up, then step e is performed, otherwise to j=j+1 Perform step b;
If e, k < K, j=1 and k=k+1 is made, perform step b, otherwise export when dividing shape of conclusion minimized aggregation degree Between sequence Xj1, K, δ2, K) and model parameter;By limiting δ1, kAnd δ2, kThe value different long memories of identification and self similarity level when Between sequence generation process.
Further, new random fractal is theoretical, including:
(1) as the slope index η of fractal parameter1=ω/α and wave amplitude index η2=σ/β describes random bullet based on a class The NLARI processes of sexual system:
WhereinεtFor white Noise, g (x) is the recovery force function for meeting condition g (- x)=- g (x) and xg (x) < 0, and ω is the expected value of external disturbance, σ It is the standard deviation of external disturbance, α is resistance coefficient, and β is to recover force coefficient, κ1It is the time lag on resistance, κ2It is to recover Time lag in power;
(2) j meets again and collects time seriesWhereinFully add Big concentration class j will also result in one it is relatively large | η1| and a relatively small η2So as to produce long memory and self similarity behavior;
(3) auto-covariance rhLikelihood ratio rH (i, im)=rh(Xi)/rh(Xim), if time series is self similarity, It can tend to a horizontal linear as given h=1 ..., H, m=1 ..., M with the increase of i;
(4) the horizontal index of long memoryWith the horizontal index of self similarity
Further, in step 4, fractal method is recognized, including:
One) calculate j and meet again and collect time series Xj, its auto-correlation coefficient ρnAs n=1 ..., N and likelihood ratio rH (i, im)Make It is h=1 ..., H, m=1 ..., M and i=1 ..., n;
Two) concentration class j is increased until meeting long memory level conditions LM (Xj) < δ1, kWith self similarity level conditions SShm (Xj) < δ2, kFor all h=1 ..., H and m=1 ..., wherein M, δ1, kAnd δ2, kIt it is two on the occasion of decreasing sequence of numbers;
Three) by changing δ1, kAnd δ2, kValue control long memory level and self similarity level.
Further, identification dynamic characteristic method includes:
θ based on t distributions1, θ2, the confidential interval calibrating of γ and return the statistic without hypothesis γ=0 opposition hypothesis γ > 0Calibrating, if
Evidence supports that data have the stable fixed point structure of NLARI, if
Evidence supports that data have ring stable period of NLARIStructure, if
Evidence supports that data have the unstable periodic ring of NLARIStructure.
Further, calibration method, including:Use X=XjCalculate least square method regression straight linePass throughTo Δ Yt1ΔYt-12g(Yt-1)+εtMake Least Square Method Δ Y theret=Yt-Yt-1, obtain Parameter estimation
NLAR1 points of shape process disclosed by the invention can show very similar length with ARFIMA points of shape process of classics Memorability (Data Comparison refers to the Fig. 1 in specific embodiment).But ARFIMA points of shape process can not disclose fractal dimension With the physical significance and the origin cause of formation and governing factor of display Long Memory Properties of Long Memory Properties.In contrast to this, The present invention gives The controlling mechanism of the fractal behavior of NLARI processes and clear and definite physical significance:The equal line slope index η of fluctuation1=ω/α controls The long-range degree of correlation of system, long range dependent be system be subjected to external disturbance horizontally relative to internal drag coefficient intensity compared with A kind of response characteristic of system when big;Wave amplitude index η2=σ/β determines whether system has self-similarity, and self-similarity is to be System is subjected to a kind of response characteristic of system when external disturbance changes smaller relative to inside recovery force coefficient;The equal line slope of fluctuation Its exterior is depended on to internal relative strength of effect with wave amplitude, but increasing observation yardstick can make the absolute value of Slope metric Be incremented by makes wave amplitude index successively decrease simultaneously, so as to cause Long Memory Properties and self-similarity respectively.In other words, as long as system is present Self-discipline recovers adjusting force, fractal behavior necessarily occurs on sufficiently large observation yardstick.Based on these properties, the present invention is provided simultaneously Recognize the fractal behavior and generting machanism and dynamic characteristic of varying level;The minimum observation yardstick of identification be assemble yardstick from phase Like sequence, so that for dynamic data sampling, compression, feature extraction provide scientific and standard;Identification data of the invention generates machine Make, annotate translate point formed because, adjust memory span, using or avoid the fractal structure from producing various uses;Although the present invention sets up The approach of new fractal theory is extremely complex, but the fractal method that the theory is provided is abnormal simple.
Brief description of the drawings
Fig. 1 is that display NLARI processes provided in an embodiment of the present invention can show and be very similar to classical ARFIMA point One instance graph of shape process Long Memory Properties.
Fig. 2 is the fractal parameter i.e. slope index η of NLARI processes provided in an embodiment of the present invention1With wave amplitude index η2With length Memory sexual intercourse.
Fig. 3 is the wave amplitude index η of NLARI processes provided in an embodiment of the present invention2With self-similarity relation.
Fig. 4 is provided in an embodiment of the present invention to increase slope index by addition polymerization time series | η1| and reduction wave amplitude refers to Number η2Fractal Identification schematic diagram.
Fig. 5 is the generating process of recognition time sequence provided in an embodiment of the present invention and the schematic flow sheet of behavioral characteristics.
Fig. 6 is the minimized aggregation degree time series generating process of the different long memory levels of identification provided in an embodiment of the present invention Schematic flow sheet.
Fig. 7 is the minimized aggregation degree time series generating process of the different self similarity levels of identification provided in an embodiment of the present invention Schematic flow sheet.
Fig. 8 is the minimized aggregation degree time series generating process of the different point shape levels of identification provided in an embodiment of the present invention Schematic flow sheet.
Fig. 9 is a reality of fractal parameter provided in an embodiment of the present invention and point shape horizontal relationship of heart time sequence Example.
Figure 10 is the minimized aggregation degree time series life of the different point shape levels of present invention identification provided in an embodiment of the present invention Into a demonstration example of process.
Figure 11 is that the Application of Data Mining principle theoretical based on new random fractal provided in an embodiment of the present invention is shown It is intended to.
Specific embodiment
In order to make the purpose , technical scheme and advantage of the present invention be clearer, with reference to embodiments, to the present invention It is further elaborated.It should be appreciated that the specific embodiments described herein are merely illustrative of the present invention, it is not used to Limit the present invention.
The Application of Data Mining theoretical based on new random fractal provided in an embodiment of the present invention, it is described based on new The theoretical Application of Data Mining of random fractal is amplifying observation yardstick by addition polymerization time series changes point shape of NLARI Slope index parameter and wave amplitude index parameters, recognize Long Memory Properties, self-similarity, have the difference of Long Memory Properties and self-similarity concurrently Divide the time series generating process and dynamic characteristic of the minimized aggregation degree of shape level.
Below in conjunction with the accompanying drawings and embodiment the present invention is described in detail.
New random fractal of the present invention is theoretical, including:
(1) as the slope index η of fractal parameter1=ω/α and wave amplitude index η2=σ/β describes random bullet based on a class The NLARI processes of sexual system:
WhereinεtFor white Noise, g (x) is the recovery force function for meeting condition g (- x)=- g (x) and xg (x) < 0, and ω is the expected value of external disturbance, σ It is the standard deviation of external disturbance, α is resistance coefficient, and β is to recover force coefficient, κ1It is the time lag on resistance, κ2It is to recover Time lag in power;
(2) j meets again and collects time seriesWhereinFully add Big concentration class j will also result in one it is relatively large | η1| and a relatively small η2So as to produce long memory and self similarity behavior;
(3) auto-covariance rhLikelihood ratio rH (i, im)=rh(Xi)/rh(Xim), if time series is self similarity, It can tend to a horizontal linear as given h=1 ..., H, m=1 ... with the increase of i, M,
(4) the horizontal index of long memoryWith the horizontal index of self similarity
Identification fractal method of the present invention, including:
(1) calculate j and meet again and collect time series Xj, its auto-correlation coefficient ρnAs n=1 ..., N and likelihood ratio rH (i, im) As h=1 ..., H, m=1 ..., M and i=1 ..., n;
(2) concentration class j is increased until meeting long memory level conditions LM (Xj) < δ1, kWith self similarity level conditions SShm (Xj) < δ2, kFor all h=1 ..., H and m=1 ..., wherein M, δ1, kAnd δ2, kIt it is two on the occasion of decreasing sequence of numbers;
(3) by changing δ1, kAnd δ2, kValue control long memory level and self similarity level.
Described identification dynamic characteristic method, including:θ based on t distributions1, θ2, the confidential interval calibrating of γ and return nothing The statistic of hypothesis γ=0 opposition hypothesis γ > 0Calibrating, if there
Evidence supports that data have the stable fixed point structure of NLARI, if
Evidence supports that data have ring stable period of NLARIStructure, if
Evidence supports that data have the unstable periodic ring of NLARIStructure.
Described calibration method, it is characterised in that use X=XiCalculate least square method regression straight lineIt is logical CrossTo Δ Yt1ΔYt-12g(Yt-1)+εtMake Least Square Method Δ Y theret=Yt-Yt-1, obtain Parameter estimation
Application principle of the invention is further described below in conjunction with the accompanying drawings.
The present invention is intended to provide a kind of theoretical Application of Data Mining of new random fractal, it be based on such as drag and Property:
One, describes the General N LARI processes of a class Stochastic Elasticity system:As shown in Figure 1;
Equation (3) can be rewritten as
Y theret=Xtt=X0- (ω/α) t and
Wherein εtIt is white noise, g (x) meets condition g (- x)=- g (x) and xg (x) < 0 to recover force function, and other are each Item, parameter and physical meaning are identical with equation (1).NLARI processes, the parameter field of equation (4) is
As g (x)=- x exp (- x2) and κ2=1, the parameter field of the NLARI processes on stabilization fixed point domain is
Stably the parameter field of the NLARI processes in two cycles annulus is
The parameter field of the NLARI processes in unstable two cycles annulus is
Above-mentioned property is used for the generting machanism of identification data and dynamic characteristic.Present inventor is shown by simulated experiment Above-mentioned property is simultaneously not limited to the concrete structure of restoring force function g (x), i.e., the recovery force function of other form also has similar Dynamic characteristic.It is first noted that NLARI processes have a fractal structure, such as it can one classics of simulation very well The Long Memory Properties that show of ARFIMA points of shape process (as shown in figure 1, g (x)=- x/ (1+x there2))。
Two, fractal parameters:
Property 1 imports slope index in NLARI processes (3)With wave amplitude indexAs fractal parameter.
It is able to demonstrate that μt=E (Xt|X0, X-1)=X0+ (ω/α) t still set up, so η1Represent the oblique of fluctuation average line Rate.Sample standard deviation sd and η1Perfect positive correlation (as shown in the 2c in Fig. 2, correlation coefficient r=1 is used as g (- x)=- x), with ripple Width index η2Perfect positive correlation (works as η1=0, as shown in the 2f in Fig. 2, r=1 is used as g (x)=- xexp (- x2)) or strong positive (work as η in pass1≠ 0, as shown in the 2i in Fig. 2, r=0.945 is used as g (x)=- x (1+x4)-1).Because sd has weighed sample fluctuation Size, so η2It is referred to as wave amplitude index.
Three, Long Memory Properties:
If the sample autocorrelation coefficient of property 2As n= 1 ..., N slowly decline with lag order n increases with less than exponential decay rate, show XtThere are Long Memory Properties.
The Long Memory Properties of the NLARI processes (3) of property 3 are attributed to an absolute value for relatively large slope index | η1|, or The relatively large wave amplitude index η of person one2Work as η1=0, or one relatively large | η1| it is relatively small with one | η2|。
Because overall auto-correlation coefficient is unknown, it is considered to the average value of a large amount of analog sample auto-correlation coefficients for repeating (still use ρnTo represent) as approximate, such simulation ρ of overall auto-covariancenBe used to find the control of Long Memory Properties Mechanism.It is able to verify that (i) is with slope index when relative restoring force coefficient gamma is increased | η1| increase, ρnWith lag order n Increase its decline degree slow down (as shown in the 2a in Fig. 2), | η1| and ρ70(the last auto-correlation coefficient, its absolute value Reflect degree of correlation) perfect positive correlation (as shown in the 2b in Fig. 2), it means that relatively large | η1| Long Memory Properties will be caused Occur as η2≠0;(ii) η is worked as1When=0, with slope index η2Increase, ρnWith its decline degree of the increase of lag order n Slow down (just as shown in the 2d in Fig. 2), | η1| and ρ70Perfect positive correlation (as shown in the 2e in Fig. 2), it is meant that relatively large η2 Long Memory Properties will be caused to occur;(iii) however work as η1When ≠ 0, with slope index η2Increase, ρnWith the increase of lag order n It declines degree and increases (just as shown in the 2g in Fig. 2), η2And ρ70Negatively correlated (as shown in the 2h in Fig. 2), it is meant that work as η1≠ 0, increase η2Memory span can be reduced.When relative restoring force coefficient gamma is fixed, with | η1| increase and η2Reduction, ρn Slow down with the increase of lag order n its decline degree (as shown in the 2j in Fig. 2, γ=0.7) there.This explanation is relative Big | η1| or a relatively large η2Work as η1=0, or one relatively large | η1| with a relatively small η2Can cause The Long Memory Properties of time series.The discovery explains emerging market has longer memory because its Endogenous system than mature market More weak, i.e. α and β are smaller, and they cause | η1| and η2Value it is bigger than mature market.Crossing small sample can block Long Memory Properties (as shown in the 2k in Fig. 2).
Four, self-similarities:
Because the statistical self-similarity on Distribution Significance is with the presence or absence of still absence of proof, so the present invention considers second order from phase Like property, it is related to the concentration class (or observation yardstick) of time series.Such as monthly price index (refers to the flat fare in units of the moon Lattice change) and the concentration class of annual price index (referring to the variation of the average price in units of year) be respectively Month And Year.
Property 4 allows X=(Xh:H=1 ..., T) represent former time series andRepresent j weights Aggregation sequenceIf there is the self tuning side that a constant δ causes lag order h for arbitrary integer m Difference rhMeet condition rh(Xm)=mδrh(X), then claiming X has second order self-similarity.
Here the component of δ allows for negative, does not also require that X must be stable.Due to overall auto-covariance be it is unknown, (r is still used here by a large amount of the average of analog sample auto-covariance for repeatinghRepresent) as the approximate of overall auto-covariance.Make It is self-similarity sequence X, there is rh(X)=irh(Xi) and rh(Xm)=irh(Xim), then
Set up.Claim rH (i, im)It is the likelihood ratio,It is the average m likelihoods ratio.Obviously there is following property:
If the X of property 5 is self similarity sequence, then for m=2 ..., M and the h=1 ... that give, H, the likelihood ratio rH (i, im)A horizontal linear, and average likelihood ratio r are shown with the growth of concentration class ihmObey power law m
The self-similarity of the NLARI processes (3) of property 6 gives the credit to a relatively small wave amplitude index η2
Present invention demonstrates working as η2Value it is small to a certain extent, NLARI processes (3) will show self-similarity.Such as with η2Value drop to 0.1, Self-similar Ratio Algorithm from 1.3As the increase of i is gradually become by the oblique line that dips down One horizontal linear (3a in Fig. 3), Self-similar Ratio Algorithm r5 (i, 2i)Increase with i is intended to a level by downward-sloping curve Straight line (3b in Fig. 3);Work as η2When=0.025, as m=2 ..., 20, Self-similar Ratio Algorithm r5 (i, im)As the increase of i shows one Bar horizontal linear (3c in Fig. 3);Average likelihood ratio r5mObey power law m-3.022(3d in Fig. 3).
The time series that the abundant addition polymerization NLARI processes (3) of property 7 produce, that is, allow concentration class (or observation yardstick) j abundant Greatly, an absolute value for relatively large slope index can simultaneously be obtained | η1| and a relatively small wave amplitude index η2, they will Assemble index sequence is caused to produce Long Memory Properties and self-similarity.
With the increase of lag order j, assemble index sequence XjSlope index absolute value | η1(Xj) | linearly Increase wave amplitude index η (as shown in the 4a in Fig. 4)2(Xj) exponentially reduce (as shown in the 4b in Fig. 4).Both will be respectively Cause the appearance of Long Memory Properties and self-similarity, such as concentration class j is increased to 100, auto-correlation coefficient ρ from 5nTo delayed rank Number n curve from quickly fall to hardly decline, until turn into a horizontal linear, it is shown that Long Memory Properties very high are (such as Shown in 4c in Fig. 4);Self-similar Ratio Algorithm sd(i, 2i)Curve to i progressively becomes horizontal linear by decline curve, it is shown that very high Self-similarity (as shown in the 4d in Fig. 4).It can be seen that the fractal behavior (Long Memory Properties and self-similarity) of NLARI can be by addition polymerization Its time series and present.
Property 8 claimsIt is the horizontal index of long memory n there0It is respectively just delayed all the time with N Exponent number.ClaimIt is the horizontal index of self similarity.
Initial lag exponent number n0Differ and be set to 1, for example n0=2.Usual 0≤LM (Xj)≤1 and 0≤SShm(Xj)≤1.Divide shape Horizontal index is smaller to mean that point shape level is higher.
Application principle of the invention is further described with reference to specific embodiment.
The theoretical excavation dynamic data method of new random fractal provided in an embodiment of the present invention, with g (x)=- x (1+x2 )-1, κ12=1 and εtIt is Gaussian white noise i.i.d.N (0, σ2) as a example by, illustrate statistics Fractal Theory Applications disclosed by the invention It is made up of four parts in Application of Data Mining, is realized by following specific steps respectively:
The generating process and behavioral characteristics of Part I recognition time sequence:
The downsizing of step 1 data absolute value is processed, and is designated as X=(Xt:T=1 ..., T).
Step 2 calculates least square method regression straight line using XUseAnd
ΔYt=Yt-Yt-1, it is rightMake Least Square Method and obtain parameter Valuation Note Y=(Y '10..., Y '1t..., Y '1T-1) ', Y1t=(Δ Yt,-Yt(1+Yt 2)-1), s11And s22Difference table Show matrixThe first row first row element and the element (in such as Fig. 5 501) of the second row secondary series.
Step 3 calculates θ1Confidential intervalWhereinIt is that t is distributed in confidence levelIt is critical It is worth and returns the statistic without hypothesis γ=0If θ1Confidential interval be comprised in interval In (- 1,1) and if returning and being rejected without hypothesis γ=0, receives opposition hypothesis γ > 0, then evidence supports data from NLARI Process, performs step 4 (in such as Fig. 5 502), otherwise to j=j+1 (initial value j=1), calculates j reunion collection time serieses Xj(such as In Fig. 5 503), be designated as X=Xj, perform step 2.If circulation time sequence is to can not continue to be aggregated, " X is output result One non-NLARI process or degeneration ARI (2, the 1) process with γ=0) " exit analysis (as shown in Fig. 5 504).
Step 4 remembers j1=j, allowsPart II Fractal Identification is performed, j is obtained2Meet again and collect sequenceShape degree is divided to be (δ1, k, δ2, k), it is designated asOr without a point shape(as shown in Fig. 5 505).
Step 5 pairWithStep 2 is performed respectively obtains θ1Confidential intervalθ2Put Letter is intervalAnd the confidential interval of γWherein If θ1, θ2, the confidential interval of γ is comprised in interval(- 1,1), (0,4), in (0,1), then to support that X carrys out self-stabilization motionless for evidence NLARI processes on point domain;If θ1, θ2, the confidential interval of γ is comprised in interval (- 1,1),It is interior, then demonstrate,prove According to supporting X from the NLARI processes in stable period annulus;If θ1, θ2, the confidential interval of γ is comprised in interval (- 1,1), (0 ,+∞),Interior, then evidence supports X from the NLARI processes in unstable cycle annulus;Otherwise X comes from critical value On NLARI processes (as shown in Fig. 5 506).Fractal sequences of the output with these behavioral characteristicsWith Without fractal sequencesIncluding model parameter as conclusion.
Part II recognizes the minimized aggregation degree time series of different long memory levels
Selected one of step 1 is on the occasion of descending series δ1, kIf, initial value k=j=1 and X1=X the 601 of 6 (in such as Fig. 6).
Step 2 calculates jth and meets again and collects time series Xj(as shown in Fig. 6 602).
Step 3 calculates XjSample autocorrelation coefficient ρnAs n=1 ..., N and LM (Xj).If LM (Xj) < δ1, k, then Step 4 (as shown in Fig. 6 603) is performed, step 2 otherwise is performed to j=j+1, quilt can not be continued when time series is recycled to Output result " the Long Memory Properties time series X of minimized aggregation degree during aggregationj-11, k-1) and model parameter X therej1,0) meaning Taste former time seriesWithout Long Memory Properties " (such as 604 institutes in Fig. 6 not).
If step 4 k < K, j=1 and k=k+1 is made, perform step 2 (as shown in Fig. 6 605), otherwise exported Conclusion " the Long Memory Properties time series X of minimized aggregation degreej1, K) and model parameter " (in such as Fig. 6 606).
Part III recognizes the minimized aggregation degree time series of different self similarity levels
Selected one of step 1 is on the occasion of descending series δ2, kIf, initial value k=j=1 and X1=X the 701 of 7 (in such as Fig. 7).
Step 2 calculates jth and meets again and collects time series Xj(as shown in Fig. 7 702).
Step 3 calculates XjSample likelihood ratio rH (i, im)And SShm(Xj) as m=1 ..., M, h=1 ..., H and i= 1 ..., n.If SShm(Xj) < δ2, kSet up as m=1 ..., M and the h=1 ... for giving, H (as shown in Fig. 7 703), Step 4 is then performed, step 2 otherwise is performed to j=j+1, when being recycled to time series and can not continue to be aggregated, output result " the self-similarity time series X of minimized aggregation degreej-12, k-1) and model parameter X therej2,0) mean former time seriesThere is no self-similarity " (as shown in Fig. 7 704).
If step 4 k < K, j=1 and k=k+1 is made, perform step 2 (as shown in Fig. 7 705), otherwise exported Conclusion " the self-similarity time series X of minimized aggregation degreej2, K) and model parameter " (in such as Fig. 7 707).
Different points of minimized aggregation degree time serieses of shape level of Part IV identification
Selected two of step 1 is on the occasion of descending series δ1, kAnd δ2, kIf, initial value k=j=1 and X1=X is (in Fig. 8 801)。
Step 2 calculates jth and meets again and collects time series Xj(as shown in Fig. 8 802).
Step 3 calculates XjSample autocorrelation coefficient ρnAs n=1 ..., N and LM (Xj).If LM (Xj) < δ1, k, then Step 4 (as shown in Fig. 8 803) is performed, step 2 otherwise is performed to j=j+1, when time series can not continue to be aggregated, Output result " the fractal varying-time dimension X of minimized aggregation degreej-11, k-1, δ2, k-1) and model parameter, there (δ1,0, δ2,0) meaning Taste former sequenceWithout fractal structure " (as shown in Fig. 8 804).
Step 4 calculates XjSample likelihood ratio rH (i, im)As i=1 ..., n and SShm(Xj) as m=1 ..., M and h= 1 ..., H.If SShm(Xj) < δ2, kAs h=1 ..., H and m=1 ..., M set up, then step 5 is performed, otherwise to j=j+1 Perform step 2 (as shown in Fig. 8 805).
If step 5 k < K, j=1 and k=k+1 is made, perform step 2 (in such as Fig. 8 806), otherwise export conclusion " the fractal varying-time dimension X of minimized aggregation degreej1, K, δ2, K) and model parameter " (in such as Fig. 8 807).
By limiting δ1, kAnd δ2, kValue can recognize the time sequence of different long memories and self similarity level using the above method Column-generation process.
Application principle of the invention is further described with reference to specific embodiment.
Fig. 9 shows that the present invention (is not in the mood for popular name for, 34 years old age, male, 52200 samples for a heart time sequence This value) an example.Because sample falls short of, therefore data are not by addition polymerization, i.e. j1=1.Original heartbeat data is by logarithm The downsizing treatment of conversion be used to estimate the parameter of NLARI processes, obtain Least Square Method value κ12=1, based on these estimations The θ of value1, θ2, the confidential interval of γ is in the theoretical parameter domain (- 1,1) for stablizing fixed point, (0,4), in (0,1).Return as calibrating Statistic without hypothesis γ=0 is γn=85, due to P (γn> 11.9) < 1%, so evidence supports opposition hypothesis γ > 0. These results support the NLARI processes that heart time sequence is come on self-stabilization fixed point domain.The heart time sequence further by 29 time serieses are divided into, each time series has 1800 sample values, each section of Long Memory Properties for showing varying level (as shown in the 9a in Fig. 9), with | η1| and η2Increase, sample autocorrelation coefficient ρnTo the decline journey of the curve of lag order n Degree is slower, it was confirmed that bigger | η1| or η2The memory level (9b in Fig. 9) more grown can be caused.At less m (such as m=2) Place, sd(i, mi)Show the horizontal linear (as shown in the 9c in Fig. 9) of rough approximation, its average sdmMeet a power function m1.02(as shown in the 9d in Fig. 9), but rH (i, mi)And η2 (i, mi)And in the sd of larger m(i, mi)Lie in less than level of approximation Straight line, because the heart time sequence has a relatively large η2Value 0.3140.
Figure 10 shows a demonstration example of present invention identification minimized aggregation degree point shape process.Consider that data result from a stabilization The NLARI processes in fixed point domain, there j1=1, θ0=-4.475 × 10-7, θ1=0.5027, θ2=0.0794, σ= 0.0249, η1=-8.999 × 10-7, η2=0.3136, γ=0.0264, T=1.8 × 107, N=70, n=50, m=20.Obtain Obtain in point horizontal index of shapeWithMinimized aggregation degree j2=100 point shape, Auto-correlation coefficient ρnAs the increase of lag order n shows almost one level Straight line, it is shown that height Long Memory Properties (as shown in the 10a in Figure 10), the likelihood ratioLevel is almost to i Straight line, the average likelihood ratio submits to a power law m-0.97, show X100There is the self-similarity (as shown in the 10b in Figure 10) of height.
Presently preferred embodiments of the present invention is the foregoing is only, is not intended to limit the invention, it is all in essence of the invention Any modification, equivalent and improvement made within god and principle etc., is created including based on the spirit and principles in the present invention An application for the smallest dimension time series generating process for meeting varying level point shape and dynamic characteristic requirement is made, all should be wrapped It is contained within protection scope of the present invention.

Claims (9)

1. a kind of Application of Data Mining theoretical based on new random fractal, it is characterised in that described
Based on the theoretical Application of Data Mining of new random fractal by addition polymerization time series being amplifying observation yardstick changes Become point shape slope index parameter and the wave amplitude index parameters of NLARI, recognize the Long Memory Properties of varying level, self-similarity, have concurrently Point shape and the minimized aggregation degree time series generating process of dynamic characteristic of Long Memory Properties and self-similarity.
2. as claimed in claim 1 based on the Application of Data Mining that new random fractal is theoretical, it is characterised in that pass through The concentration class of control time sequence recognizes the time series generating process of different point shape levels and dynamic characteristic;Specific bag Include:
Step one, data absolute value downsizing treatment, is designated as X=(Xt:T=1 ..., T);
Step 2, least square method regression straight line is calculated using XUseAnd
ΔYt=Yt-Yt-1, it is rightMake Least Square Method and obtain parameter estimationNote Y=(Y '10..., Y '1t... Y '1T-1) ',s11And s22Difference representing matrixThe first row first row element and the element of the second row secondary series;
Step 3, calculates θ1Confidential intervalWhereinIt is that t is distributed in confidence levelCritical value with And return the statistic without hypothesis γ=0If θ1Confidential interval be comprised in it is interval (- 1, 1) interior, θ2Confidential intervalWith the confidential interval of γBe contained in interval (0, ∞), with And if returning and being rejected without hypothesis γ=0, receive opposition hypothesis γ > 0, then evidence supports that data come from NLARI processes, performs Step 4, otherwise to j=j+1 (initial value j=1), calculates j reunion collection time serieses Xj, it is designated as X=Xj, perform step 2;If To that can not continue to be aggregated, output result X is the degeneration that a non-NLARI process or have γ=0 to circulation time sequence ARI (2,1) process, exits analysis;
Step 4, remembers j1=j, allowsFractal Identification is performed, j is obtained2Meet again and collect sequencePoint shape degree is (δ1, k, δ2, k), it is designated asOr without a point shape
Step 5 is rightWithStep 2 is performed respectively obtains θ1Confidential intervalθ2Confidential intervalAnd the confidential interval of γWherein If θ1, θ2, the confidential interval of γ is comprised in interval (- 1,1), (0,4), (0,1), then to support that X carrys out self-stabilization motionless for evidence NLARI processes on point domain;If θ1, θ2, the confidential interval of γ is comprised in interval (- 1,1),It is interior, then demonstrate,prove According to supporting X from the NLARI processes in stable period annulus;If θ1, θ2, the confidential interval of γ is comprised in interval (- 1,1), (0 ,+∞),Interior, then evidence supports X from the NLARI processes in unstable cycle annulus;Otherwise X comes from critical value On NLARI processes;Fractal sequences of the output with these behavioral characteristicsWith without fractal sequencesIncluding mould Shape parameter is used as conclusion.
3. as claimed in claim 1 based on the Application of Data Mining that new random fractal is theoretical, it is characterised in that identification The minimized aggregation degree Time Series Method of different long memory levels, including:
1) one is selected on the occasion of descending series δ1, kIf, initial value k=j=1 and X1=X;
2) calculate jth and meet again and collect time series Xj
3) X is calculatedjSample autocorrelation coefficient ρnAs n=1 ..., N and LM (Xj);If LM (Xj) < δ1, k, then step is performed 4) step 2 otherwise, is performed to j=j+1), the output result minimized aggregation degree when being recycled to time series and can not continue to be aggregated Long Memory Properties time series Xj-11, k-1) and model parameter X therej1,0) mean former time seriesWithout long memory Property;
If 4) k < K, j=1 and k=k+1 is made, performs step 2), when otherwise exporting the Long Memory Properties of conclusion minimized aggregation degree Between sequence Xj1, k) and model parameter.
4. as claimed in claim 1 based on the Application of Data Mining that new random fractal is theoretical, it is characterised in that identification The minimized aggregation degree Time Series Method of different self similarity levels, including:
A, selected one on the occasion of descending series δ2, kIf, initial value k=j=1 and X1=X;
B, calculating jth are met again and collect time series Xj
C, calculating XjSample likelihood ratio rH (i, im)And SShm(Xj) as m=1 ..., M, h=1 ..., H and i=1 ..., n;If SShm(Xj) < δ2, kSet up as given m=1 ..., M and h=1 ..., H then performs step D, otherwise performs step to j=j+1 Rapid B, when being recycled to time series and can not continue to be aggregated, the self-similarity time series X of output result minimized aggregation degreej-1Z, k-1) and model parameter X therej2,0) mean former time seriesThere is no self-similarity;
If D, k < K, j=1 and k=k+1 is made, step B is performed, when otherwise exporting the self-similarity of conclusion minimized aggregation degree Between sequence XjZ, K) and model parameter.
5. as claimed in claim 1 based on the Application of Data Mining that new random fractal is theoretical, it is characterised in that identification Have the minimized aggregation degree Time Series Method of the difference point shape level of Long Memory Properties and self-similarity concurrently, including:
A, selected two on the occasion of descending series δ1, kAnd δ2, kIf, initial value k=j=1 and X1=X;
B, calculating jth are met again and collect time series Xj
C, calculating XjSample autocorrelation coefficient ρnAs n=1 ..., N and LM (Xi);If LM (Xj) < δ1, k, then step is performed D, otherwise performs step b to j=j+1, when time series can not continue to be aggregated, when dividing shape of output result minimized aggregation degree Between sequence Xj-11, k-1, δ2, k-1) and model parameter, (δ1,0, δ2,0) it is former sequenceWithout fractal structure;
D, calculating XjSample likelihood ratio rH (i, im)As i=1 ..., n and SShm(Xj) as m=1 ..., M and h=1 ..., H; If SShm(Xj) < δ2, kAs h=1 ..., H and m=1 ..., M set up, then perform step e, otherwise perform step to j=j+1 b;
If e, k < K, j=1 and k=k+1 is made, perform step b, otherwise export the time fractal sequence of conclusion minimized aggregation degree Row Xj1, k, δ2, k) and model parameter;By limiting δ1, kAnd δ2, kThe time sequence of the different long memories of value identification and self similarity level Column-generation process.
6. as claimed in claim 1 based on the Application of Data Mining that new random fractal is theoretical, it is characterised in that new Random fractal is theoretical, including:
(1) as the slope index η of fractal parameter1=ω/α and wave amplitude index η2=σ/β describes Stochastic Elasticity system based on a class The NLARI processes of system:
X t = θ 0 + ( 1 + θ 1 ) X t - 1 - θ 1 X t - 2 + θ 2 g ( X t - κ 2 - μ t - κ 2 ) + v t
WhereinεtIt is white noise,
G (x) is the recovery force function for meeting condition g (- x)=- g (x) and xg (x) < 0, and ω is the expected value of external disturbance, and σ is The standard deviation of external disturbance, α is resistance coefficient, and β is to recover force coefficient, κ1It is the time lag on resistance, κ2It is in restoring force On time lag;
(2) j meets again and collects time seriesWhereinFully
Increase concentration class j will also result in one it is relatively large | η1| and a relatively small η2So as to produce long memory and from phase Like behavior;
(3) auto-covariance rhLikelihood ratio rH (i, im)=rh(Xi)/rh(Xim), if time series is self similarity,
It can tend to a horizontal linear as given h=1 ..., H, m=1 ..., M with the increase of i;
(4) the horizontal index of long memoryWith the horizontal index of self similarity SS h m ( X j ) ≡ 1 - min i = 1 , ... , n ( r h ( i , i m ) ) / max i = 1 , ... , n ( r h ( i , i m ) ) .
7. as claimed in claim 2 based on the Application of Data Mining that new random fractal is theoretical, it is characterised in that step In four, fractal method is recognized, including:
One) calculate j and meet again and collect time series Xj, its auto-correlation coefficient ρnAs n=1 ..., N and likelihood ratio rH (i, im)As h =1 ..., H, m=1 ..., M and i=1 ..., n;
Two) concentration class j is increased until meeting long memory level conditions LM (Xj) < δ1, kWith self similarity level conditions SShm(Xj) < δ2, kFor all h=1 ..., H and m=1 ..., wherein M, δ1, kAnd δ2, kIt it is two on the occasion of decreasing sequence of numbers;
Three) by changing δ1, kAnd δ2, kValue control long memory level and self similarity level.
8. as claimed in claim 1 based on the Application of Data Mining that new random fractal is theoretical, it is characterised in that identification Dynamic characteristic method includes:
θ based on t distributions1, θ2, the confidential interval calibrating of γ and return the statistic without hypothesis γ=0 opposition hypothesis γ > 0Calibrating, if
Evidence supports that data have the stable fixed point structure of NLARI, if
Evidence supports that data have ring stable period of NLARIStructure, if
Evidence supports that data have the unstable periodic ring of NLARIStructure.
9. as claimed in claim 8 based on the Application of Data Mining that new random fractal is theoretical, it is characterised in that calibrating Method, including:Use X=XjCalculate least square method regression straight linePass throughTo Δ Yt1Δ Yt-12g(Yt-1)+εtMake Least Square Method Δ Y theret=Yt-Yt-1, obtain parameter estimation
CN201710151985.1A 2017-03-15 2017-03-15 A kind of Application of Data Mining theoretical based on new random fractal Pending CN106909799A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201710151985.1A CN106909799A (en) 2017-03-15 2017-03-15 A kind of Application of Data Mining theoretical based on new random fractal

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201710151985.1A CN106909799A (en) 2017-03-15 2017-03-15 A kind of Application of Data Mining theoretical based on new random fractal

Publications (1)

Publication Number Publication Date
CN106909799A true CN106909799A (en) 2017-06-30

Family

ID=59187087

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201710151985.1A Pending CN106909799A (en) 2017-03-15 2017-03-15 A kind of Application of Data Mining theoretical based on new random fractal

Country Status (1)

Country Link
CN (1) CN106909799A (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107644327A (en) * 2017-10-25 2018-01-30 西华大学 A kind of data processing method in project management system based on cloud computing

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107644327A (en) * 2017-10-25 2018-01-30 西华大学 A kind of data processing method in project management system based on cloud computing

Similar Documents

Publication Publication Date Title
Zhang et al. Modeling and simulating of reservoir operation using the artificial neural network, support vector regression, deep learning algorithm
Wei Picture fuzzy cross-entropy for multiple attribute decision making problems
CN104951425B (en) A kind of cloud service performance self-adapting type of action system of selection based on deep learning
Noori et al. RETRACTED: Results uncertainty of solid waste generation forecasting by hybrid of wavelet transform-ANFIS and wavelet transform-neural network
CN109960759A (en) Recommender system clicking rate prediction technique based on deep neural network
Adeleke et al. Prediction of municipal solid waste generation: an investigation of the effect of clustering techniques and parameters on ANFIS model performance
CN107562784A (en) Short text classification method based on ResLCNN models
CN107169454A (en) A kind of facial image age estimation method, device and its terminal device
CN107203810A (en) A kind of precipitation Forecasting Methodology based on depth network
CN103577876B (en) Based on feedforward neural network credible with insincere user identification method
Wei et al. Improving prediction accuracy of river discharge time series using a Wavelet-NAR artificial neural network
CN105931116A (en) Automated credit scoring system and method based on depth learning mechanism
Liu Some models for time series of counts
CN103324954A (en) Image classification method based on tree structure and system using same
Defourny et al. Scenario trees and policy selection for multistage stochastic programming using machine learning
CN110009108A (en) A kind of completely new quantum transfinites learning machine
CN108073978A (en) A kind of constructive method of the ultra-deep learning model of artificial intelligence
Chantasut et al. Predictive mining of rainfall predictions using artificial neural networks for Chao Phraya River
Liu et al. Cloud-vae: Variational autoencoder with concepts embedded
Chang et al. Intelligent fault diagnosis scheme via multi-module supervised-learning network with essential features capture-regulation strategy
CN104680023B (en) Oil pumper parameter optimization method based on multiobjective decision-making
CN106909799A (en) A kind of Application of Data Mining theoretical based on new random fractal
Wang et al. Knowledge discovery through self-organizing maps: Data visualization and query processing
Sarakon et al. Robust noise for human activity recognition using convolutional neural network
CN108073979A (en) A kind of ultra-deep study of importing artificial intelligence knows method for distinguishing for image

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication

Application publication date: 20170630

RJ01 Rejection of invention patent application after publication