CN108090138A - The monitoring method and system of a kind of data warehouse - Google Patents

The monitoring method and system of a kind of data warehouse Download PDF

Info

Publication number
CN108090138A
CN108090138A CN201711230833.7A CN201711230833A CN108090138A CN 108090138 A CN108090138 A CN 108090138A CN 201711230833 A CN201711230833 A CN 201711230833A CN 108090138 A CN108090138 A CN 108090138A
Authority
CN
China
Prior art keywords
data
increment
tables
time series
time period
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201711230833.7A
Other languages
Chinese (zh)
Inventor
王勇
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Lianjia Beijing Technology Co Ltd
Original Assignee
Lianjia Beijing Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Lianjia Beijing Technology Co Ltd filed Critical Lianjia Beijing Technology Co Ltd
Priority to CN201711230833.7A priority Critical patent/CN108090138A/en
Publication of CN108090138A publication Critical patent/CN108090138A/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/28Databases characterised by their database models, e.g. relational or object models
    • G06F16/283Multi-dimensional databases or data warehouses, e.g. MOLAP or ROLAP
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/22Indexing; Data structures therefor; Storage structures
    • G06F16/2282Tablespace storage structures; Management thereof

Landscapes

  • Engineering & Computer Science (AREA)
  • Databases & Information Systems (AREA)
  • Theoretical Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Software Systems (AREA)
  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)

Abstract

The present invention provides a kind of monitoring method of data warehouse, including:S1 obtains any data table in data warehouse, and integrating moving average model according to autoregression predicts data increment of the tables of data in the fixed time period of prediction day, and the data increment is the increase item number of data in tables of data;S2, calculates real data increment of the prediction day in fixed time period, and the real data increment with the data increment predicted is compared, monitors whether the tables of data carries out early warning.The present invention is compared by the way that the actual increment of the data gathered from data warehouse to be integrated to the increment of moving average model prediction with autoregression, early warning is carried out in actual increment and excessive prediction increment deviation, and realize and comprehensive monitoring is carried out to the data volume of each tables of data in data warehouse, it can be with the stability of effective guarantee data warehouse.

Description

The monitoring method and system of a kind of data warehouse
Technical field
The present invention relates to database processing technical field, a kind of monitoring method more particularly, to data warehouse and it is System.
Background technology
One data warehouse corresponds to multiple business datum sources.With business deepen constantly, it is necessary to the data analyzed not yet It is disconnected to increase, correspondingly, data warehouse task is various.There are substantial amounts of newly-increased data to be stored in data warehouse daily, if daily Occur exception in task processes, the quality of data can be influenced, and the data in the next time backward may be generated It influences.Therefore exception of the data in processing procedure is found in time, is handled.Data warehouse is very important.
A kind of method of data warehouse data monitoring is provided in the prior art, by performing timing monitor task and correlation A series of configurations can be achieved to be monitored the data cases of data warehouse newer table daily, can also realize because of business number According to source change and will output exception early warning, can find the various problems in program emerged in operation in time.
A kind of database monitoring method is additionally provided in the prior art, according to the information of each database, is obtained and each data The corresponding monitoring configuration file of information in storehouse;According to acquisition monitoring configuration file corresponding with the information of each database, utilize The monitoring programme corresponding with the information of each database write in advance, is monitored corresponding database.
But the prior art business routine data amount is not changed and may output exception early warning and to data The data volume of each tables of data in warehouse carries out comprehensive monitoring, can not effective guarantee data warehouse stability.
The content of the invention
The present invention provides a kind of monitoring for the data warehouse for overcoming the above problem or solving the above problems at least partly Method and system.
On the one hand, the present invention provides a kind of monitoring method of data warehouse, including:
S1 obtains any data table in data warehouse, and integrating moving average model according to autoregression predicts the data Data increment of the table in the fixed time period of prediction day, the data increment are the increase item number of data in tables of data;
S2 calculates real data increment of the prediction day in fixed time period, by the real data increment and in advance The data increment surveyed is compared, and monitors whether the tables of data carries out early warning.
Preferably, include before step S1:
Database name and data table name in the data warehouse, search any data in the data warehouse Table.
Preferably, the autoregression integration moving average model includes autoregression model, moving average model and autoregression Moving average model, the integration moving average model of autoregression described in step S1 are established by following steps:
The data increment of the fixed time period in described prediction continuous several weeks a few days ago is formed into time series, when will be described Between series processing be stationary time series;
According to the partial autocorrelation function and auto-correlation function of the stationary time series, establish and the stationary time series Corresponding autoregression integrates moving average model.
Preferably, it is described to include the time Series Processing for the specific steps of stationary time series:
It is nonstationary time series according to the time series, one is carried out to each data increment in the time series Secondary calculus of differences obtains new time series;
Judge whether the new time series is stationary time series, if it is not, previous step is then returned, until new Time series is stationary time series.
Preferably, the partial autocorrelation function and auto-correlation function according to the stationary time series, establish with it is described The corresponding autoregression integration moving average model of stationary time series specifically includes:
If the partial autocorrelation function is with truncation and auto-correlation function is with hangover property, establish and the stationary time The corresponding autoregression model of sequence;
If partial autocorrelation function is with hangover property and auto-correlation function is with truncation, establish and the stationary time series Corresponding moving average model;
If partial autocorrelation function and auto-correlation function are respectively provided with hangover, foundation is corresponding with the stationary time series certainly Regressive averaging model;
Wherein, the determination methods of the hangover property are function by exponential damping or into sine wave, the truncation Determination methods are that functional value is zero.
Preferably, fixed time period is obtained by following steps described in step S1:
Multiple special time periods were chosen in one week, gather the tables of data each specific time within continuous several weeks Data increment in section calculates the variance of the data increment in each special time period;
Compare the variance of data increment in all special time periods, using the special time period of variance minimum as described in Fixed time period.
Preferably, real data increment is obtained by following steps described in step S2:
Record the write time in each tables of data per data in the data warehouse;It is calculated by the said write time The data increment of tables of data in the fixed time period.
Preferably, the real data increment for calculating each prediction day in fixed time period specifically includes:
By the said write time, the primary data total amount of the tables of data when fixed time period starts is gathered, with And at the end of the fixed time period tables of data end total amount of data;
The end total amount of data is subtracted into the primary data total amount, obtains the tables of data in the fixed time period Data increment.
Preferably, the real data increment with the data increment predicted is compared described in step S2, supervised Controlling the tables of data and whether carrying out the specific steps of early warning includes:
The ratio of the real data increment and the prediction data increment is subtracted 1 as deviation, if the deviation Absolute value be more than 10%, to the tables of data carry out early warning.
On the other hand, the present invention provides a kind of monitoring system of data warehouse, including:
For obtaining any data table in data warehouse, sliding average mould is integrated according to autoregression for incremental forecasting module Type predicts data increment of the tables of data in the fixed time period of prediction day, and the data increment is data in tables of data Increase item number;
Increment comparing module, for calculating real data increment of the prediction day in fixed time period, by the reality Border data increment is compared with the data increment predicted, monitors whether the tables of data carries out early warning.
The monitoring method and system of a kind of data warehouse provided by the invention, pass through the data that will be gathered from data warehouse The increment of actual increment and autoregression integration moving average model prediction be compared, in actual increment with predicting increment deviation Early warning is carried out when excessive, and realizes and comprehensive monitoring is carried out to the data volume of each tables of data in data warehouse, it can With the stability of effective guarantee data warehouse.
Description of the drawings
Fig. 1 is the flow chart of the monitoring method of the data warehouse of one embodiment of the invention;
Fig. 2 is the structure diagram of the monitoring system of the data warehouse of one embodiment of the invention.
Specific embodiment
With reference to the accompanying drawings and examples, the specific embodiment of the present invention is described in further detail.Implement below Example is not limited to the scope of the present invention for illustrating the present invention.
It is the flow chart of the monitoring method of the data warehouse of one embodiment of the invention referring to Fig. 1, including:S1 obtains number According to any data table in warehouse, fixed time period of the moving average model prediction data table in prediction day is integrated according to autoregression Interior data increment, data increment are the increase item number of data in tables of data;S2 calculates reality of the prediction day in fixed time period Real data increment is compared border data increment with the data increment predicted, whether monitoring data table carries out early warning.
Specifically, autoregression integration moving average model is a kind of common time series predicting model, can be pre- by inciting somebody to action The data sequence that survey object is formed over time is considered as a random time series, then again with certain mathematical model Carry out approximate description this time series, this model be identified it is errorless after, so that it may from the past value of time series and value is come now Predict future value, the time series data predicted.The embodiment of the present invention by obtaining any one tables of data in data warehouse, And according to autoregression integrate moving average model come the data in prediction data table prediction day some fixed time period in Data increment obtains the prediction data increment of prediction day.Here prediction refers to any one or more futures to be predicted day Date, data increment then refer to the increase item number of the data in tables of data in fixed time period.
Total amount of data of the above-mentioned tables of data before and after the fixed time period of prediction day is detected, and the tables of data is calculated and exists The real data increment in the fixed time period of day is predicted, by real data increment and autoregression integration moving average model prediction Obtained prediction data increment is compared, and when deviation between the two is more than default difference, which is carried out pre- It is alert, it is whether abnormal by the data volume variation in any one tables of data in monitoring data warehouse, to realize to entire data bins The monitoring in storehouse.Wherein, alarm mode includes sending mail or SMS to relevant staff, but is not limited to This.
The embodiment of the present invention is flat by the way that the actual increment of the data gathered from data warehouse and autoregression integration are slided The increment of equal model prediction is compared, and carries out early warning in actual increment and excessive prediction increment deviation, and realizes logarithm Comprehensive monitoring is carried out according to the data volume of each tables of data in warehouse, it can be with the stability of effective guarantee data warehouse.
Based on above-described embodiment, as a kind of optional embodiment, database name and tables of data in data warehouse , any data table in searching data warehouse.
Specifically, the data that data warehouse is a subject-oriented, integrated, metastable, reflecting history changes Set.Data warehouse is a data source active set to multiple isomeries towards analytic type data processing for supporting decision-making Into.Mass data is store in the database of data warehouse, is on the basis that original scattered database data is extracted, cleared up On process, summarize and arrange by system, by different tables of data by these data classification ensembles in data warehouse In.Due to the table name of any data table in the database be all it is unique existing, database name and tables of data can be passed through Name, unique tables of data is found from data warehouse, any one tables of data in data warehouse can be found accordingly, and it is carried out Monitoring.
Based on above-described embodiment, as a kind of optional embodiment, autoregression integration moving average model includes autoregression Model, moving average model and autoregressive moving-average model, autoregression integration moving average model passes through following in step S1 Step is established:The data increment for predicting fixed time period in continuous several weeks a few days ago is formed into time series, by time Series Processing For stationary time series;According to the partial autocorrelation function and auto-correlation function of stationary time series, foundation and stationary time series Corresponding autoregression integrates moving average model.
Specifically, there are a fixed time period, which can be weekly the working day of the week, Can be specific certain day 7 points to 8 points of evening etc. weekly for each week it is fixed existing for a period.Obtain prediction Data increment of the tables of data in this fixed time period in continuous several weeks a few days ago, above-mentioned data increment have it is multiple, by institute Some data increments chronologically form a time series, the processing of calculus of differences are carried out to the time series, after being handled Stationary time series.According to the auto-correlation function and partial autocorrelation function of the stationary time series, can establish steady with this The corresponding autoregression integration moving average model of time series, i.e. autoregression model, moving average model and autoregression is slided flat One kind in this three class model of equal model.Wherein, auto-correlation function is the phase relation ordered series of numbers of stochastic variable and its lagged variable, is used To investigate the correlation intensity of stochastic variable and its lagged variable, in embodiments of the present invention, stochastic variable refers to form time sequence Each data increment of row, and lagged variable refers to from sequential than the variable of current variable hysteresis, for example, NtRefer to data In warehouse in certain tables of data yesterday data increment, and Nt-1 then represents the data increment of the day before yesterday, then it is stagnant to be referred to as monovalence by Nt-1 Variable afterwards.It should be noted that partial autocorrelation function is another method for describing random process structure feature.
Based on above-described embodiment, as a kind of optional embodiment, calculus of differences is carried out to time series, when obtaining steady Between the specific steps of sequence include:It is nonstationary time series according to time series, to each data increment in time series First difference computing is carried out, obtains new time series;Judge whether new time series is stationary time series, if it is not, Previous step is then returned to, until new time series is stationary time series.
Specifically, judge whether the time series that above-mentioned all data increments are chronologically formed is stationary time series, So-called stationary time series refers to that time series data is stable, without trend and periodic time series, the i.e. time series In the averages of all time series datas possess the amplitude of constant on a timeline, and the variance of time series data tends to be same on a timeline All statistical natures of one stationary value, the i.e. time series are all the constants on the time.When above-mentioned data increment chronologically When the time series of composition is nonstationary time series, calculus of differences, fortune are carried out to each data increment in time series A new time series is obtained after calculation.Judge whether new time series is stationary time series, if it is not, then continuing Calculus of differences is carried out to the data in the time series, new time series is generated again, until newly-generated time series is Until stabilization time sequence.
It should be noted that the time series for establishing autoregression integration moving average model must be stable, non-stationary Time series can not capture rule, such as the stock certificate data for usually being influenced and being fluctuated by policy and news, due to the number of share of stock According to being non-stable, therefore moving average model can not be integrated with autoregression to predict following stock certificate data.
Based on above-described embodiment, as a kind of optional embodiment, according to the partial autocorrelation function of stationary time series and Auto-correlation function is established autoregression integration moving average model corresponding with stationary time series and is specifically included:If partial autocorrelation Function is with truncation and auto-correlation function is with hangover property, establishes autoregression model corresponding with stationary time series;If partially Auto-correlation function is with hangover property and auto-correlation function is with truncation, establishes sliding average mould corresponding with stationary time series Type;If partial autocorrelation function and auto-correlation function are respectively provided with hangover, establish autoregression corresponding with stationary time series and slide Averaging model;The determination methods of hangover property press exponential damping or into sine wave for function, and the determination methods of truncation are letter Numerical value is zero.
Specifically, the stationary time series obtained according to above-described embodiment, Observable are corresponding with the stationary time series Auto-correlation function and partial autocorrelation function establish corresponding autoregression integration moving average model, i.e. autoregression model, slip is flat One kind in equal model and autoregressive moving-average model this three class model.
When partial autocorrelation function with truncation and auto-correlation function with hangover property when, establish and stationary time series pair The autoregression model answered, autoregression model are with the model for itself doing regression variable, that is, utilize the random change at early period at several moment The linear combination of amount is a kind of common shape in time series come the linear regression model (LRM) of certain moment stochastic variable after describing Formula can establish p rank autoregression model AR (p) by acquiring the hysteresis number p of the time series data used in prediction model in itself;When Partial autocorrelation function is with hangover property and auto-correlation function is with truncation, establishes sliding average corresponding with stationary time series Model, moving average model are a kind of according to time series data, item by item elapse, and are put down when calculating the sequence comprising certain item number successively Average to reflect the model of long-term trend, the hysteresis number q of the prediction error used in prediction model can be represented by acquiring to build Vertical q rank moving average model MA (q);If partial autocorrelation function and auto-correlation function are respectively provided with hangover, foundation and stationary time The corresponding autoregressive moving-average model of sequence, autoregressive moving-average model are the important methods of search time sequence, by certainly Regression model is formed with being mixed based on moving average model, it is necessary to acquire the time series data for representing and being used in prediction model in itself Hysteresis number p and represent the hysteresis number q of the prediction error used in prediction model, to establish autoregressive moving-average model ARMA (p, q).
It should be noted that for partial autocorrelation function and auto-correlation function, the determination methods of hangover property are function By exponential damping or into sine wave, and it is zero that the determination methods of truncation, which are functional value,.
Based on above-described embodiment, as a kind of optional embodiment, fixed time period is obtained by following steps in step S1 It takes:Multiple special time periods were chosen in one week, data of the gathered data table within continuous several weeks in each special time period increase Amount calculates the variance of the data increment in each special time period;Compare the variance of data increment in whole special time periods, it will The special time period of variance minimum is as fixed time period.
Specifically, since the content that is stored in each tables of data is different, weekly in different period to each tables of data The influence of middle data increment is also different.For example, when the sales volume of beer described in some tables of data, due to working day with The difference of nonworkdays, more people can select to drink to loosen at weekend, therefore sales volume of the beer at weekend has apparent increasing Add, working day, daily data increment then had apparent difference compared with weekend in tables of data at this time, continuous more data in the daytime Increment then has big difference, and has periodic variation.Some fixed time period is selected, and should in continuous several weeks a few days ago according to predicting , it is necessary to choose the data in fixed time period when data increment in fixed time period establishes autoregression integration moving average model Increment more they tends to stabilization, i.e., variance is smaller.
Therefore, multiple representative specific times that may be impacted to data increment were first chosen from one week Section, for example, can using working day or weekend daily as a special time period, can also by 9 points to 10 points of every morning, 7 points to 9 points of 2 points to 5 points of every afternoon or every night etc. daily a period of time as a special time period, for choosing Fixed each special time period, the data increment in gathered data table special time period all within continuous several weeks, and calculate The variance of these data increments.For example, 9 points of working day weekly, weekend and every morning are chosen respectively to 12 points as three Special time period, and the data increment in continuous surrounding in three special time periods is obtained respectively, acquire work weekly in surrounding Make the variance of daily data increment in day as first variance, the variance of daily data increment is second variance in weekend weekly, 9 points of variances to 12 point data increments of every morning are poor for third party, are respectively compared first variance, second variance and third party The size of difference, if finding, first variance is more than second variance, and first variance is poor less than third party, it is known that and third party's difference is minimum, Then the data increment in 9 points to 12 points this special time periods of every morning more they tends to stable state, therefore, selects every morning 9 points to 12 points are used as fixed time period, and the autoregression integration moving average model established with data increment in the period is predicted Data increment it is more accurate and more representative.
Based on above-described embodiment, as a kind of optional embodiment, real data increment passes through following steps in step S2 It obtains:Record the write time in each tables of data per data in data warehouse;Fixed time period is calculated by the write time The data increment of interior tables of data.
Specifically, when data write the tables of data in data warehouse, the write time per data is recorded, so as to basis The write time of data judges the total amount of data in data warehouse in any one tables of data in each period, calculates accordingly solid The data increment of any one tables of data in section of fixing time.
Based on above-described embodiment, as a kind of optional embodiment, actual number of the prediction day in fixed time period is calculated It is specifically included according to increment:By the write time, gather the primary data total amount of tables of data when fixed time period starts and fix The end total amount of data of tables of data at the end of period;It will terminate total amount of data and subtract primary data total amount, obtain the set time The data increment of tables of data in section.
Specifically, write time during tables of data is write according to data, period data when starting can be fixed The total amount of all data is as primary data total amount in table, and by the tables of data at the end of fixed time period all data it is total Amount is as end total amount of data, it is known that data increment at this time is terminates the difference between total amount of data and primary data total amount, i.e., It is the total amount increased or decreased from the data in tables of data in fixed time period start to finish this period.
Based on above-described embodiment, as a kind of optional embodiment, by real data increment and the number of prediction in step S2 It is compared according to increment, the specific steps whether monitoring data table carries out early warning include:By real data increment and the number of prediction Subtract 1 as deviation according to the ratio of increment, if the absolute value of deviation is more than 10%, early warning is carried out to tables of data.
Specifically, the data increment real data increment measured and autoregression integration moving average model predicted It is compared, calculates the size of the deviation of the two, know whether the tables of data should carry out early warning.By real data increment with The ratio of the data increment of prediction subtracts 1, you can the size of the deviation of the two is obtained, when the absolute value of deviation is more than threshold value When 10%, it is known that the deviation between real data increment and prediction data increment is excessive, then carries out early warning, Xiang Xiang to the tables of data The staff of pass sends warning information.It should be noted that when threshold value is too small, since there are multiple data in data warehouse Table, and each tables of data has different degrees of data volume update daily, threshold value is set too small, may cause data warehouse frequency The phenomenon that numerous early warning;And when threshold value is excessive, there may be the data in tables of data to generate missing and omit but do not carry out early warning Therefore measure, sets the threshold to 10%, can more effectively ensure the stability of database.
The optimum modeling period of any data table establishes autoregression integration and slides in selected data of embodiment of the present invention warehouse Dynamic averaging model, and the actual increment of the data gathered from data warehouse and autoregression integration moving average model are predicted Increment is compared, and carries out early warning in actual increment and excessive prediction increment deviation, realizes to each in data warehouse The data volume of tables of data all carries out comprehensive monitoring, can more effectively prevention data missing omit, being capable of effective guarantee number According to the stability in warehouse.
It is the structure diagram of the monitoring system of the data warehouse of one embodiment of the invention referring to Fig. 1, including:Increment Prediction module obtains any data table in data warehouse, and moving average model prediction data table is integrated pre- according to autoregression The data increment in the fixed time period of day is surveyed, data increment is the increase item number of data in tables of data;Increment comparing module is used In calculating real data increment of the prediction day in fixed time period, real data increment is compared with the data increment predicted Right, whether monitoring data table carries out early warning.
Specifically, in embodiments of the present invention, incremental forecasting module predicts the fixed time period predicted in day for acquisition The data increment of interior tables of data, and real data of the increment contrast module on the day of obtaining prediction day in fixed time period increases Amount, and real data increment is compared with the data increment predicted, judge whether deviation between the two belongs to normal model It encloses, if it is not, then carrying out early warning to the tables of data, effective prevention data missing is come with this and is omitted.It should be noted that data The implementation steps above method embodiment of the specific monitoring and early warning of any data table progress has elaborated in warehouse, herein not It is again to repeat more.
The embodiment of the present invention is by the present invention by by the actual increment of the data gathered from data warehouse and autoregression The increment of integration moving average model prediction is compared, and early warning is carried out in actual increment and excessive prediction increment deviation, and It realizes and comprehensive monitoring is carried out to the data volume of each tables of data in data warehouse, it can be with effective guarantee data warehouse Stability.
Finally, the present processes are only preferable embodiment, are not intended to limit the scope of the present invention.It is all Within the spirit and principles in the present invention, any modifications, equivalent replacements and improvements are made should be included in the protection of the present invention Within the scope of.

Claims (10)

1. a kind of monitoring method of data warehouse, which is characterized in that including:
S1 obtains any data table in data warehouse, and integrating moving average model according to autoregression predicts that the tables of data exists Predict the data increment in the fixed time period of day, the data increment is the increase item number of data in tables of data;
S2 calculates real data increment of the prediction day in fixed time period, by the real data increment and prediction The data increment is compared, and monitors whether the tables of data carries out early warning.
2. the monitoring method of data warehouse according to claim 1, which is characterized in that include before step S1:
Database name and data table name in the data warehouse, search any data table in the data warehouse.
3. the monitoring method of data warehouse according to claim 1, the autoregression integration moving average model is included certainly Regression model, moving average model and autoregressive moving-average model, which is characterized in that the integration of autoregression described in step S1 is slided Dynamic averaging model is established by following steps:
The data increment of the fixed time period in described prediction continuous several weeks a few days ago is formed into time series, by the time sequence Column processing is stationary time series;
According to the partial autocorrelation function and auto-correlation function of the stationary time series, establish corresponding with the stationary time series Autoregression integration moving average model.
4. the monitoring method of data warehouse according to claim 3, which is characterized in that described by the time Series Processing Include for the specific steps of stationary time series:
It is nonstationary time series according to the time series, first difference is carried out to each data increment in the time series Partite transport is calculated, and obtains new time series;
Judge whether the new time series is stationary time series, if it is not, previous step is then returned to, until the new time Sequence is stationary time series.
5. the monitoring method of data warehouse according to claim 3, which is characterized in that described according to the stationary time sequence The partial autocorrelation function and auto-correlation function of row establish autoregression integration sliding average mould corresponding with the stationary time series Type specifically includes:
If the partial autocorrelation function is with truncation and auto-correlation function is with hangover property, establish and the stationary time series Corresponding autoregression model;
If partial autocorrelation function is with hangover property and auto-correlation function is with truncation, establish corresponding with the stationary time series Moving average model;
If partial autocorrelation function and auto-correlation function are respectively provided with hangover, autoregression corresponding with the stationary time series is established Moving average model;
Wherein, the determination methods of the hangover property are function by exponential damping or into sine wave, the judgement of the truncation Method is that functional value is zero.
6. the monitoring method of data warehouse according to claim 1, which is characterized in that fixed time period described in step S1 It is obtained by following steps:
Multiple special time periods were chosen in one week, gather the tables of data within continuous several weeks in each special time period Data increment, calculate the variance of the data increment in each special time period;
Compare the variance of data increment in all special time periods, using the special time period of variance minimum as the fixation Period.
7. the monitoring method of data warehouse according to claim 1, which is characterized in that real data described in step S2 increases Amount is obtained by following steps:
Record the write time in each tables of data per data in the data warehouse;By described in the calculating of said write time The data increment of tables of data in fixed time period.
8. the monitoring method of data warehouse according to claim 7, which is characterized in that described to calculate each prediction day Real data increment in fixed time period specifically includes:
By the said write time, the primary data total amount of the tables of data when fixed time period starts, Yi Jisuo are gathered State the end total amount of data of the tables of data at the end of fixed time period;
The end total amount of data is subtracted into the primary data total amount, obtains the number of the tables of data in the fixed time period According to increment.
9. the monitoring method of data warehouse according to claim 1, which is characterized in that by the reality described in step S2 Data increment is compared with the data increment predicted, monitors the specific steps the bag whether tables of data carries out early warning It includes:
The ratio of the real data increment and the prediction data increment is subtracted 1 as deviation, if the deviation is exhausted 10% is more than to value, early warning is carried out to the tables of data.
10. a kind of monitoring system of data warehouse, which is characterized in that including:
Incremental forecasting module obtains any data table in data warehouse, and moving average model prediction institute is integrated according to autoregression Data increment of the tables of data in the fixed time period of prediction day is stated, the data increment is the increase item of data in tables of data Number;
Increment comparing module, for calculating real data increment of the prediction day in fixed time period, by the actual number It is compared according to increment with the data increment predicted, monitors whether the tables of data carries out early warning.
CN201711230833.7A 2017-11-29 2017-11-29 The monitoring method and system of a kind of data warehouse Pending CN108090138A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201711230833.7A CN108090138A (en) 2017-11-29 2017-11-29 The monitoring method and system of a kind of data warehouse

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201711230833.7A CN108090138A (en) 2017-11-29 2017-11-29 The monitoring method and system of a kind of data warehouse

Publications (1)

Publication Number Publication Date
CN108090138A true CN108090138A (en) 2018-05-29

Family

ID=62173442

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201711230833.7A Pending CN108090138A (en) 2017-11-29 2017-11-29 The monitoring method and system of a kind of data warehouse

Country Status (1)

Country Link
CN (1) CN108090138A (en)

Cited By (11)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109543993A (en) * 2018-11-20 2019-03-29 青海黄河上游水电开发有限责任公司光伏产业技术分公司 Analyze method, computer storage medium and the computer equipment of photovoltaic plant
CN109684300A (en) * 2018-11-20 2019-04-26 成都四方伟业软件股份有限公司 One kind being based on visual big data warehouse design method and system
CN110083638A (en) * 2019-04-24 2019-08-02 广东联合电子服务股份有限公司 A kind of regular base construction method of delay and data retention analysis method
CN110134680A (en) * 2019-04-04 2019-08-16 平安科技(深圳)有限公司 Space monitoring method, device, computer equipment and storage medium
CN110210959A (en) * 2019-06-10 2019-09-06 广发证券股份有限公司 Analysis method, device and the storage medium of financial data
CN110851325A (en) * 2019-11-08 2020-02-28 深圳市彬讯科技有限公司 Method, device and equipment for monitoring data warehouse based on Hive table
CN110888775A (en) * 2019-11-08 2020-03-17 深圳市彬讯科技有限公司 Method, device and equipment for monitoring data warehouse by data balance
CN113407422A (en) * 2021-08-20 2021-09-17 太平金融科技服务(上海)有限公司深圳分公司 Data abnormity alarm processing method and device, computer equipment and storage medium
CN113821409A (en) * 2021-09-23 2021-12-21 中国建设银行股份有限公司 Method, device, storage medium and equipment for monitoring data transmission
CN114322634A (en) * 2021-12-29 2022-04-12 博锐尚格科技股份有限公司 Data screening method and device for refrigerating system strategy model
CN113821409B (en) * 2021-09-23 2024-06-04 中国建设银行股份有限公司 Method, device, storage medium and equipment for monitoring data transmission

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104301895A (en) * 2014-09-28 2015-01-21 北京邮电大学 Double-layer trigger intrusion detection method based on flow prediction
CN104581749A (en) * 2013-10-11 2015-04-29 北京亿阳信通科技有限公司 A method and device for predicting service amount of data service of mobile network
CN105095056A (en) * 2015-08-14 2015-11-25 焦点科技股份有限公司 Method for monitoring data in data warehouse
CN106157163A (en) * 2016-07-27 2016-11-23 河南工业大学 A kind of grain yield short term prediction method and device
CN106844180A (en) * 2017-02-07 2017-06-13 山东浪潮云服务信息科技有限公司 A kind of monitoring and controlling forecast method of OpenStack platforms computing resource

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104581749A (en) * 2013-10-11 2015-04-29 北京亿阳信通科技有限公司 A method and device for predicting service amount of data service of mobile network
CN104301895A (en) * 2014-09-28 2015-01-21 北京邮电大学 Double-layer trigger intrusion detection method based on flow prediction
CN105095056A (en) * 2015-08-14 2015-11-25 焦点科技股份有限公司 Method for monitoring data in data warehouse
CN106157163A (en) * 2016-07-27 2016-11-23 河南工业大学 A kind of grain yield short term prediction method and device
CN106844180A (en) * 2017-02-07 2017-06-13 山东浪潮云服务信息科技有限公司 A kind of monitoring and controlling forecast method of OpenStack platforms computing resource

Cited By (15)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109543993A (en) * 2018-11-20 2019-03-29 青海黄河上游水电开发有限责任公司光伏产业技术分公司 Analyze method, computer storage medium and the computer equipment of photovoltaic plant
CN109684300A (en) * 2018-11-20 2019-04-26 成都四方伟业软件股份有限公司 One kind being based on visual big data warehouse design method and system
CN109543993B (en) * 2018-11-20 2023-04-18 青海黄河上游水电开发有限责任公司光伏产业技术分公司 Method for analyzing photovoltaic power station, computer storage medium and computer device
CN110134680A (en) * 2019-04-04 2019-08-16 平安科技(深圳)有限公司 Space monitoring method, device, computer equipment and storage medium
CN110083638A (en) * 2019-04-24 2019-08-02 广东联合电子服务股份有限公司 A kind of regular base construction method of delay and data retention analysis method
CN110210959A (en) * 2019-06-10 2019-09-06 广发证券股份有限公司 Analysis method, device and the storage medium of financial data
CN110888775A (en) * 2019-11-08 2020-03-17 深圳市彬讯科技有限公司 Method, device and equipment for monitoring data warehouse by data balance
CN110851325A (en) * 2019-11-08 2020-02-28 深圳市彬讯科技有限公司 Method, device and equipment for monitoring data warehouse based on Hive table
CN110851325B (en) * 2019-11-08 2024-03-15 土巴兔集团股份有限公司 Method, device and equipment for monitoring data warehouse based on Hive table
CN110888775B (en) * 2019-11-08 2024-04-09 土巴兔集团股份有限公司 Method, device and equipment for monitoring data warehouse by utilizing data balance
CN113407422A (en) * 2021-08-20 2021-09-17 太平金融科技服务(上海)有限公司深圳分公司 Data abnormity alarm processing method and device, computer equipment and storage medium
CN113407422B (en) * 2021-08-20 2021-11-09 太平金融科技服务(上海)有限公司深圳分公司 Data abnormity alarm processing method and device, computer equipment and storage medium
CN113821409A (en) * 2021-09-23 2021-12-21 中国建设银行股份有限公司 Method, device, storage medium and equipment for monitoring data transmission
CN113821409B (en) * 2021-09-23 2024-06-04 中国建设银行股份有限公司 Method, device, storage medium and equipment for monitoring data transmission
CN114322634A (en) * 2021-12-29 2022-04-12 博锐尚格科技股份有限公司 Data screening method and device for refrigerating system strategy model

Similar Documents

Publication Publication Date Title
CN108090138A (en) The monitoring method and system of a kind of data warehouse
US7788127B1 (en) Forecast model quality index for computer storage capacity planning
Segnon et al. Modeling and forecasting the volatility of carbon dioxide emission allowance prices: A review and comparison of modern volatility models
Raggi et al. Long memory and nonlinearities in realized volatility: a Markov switching approach
Marcellino A linear benchmark for forecasting GDP growth and inflation?
Mancini et al. Spot volatility estimation using delta sequences
Berger et al. Nowcasting the output gap
US20100010869A1 (en) Demand curve analysis method for predicting forecast error
CN106649832B (en) Estimation method and device based on missing data
US20150235133A1 (en) Data concentration prediction device, data concentration prediction method, and recording medium recording program thereof
Marczak et al. Outlier detection in structural time series models: The indicator saturation approach
US20190303863A1 (en) Database modification for improved on-shelf availability determination
Bergman et al. A Bayesian approach to demand forecasting for new equipment programs
CN110909306B (en) Business abnormality detection method and device, electronic equipment and storage equipment
CN113347057B (en) Abnormal data detection method and device, electronic equipment and storage medium
Clements et al. Data revisions and real-time forecasting
CN112150205A (en) Price prediction method and device and electronic equipment
CN112070284A (en) Screening method, device, equipment and storage medium for component prediction
Bisi et al. Dynamic learning, pricing, and ordering by a censored newsvendor
Huseby et al. Discrete event simulation methods applied to advanced importance measures of repairable components in multistate network flow systems
Bjørnland et al. Forecasting inflation with an uncertain output gap
CN107590244B (en) Method and device for identifying offline activity scene of mobile equipment
US7783509B1 (en) Determining that a change has occured in response to detecting a burst of activity
CN110992189A (en) Resource data estimation method, resource data estimation device, computer equipment and storage medium
Thury et al. Forecasting industrial production using structural time series models

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication
RJ01 Rejection of invention patent application after publication

Application publication date: 20180529