CN104199942A - Hadoop platform time series data incremental computation method and system - Google Patents

Hadoop platform time series data incremental computation method and system Download PDF

Info

Publication number
CN104199942A
CN104199942A CN201410456262.9A CN201410456262A CN104199942A CN 104199942 A CN104199942 A CN 104199942A CN 201410456262 A CN201410456262 A CN 201410456262A CN 104199942 A CN104199942 A CN 104199942A
Authority
CN
China
Prior art keywords
time series
series data
computing
sub
historical
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201410456262.9A
Other languages
Chinese (zh)
Other versions
CN104199942B (en
Inventor
孙广中
王丹
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
University of Science and Technology of China USTC
Original Assignee
University of Science and Technology of China USTC
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by University of Science and Technology of China USTC filed Critical University of Science and Technology of China USTC
Priority to CN201410456262.9A priority Critical patent/CN104199942B/en
Publication of CN104199942A publication Critical patent/CN104199942A/en
Application granted granted Critical
Publication of CN104199942B publication Critical patent/CN104199942B/en
Expired - Fee Related legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/21Design, administration or maintenance of databases

Landscapes

  • Engineering & Computer Science (AREA)
  • Databases & Information Systems (AREA)
  • Theoretical Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

The invention discloses a Hadoop platform time series data incremental computation method and system. The method includes the steps that when a time series data incremental computation task is started, the historical computational state of time series data is obtained from a cache server; incremental computation is carried out by means of a segmented time series data incremental computation method containing a SubCp sub-operation and a ReduceCp sub-operation according to the historical computational state, wherein the SubCp sub-operation is used for self-definition of segmented time series data and storage of intermediate results, the ReduceCp sub-operation is carried out in an operation merging stage and used for merging operation on computed results of the segmented time series data according to the self-defined operation, and the computational state of the SubCp sub-operation and the computational state of the ReduceCp sub-operation are maintained through the cache server. By the adoption of the method and system, plenty of unnecessary repetitive computation can be saved through incremental computation, and therefore data processing efficiency is improved.

Description

A kind of Hadoop platform time series data incremental calculation method and system
Technical field
The present invention relates to field of computer technology, relate in particular to a kind of Hadoop platform time series data incremental calculation method.
Background technology
Along with the develop rapidly of current Internet technology, the widespread use of information acquiring technology etc. produces and has accumulated the various data that exist with time series form of magnanimity in many science-cum-industry fields such as telecommunications, meteorology, geology, electric power, finance.Traditional time Series Processing method is generally to select the relevant mathematics computational tools such as Matlab to carry out, but in the time that the problem scale of processing becomes large, problem often allows people insufferable computing time.
Current, along with large data processing is taken seriously gradually, some companies, research institution have also started the research of this respect, and related work mainly concentrates on Hadoop and increases income on Distributed Computing Platform.Hadoop is as a Distributed Architecture, can distributed operation mass data, there are a lot of advantages processing in mass data, such as thering is the features such as high fault tolerance, high scalability, high reliability.
At present, to time series data, processing does not provide good support to Hadoop platform, and fewer to the incremental computations correlative study of time series data, needs double counting while causing time series data newly-increased, thereby reduces the efficiency of data processing.
Summary of the invention
The object of this invention is to provide a kind of Hadoop platform time series data incremental calculation method and system, can save a large amount of unnecessary double countings by incremental computations, thereby improved the efficiency of data processing.
The object of the invention is to be achieved through the following technical solutions:
A kind of Hadoop platform time series data incremental calculation method, the method comprises:
In the time starting time series data incremental computations task, from caching server, obtain the historical computing mode of this time series data;
The subsection timing sequence data increment computing method that comprise SubCp and the sub-computing of ReduceCP according to described historical computing mode utilization are carried out incremental computations;
Wherein, the sub-computing of SubCp is for carrying out self-defining sub-computing and intermediate result is preserved subsection timing sequence data respectively; The sub-computing of ReduceCP is computing merging phase, the result of calculation merger operation according to self-defining operation to subsection timing sequence data, and the computing mode of described SubCp and the sub-computing of ReduceCP is safeguarded by caching server.
A kind of Hadoop platform time series data incremental computations system, this system comprises:
Time series data incremental processing module TSI in the time starting time series data incremental computations task, obtains the historical computing mode of this time series data from caching server; The subsection timing sequence data increment computing method that comprise SubCp and the sub-computing of ReduceCP according to described historical computing mode utilization are carried out incremental computations; Wherein, the sub-computing of SubCp is for carrying out self-defining sub-computing and intermediate result is preserved subsection timing sequence data respectively; The sub-computing of ReduceCP is computing merging phase, the result of calculation merger operation according to self-defining operation to subsection timing sequence data, and the computing mode of described SubCp and the sub-computing of ReduceCP is safeguarded by caching server;
Caching server, for preserving the historical computing mode of time series data.
As seen from the above technical solution provided by the invention, by the historical computing mode of caching server buffer memory time series data, in the time starting incremental computations, according to the historical computing mode getting, directly carry out the calculating of incremental data, multiplexing historical result of calculation fast again, has avoided unnecessary double counting, thereby has improved the efficiency of data processing.
Brief description of the drawings
In order to be illustrated more clearly in the technical scheme of the embodiment of the present invention, below the accompanying drawing of required use during embodiment is described is briefly described, apparently, accompanying drawing in the following describes is only some embodiments of the present invention, for those of ordinary skill in the art, do not paying under the prerequisite of creative work, can also obtain other accompanying drawings according to these accompanying drawings.
The process flow diagram of a kind of Hadoop platform time series data incremental calculation method that Fig. 1 provides for the embodiment of the present invention one;
The schematic diagram of a kind of time series data fragmentation scheme that Fig. 2 provides for the embodiment of the present invention one;
The schematic diagram of a kind of subsection timing sequence data increment computing method that Fig. 3 provides for the embodiment of the present invention one;
The schematic diagram of the moving window incremental calculation method of a kind of stationary window width with state that Fig. 4 provides for the embodiment of the present invention one;
The schematic diagram of a kind of incremental calculation method with the fixing monotone increasing window of the starting point of state that Fig. 5 provides for the embodiment of the present invention one;
The schematic diagram of a kind of Hadoop platform time series data incremental computations system that Fig. 6 provides for the embodiment of the present invention two;
The schematic diagram that the existing Hadoop platform that Fig. 7 provides for the embodiment of the present invention two and incremental computations system are mutually integrated.
Embodiment
Below in conjunction with the accompanying drawing in the embodiment of the present invention, the technical scheme in the embodiment of the present invention is clearly and completely described, obviously, described embodiment is only the present invention's part embodiment, instead of whole embodiment.Based on embodiments of the invention, those of ordinary skill in the art, not making the every other embodiment obtaining under creative work prerequisite, belong to protection scope of the present invention.
Embodiment mono-
The process flow diagram of a kind of Hadoop platform time series data incremental calculation method that Fig. 1 provides for the embodiment of the present invention one.As shown in Figure 1, the method mainly comprises the steps:
Step 11, in the time starting time series data incremental computations task, from caching server, obtain the historical computing mode of this time series data.
Wherein, described time series data, is divided into multiple segmentations continuous time series data taking section sometime as unit, and the time series data computing in each unit interval section is a sub-computing; And time series data after segmentation need to meet the one semigroup character.
Described time series data incremental computations task indicates newly-increased subsection timing sequence data.
Step 12, the subsection timing sequence data increment computing method that comprise SubCp and the sub-computing of ReduceCP according to described historical computing mode utilization are carried out incremental computations.
Wherein, the sub-computing of SubCp is for carrying out self-defining sub-computing and intermediate result is preserved subsection timing sequence data respectively; The sub-computing of ReduceCP is computing merging phase, the result of calculation merger operation according to self-defining operation to subsection timing sequence data, and the computing mode of described SubCp and the sub-computing of ReduceCP is safeguarded by caching server.
Further, described subsection timing sequence data increment computing method comprise:
Moving window incremental calculation method with the stationary window width of state: the historical computing mode of the time series data that described state representation caching server is safeguarded, described window width is fixing represents that the time period number comprising fixes; If the width of window is fixed as n, and the 1st time series data to n time period has completed and has calculated and deposit in described caching server, in the time having n+1 newly-increased time series data to arrive, according to the historical computing mode of this time series data in caching server, utilize the sub-computing of SubCp only to carry out the calculating of n+1 newly-increased time series data, the result merger in n+1 newly-increased time series data and historical computing mode is carried out in the sub-computing of recycling ReduceCP, and deducts the time series data of the 1st time period;
Incremental calculation method with the fixing monotone increasing window of the starting point of state: the historical computing mode of the time series data that described state representation caching server is safeguarded, its window start time point is fixed, and the size of window increases progressively in time; If the starting point of window is the time series data of the 1st time period, and the 1st time series data to n time period has completed and has calculated and deposit in described caching server, in the time having n+1 newly-increased time series data to arrive, according to the historical computing mode of this time series data in caching server, utilize the sub-computing of SubCp only to carry out the calculating of n+1 newly-increased time series data, the result merger in n+1 newly-increased time series data and historical computing mode is carried out in the sub-computing of recycling ReduceCP.
For the ease of understanding, below in conjunction with accompanying drawing, 2-5 is described further the present invention.
As shown in Figure 2, for just following the schematic diagram that the time series data fragmentation scheme providing is provided.As shown in Figure 2, for time series data, can continuous time series data be divided into multiple sections as unit taking section sometime, the time series data computing in each like this unit interval section is a sub-computing.Wherein, the sub-computing after division need to meet the one semigroup character, can carry out Merging to corresponding sub-computing.
As shown in Figure 3, for subsection timing sequence data increment calculation flow chart, this process has been utilized the subsection timing sequence data mechanism of Fig. 2, these computing method comprise two sub-computings of sub-computing: SubCp and the sub-computing of ReduceCP, wherein, the sub-computing of SubCp is for carrying out self-defining sub-computing and intermediate result is preserved subsection timing sequence data respectively; Exemplary, in the subsection timing sequence data of statistics taking sky as unit, certain page access flow of website in each time period.ReduceCP is computing merging phase, the result of calculation merger operation according to self-defining operation to segment data; Exemplary, in the subsection timing sequence data of merger taking sky as unit, the n days total flowing of access of certain page of this website recently.And the state of the sub-computing of above-mentioned SubCp and the sub-computing of ReduceCP is safeguarded by caching server (Cache Server).
The embodiment of the present invention can be saved a large amount of unnecessary double countings by incremental computations, thereby has improved the efficiency of data processing; In the embodiment of the present invention, in conjunction with the correlation properties of subsection timing sequence data increment computing method and subsection timing sequence data, two kinds of moving window incremental calculation methods with state are proposed: fixed width window, the time period number that window comprises is fixed; Monotone increasing window, window start time point is fixed, and passes in time window size and increases progressively.Specific as follows:
Shown in Fig. 4, be the moving window incremental computations of the stationary window width with state, described state refers to the correlation computations state that Cache Server safeguards.In conjunction with correlation properties and the incremental calculation method of subsection timing sequence data in Fig. 2,3, as shown in Figure 4, here the width of supposing window is fixed as n, in the time having n+1 newly-increased time series data to arrive, data (the 1st to n segments order sequenced data) on the left of learning according to the historical computing mode in Cache Server had been calculated, now only need to calculate incremental data (n+1 newly-increased time series data) and and the merger of part historical results just can obtain results needed, after merger, also need to deduct the 1st segments order sequenced data because the width of window is fixed as n; The newly-increased number sequence data of final combination and historical result of calculation can obtain and carry out global data and calculate the same result, and this method can be avoided a large amount of unnecessary double countings, thereby have improved the efficiency of data processing.
As shown in Figure 5, be the incremental computations of the fixing monotone increasing window of the starting point with state, described state refers to the correlation computations state that Cache Server safeguards.In conjunction with Fig. 2, the correlation properties of subsection timing sequence data and incremental calculation method in 3, as shown in Figure 5, the starting point of supposing window is 1, in the time having n+1 newly-increased number sequence data to arrive, data (the 1st to n segments order sequenced data) on the left of learning according to the historical computing mode in Cache Server had been calculated, now only need to calculate incremental data (n+1 newly-increased time series data) and and the merger of part historical results just can obtain results needed, the newly-increased data of final combination and historical result of calculation can obtain and carry out global data and calculate the same result, this method can be avoided a large amount of unnecessary double countings, thereby improve the efficiency of data processing.
On the other hand, the caching server in the embodiment of the present invention also can arrange timing mechanism to the data of inserting, and it is identifying and removing useless legacy data to guarantee the memory database expansion of can breaking after section sometime.
Meanwhile, time series data computational algorithm can also be combined with incremental calculation method provided by the invention; Wherein, time series data computational algorithm comprises the algorithm that following conventional time series is calculated: time series forecasting algorithm, comprises simple sequential average method, moving average method, weighted moving average method etc.; Time Series Similarity metric algorithm, comprises ED, DTW, FastDTW etc.
The technical scheme that the embodiment of the present invention provides compared with prior art, has following beneficial effect:
1) based on Hadoop platform, do not change Hadoop bottom architecture structure, facilitate programming personnel's coding;
2) on Hadoop platform, support the processing of time series data;
3) incremental computations of the time series data of support Hadoop platform, reduces unnecessary double counting, improves incremental data counting yield.
Embodiment bis-
The schematic diagram of a kind of Hadoop platform time series data incremental computations system that Fig. 6 provides for the embodiment of the present invention two.As shown in Figure 6, this system mainly comprises:
Time series data incremental processing module TSI11 in the time starting time series data incremental computations task, obtains the historical computing mode of this time series data from caching server; The subsection timing sequence data increment computing method that comprise SubCp and the sub-computing of ReduceCP according to described historical computing mode utilization are carried out incremental computations; Wherein, the sub-computing of SubCp is for carrying out self-defining sub-computing and intermediate result is preserved subsection timing sequence data respectively; The sub-computing of ReduceCP is computing merging phase, the result of calculation merger operation according to self-defining operation to subsection timing sequence data, and the computing mode of described SubCp and the sub-computing of ReduceCP is safeguarded by caching server;
Caching server 12, for preserving the historical computing mode of time series data.
Further, described subsection timing sequence data increment computing method comprise:
Moving window incremental calculation method with the stationary window width of state: the historical computing mode of the time series data that described state representation caching server is safeguarded, described window width is fixing represents that the time period number comprising fixes; If the width of window is fixed as n, and the 1st time series data to n time period has completed and has calculated and deposit in described caching server, in the time having n+1 newly-increased time series data to arrive, according to the historical computing mode of this time series data in caching server, utilize the sub-computing of SubCp only to carry out the calculating of n+1 newly-increased time series data, the result merger in n+1 newly-increased time series data and historical computing mode is carried out in the sub-computing of recycling ReduceCP, and deducts the time series data of the 1st time period;
Incremental calculation method with the fixing monotone increasing window of the starting point of state: the historical computing mode of the time series data that described state representation caching server is safeguarded, its window start time point is fixed, and the size of window increases progressively in time; If the starting point of window is the time series data of the 1st time period, and the 1st time series data to n time period has completed and has calculated and deposit in described caching server, in the time having n+1 newly-increased time series data to arrive, according to the historical computing mode of this time series data in caching server, utilize the sub-computing of SubCp only to carry out the calculating of n+1 newly-increased time series data, the result merger in n+1 newly-increased time series data and historical computing mode is carried out in the sub-computing of recycling ReduceCP.
Further, described time series data, is divided into multiple segmentations continuous time series data taking section sometime as unit, and the time series data computing in each unit interval section is a sub-computing; Wherein, the time series data after segmentation meets the one semigroup character.
Because native system can be realized based on Hadoop platform, for ease of understanding, above-mentioned module can be combined with existing Hadoop platform.As shown in Figure 7, based on Hadoop platform extension caching server Cache Server and time series data incremental processing module TSI; Caching server is data cached library module, its buffer memory necessary computing mode result, the buffer memory service that comparing Hadoop self provides has abundanter data structure to represent function; TSI module is mainly used in time series data incremental computations.
It should be noted that, in the specific implementation of the function that each functional module comprising in said system realizes each embodiment above, have a detailed description, therefore here repeat no more.
Those skilled in the art can be well understood to, for convenience and simplicity of description, only be illustrated with the division of above-mentioned each functional module, in practical application, can above-mentioned functions be distributed and completed by different functional modules as required, be divided into different functional modules by the inner structure of system, to complete all or part of function described above.
Through the above description of the embodiments, those skilled in the art can be well understood to above-described embodiment and can realize by software, and the mode that also can add necessary general hardware platform by software realizes.Based on such understanding, the technical scheme of above-described embodiment can embody with the form of software product, it (can be CD-ROM that this software product can be stored in a non-volatile memory medium, USB flash disk, portable hard drive etc.) in, comprise that some instructions are in order to make a computer equipment (can be personal computer, server, or the network equipment etc.) carry out the method described in each embodiment of the present invention.
The above; only for preferably embodiment of the present invention, but protection scope of the present invention is not limited to this, is anyly familiar with in technical scope that those skilled in the art disclose in the present invention; the variation that can expect easily or replacement, within all should being encompassed in protection scope of the present invention.Therefore, protection scope of the present invention should be as the criterion with the protection domain of claims.

Claims (6)

1. a Hadoop platform time series data incremental calculation method, is characterized in that, the method comprises:
In the time starting time series data incremental computations task, from caching server, obtain the historical computing mode of this time series data;
The subsection timing sequence data increment computing method that comprise SubCp and the sub-computing of ReduceCP according to described historical computing mode utilization are carried out incremental computations;
Wherein, the sub-computing of SubCp is for carrying out self-defining sub-computing and intermediate result is preserved subsection timing sequence data respectively; The sub-computing of ReduceCP is computing merging phase, the result of calculation merger operation according to self-defining operation to subsection timing sequence data, and the computing mode of described SubCp and the sub-computing of ReduceCP is safeguarded by caching server.
2. method according to claim 1, is characterized in that, described subsection timing sequence data increment computing method comprise:
Moving window incremental calculation method with the stationary window width of state: the historical computing mode of the time series data that described state representation caching server is safeguarded, described window width is fixing represents that the time period number comprising fixes; If the width of window is fixed as n, and the 1st time series data to n time period has completed and has calculated and deposit in described caching server, in the time having n+1 newly-increased time series data to arrive, according to the historical computing mode of this time series data in caching server, utilize the sub-computing of SubCp only to carry out the calculating of n+1 newly-increased time series data, the result merger in n+1 newly-increased time series data and historical computing mode is carried out in the sub-computing of recycling ReduceCP, and deducts the time series data of the 1st time period;
Incremental calculation method with the fixing monotone increasing window of the starting point of state: the historical computing mode of the time series data that described state representation caching server is safeguarded, its window start time point is fixed, and the size of window increases progressively in time; If the starting point of window is the time series data of the 1st time period, and the 1st time series data to n time period has completed and has calculated and deposit in described caching server, in the time having n+1 newly-increased time series data to arrive, according to the historical computing mode of this time series data in caching server, utilize the sub-computing of SubCp only to carry out the calculating of n+1 newly-increased time series data, the result merger in n+1 newly-increased time series data and historical computing mode is carried out in the sub-computing of recycling ReduceCP.
3. method according to claim 1 and 2, is characterized in that, described time series data is divided into multiple segmentations continuous time series data taking section sometime as unit, and the time series data computing in each unit interval section is a sub-computing; Wherein, the time series data after segmentation meets the one semigroup character.
4. a Hadoop platform time series data incremental computations system, is characterized in that, this system comprises:
Time series data incremental processing module TSI in the time starting time series data incremental computations task, obtains the historical computing mode of this time series data from caching server; The subsection timing sequence data increment computing method that comprise SubCp and the sub-computing of ReduceCP according to described historical computing mode utilization are carried out incremental computations; Wherein, the sub-computing of SubCp is for carrying out self-defining sub-computing and intermediate result is preserved subsection timing sequence data respectively; The sub-computing of ReduceCP is computing merging phase, the result of calculation merger operation according to self-defining operation to subsection timing sequence data, and the computing mode of described SubCp and the sub-computing of ReduceCP is safeguarded by caching server;
Caching server, for preserving the historical computing mode of time series data.
5. system according to claim 4, is characterized in that, described subsection timing sequence data increment computing method comprise:
Moving window incremental calculation method with the stationary window width of state: the historical computing mode of the time series data that described state representation caching server is safeguarded, described window width is fixing represents that the time period number comprising fixes; If the width of window is fixed as n, and the 1st time series data to n time period has completed and has calculated and deposit in described caching server, in the time having n+1 newly-increased time series data to arrive, according to the historical computing mode of this time series data in caching server, utilize the sub-computing of SubCp only to carry out the calculating of n+1 newly-increased time series data, the result merger in n+1 newly-increased time series data and historical computing mode is carried out in the sub-computing of recycling ReduceCP, and deducts the time series data of the 1st time period;
Incremental calculation method with the fixing monotone increasing window of the starting point of state: the historical computing mode of the time series data that described state representation caching server is safeguarded, its window start time point is fixed, and the size of window increases progressively in time; If the starting point of window is the time series data of the 1st time period, and the 1st time series data to n time period has completed and has calculated and deposit in described caching server, in the time having n+1 newly-increased time series data to arrive, according to the historical computing mode of this time series data in caching server, utilize the sub-computing of SubCp only to carry out the calculating of n+1 newly-increased time series data, the result merger in n+1 newly-increased time series data and historical computing mode is carried out in the sub-computing of recycling ReduceCP.
6. according to the system described in claim 4 or 5, it is characterized in that, described time series data, is divided into multiple segmentations continuous time series data taking section sometime as unit, and the time series data computing in each unit interval section is a sub-computing; Wherein, the time series data after segmentation meets the one semigroup character.
CN201410456262.9A 2014-09-09 2014-09-09 A kind of Hadoop platform time series data incremental calculation method and system Expired - Fee Related CN104199942B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201410456262.9A CN104199942B (en) 2014-09-09 2014-09-09 A kind of Hadoop platform time series data incremental calculation method and system

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201410456262.9A CN104199942B (en) 2014-09-09 2014-09-09 A kind of Hadoop platform time series data incremental calculation method and system

Publications (2)

Publication Number Publication Date
CN104199942A true CN104199942A (en) 2014-12-10
CN104199942B CN104199942B (en) 2017-11-07

Family

ID=52085235

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201410456262.9A Expired - Fee Related CN104199942B (en) 2014-09-09 2014-09-09 A kind of Hadoop platform time series data incremental calculation method and system

Country Status (1)

Country Link
CN (1) CN104199942B (en)

Cited By (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105843891A (en) * 2016-03-22 2016-08-10 浙江大学 Incremental online characteristic extraction and analysis method and system
WO2017113865A1 (en) * 2015-12-31 2017-07-06 华为技术有限公司 Method and device for big data increment calculation
CN108846636A (en) * 2018-06-01 2018-11-20 北京字节跳动网络技术有限公司 Data dispatching method, device, computer readable storage medium
CN109948007A (en) * 2019-03-21 2019-06-28 浙江邦盛科技有限公司 A kind of clock synchronization ordinal number maximum processing method for being increased continuously number and number of increments according to statistics
CN110008544A (en) * 2019-03-21 2019-07-12 浙江邦盛科技有限公司 A kind of processing method of clock synchronization ordinal number number of increments and reduced degree according to statistics
CN110019367A (en) * 2017-12-28 2019-07-16 北京京东尚科信息技术有限公司 A kind of method and apparatus of statistical data feature
CN112488412A (en) * 2020-12-11 2021-03-12 北京字跳网络技术有限公司 Duration information determination method and device, electronic equipment and computer storage medium

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103049556A (en) * 2012-12-28 2013-04-17 中国科学院深圳先进技术研究院 Fast statistical query method for mass medical data
CN103676645A (en) * 2013-12-11 2014-03-26 广东电网公司电力科学研究院 Mining method for association rules in time series data flows
US20140214372A1 (en) * 2013-01-25 2014-07-31 International Business Machines Corporation Interpolation techniques used for time alignment of multiple simulation models

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103049556A (en) * 2012-12-28 2013-04-17 中国科学院深圳先进技术研究院 Fast statistical query method for mass medical data
US20140214372A1 (en) * 2013-01-25 2014-07-31 International Business Machines Corporation Interpolation techniques used for time alignment of multiple simulation models
CN103676645A (en) * 2013-12-11 2014-03-26 广东电网公司电力科学研究院 Mining method for association rules in time series data flows

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
刘学军等: "基于滑动窗口的在线数据流增量聚集查询", 《计算机工程》 *
王文胜: "基于集群计算的网络信息采集系统的设计与实现", 《中国优秀硕士学位论文全文数据库》 *

Cited By (11)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2017113865A1 (en) * 2015-12-31 2017-07-06 华为技术有限公司 Method and device for big data increment calculation
CN106933882A (en) * 2015-12-31 2017-07-07 华为技术有限公司 A kind of big data incremental calculation method and device
CN106933882B (en) * 2015-12-31 2020-09-29 华为技术有限公司 Big data increment calculation method and device
CN105843891A (en) * 2016-03-22 2016-08-10 浙江大学 Incremental online characteristic extraction and analysis method and system
CN110019367A (en) * 2017-12-28 2019-07-16 北京京东尚科信息技术有限公司 A kind of method and apparatus of statistical data feature
CN110019367B (en) * 2017-12-28 2022-04-12 北京京东尚科信息技术有限公司 Method and device for counting data characteristics
CN108846636A (en) * 2018-06-01 2018-11-20 北京字节跳动网络技术有限公司 Data dispatching method, device, computer readable storage medium
CN109948007A (en) * 2019-03-21 2019-06-28 浙江邦盛科技有限公司 A kind of clock synchronization ordinal number maximum processing method for being increased continuously number and number of increments according to statistics
CN110008544A (en) * 2019-03-21 2019-07-12 浙江邦盛科技有限公司 A kind of processing method of clock synchronization ordinal number number of increments and reduced degree according to statistics
CN109948007B (en) * 2019-03-21 2020-07-14 浙江邦盛科技有限公司 Processing method for inquiring maximum continuous increasing times and decreasing times of time sequence data statistics
CN112488412A (en) * 2020-12-11 2021-03-12 北京字跳网络技术有限公司 Duration information determination method and device, electronic equipment and computer storage medium

Also Published As

Publication number Publication date
CN104199942B (en) 2017-11-07

Similar Documents

Publication Publication Date Title
CN104199942A (en) Hadoop platform time series data incremental computation method and system
US10423635B2 (en) Processing time series
US10281284B2 (en) Hybrid road network and grid based spatial-temporal indexing under missing road links
JP6852392B2 (en) Stochastic price and sudden change forecast
CN111339240A (en) Object recommendation method and device, computing equipment and storage medium
CN110633594A (en) Target detection method and device
CN113052265B (en) Moving object track simplifying method based on feature selection
CN103473036A (en) Input method skin push method and system
CN114817651B (en) Data storage method, data query method, device and equipment
US20170061208A1 (en) Determining Severity of a Geomagnetic Disturbance on a Power Grid Using Similarity Measures
Liu et al. Segmented analysis for reducing data movement
CN118246413A (en) Text generation method, device and model based on attention mechanism
CN109657071B (en) Vocabulary prediction method, device, equipment and computer readable storage medium
CN110309974B (en) Logistics transportation destination prediction method and device, electronic equipment and storage medium
US20230224493A1 (en) Segment-wise prediction machine learning frameworks
CN115630733A (en) Order delivery time estimation method, system, medium and electronic device
Chen et al. Discrete wavelet transform application for bike sharing system check-in/out demand prediction
CN105302894A (en) Parallel association rule based tracking method and tracking apparatus for hotspots of public opinions
CN113468866B (en) Method and device for analyzing non-standard JSON string
CN114691918A (en) Radar image retrieval method and device based on artificial intelligence and electronic equipment
Kim et al. Are Self-Attentions Effective for Time Series Forecasting?
Kang et al. Integration of cloud and big data analytics for future smart cities
Zhang et al. Wind power error compensation prediction model based on CEEMD-SE-ELM-TCN
CN115908427B (en) Pavement disease maintenance cost prediction method and system based on semantic segmentation and SVM
CN109815312A (en) A kind of method, apparatus of document query calculates equipment and computer storage medium

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant
CF01 Termination of patent right due to non-payment of annual fee
CF01 Termination of patent right due to non-payment of annual fee

Granted publication date: 20171107