CN106991137B - The method that time series data is indexed based on Hbase hash summary forest - Google Patents
The method that time series data is indexed based on Hbase hash summary forest Download PDFInfo
- Publication number
- CN106991137B CN106991137B CN201710154614.9A CN201710154614A CN106991137B CN 106991137 B CN106991137 B CN 106991137B CN 201710154614 A CN201710154614 A CN 201710154614A CN 106991137 B CN106991137 B CN 106991137B
- Authority
- CN
- China
- Prior art keywords
- time
- tree
- node
- hash
- query
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
- 238000000034 method Methods 0.000 title claims abstract description 54
- 230000008569 process Effects 0.000 claims description 16
- 238000006116 polymerization reaction Methods 0.000 description 10
- 238000010586 diagram Methods 0.000 description 5
- 238000004380 ashing Methods 0.000 description 3
- 229910052751 metal Inorganic materials 0.000 description 3
- 239000000203 mixture Substances 0.000 description 3
- 230000006399 behavior Effects 0.000 description 2
- 238000004364 calculation method Methods 0.000 description 2
- 238000013139 quantization Methods 0.000 description 2
- 241001269238 Data Species 0.000 description 1
- 230000003321 amplification Effects 0.000 description 1
- 230000009286 beneficial effect Effects 0.000 description 1
- 230000008901 benefit Effects 0.000 description 1
- 238000004883 computer application Methods 0.000 description 1
- 230000006870 function Effects 0.000 description 1
- 230000004927 fusion Effects 0.000 description 1
- 238000003780 insertion Methods 0.000 description 1
- 230000037431 insertion Effects 0.000 description 1
- 230000007246 mechanism Effects 0.000 description 1
- 239000002184 metal Substances 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 238000003199 nucleic acid amplification method Methods 0.000 description 1
- 238000005457 optimization Methods 0.000 description 1
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/20—Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
- G06F16/22—Indexing; Data structures therefor; Storage structures
- G06F16/2228—Indexing structures
- G06F16/2255—Hash tables
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/20—Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
- G06F16/22—Indexing; Data structures therefor; Storage structures
- G06F16/2228—Indexing structures
- G06F16/2246—Trees, e.g. B+trees
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Software Systems (AREA)
- Data Mining & Analysis (AREA)
- Databases & Information Systems (AREA)
- Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
Abstract
The invention discloses a kind of methods being indexed based on Hbase hash summary forest to time series data, comprising the following steps: (1) establishes every time quantum tree according to time granularity;(2) hash code of every time quantum tree is sought, and the time quantum tree with hash code is formed into the hash summary forest based on Hbase;(3) time series data of acquisition is inserted into hash summary forest according to hash code;(4) time series data for reading storage is inquired according to time range.The present invention improves the inquiry velocity of time series data converging operation, while providing hash index by generating hash code for cell tree, solves Hbase distributed storage time series data and generate hot issue by combining summary forest tree index scheme.
Description
Technical field
The present invention relates to technical field of memory, and in particular to one kind based on Hbase hash summary forest to time series data into
The method of line index.
Background technique
Time series data is the continuous data indexed with time series, and with popularizing for computer application, time series data is each
A field is also widely used.Such as: as the combination of financial field and internet is more and more closer, financial field is big
It is increasing to the converging operation performance requirement of time series data that the quantization of amount withdraws operation.Such as: in futures when a season
Between market price, disk mouth price or the exchange hand of certain commodity contract etc. in range counted, summed or calculated maximum value
Equal converging operations.How quick and precisely such application scenarios are frequent in finance quantization, and since data volume is huge,
The converging operation result that ground calculates the Financial Time Series in t1~t2 time becomes particularly significant.
In to Au metal forward business data within the scope of certain time for the sum operation of market price:
Select SUM(Last Price)From‘Au’WHERE time>t1AND time<t2
In such an application scenario, it is necessary to which support quickly obtains converging operation result in the time series data of magnanimity.
Traditional Relational DataBase mainly achievees the purpose that accelerate aggregate query by the way of Materialized View or summary table.
Materialized View is pre-processed to the querying command for being related to table connection, and result is stored in view table, and when inquiry is direct
Take out the result pre-processed.Summary table is then to calculate while data are written and save corresponding summary info, to send out
When raw inquiry, directly inquires and return the result from summary table.Such method improves search efficiency, but a disadvantage is that increasing
The expansion rate of database.In NoSQL database, some databases are handled by the way of MapReduce and polymerization pipeline
These converging operations are all the representatives calculated in real time, although not increasing the expansion rate of database, are generated in query process
A large amount of disk and computing cost, inefficient time-consuming are unable to satisfy the demand of extemporaneous inquiry.Some NoSQL databases are by tree-shaped
Index structure fusion, improves search efficiency, reduces magnetic disc access times.
Summary of the invention
In view of above-mentioned, the invention proposes a kind of sides being indexed based on Hbase hash summary forest to time series data
Method accelerates the query time of time series data by establishing tree index, and avoids time series data by hash code and be distributed
The uneven problem of the space distribution that sequential storage generates in formula database.
A method of time series data is indexed based on Hbase hash summary forest, comprising the following steps:
(1) every time quantum tree is established according to time granularity;
(2) hash code of every time quantum tree is sought, and the time quantum tree composition with hash code is based on Hbase
Hash summary forest;
(3) time series data of acquisition is inserted into hash summary forest according to hash code;
(4) time series data for reading storage is inquired according to time range.
In step (1), the process of settling time cell tree are as follows: firstly, predefining the time granularity of time quantum tree;So
Start to carry out recurrence with root node afterwards, establish a new node every time, next, recurrence establishes the left and right child section of this node
Point stops recurrence when the node of creation exceeds the range precalculated, completes the establishment process of whole tree.
In step (1), every time quantum tree is a Kd-Trees, and includes a set time granularity.Pass through control
The tree height of each tree processed controls time granularity.Line segment tree node stores the summary info of the range of nodes, specifically includes that
LBound, RBound, LNode, RNode and Data;Wherein, it includes time model that LBound, RBound, which respectively indicate the node,
The start time point enclosed and termination time point;LNode, RNode respectively indicate the left child of the node and when right child nodes include
Between the midpoint put;Data indicates the summary data value of node storage, each node for the time quantum tree established at this time
Data is empty.
The root node of each tree indicates the polymerization result of time series data in the t time span of this tree carrying, second layer section
Point carries the polymerization result of time series data in t/2 time span, analogizes every node layer and carries upper node layer half the time length
Index data.It is dissipated to realize that the time quantum tree comprising set time granularity can be convenient with same by the way that control tree is high
Column code polymerize the node of each tree, realizes the load balancing of hot spot.
The leaf node of the polymerization result of one time quantum tree representation, one unit time granularity range, each tree indicates
The polymerization summary info of most fine granularity range.Granularity can adjust according to actual needs.
Time quantum tree cover time (TreeBound) calculation formula:
TreeBound=(2^ (TreeMaxLevel-1)) * Leaf Bound
Wherein, TreeMaxLevel is that maximal tree is high, and LeafBound is the time range that leaf node indicates.Such as: one
Time quantum tree time range is set to one day, and tree height is set to 9 layers, and leaf node indicates 6 minutes summary infos.So one
The aggregated data that condominium is had jurisdiction over 1536 minutes is set, realizes that a time quantum tree covers one day time range.
In step (2), the mode for seeking the hash code of every time quantum tree has very much, passes through preferably, choosing
The processing of md5 transcoding is carried out to the information of each tree and generates the correspondence hash code (Hash) of this time cell tree, and is written into
In tree hash table;Calculate the concrete mode of hash code are as follows:
Hash=md5 (tree Info+tree low bound)
Tree info is Data Identification brief information, such as: this data represents Au metallic element futures data, then marks
It is the start time point of time quantum tree for Au, tree low bound.
More time quantum tree composition hash summary forests, the entire time range for hashing the data that summary forest is indexed
To form the sum of time range represented by his time quantum tree.
Hash summary forest based on Hbase is made of two Hbase tables of tree-hash and tree-node, wherein
Tree-hash table is used to search the hash code of time quantum tree, and tree-node table stores the burl of all time quantum trees
Point, and tree-hash table is individually to store with tree-node table, i.e., by the hash code of every time quantum tree and every time
The time series data that cell tree is loaded with separates individually storage, and the when ordinal number that the time quantum tree for possessing same Hash code is loaded with
According to centrally stored.
In step (3), the time series data can be Financial Time Series, forward business data etc. it is any when ordinal number
According to.
In step (3), time series data is inserted into the detailed process of hash summary forest are as follows:
(3-1) by belonging time of time series data found in tree-hash table where this timing data when
Between cell tree tree hash value;
(3-2) finds tree-node table corresponding to the tree hash value, and time series data recurrence is inserted into this tree-
In node table, detailed process are as follows:
Firstly, recurrence is started according to the root node that tree hash value finds locating time quantum tree, ordinal number when then carrying out
According to time point and current queries node time compare, when the time point of time series data be less than the node time when, to this
Left child's recurrence of node is inserted into time series data, when being greater than the node time at the time point of time series data, then the right side of the node
Child's recurrence is inserted into time series data;Until being inserted into the leaf node of time quantum tree.
In step (4), in data query, hash summary forest is due to the data in cell tree at the same time
Rowkey possesses same hash code, so hbase range query can rapidly find out whole time quantum tree and be stored in memory
In, this greatlys save the disk I/O operation after carrying out recurrence to tree.
Carry out the process of data query are as follows:
(a) judge whether query time range (t1, t2) belongs to cell tree range at the same time, if so, executing step
(b), if it is not, executing step (c);
(b) inquiry operation is Query (t1, t2);
(c) inquiry operation is Query (t1, EndUnitTime (t1)), Query (midUnitTime) and Query
(StartUnitTime(t2),t2);
Wherein, StartUnitTime (t1) is the start time point of time quantum locating for time point t1;
EndUnitTime (t2) is the end time point of time quantum locating for time point t2;
MidUnitTime is the time model of several cell trees between the time range of time quantum tree locating for t1 and t2
It encloses;
Query (t1, t2) is expressed as the execution inquiry behaviour of the query time range (t1, t2) in same time quantum tree
Make;
Query (t1, EndUnitTime (t1)) indicates to execute inquiry behaviour in first cell tree in query context
Make;
Query (midUnitTime) indicates to execute in second to the second from the bottom cell tree in query context
Inquiry operation;
Query (StartUnitTime (t2), t2) indicates to execute inquiry in last cell tree in query context
Operation.
The detailed process of Query (t1, t2) are as follows:
(a) go out the hash code of its affiliated time quantum tree by searching for the time reckoning of item and navigate to this time unit
Tree, and using the root node of this time cell tree as current root node;
(b) recursive query since current root node, query task start from (t1, t2);
(c) when recursive query is to present node, parse the time range that the node includes start time point LBound,
Middle time point midTime and termination time point RBound;
(d) judge whether t1 and t2 meet t1=LBound and t2=RBound, if so, recording the node result and exiting
Recurrence, if it is not, executing step (e);
(e) judge whether t1 and t2 meet t1≤midTime and t2≤midTime, if so, the left child of the node is saved
Point is used as current root node, jumps and executes step (b)~step (d), if it is not, executing step (f);
(f) judge whether t1 and t2 meet t1 >=midTime and t2 >=midTime, if so, the right child of the node is saved
Point is used as current root node, executes step (b)~step (d);If it is not, executing step (g);
(g) judge whether time t2 meets LBound < t2 < midTime, if so, using left child as present node, it will
MidTime executes step (b)~step (d) as t2;Using right child nodes as present node, using midTime as t1,
Execute step (b)~step (d).
The present invention improves the inquiry velocity of time series data converging operation, together by combining summary forest tree index scheme
When by generating hash code be that cell tree provides hash index, solve Hbase distributed storage time series data generation hot issue.
Detailed description of the invention
Fig. 1 is that the present invention is based on the flow charts that Hbase hashes the method that summary forest is indexed time series data;
Fig. 2 is the structural schematic diagram for the time quantum tree established;
Fig. 3 is that data throughout comparison diagram is written in the embodiment of the present invention 1;
Fig. 4 is large span time range aggregate query time-consuming comparison diagram in the embodiment of the present invention 2;
Fig. 5 is hash summary depth woods method and the time-consuming comparison of non-hash summary depth woods method inquiry in the embodiment of the present invention 3
Figure.
Specific embodiment
In order to more specifically describe the present invention, with reference to the accompanying drawing and specific embodiment is to technical solution of the present invention
It is described in detail.
As shown in Figure 1, the specific steps for the method being indexed based on Hbase hash summary forest to time series data are as follows:
Step 1, every time quantum tree is established: firstly, predefining the time range of time quantum tree;Then with root section
Point starts to carry out recurrence, establishes a new node every time, next, recurrence establishes the left and right child nodes of this node, works as wound
The node built stops recurrence when exceeding the range precalculated, complete the establishment process of whole tree, and each node is written
In tree node table, the time quantum tree of foundation is as shown in Figure 2;
In this step, every time quantum tree of foundation is a Kd-Trees, and includes a set time granularity.Pass through
The tree height of each tree is controlled to control time granularity.Line segment tree node stores the summary info of the range of nodes, specifically includes that
LBound, RBound, LNode, RNode and Data;Wherein, it includes time model that LBound, RBound, which respectively indicate the node,
The start time point enclosed and termination time point;LNode, RNode respectively indicate the left child of the node and when right child nodes include
Between the midpoint put;Data indicates the summary data value of node storage, each node for the time quantum tree established at this time
Data is empty.
The root node of each tree indicates the polymerization result of time series data in the t time span of this tree carrying, second layer section
Point carries the polymerization result of time series data in t/2 time span, analogizes every node layer and carries upper node layer half the time length
Index data.It is dissipated to realize that the time quantum tree comprising set time granularity can be convenient with same by the way that control tree is high
Column code polymerize the node of each tree, realizes the load balancing of hot spot.
The leaf node of the polymerization result of one time quantum tree representation, one unit time granularity range, each tree indicates
The polymerization summary info of most fine granularity range.Granularity can adjust according to actual needs.
Time quantum tree cover time (TreeBound) calculation formula:
TreeBound=(2^ (TreeMaxLevel-1)) * Leaf Bound
Wherein, TreeMaxLevel is that tree maximal tree is high, and LeafBound is the time range that leaf node indicates.
Step 2, the hash code of every time quantum tree is sought, and the time quantum tree composition with hash code is based on
The hash summary forest of Hbase;
In this step, the concrete mode of hash code is calculated are as follows:
Hash=md5 (tree Info+tree low bound)
Tree info is Data Identification brief information, such as this data represents Au metallic element futures data, then is labeled as
Au;Tree low bound is the start time point of time quantum tree, will be acquired in hash value write-in tree hash table;
Step 3, the time series data of acquisition is inserted into hash summary forest, insertion process are as follows:
Step 3-1, where finding this timing data in tree-hash table by the belonging time of time series data
The tree hash value of time quantum tree;
Step 3-2 finds tree-node table corresponding to the tree hash value, time series data recurrence is inserted into this
In tree-node table, detailed process are as follows:
Firstly, recurrence is started according to the root node that tree hash value finds locating time quantum tree, ordinal number when then carrying out
According to time point and current queries node time compare, when the time point of time series data be less than the node time when, to this
Left child's recurrence of node is inserted into time series data, when being greater than the node time at the time point of time series data, then the right side of the node
Child's recurrence is inserted into time series data;Until being inserted into the leaf node of time quantum tree.
Step 4, the time series data for reading storage, the process of inquiry are inquired according to time range are as follows:
Step 4-1, judges whether query time range (t1, t2) belongs to cell tree range at the same time, if so, executing
Step 4-2, if it is not, executing step 4-3;
Step 4-2, inquiry operation are Query (t1, t2);
Step 4-3, inquiry operation be Query (t1, EndUnitTime (t1)), Query (midUnitTime) and
Query(StartUnitTime(t2),t2)。
Embodiment 1
Using the method for the present invention, Opentsdb open source time series database method and original Hbase method to it is identical when
Ordinal number records the write-in handling capacity of every kind of method according to progress data write-in, as shown in figure 3, from the available present invention side Fig. 3
Method is due to having index (hash code) and achievement process, so writing speed is directly stored in original number than original Hbase method
According to slow but faster than opentsdb open source time series database method writing speed.
When the method for the present invention progress data write-in is shown in table 1, the division situation of the region in Hbase.
Hash summary forest is split into after the region division of triggering Hbase by hashed value multiple as can be seen from Table 1
region.Same time quantum tree for possessing same Hash value can divide in the same region.Hashed value generates at random, newly
It hashes in different region to the tree uniform load built, avoids hot issue.
Embodiment 2
Data are carried out using the method for the present invention, Opentsdb open source time series database method and original Hbase method
The aggregate query of large span time range operates, and Fig. 4 is three kinds of methods in Long time scale aggregate query time-consuming comparison diagram, from figure
4 is available in the aggregate query operation of large span time range, and the method for the present invention performance is preferable.It is simultaneously it can be seen that original
Hbase method is when inquiring the polymerization result of 200000 datas, and time-consuming tens of seconds are unable to satisfy quickly extemporaneous query demand.
Opentsdb increases income time series database method since caching and Indexing Mechanism speed greatly promote.The method of the present invention ratio
Opentsdb inquiry velocity faster, and as query context increases, it is slower compared with other schemes to inquire time-consuming amplification.Illustrate to utilize
The rowkey of hbase is designed, and whole tree is found in use scope inquiry to be put into memory and have optimization function to query performance, is saved
Disk expense.
Embodiment 3
Aggregate query operation is carried out to data using the method for the present invention and non-ashing technique, Fig. 5 is hash summary forest side
Method and non-ashing technique inquire time-consuming comparison diagram.Lack Hash fields from the available non-Hash scheme rowkey of Fig. 5, and
Random challenge is carried out to hbase when query search Kd-Trees each recurrence.Same Kd-Trees may be dispersed in different
In region, the inquiry time-consuming of the method for the present invention is less than non-ashing technique, and as the increase advantage of query context is more obvious.
Technical solution of the present invention and beneficial effect is described in detail in above-described specific embodiment, Ying Li
Solution is not intended to restrict the invention the foregoing is merely presently most preferred embodiment of the invention, all in principle model of the invention
Interior done any modification, supplementary, and equivalent replacement etc. are enclosed, should all be included in the protection scope of the present invention.
Claims (4)
1. a kind of method being indexed based on Hbase hash summary forest to time series data, comprising the following steps:
(1) time range that high, leaf node includes according to tree establishes every time quantum tree comprising set time granularity;
(2) hash code of every time quantum tree is sought, and the time quantum tree with hash code is formed into dissipating based on Hbase
Column summary forest;
(3) time series data of acquisition is inserted into hash summary forest according to hash code;
(4) time series data for reading storage is inquired according to time range;
Every time quantum tree is a Kd-Trees, and line segment tree node stores the summary info of the range of nodes, comprising:
LBound, RBound, LNode, LNode and Data;Wherein, it includes time model that LBound, RBound, which respectively indicate the node,
The start time point enclosed and termination time point;LNode, LNode respectively indicate the left child of the node and when right child nodes include
Between range midpoint;Data indicates the summary data value of node storage, each node for the time quantum tree established at this time
Data is empty;
The hash code Hash's of every time quantum tree seeks formula are as follows:
Hash=md5 (tree Info+tree low bound)
Tree info is Data Identification brief information;Tree low bound is the start time point of time quantum tree;Md5 is
A kind of transcoding mode;
The hash summary forest based on Hbase is made of two Hbase tables of tree-hash and tree-node, wherein
For tree-hash table for storing the corresponding hash code of all time quantum trees, each tree-node table has corresponding time quantum
All leaf nodes of tree, and tree-hash table is individually to store, and will possess same Hash code with tree-node table
The time series data that time quantum tree is loaded with is centrally stored.
2. the method being indexed according to claim 1 based on Hbase hash summary forest to time series data, feature are existed
In: time series data is inserted into the detailed process of hash summary forest are as follows:
Time where (3-1) finds this timing data by the belonging time of time series data in tree-hash table is single
The hash code of member tree;
(3-2) finds tree-node table corresponding to the hash code, and time series data recurrence is inserted into this tree-node table,
Detailed process are as follows:
Start recurrence according to the root node that hash code finds locating time quantum tree, then the time point of progress time series data with work as
The time range of preceding query node compares, and is less than the middle time point of the time range of the node when the time point of time series data
When, into the Data of the left child nodes of the node, recurrence is inserted into time series data, is greater than the node when the time point of time series data
Time range middle time point when, then in the Data of the right child nodes of the node recurrence be inserted into time series data;Until inserting
Enter until the leaf node of time cell tree.
3. the method being indexed according to claim 1 based on Hbase hash summary forest to time series data, feature are existed
In: carry out the process of data query are as follows:
(a) judge whether query time range (t1, t2) belongs to cell tree range at the same time, if so, step (b) is executed,
If it is not, executing step (c);
(b) inquiry operation Query (t1, t2) is executed
(c) inquiry operation Query (t1, EndUnitTime (t1)), Query (midUnitTime) and Query are executed
(StartUnitTime(t2),t2);
Wherein, StartUnitTime (t2) is the initial time for the time range that time quantum tree locating for time point t2 includes
Point;
EndUnitTime (t1) is the end time point for the time range that time quantum tree locating for time point t1 includes;
MidUnitTime is the time range between the time range of time quantum tree locating for t1 and t2;
Query (t1, t2) is expressed as executing inquiry operation in the same time quantum tree belonging to query context t1~t2;
Query (t1, EndUnitTime (t1)) indicates first unit in query context t1~EndUnitTime (t1)
Inquiry operation is executed in tree;
Query (midUnitTime) indicates second to the second from the bottom cell tree in query context midUnitTime
Middle execution inquiry operation;
Query (StartUnitTime (t2), t2) indicates last in query context StartUnitTime (t2)~t2
Inquiry operation is executed in cell tree.
4. the method being indexed according to claim 3 based on Hbase hash summary forest to time series data, feature are existed
In: the detailed process of Query (t1, t2) are as follows:
(a) go out the hash code of its affiliated time quantum tree by searching for the time reckoning of item and navigate to this time cell tree, and
Using the root node of this time cell tree as current root node;
(b) recursive query since current root node, query task start from (t1, t2);
(c) when recursive query is to present node, start time point LBound, the centre of the time range that the node includes are parsed
Time point midTime and termination time point RBound;
(d) judge whether t1 and t2 meet t1=LBound and t2=RBound, passed if so, recording the node result and exiting
Return, if it is not, executing step (e);
(e) judge whether t1 and t2 meet t1≤midTime and t2≤midTime, if so, the left child nodes of the node are made
It for current root node, jumps and executes step (b)~step (d), if it is not, executing step (f);
(f) judge whether t1 and t2 meet t1 >=midTime and t2 >=midTime, if so, the right child nodes of the node are made
For current root node, step (b)~step (d) is executed;If it is not, executing step (g);
(g) judge whether time t2 meets LBound < t2 < midTime, if so, using left child as present node, it will
MidTime executes step (b)~step (d) as t2;Using right child nodes as present node, using midTime as t1,
Execute step (b)~step (d).
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201710154614.9A CN106991137B (en) | 2017-03-15 | 2017-03-15 | The method that time series data is indexed based on Hbase hash summary forest |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201710154614.9A CN106991137B (en) | 2017-03-15 | 2017-03-15 | The method that time series data is indexed based on Hbase hash summary forest |
Publications (2)
Publication Number | Publication Date |
---|---|
CN106991137A CN106991137A (en) | 2017-07-28 |
CN106991137B true CN106991137B (en) | 2019-10-18 |
Family
ID=59413068
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201710154614.9A Active CN106991137B (en) | 2017-03-15 | 2017-03-15 | The method that time series data is indexed based on Hbase hash summary forest |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN106991137B (en) |
Families Citing this family (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN113535712B (en) * | 2021-06-04 | 2023-09-29 | 山东大学 | Method and system for supporting large-scale time sequence data interaction based on line segment KD tree |
Citations (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN106202384A (en) * | 2016-07-08 | 2016-12-07 | 清华大学 | A kind of indexing means supporting time series data aggregate function |
Family Cites Families (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
KR101207510B1 (en) * | 2008-12-18 | 2012-12-03 | 한국전자통신연구원 | Cluster Data Management System And Method for Data Restoring Using Shared Read-Only Log in Cluster Data Management System |
-
2017
- 2017-03-15 CN CN201710154614.9A patent/CN106991137B/en active Active
Patent Citations (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN106202384A (en) * | 2016-07-08 | 2016-12-07 | 清华大学 | A kind of indexing means supporting time series data aggregate function |
Non-Patent Citations (1)
Title |
---|
支持时序数据聚合函数的索引;黄向东等;《清华大学(自然科学)》;20160331;第229-236,245页 * |
Also Published As
Publication number | Publication date |
---|---|
CN106991137A (en) | 2017-07-28 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN103366015B (en) | A kind of OLAP data based on Hadoop stores and querying method | |
US5943677A (en) | Sparsity management system for multi-dimensional databases | |
CN103577440A (en) | Data processing method and device in non-relational database | |
US20150006509A1 (en) | Incremental maintenance of range-partitioned statistics for query optimization | |
CN105574093A (en) | Method for establishing index in HDFS based spark-sql big data processing system | |
CN105975587A (en) | Method for organizing and accessing memory database index with high performance | |
CN103914483B (en) | File memory method, device and file reading, device | |
US10990573B2 (en) | Fast index creation system for cloud big data database | |
CN112015741A (en) | Method and device for storing massive data in different databases and tables | |
CN103678519A (en) | Mixed storage system and mixed storage method for supporting Hive DML (data manipulation language) enhancement | |
CN101963993B (en) | Method for fast searching database sheet table record | |
CN106991137B (en) | The method that time series data is indexed based on Hbase hash summary forest | |
WO2022127418A1 (en) | Data retrieval method and apparatus, electronic device, and storage medium | |
Qu et al. | A real-time materialized view approach for analytic flows in hybrid cloud environments | |
Šalgová et al. | Reducing Data Access Time using Table Partitioning Techniques | |
CN108241624A (en) | The generation method and device of a kind of query script | |
CN102968467A (en) | Optimization method and query method for multiple layers of Bloom Filters | |
Phanluong | A simple and efficient method for computing data cubes | |
KR102047233B1 (en) | Metadata search system by using dual load balancing and database management method by using dual load balancing | |
Cao et al. | Optimized query evaluation using cooperative sorts | |
US20180232416A1 (en) | Distribute execution of user-defined function | |
Faust et al. | Footprint reduction and uniqueness enforcement with hash indices in SAP HANA | |
Phan-Luong | First-half index base for querying data cube | |
Burdakov et al. | Bloom filter cascade application to SQL query implementation on Spark | |
CN108959308A (en) | A kind of reply can supplemental data indexing means |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant |