CN102375853A - Distributed database system, method for building index therein and query method - Google Patents

Distributed database system, method for building index therein and query method Download PDF

Info

Publication number
CN102375853A
CN102375853A CN2010102611675A CN201010261167A CN102375853A CN 102375853 A CN102375853 A CN 102375853A CN 2010102611675 A CN2010102611675 A CN 2010102611675A CN 201010261167 A CN201010261167 A CN 201010261167A CN 102375853 A CN102375853 A CN 102375853A
Authority
CN
China
Prior art keywords
index
data
data block
file
query
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN2010102611675A
Other languages
Chinese (zh)
Inventor
齐骥
钱岭
郭磊涛
周大
罗治国
孙少陵
张松波
张卫平
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
China Mobile Communications Group Co Ltd
Original Assignee
China Mobile Communications Group Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by China Mobile Communications Group Co Ltd filed Critical China Mobile Communications Group Co Ltd
Priority to CN2010102611675A priority Critical patent/CN102375853A/en
Publication of CN102375853A publication Critical patent/CN102375853A/en
Pending legal-status Critical Current

Links

Images

Landscapes

  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

The invention discloses a distributed database system and a method for building an index in the distributed database system. The distributed database system comprises a plurality of distributed storage units, an index memory, a resolver, an index query module and a parallel processing engine, wherein the distributed storage units store a plurality of data block files by sections; the index memory stores the indexes of the data block files; the resolver resolves a query sentence initiated by a user and selects a corresponding query index; the index query module searches the indexes of the data block files according to the selected query index to obtain at least one query data block set; the query data block set comprises an index key value and records the position information of the data block files corresponding to the index key value in the data block files; and the parallel processing engine splits the at least one query data block set and initiates a parallel scanning task.

Description

Distributed data base system, set up the method and the querying method of index therein
Technical field
The application relates to a kind of distributed data base system, sets up the method and the querying method of index therein.
Background technology
Structural data in enormous quantities is stored in the database, particularly is the data managing method of using always in the relevant database.Simple and direct practice is: dispose ripe data base management system (DBMS), with interface (like SQL) the definition of data table and the data structure of standard, with the data importing of collecting or be inserted in the respective table of database.As required, Database Systems are set up for it and are used when index supplies fast query.During data query,, can select for use suitable index to optimize query performance according to querying condition.
At the management aspect of large-scale data, the data volume and the disk I of visit when the key factor that influences the data query performance is inquiry.Index technology is the important method that improves query performance in the database practice.The indexed data amount is generally little a lot of than actual data volume, and can be organized into the data structure of being convenient to search, like tree or HASH list structure.Filter out the data that major part need not visit rather than directly scan real data through preferentially searching index, can reduce effectively must visit data volume and disk I.The tissue and the storage mode of data are also extremely important to setting up effective index simultaneously, and different index technologies also has different requirement to the tissue and the storage mode of data.Index type such as B-TREE index, HASH index and the BITMAP index etc. used always in the Database Systems are applicable to different occasions respectively, and their principle basically all is to come the memory location of rapidly locating record through the key assignments of inquiry.
Current in many industries, the data volume of generation and accumulation is huge especially, even reaches hundreds of TB or PB level.And these data along with the time in continuous expansion, the speed that produces data along with the development of business is also improving constantly.For example telecommunication service CDR (Call Detail Record) data, Internet of Things sensing data, data of financial transaction, internet daily record data etc.
Mass data have in the following characteristics one of at least:
(1) mostly data are time series data, free label, and according to or roughly produce and storage according to time sequencing.
(2) data are structuring or semi-structured data, and structure is subject to variation;
(3) speed that produces of data very fast (producing 2TB or 5,000,000,000 records every day) like certain system, and data volume is increasing;
(4) repetition rate of the value on a lot of Attribute domains is very high.
Management and application to mass data also have following characteristics:
(1) need to preserve the long period (like half a year), data more of a specified duration are dropped or backup to other medium;
(2) old historical data must be able to be visited, but the chance of being visited is less; From cost consideration, resource (like CPU, internal memory, bandwidth etc.) in the time of except storage resources, should not taking too many operation;
(3) historical data does not generally need to revise, in case data storage is good, just only need read it;
(4) generally can specify the regular hour range of condition to the inquiry of data;
(5), except will supporting fast query manipulation, need support batch data analysis and dredge operation toward contact to same data set.And same analysis and dredge operation to same batch data generally can repeatedly not repeat.
The user will concentrate inquiry to obtain the data of wanting from mass data, continues to use existing database and indexing means thereof unusual difficulty.Database often can't be stored googol like this according to amount, and not too is applicable to the change of semi-structured data or data structure.Intensive complete index not only can make concerning mass data to be set up and safeguards that the expense of index is big, speed is slow, and the data volume of index itself is also very huge, thereby also makes the writing speed of data be difficult to catch up with the generation speed of data.
Summary of the invention
On the one hand, the application discloses a kind of distributed data base system, comprising:
A plurality of distributed storage unit, subregion store a plurality of data block files;
Index store stores the index of said a plurality of data block files;
Resolver is resolved the Client-initiated query statement, and is selected corresponding search index;
The search index module; According to the search index of selecting; The index of searching for said a plurality of data block files is to obtain at least one data query piece collection, and said data query piece collection comprises the index key assignments and write down the positional information of data block file corresponding with said index key assignments in said a plurality of data block file; And
The parallel processing engine splits and initiates parallel scan task with said at least one data query piece collection.
In the application's a embodiment, distributed data base system has defined the basic structure of data organization and storage, is write wherein by order with the data recording that the mode of stream is collected or batch obtains.Comprise data file and corresponding data block index file in the basic structure of said data organization and storage.Many compression data blocks can be deposited in proper order in each data file, and many data recording can be deposited in proper order in each data block.The size of data block can suitably define according to average record length, for example is defined as 1MB; The size of data file also can define flexibly, as is defined as 1GB.Data block adopts compression algorithm commonly used to compress to save the space.Each data file is accompanied by a very data block index file of lightweight, is used for locating fast the data designated piece.The data block index generally generates in the write data file, also can rebuild according to the data file that has existed.The application does not limit data block and its index separate storage in different files, can be stored in the identical file yet.
The index that the application provides is based upon on the foregoing data block index.This is a kind of approximate sparse index structure; The key assignments that is said index is not the memory location that navigates to every record; Occurred on all data blocks of this key assignments and just point to approx, in index, only be recorded in the position that occurs this key assignments in the indication data block for the first time.Because comprise many records in each data block, and same key assignments maybe be in certain data block repeatedly repeats, and the index of setting up so just can become order of magnitude ground to dwindle, and the speed of index is set up in quickening greatly.Also can avoid the serious inhomogeneous index problem of non-uniform that causes simultaneously because key assignments distributes.10000 records are for example arranged in a data block, and have only 100 unique key assignments, just only can produce 100 index.
For the big but Attribute domain of Finite Discrete of span, the for example telephone number among the telecommunication service CDR or the ID of other data centralizations etc., it is very effective to the inquiry of this Attribute domain to set up index.In a data block, no matter how many times appearred in a certain particular value of this Attribute domain, only write down the position that it occurs for the first time.The structure of index is like < Key, BlockLocation >.Because the value repetition rate of this type of Attribute domain is very high, so its index is also very little and sparse.This type of Attribute domain is comparatively common, also often need set up index.Also can set up joint index to a plurality of Attribute domains.
Though said index strategy just is based upon on the data block, has reduced the size of index greatly, need when inquiry, be increased in the expense of carrying out sequential scanning in the finite data piece.In the processing of mass data, the benefit that this compromise obtained is howed a lot than setting up heavy index.In distributed system, adopt under the situation of parallel processing technique, above-mentioned expense will drop to the acceptable reduced levels.
In addition, the application discloses a kind of method of in distributed data base system, setting up index, comprising:
The data that collection will be stored;
Said data block is compressed into a plurality of data blocks and confirms the corresponding data block index;
The form subregion of the data block of compressing according to file is stored in a plurality of distributed storage unit in the said distributed data base system; And
Institute's data blocks stored is set up index file, and wherein, each index in the said index file comprises the positional information of index key assignments and said data block.
The index data of above-mentioned foundation itself can be stored in it in relevant database because capacity is little, in relevant database, the B-TREE index is set up in its key word, can support range query and some inquiry to this Attribute domain so simultaneously.Also can in distributed Key-Value storage system better retractility and stability be provided with index datastore.
As the optional of said index strategy replenished; The data volume that need visit when reducing the data query of crossing over relative broad range (like many days time); Also for the data volume of the bulk statistics analysis that reduces specified scope (like many days time) and the visit of data mining action need; Can carry out the division directory stores to the data file, like subregion by date.Behind the subregion, aforesaid index based on data block can be based upon on the subregion.Subregion can be counted as a kind of coarseness index based on catalogue.
This application also discloses a kind of querying method that is applied in the distributed data base system, and said distributed data base system comprises the index that uses said method to form, and said querying method comprises:
Resolve query statement and determine corresponding search index;
Search index according to selecting is searched for said index file to obtain at least one data query piece collection; And
Initiate parallel scan task with said at least one data query piece collection fractionation and according to the positional information that said data query piece collection comprises.
In one embodiment, when inquiry,, at first judge partition list related in the querying condition (like the date subregion), the subregion scope of dwindling inquiry if comprise the subregion condition in the querying condition.If comprise the Attribute domain of having set up index in the querying condition, the index of this Attribute domain of each relevant partitions of inquiry obtains a set of data blocks earlier, has further dwindled the scope of data block.If there are a plurality of Attribute domains of setting up index in the querying condition, just the corresponding index of inquiry obtains a plurality of set of data blocks respectively, and again according to the logical relation of a plurality of conditions, for example AND or OR obtain the common factor or the union of set of data blocks.At last, the set of data blocks that obtains is initiated and the line scanning matching operation, the result of said matching operation is merged scanning, and the result that will scan is as the result of this inquiry.
Description of drawings
Fig. 1 shows the data storage basic structure according to an embodiment of the application.
Fig. 2 has described according to a method embodiment of the application, that in distributed data base system, set up index.
Fig. 3 shows the logical organization signal according to the Subscriber Number index of an embodiment of the application.
Fig. 4 is the block scheme that shows according to the distributed data base system of an embodiment of the application.
Fig. 5 is the query processing according to another embodiment of the application.
Embodiment
Below, be described in detail with reference to the illustrative embodiments of accompanying drawing the application.
Embodiment among the application is the basis with the distributed file system.Distributed file system is made up of a plurality of storages and computing node; These nodes can be made up of the PC server of a plurality of networkings, number of nodes even can reach several thousand.Under the situation of not break in service, can increase or the deleted data node according to the capacity needs are level and smooth, the fault of minority back end can not cause system service to be interrupted yet.That kind as will be described below, file data are divided into piece and as far as possible balancedly are distributed on each back end, and provide book copying to guarantee the reliability of data.Can be through calling any file and the data of distributed store on each back end thereof in the distributed file system client API Access file system, wherein direct with relevant back end communication to data write in the file.This file system has solved the required problems such as distributed store, load balancing, stability, data reliability, retractility and high-throughput of mass data of handling well.
Fig. 1 shows the data storage basic structure 100 according to an embodiment of the application.This storage organization 100 comprises data file 111 and the data block index file 112 corresponding with it.Data recording writes in this storage organization with the form of journal stream, and compresses (as adopting compression algorithms such as GZIP, LZO) according to user-defined data block size (like 1MB), and the data block after the compression is write in the data file 111 in proper order.In one embodiment, in the write data file, generate in the corresponding index and writing data blocks index file 112.The user can the definition of data file full-size (like 1GB).
Have dual mode to read the data in the storage organization 110: a kind of ID according to specified data block determines its position of piece index in data block index file 112, and according to the indexed search of determining to the position of data block in data file 111.Another kind of mode is directly to read according to the position of data block in data file 111, has saved the expense of read data piece index file like this.If navigate to Record ID concrete in the specified data block, need that order jump to designated recorder ID after navigating to specified data block during read data.
Table 1 shows the data structure of data block index file 112." data block ID " is the parameter that implies, and in the data block index data structure, do not occur.The position of " block offset " expression data block in data file.The size of this data block before " raw data byte number " expression compression is usually slightly larger than or equals user-defined data block size." packed byte number " is the actual storage size that takies of this data block after the compression." record strip number " is a statistical value, representes the summary journal bar number in this data block.In the data block index file, every index is isometric, therefore can be easy to calculate its position hereof according to data block ID.Faster speed can be selected the data block indexed cache in internal memory according to data block ID locator data piece if desired.
Data block ID Block offset The raw data byte number The packed byte number The record strip number ......
Table 1
Be described in the method 200 of setting up index in the distributed data base system with reference to Fig. 2 below.For the purpose of clear, describe with magnanimity telecommunication service CDR data instance below and handle 200, but the present invention be not limited to this.Telecommunication service CDR is the data of the recording user call event that produces in the communication network.For example comprise a lot of information such as Subscriber Number, time tag, type of service, failure cause, about 400 bytes of length among a typical C DR.For example produce about 5,000,000,000 records every day, about 2TB, and need preservation be the data of 2TB*90=180TB in 3 months.According to its cdr logging in special time period of designated user number inquiry is a kind of query demand commonly used.And operator also need carry out batch quantity analysis and excavation to these CDR.
In step S201, at first collect the CDR data.Can adopt existing C DR centralized collection mode to realize, also can adopt the most original CDR of parallel processing (MapReduce) batch processing to collect file and gather the CDR data.
In step S202, the data compression of collecting is become a plurality of data blocks.Every cdr logging can for example be encoded according to compact code form (as adopting compression algorithms such as GZIP, LZO).When packed data, can determine the index column of each data block.
Foundation to index file in step 204 is described.Therefore unclear description above having deleted.
Then, in step S203, the form subregion of the data block of compressing according to file is stored in a plurality of distributed storage unit in the said distributed data base system.For example can carry out the subregion storage to the CDR data by date according to time tag, promptly in distributed file system, the data of same date are not stored under the different catalogues.Among the files of data storage under catalogue/CDR/20100103 catalogue like on January 3rd, 2010.
Then, in step S204, institute's data blocks stored is set up index file, wherein, each index in the said index file comprises the positional information of index key assignments and said data block.In the present embodiment, Subscriber Number is a Finite Discrete and Attribute domain that repetition rate is higher.Whole data set total user number yardage in a period of time is certain, and the cdr logging of same Subscriber Number only can appear in a small amount of limited data block.In a data block, no matter how many times appearred in a Subscriber Number, only write down the position that it occurs for the first time.The indexed data structure is like < Subscriber Number, BlockLocations >.Wherein BlockLocation has directly write down the position of this data block in specific file.BlockLocation also can write down the information such as size of this data block.As a kind of selection, can also in index data, write down the data block ID among the specific file simply.In when inquiry, the data block index file of specifying file be need read earlier like this, disk tracking and IO increased.
Step S204 can adopt that initiate files carries out in each subregion of parallel processing (MapReduce) batch scanning.Carry out when also can state step S203 in realization, to reduce the process of disk scanning.The index data of this generating step is stored in the distributed data library storage system according to subregion.In one embodiment, can for example adopt the storage system of a kind of similar GoogleBigtable to store the index data of generation.The index stores of different subregions correspondences is in different row groups, and for example the index stores of subregion 20100103 is in row group 20100103.
Fig. 3 shows the logical organization signal of the Subscriber Number index that above-mentioned indexing means 200 set up.Subscriber Number is as the key assignments (Key) 301 of index, and its value comprises all Subscriber Numbers that whole data centralization occurred, and for example occurs 1,000 ten thousand Subscriber Numbers altogether, and 1,000 ten thousand line index are just arranged here.Some files (files) 303 have been comprised in each date subregion 302.And the index of specific user's number only writes down the BlockLocations 304 of its data block that in specific files, occurred.Because the cdr logging that the specific user produces is very discrete, perhaps there is not record certain period at all, therefore the logical organization of this index is very sparse.In the storage organization of index, dummy cell 305 does not take any storage space, and total like this index size can keep less.
Fig. 4 is the block scheme that shows according to the distributed data base system 400 of an embodiment of the application.In this system framework, data file is stored in the distributed file system 410, and this document system 410 is made up of a plurality of unit node, and these nodes are made up of the PC server of a plurality of networkings.On the structure, distributed file system 410 comprises a main control unit (not shown) and a plurality of data storage cell.410 pairs of big files of file system adopt the mode of piecemeal (for example every 64MB) that different data blocks is evenly distributed on the different unit node, and to a plurality of backups of each data block store (for example 3 backups).On unit node, data block can be for example with the stored in form of Linux local file on local disk.Main control unit provides unified file system namespace metadata and coordinates and manages whole group system, data storage cell distributed earth storage data block.In distributed system, be prior art through main control unit storage data, therefore repeat no more.
Parallel processing platform (MapReduce framework) 420 can be deployed in when being responsible for setting up index, data query in the same cluster with distributed file system 410, the parallel processing when data analysis and excavation etc.
The index data file storage adopts a kind of distributed memory system of similar Google Bigtable model to store index in index store 430 in the present embodiment, it has set up the B-TREE index at index key, supports to search fast.Index store 430 also can be deployed in the same cluster with distributed file system 410 and parallel processing platform 420.Concrete index data file can be for example with above-mentioned table 1 with shown in Figure 3.
Carry out the engine 440 main execution of being responsible for query manipulation, and can comprise resolver (for example SQL resolver) 440-1, search index module 440-2 and parallel processing engine 440-3.Wherein, resolver 440-1 is responsible for resolving the action statement from user interface 150, like query statement, and selects corresponding search index; Search index module 440-2 is responsible for the data scanning scope that search index obtains dwindling, like index data piece collection; Particularly, search index module 440-2 can in said index store 430, search for said a plurality of data block files according to the search index of selecting index to obtain at least one data query piece collection.Parallel processing engine 440-3 is responsible for that data area to be scanned is carried out logic and splits, and initiates parallel processing task.
Parallel processing platform 420 returns to inquiring client terminal with the process result merging after handling this parallel task.
With reference to Fig. 5, be that example is described the query processing 500 according to an embodiment of the application below to inquire about the cdr logging of certain Subscriber Number (as 13500000002) in certain two days (as 20100103 and 20100104).In addition, for purposes of illustration, be described below processing 500 with system shown in Figure 4 400.Yet the system shown in Figure 4 that is applied to is not limit in query processing 500.
At first, in step S501, the query statement (like the SQL query statement) that the user initiates through user interface 450; Then, in step S502, resolver 440-1 resolves and determines index to query statement.For example, the querying condition in the query statement can relate to partition list (like the date subregion), to dwindle the subregion scope of inquiry.If comprise the Attribute domain of having set up index in the querying condition, then select the index of this Attribute domain of each relevant partitions, obtain a set of data blocks, thereby can further dwindle the scope of data block.If a plurality of Attribute domains of setting up index are arranged in the querying condition, just select corresponding index respectively.
If do not set up available index, perhaps data analysis application need be carried out the batch quantity analysis operation to the data of bulk, then can directly parallel processing engine 440-3 be submitted in this operation and carry out (step S504).
In step S503, search index module 440-2 according to the index file of storage in the result queries index store 430 of resolving to obtain at least one data query piece collection.When in step S501, analyzing the Attribute domain that obtains to have in the querying condition a plurality of index; And in above-mentioned steps S502, selected corresponding index respectively; Then in this step, inquire about corresponding index respectively and obtain a plurality of set of data blocks, obtain the common factor or the union of set of data blocks again according to the logical relation (for example AND or OR) of a plurality of conditions.With index shown in Figure 4 is example, can obtain following set of data blocks:
20100103/file-2/BlockLocation-3
20100104/file-4/BlockLocation-6
20100104/file-4/BlockLocation-7
20100104/file-5/BlockLocation-8
Then, give parallel processing engine 440-3 with above-mentioned set of data blocks and split and initiate parallel scan task to parallel processing platform 420.For example four data blocks in the above-mentioned set of data blocks are assigned respectively to four parallel processing nodes and scan simultaneously.Particularly, in step S504, parallel processing platform 420 is handled above-mentioned querying command according to above-mentioned set of data blocks, returns to inquiring client terminal after the structure that parallel processing engine 440-3 handles parallel processing platform 420 merges.
More than be merely the application's illustrative embodiments, those skilled in the art, can make amendment to above-mentioned each embodiment in the application's scope thereof according to above-mentioned embodiment.

Claims (16)

1. distributed data base system comprises:
A plurality of distributed storage unit, subregion store a plurality of data block files;
Index store stores the index of said a plurality of data block files;
Resolver is resolved the Client-initiated query statement, and is selected corresponding search index;
The search index module; According to the search index of selecting; The index of searching for said a plurality of data block files is to obtain at least one data query piece collection, and said data query piece collection comprises the index key assignments and write down the positional information of data block file corresponding with said index key assignments in said a plurality of data block file; And
The parallel processing engine splits and initiates parallel scan task with said at least one data query piece collection.
2. the system of claim 1, wherein, said query statement comprises querying condition, comprises a plurality of Attribute domains of said index in the said querying condition, and
Wherein, said resolver is selected and the corresponding index of said a plurality of Attribute domains respectively after said query statement is analyzed.
3. system as claimed in claim 2, wherein, said search index module is inquired about respectively and the corresponding index of said a plurality of Attribute domains, obtaining a plurality of index data piece collection, and determines the common factor or the union of said a plurality of index data piece collection through logical operation.
4. the system of claim 1, wherein, said a plurality of data block files according to different attribute store in said a plurality of distributed storage unit under the different files catalogue.
5. the system of claim 1, wherein, said index store data blocks stored file carries out encoding compression according to the compact code form.
6. method of in distributed data base system, setting up index comprises:
The data that collection will be stored;
Said data are divided into a plurality of data blocks and confirm the corresponding data block index;
The data block that to cut apart is stored in a plurality of distributed storage unit in the said distributed data base system according to the form subregion of file; And
Institute's data blocks stored is set up index file, and wherein, each index in the said index file comprises the positional information of index key assignments and said data block.
7. method as claimed in claim 6 wherein, comprises the data block of compression according to the step that the form subregion of file is stored in a plurality of distributed storage unit in the said distributed data base system:
With the compression data block according under the different file directorys in a plurality of distributed storage unit of different data block property store in said distributed data base system.
8. method as claimed in claim 6, wherein, said positional information has write down the position of said data block in said file directory.
9. method as claimed in claim 7, wherein, said data block attribute is the time that said data block generates.
10. method as claimed in claim 6 wherein, becomes said data compression a plurality of data blocks and confirms that the step of corresponding data block index comprises:
Said data are carried out encoding compression and confirmed the corresponding data block index according to the compact code form by piece soon.
11., said data are divided into a plurality of data blocks and confirm that the step of corresponding data block index comprises like any described method among the claim 6-10:
Said data are divided into a plurality of data blocks;
A plurality of data blocks that compression is cut apart; And
Data block specified data piece index for each compression.
12. like any described method in the claim 11, wherein, said index key assignments is directed to all data blocks that this index key assignments occurred, in said index file, only is recorded in the position that occurs this index key assignments in the indication data block for the first time.
13. a querying method that is applied in the distributed data base system, said distributed data base system comprise the index that method as claimed in claim 12 forms, said querying method comprises:
Resolve query statement and determine corresponding search index;
Search index according to selecting is searched for said index file to obtain at least one data query piece collection; And
Initiate parallel scan task with said at least one data query piece collection fractionation and according to the positional information that said data query piece collection comprises.
14. querying method as claimed in claim 13, wherein, said query statement comprises querying condition, and said querying condition comprises partition list, is used to dwindle the subregion scope of inquiry.
15. querying method as claimed in claim 13, wherein, said parsing query statement and the step of determining corresponding search index comprise:
Parse and include a plurality of index attributes territory in the said querying condition, and select respectively and the corresponding index of said a plurality of Attribute domains.
16. querying method as claimed in claim 15, wherein, said search index according to selection is searched for said index file and is comprised with the step that obtains at least one data query piece collection:
The index of inquiring about said correspondence respectively obtains a plurality of index data piece collection; And
Determine the common factor or the union of said a plurality of index data piece collection through the logical operation relation.
CN2010102611675A 2010-08-24 2010-08-24 Distributed database system, method for building index therein and query method Pending CN102375853A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN2010102611675A CN102375853A (en) 2010-08-24 2010-08-24 Distributed database system, method for building index therein and query method

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN2010102611675A CN102375853A (en) 2010-08-24 2010-08-24 Distributed database system, method for building index therein and query method

Publications (1)

Publication Number Publication Date
CN102375853A true CN102375853A (en) 2012-03-14

Family

ID=45794475

Family Applications (1)

Application Number Title Priority Date Filing Date
CN2010102611675A Pending CN102375853A (en) 2010-08-24 2010-08-24 Distributed database system, method for building index therein and query method

Country Status (1)

Country Link
CN (1) CN102375853A (en)

Cited By (71)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102779160A (en) * 2012-06-14 2012-11-14 中金数据系统有限公司 Mass data information indexing system and indexing construction method
CN102833352A (en) * 2012-09-17 2012-12-19 深圳中兴网信科技有限公司 Distributed cache management system and method for implementing distributed cache management
CN102841944A (en) * 2012-08-27 2012-12-26 南京云创存储科技有限公司 Method achieving real-time processing of big data
CN102915324A (en) * 2012-08-09 2013-02-06 深圳中兴网信科技有限公司 Data storing and retrieving device and data storing and retrieving method
CN102968309A (en) * 2012-11-30 2013-03-13 亚信联创科技(中国)有限公司 Method and device for realizing rule matching based on rule engine
CN103002027A (en) * 2012-11-26 2013-03-27 中国科学院高能物理研究所 System and method for data storage on basis of key-value pair system tree-shaped directory achieving structure
CN103036891A (en) * 2012-12-19 2013-04-10 北京时代凌宇科技有限公司 Method and device based on wireless fidelity (Wi-Fi) for accessing to Internet of Things
CN103034734A (en) * 2012-12-27 2013-04-10 上海顶竹通讯技术有限公司 File storage and inquiry agency and information searching method and system
CN103064933A (en) * 2012-12-24 2013-04-24 华为技术有限公司 Data query method and system
CN103309902A (en) * 2012-03-16 2013-09-18 多玩娱乐信息技术(北京)有限公司 Method and device for storing and searching user information in social network
CN103399945A (en) * 2013-08-15 2013-11-20 成都博云科技有限公司 Data structure based on cloud computing database system
CN103473276A (en) * 2013-08-26 2013-12-25 广东电网公司电力调度控制中心 Storage method of very large data and distributed database system and retrieval method thereof
CN103488709A (en) * 2013-09-09 2014-01-01 东软集团股份有限公司 Method and system for building indexes and method and system for retrieving indexes
CN103631539A (en) * 2013-12-13 2014-03-12 百度在线网络技术(北京)有限公司 Distributed storage system and distributed storage method based on erasure coding mechanism
CN103631910A (en) * 2013-11-26 2014-03-12 烽火通信科技股份有限公司 Distributed database multi-column composite query system and method
CN103748578A (en) * 2012-07-26 2014-04-23 华为技术有限公司 Data distribution method, device, and system
CN103902698A (en) * 2014-03-31 2014-07-02 北京车商汇软件有限公司 Data storage system and data storage method
CN103902702A (en) * 2014-03-31 2014-07-02 北京车商汇软件有限公司 Data storage system and data storage method
CN103914483A (en) * 2013-01-07 2014-07-09 深圳市腾讯计算机系统有限公司 File storage method and device and file reading method and device
CN104063376A (en) * 2013-03-18 2014-09-24 阿里巴巴集团控股有限公司 Multi-dimensional grouping operation method and system
CN104133867A (en) * 2014-07-18 2014-11-05 中国科学院计算技术研究所 DOT in-fragment secondary index method and DOT in-fragment secondary index system
CN104239525A (en) * 2014-09-18 2014-12-24 浪潮软件集团有限公司 Distributed storage method based on Internet
CN104331453A (en) * 2014-10-30 2015-02-04 北京思特奇信息技术股份有限公司 Distributed file system and constructing method thereof
CN104536962A (en) * 2014-11-11 2015-04-22 珠海天琴信息科技有限公司 Data query method and data query device used in embedded system
CN104598508A (en) * 2013-09-18 2015-05-06 Ims保健公司 System and method for fast query response
CN104699815A (en) * 2015-03-24 2015-06-10 北京嘀嘀无限科技发展有限公司 Data processing method and system
CN104750690A (en) * 2013-12-25 2015-07-01 中国移动通信集团公司 Query processing method, device and system
CN104951464A (en) * 2014-03-27 2015-09-30 华为技术有限公司 Data storage method and system
CN105117171A (en) * 2015-08-28 2015-12-02 南京国电南自美卓控制系统有限公司 Energy SCADA massive data distributed processing system and method thereof
CN105488085A (en) * 2014-12-27 2016-04-13 北京安天电子设备有限公司 File positioning method and system through log
CN105512200A (en) * 2015-11-26 2016-04-20 华为技术有限公司 Distributed database processing method and device
WO2016119275A1 (en) * 2015-01-30 2016-08-04 深圳市华傲数据技术有限公司 Network account identifying and matching method
CN105843933A (en) * 2016-03-30 2016-08-10 电子科技大学 Index building method for distributed memory columnar database
CN105868253A (en) * 2015-12-23 2016-08-17 乐视网信息技术(北京)股份有限公司 Data importing and query methods and apparatuses
CN105912687A (en) * 2016-04-19 2016-08-31 江苏物联网研究发展中心 Mass distributed database memory cell
WO2016141584A1 (en) * 2015-03-12 2016-09-15 Intel Corporation Method and apparatus for compaction of data received over a network
WO2016165509A1 (en) * 2015-04-15 2016-10-20 Huawei Technologies Co., Ltd. Big data statistics at data-block level
CN106126545A (en) * 2016-06-15 2016-11-16 北京智能管家科技有限公司 Distributed fission querying method and device
CN106250409A (en) * 2016-07-21 2016-12-21 中国农业银行股份有限公司 Data query method and device
CN106503128A (en) * 2016-10-19 2017-03-15 许继集团有限公司 A kind of intelligent electric meter data query method and system
CN103678520B (en) * 2013-11-29 2017-03-29 中国科学院计算技术研究所 A kind of multi-dimensional interval query method and its system based on cloud computing
CN106599127A (en) * 2016-12-01 2017-04-26 深圳市风云实业有限公司 Log storage and query method applied to standalone server
CN106776702A (en) * 2016-11-11 2017-05-31 北京奇虎科技有限公司 A kind of method and apparatus for processing the index in master-slave mode Database Systems
CN106933891A (en) * 2015-12-31 2017-07-07 华为技术有限公司 Access the method for distributed data base and the device of Distributed database service
CN107273540A (en) * 2017-07-05 2017-10-20 北京三快在线科技有限公司 Distributed search and index updating method, system, server and computer equipment
CN107273556A (en) * 2017-08-23 2017-10-20 上海点融信息科技有限责任公司 Block chain data index method and equipment
CN107315806A (en) * 2017-06-26 2017-11-03 杭州时趣信息技术有限公司 A kind of embedded storage method and device based on file system
CN107463632A (en) * 2016-09-21 2017-12-12 广州特道信息科技有限公司 A kind of distributed NewSQL Database Systems and data query method
CN107844542A (en) * 2017-10-26 2018-03-27 山东浪潮通软信息科技有限公司 A kind of distributed document storage method and device
CN108038113A (en) * 2017-10-16 2018-05-15 武汉楚鼎信息技术有限公司 Search method and system based on internet financial intelligent question and answer
CN108052649A (en) * 2017-12-26 2018-05-18 广州泼墨神网络科技有限公司 The data managing method and its system of a kind of distributed file system
CN108256076A (en) * 2018-01-18 2018-07-06 广州大学 Distributed mass data processing method and processing device
CN108804510A (en) * 2017-04-26 2018-11-13 三星电子株式会社 Key assignments file system
CN108932258A (en) * 2017-05-25 2018-12-04 华为技术有限公司 Data directory processing method and processing device
CN109120885A (en) * 2017-06-26 2019-01-01 杭州海康威视数字技术股份有限公司 Video data acquisition methods and device
WO2019080790A1 (en) * 2017-10-26 2019-05-02 Huawei Technologies Co., Ltd. Method and apparatus for storing and retrieving information in a distributed database
CN109828862A (en) * 2017-11-23 2019-05-31 成都华为技术有限公司 A kind of method and apparatus playing back log
CN109948238A (en) * 2019-03-15 2019-06-28 中民筑友科技投资有限公司 A kind of management method, device, system and the storage medium of assembled design document
CN110352414A (en) * 2017-12-29 2019-10-18 北京嘀嘀无限科技发展有限公司 For the system and method for big data addition index
CN110419026A (en) * 2017-03-10 2019-11-05 维萨国际服务协会 Search technique in memory
CN110678854A (en) * 2017-05-31 2020-01-10 华为技术有限公司 Data query method and device
CN110990357A (en) * 2019-12-18 2020-04-10 深圳市商汤科技有限公司 Data processing method, device and system, electronic equipment and storage medium
CN111008200A (en) * 2019-12-18 2020-04-14 北京数衍科技有限公司 Data query method and device and server
CN111026827A (en) * 2019-12-06 2020-04-17 北京地拓科技发展有限公司 Data service method and device for soil erosion factors and electronic equipment
CN111209321A (en) * 2019-12-25 2020-05-29 北京永洪商智科技有限公司 Grouping data mart method for complex query
CN111221851A (en) * 2018-11-27 2020-06-02 北京京东尚科信息技术有限公司 Lucene-based mass data query and storage method and device
CN111382185A (en) * 2018-12-28 2020-07-07 无锡赫名网络技术有限公司 Method for counting and applying mobile phone numbers by adopting partition positioning
CN111444219A (en) * 2020-03-30 2020-07-24 深圳天岳创新科技有限公司 Distributed data processing method and device and electronic equipment
CN112115211A (en) * 2020-09-19 2020-12-22 东方蓝天钛金科技有限公司 Detection report query system and query method thereof
CN112632129A (en) * 2020-12-31 2021-04-09 联想未来通信科技(重庆)有限公司 Code stream data management method, device and storage medium
CN113901293A (en) * 2021-09-30 2022-01-07 苏州浪潮智能科技有限公司 Metadata management method, electronic device, and computer-readable storage medium

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101246500A (en) * 2008-03-27 2008-08-20 腾讯科技(深圳)有限公司 Retrieval system and method for implementing data fast indexing
CN101727465A (en) * 2008-11-03 2010-06-09 中国移动通信集团公司 Methods for establishing and inquiring index of distributed column storage database, device and system thereof

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101246500A (en) * 2008-03-27 2008-08-20 腾讯科技(深圳)有限公司 Retrieval system and method for implementing data fast indexing
CN101727465A (en) * 2008-11-03 2010-06-09 中国移动通信集团公司 Methods for establishing and inquiring index of distributed column storage database, device and system thereof

Non-Patent Citations (3)

* Cited by examiner, † Cited by third party
Title
李晔锋: "数据仓库的存储研究", 《中国优秀硕士学位论文全文数据库信息科技辑》, 15 October 2009 (2009-10-15), pages 1 - 63 *
董继润: "关系数据库和顺序相关性", 《山东大学学报》, no. 4, 31 December 1983 (1983-12-31), pages 31 - 39 *
谢力军等: "几种索引技术的比较", 《怀化学院学报》, vol. 28, no. 8, 31 August 2009 (2009-08-31), pages 115 - 118 *

Cited By (110)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103309902A (en) * 2012-03-16 2013-09-18 多玩娱乐信息技术(北京)有限公司 Method and device for storing and searching user information in social network
CN102779160B (en) * 2012-06-14 2016-02-03 中金数据系统有限公司 Mass data information index system and index structuring method
CN102779160A (en) * 2012-06-14 2012-11-14 中金数据系统有限公司 Mass data information indexing system and indexing construction method
CN103748578A (en) * 2012-07-26 2014-04-23 华为技术有限公司 Data distribution method, device, and system
CN102915324B (en) * 2012-08-09 2016-08-03 深圳中兴网信科技有限公司 Data storage and retrieval device and data storage and retrieval method
CN102915324A (en) * 2012-08-09 2013-02-06 深圳中兴网信科技有限公司 Data storing and retrieving device and data storing and retrieving method
CN102841944A (en) * 2012-08-27 2012-12-26 南京云创存储科技有限公司 Method achieving real-time processing of big data
CN102833352A (en) * 2012-09-17 2012-12-19 深圳中兴网信科技有限公司 Distributed cache management system and method for implementing distributed cache management
CN103002027B (en) * 2012-11-26 2015-09-02 中国科学院高能物理研究所 Data-storage system and the method for tree directory structure is realized based on key-value pair system
CN103002027A (en) * 2012-11-26 2013-03-27 中国科学院高能物理研究所 System and method for data storage on basis of key-value pair system tree-shaped directory achieving structure
CN102968309A (en) * 2012-11-30 2013-03-13 亚信联创科技(中国)有限公司 Method and device for realizing rule matching based on rule engine
CN102968309B (en) * 2012-11-30 2016-01-20 亚信科技(中国)有限公司 A kind of rule matching method and device realizing rule-based engine
CN103036891A (en) * 2012-12-19 2013-04-10 北京时代凌宇科技有限公司 Method and device based on wireless fidelity (Wi-Fi) for accessing to Internet of Things
CN103064933B (en) * 2012-12-24 2016-06-29 华为技术有限公司 Data query method and system
CN103064933A (en) * 2012-12-24 2013-04-24 华为技术有限公司 Data query method and system
WO2014101445A1 (en) * 2012-12-24 2014-07-03 华为技术有限公司 Data query method and system
CN103034734A (en) * 2012-12-27 2013-04-10 上海顶竹通讯技术有限公司 File storage and inquiry agency and information searching method and system
CN103914483B (en) * 2013-01-07 2018-09-25 深圳市腾讯计算机系统有限公司 File memory method, device and file reading, device
CN103914483A (en) * 2013-01-07 2014-07-09 深圳市腾讯计算机系统有限公司 File storage method and device and file reading method and device
WO2014106418A1 (en) * 2013-01-07 2014-07-10 Tencent Technology (Shenzhen) Company Limited Method and apparatus for storing and reading files
CN104063376A (en) * 2013-03-18 2014-09-24 阿里巴巴集团控股有限公司 Multi-dimensional grouping operation method and system
CN103399945A (en) * 2013-08-15 2013-11-20 成都博云科技有限公司 Data structure based on cloud computing database system
CN103473276B (en) * 2013-08-26 2017-08-25 广东电网公司电力调度控制中心 Ultra-large type date storage method, distributed data base system and its search method
CN103473276A (en) * 2013-08-26 2013-12-25 广东电网公司电力调度控制中心 Storage method of very large data and distributed database system and retrieval method thereof
CN103488709B (en) * 2013-09-09 2017-06-16 东软集团股份有限公司 A kind of index establishing method and system, search method and system
CN103488709A (en) * 2013-09-09 2014-01-01 东软集团股份有限公司 Method and system for building indexes and method and system for retrieving indexes
CN104598508B (en) * 2013-09-18 2021-06-08 Iqvia 公司 System and method for fast query response
CN104598508A (en) * 2013-09-18 2015-05-06 Ims保健公司 System and method for fast query response
CN103631910A (en) * 2013-11-26 2014-03-12 烽火通信科技股份有限公司 Distributed database multi-column composite query system and method
CN103678520B (en) * 2013-11-29 2017-03-29 中国科学院计算技术研究所 A kind of multi-dimensional interval query method and its system based on cloud computing
CN103631539A (en) * 2013-12-13 2014-03-12 百度在线网络技术(北京)有限公司 Distributed storage system and distributed storage method based on erasure coding mechanism
CN103631539B (en) * 2013-12-13 2016-08-24 百度在线网络技术(北京)有限公司 Distributed memory system based on erasure codes mechanism and storage method thereof
CN104750690A (en) * 2013-12-25 2015-07-01 中国移动通信集团公司 Query processing method, device and system
CN104951464A (en) * 2014-03-27 2015-09-30 华为技术有限公司 Data storage method and system
CN104951464B (en) * 2014-03-27 2018-09-11 华为技术有限公司 Date storage method and system
CN103902702B (en) * 2014-03-31 2017-11-28 北京皮尔布莱尼软件有限公司 A kind of data-storage system and storage method
CN103902698B (en) * 2014-03-31 2018-04-13 北京皮尔布莱尼软件有限公司 A kind of data-storage system and storage method
CN103902698A (en) * 2014-03-31 2014-07-02 北京车商汇软件有限公司 Data storage system and data storage method
CN103902702A (en) * 2014-03-31 2014-07-02 北京车商汇软件有限公司 Data storage system and data storage method
CN104133867A (en) * 2014-07-18 2014-11-05 中国科学院计算技术研究所 DOT in-fragment secondary index method and DOT in-fragment secondary index system
CN104239525A (en) * 2014-09-18 2014-12-24 浪潮软件集团有限公司 Distributed storage method based on Internet
CN104331453B (en) * 2014-10-30 2017-10-17 北京思特奇信息技术股份有限公司 A kind of construction method of distributed file system and distributed file system
CN104331453A (en) * 2014-10-30 2015-02-04 北京思特奇信息技术股份有限公司 Distributed file system and constructing method thereof
CN104536962A (en) * 2014-11-11 2015-04-22 珠海天琴信息科技有限公司 Data query method and data query device used in embedded system
CN105488085A (en) * 2014-12-27 2016-04-13 北京安天电子设备有限公司 File positioning method and system through log
WO2016119275A1 (en) * 2015-01-30 2016-08-04 深圳市华傲数据技术有限公司 Network account identifying and matching method
US10701168B2 (en) 2015-03-12 2020-06-30 Intel Corporation Method and apparatus for compaction of data received over a network
CN107409152B (en) * 2015-03-12 2021-10-15 英特尔公司 Method and apparatus for compressing data received over a network
US10015272B2 (en) 2015-03-12 2018-07-03 Intel Corporation Method and apparatus for compaction of data received over a network
WO2016141584A1 (en) * 2015-03-12 2016-09-15 Intel Corporation Method and apparatus for compaction of data received over a network
CN107409152A (en) * 2015-03-12 2017-11-28 英特尔公司 Method and apparatus for compressing the data received by network
CN104699815A (en) * 2015-03-24 2015-06-10 北京嘀嘀无限科技发展有限公司 Data processing method and system
CN107533551B (en) * 2015-04-15 2020-08-07 华为技术有限公司 Big data statistics at data Block level
WO2016165509A1 (en) * 2015-04-15 2016-10-20 Huawei Technologies Co., Ltd. Big data statistics at data-block level
CN107533551A (en) * 2015-04-15 2018-01-02 华为技术有限公司 The other big data statistics of block level
CN105117171A (en) * 2015-08-28 2015-12-02 南京国电南自美卓控制系统有限公司 Energy SCADA massive data distributed processing system and method thereof
US10872101B2 (en) 2015-11-26 2020-12-22 Huawei Technologies Co., Ltd. Distributed database processing method and device
CN105512200A (en) * 2015-11-26 2016-04-20 华为技术有限公司 Distributed database processing method and device
WO2017088358A1 (en) * 2015-11-26 2017-06-01 华为技术有限公司 Distributed database processing method and device
CN105868253A (en) * 2015-12-23 2016-08-17 乐视网信息技术(北京)股份有限公司 Data importing and query methods and apparatuses
CN106933891A (en) * 2015-12-31 2017-07-07 华为技术有限公司 Access the method for distributed data base and the device of Distributed database service
CN105843933A (en) * 2016-03-30 2016-08-10 电子科技大学 Index building method for distributed memory columnar database
CN105843933B (en) * 2016-03-30 2019-01-29 电子科技大学 The index establishing method of distributed memory columnar database
CN105912687A (en) * 2016-04-19 2016-08-31 江苏物联网研究发展中心 Mass distributed database memory cell
CN105912687B (en) * 2016-04-19 2019-05-24 江苏物联网研究发展中心 Magnanimity distributed data base storage unit
CN106126545A (en) * 2016-06-15 2016-11-16 北京智能管家科技有限公司 Distributed fission querying method and device
CN106250409A (en) * 2016-07-21 2016-12-21 中国农业银行股份有限公司 Data query method and device
CN107463632A (en) * 2016-09-21 2017-12-12 广州特道信息科技有限公司 A kind of distributed NewSQL Database Systems and data query method
CN107463632B (en) * 2016-09-21 2020-06-09 云润大数据服务有限公司 Distributed NewSQL database system and data query method
CN106503128A (en) * 2016-10-19 2017-03-15 许继集团有限公司 A kind of intelligent electric meter data query method and system
CN106776702A (en) * 2016-11-11 2017-05-31 北京奇虎科技有限公司 A kind of method and apparatus for processing the index in master-slave mode Database Systems
CN106599127A (en) * 2016-12-01 2017-04-26 深圳市风云实业有限公司 Log storage and query method applied to standalone server
US11687542B2 (en) 2017-03-10 2023-06-27 Visa International Service Association Techniques for in-memory data searching
CN110419026B (en) * 2017-03-10 2024-04-16 维萨国际服务协会 In-memory search technique
CN110419026A (en) * 2017-03-10 2019-11-05 维萨国际服务协会 Search technique in memory
CN108804510A (en) * 2017-04-26 2018-11-13 三星电子株式会社 Key assignments file system
CN108932258B (en) * 2017-05-25 2022-05-24 华为技术有限公司 Data index processing method and device
CN108932258A (en) * 2017-05-25 2018-12-04 华为技术有限公司 Data directory processing method and processing device
CN110678854B (en) * 2017-05-31 2021-10-15 华为技术有限公司 Data query method and device
CN110678854A (en) * 2017-05-31 2020-01-10 华为技术有限公司 Data query method and device
CN107315806B (en) * 2017-06-26 2020-04-10 杭州时趣信息技术有限公司 Embedded storage method and device based on file system
CN107315806A (en) * 2017-06-26 2017-11-03 杭州时趣信息技术有限公司 A kind of embedded storage method and device based on file system
CN109120885A (en) * 2017-06-26 2019-01-01 杭州海康威视数字技术股份有限公司 Video data acquisition methods and device
CN107273540A (en) * 2017-07-05 2017-10-20 北京三快在线科技有限公司 Distributed search and index updating method, system, server and computer equipment
CN107273556A (en) * 2017-08-23 2017-10-20 上海点融信息科技有限责任公司 Block chain data index method and equipment
CN108038113A (en) * 2017-10-16 2018-05-15 武汉楚鼎信息技术有限公司 Search method and system based on internet financial intelligent question and answer
CN107844542A (en) * 2017-10-26 2018-03-27 山东浪潮通软信息科技有限公司 A kind of distributed document storage method and device
US11256676B2 (en) 2017-10-26 2022-02-22 Huawei Technologies Co., Ltd. Method and apparatus for storing and retrieving information in a distributed database
WO2019080790A1 (en) * 2017-10-26 2019-05-02 Huawei Technologies Co., Ltd. Method and apparatus for storing and retrieving information in a distributed database
CN109828862B (en) * 2017-11-23 2023-08-22 成都华为技术有限公司 Method and device for replaying log
CN109828862A (en) * 2017-11-23 2019-05-31 成都华为技术有限公司 A kind of method and apparatus playing back log
CN108052649A (en) * 2017-12-26 2018-05-18 广州泼墨神网络科技有限公司 The data managing method and its system of a kind of distributed file system
CN110352414B (en) * 2017-12-29 2022-11-11 北京嘀嘀无限科技发展有限公司 System and method for adding index to big data
CN110352414A (en) * 2017-12-29 2019-10-18 北京嘀嘀无限科技发展有限公司 For the system and method for big data addition index
CN108256076A (en) * 2018-01-18 2018-07-06 广州大学 Distributed mass data processing method and processing device
CN111221851A (en) * 2018-11-27 2020-06-02 北京京东尚科信息技术有限公司 Lucene-based mass data query and storage method and device
CN111382185A (en) * 2018-12-28 2020-07-07 无锡赫名网络技术有限公司 Method for counting and applying mobile phone numbers by adopting partition positioning
CN111382185B (en) * 2018-12-28 2024-01-26 无锡赫名网络技术有限公司 Method for counting and applying mobile phone numbers by adopting partition positioning
CN109948238A (en) * 2019-03-15 2019-06-28 中民筑友科技投资有限公司 A kind of management method, device, system and the storage medium of assembled design document
CN111026827A (en) * 2019-12-06 2020-04-17 北京地拓科技发展有限公司 Data service method and device for soil erosion factors and electronic equipment
CN111008200B (en) * 2019-12-18 2024-01-16 北京数衍科技有限公司 Data query method, device and server
CN111008200A (en) * 2019-12-18 2020-04-14 北京数衍科技有限公司 Data query method and device and server
CN110990357A (en) * 2019-12-18 2020-04-10 深圳市商汤科技有限公司 Data processing method, device and system, electronic equipment and storage medium
CN111209321A (en) * 2019-12-25 2020-05-29 北京永洪商智科技有限公司 Grouping data mart method for complex query
CN111444219A (en) * 2020-03-30 2020-07-24 深圳天岳创新科技有限公司 Distributed data processing method and device and electronic equipment
CN112115211A (en) * 2020-09-19 2020-12-22 东方蓝天钛金科技有限公司 Detection report query system and query method thereof
CN112632129B (en) * 2020-12-31 2023-11-21 联想未来通信科技(重庆)有限公司 Code stream data management method, device and storage medium
CN112632129A (en) * 2020-12-31 2021-04-09 联想未来通信科技(重庆)有限公司 Code stream data management method, device and storage medium
CN113901293B (en) * 2021-09-30 2024-01-16 苏州浪潮智能科技有限公司 Metadata management method, electronic device, and computer-readable storage medium
CN113901293A (en) * 2021-09-30 2022-01-07 苏州浪潮智能科技有限公司 Metadata management method, electronic device, and computer-readable storage medium

Similar Documents

Publication Publication Date Title
CN102375853A (en) Distributed database system, method for building index therein and query method
US10860598B2 (en) Systems and methods for interest-driven business intelligence systems including event-oriented data
CN102193917B (en) Method and device for processing and querying data
CN107423422B (en) Spatial data distributed storage and search method and system based on grid
CN103020315B (en) A kind of mass small documents storage means based on master-salve distributed file system
CN104252536B (en) A kind of internet log data query method and device based on hbase
US10061834B1 (en) Incremental out-of-place updates for datasets in data stores
CN104239377A (en) Platform-crossing data retrieval method and device
CN103838867A (en) Log processing method and device
CN102332030A (en) Data storing, managing and inquiring method and system for distributed key-value storage system
CN102467572B (en) Data block inquiring method for supporting data de-duplication program
EP2263180A2 (en) Indexing large-scale gps tracks
CN110275920A (en) Data query method, apparatus, electronic equipment and computer readable storage medium
CN102646130A (en) Method for storing and indexing mass historical data
CN103198150B (en) A kind of large data index method and system
CN102779138B (en) The hard disk access method of real time data
CN104424219B (en) A kind of management method and device of data file
CN106528847A (en) Multi-dimensional processing method and system for massive data
CN103473239A (en) Method and device for updating data of non relational database
CN104809182A (en) Method for web crawler URL (uniform resource locator) deduplicating based on DSBF (dynamic splitting Bloom Filter)
CN103353901B (en) The orderly management method of table data based on Hadoop distributed file system and system
CN102890714A (en) Method and device for indexing data
CN102930060A (en) Method and device for performing fast indexing of database
CN104951464A (en) Data storage method and system
CN102968456B (en) A kind of raster data reading and processing method and device

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C12 Rejection of a patent application after its publication
RJ01 Rejection of invention patent application after publication

Application publication date: 20120314