CN103631910A - Distributed database multi-column composite query system and method - Google Patents

Distributed database multi-column composite query system and method Download PDF

Info

Publication number
CN103631910A
CN103631910A CN201310615977.XA CN201310615977A CN103631910A CN 103631910 A CN103631910 A CN 103631910A CN 201310615977 A CN201310615977 A CN 201310615977A CN 103631910 A CN103631910 A CN 103631910A
Authority
CN
China
Prior art keywords
index
query
data
module
result
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201310615977.XA
Other languages
Chinese (zh)
Inventor
孙杰
阎星娥
赵万亮
杨昆
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Fiberhome Telecommunication Technologies Co Ltd
Original Assignee
Fiberhome Telecommunication Technologies Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Fiberhome Telecommunication Technologies Co Ltd filed Critical Fiberhome Telecommunication Technologies Co Ltd
Priority to CN201310615977.XA priority Critical patent/CN103631910A/en
Publication of CN103631910A publication Critical patent/CN103631910A/en
Pending legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/24Querying
    • G06F16/245Query processing
    • G06F16/2455Query execution
    • G06F16/24553Query execution of query operations
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/22Indexing; Data structures therefor; Storage structures
    • G06F16/2228Indexing structures

Abstract

The invention discloses a distributed database multi-column composite query system and method. The distributed database multi-column composite query system is composed of a storage subsystem, an index subsystem, a linear sequence generator, a database entry module and a query module. When data enter a database and indexes are built, a monotone increasing sequential value is generated for each data record, and values of index fields and the monotone increasing sequential values are combined and used as row keys of an index table. When indexes are scanned, returned results are sequenced according to the sequence of the row keys, execution efficiency is high, and occupied system resources are few. Query of index key values, merging of index results and search of the storage subsystem can be concurrently executed, and therefore the query response speed is greatly increased.

Description

A kind of system and method for distributed data base multiple row compound query
Technical field
The application belongs to areas of information technology, relates in particular to a kind of system and method for distributed data base multiple row compound query.
Background technology
Current a lot of industry, every day, along with the development of technology and business, the speed that data produce was constantly accelerated all producing a large amount of data, and data volume constantly expands.For this massive data sets, store and therefrom search fast the data that need, traditional database is not too applicable, so the various distributed data bases that have been born.
In large-scale data management, the key factor that affects data query speed is to need data volume and the disk I/O of access.Index technology is in database practice, to improve the important method of query performance.
In at present common distributed data base system, for multiple row inquiry, querying condition comprises the inquiry of a plurality of index key assignments, conventionally has following several processing mode:
1. according to the inquiry that indexes respectively of each index key assignments, obtain series of results collection, then according to the logical relation between each index key assignments, each result set is got and occured simultaneously or get union, finally obtain a result set that there is no repetition.Whether every result need to searching successively during merging in each result set is present among other result sets, for improving combined efficiency, conventionally has again two kinds of specific implementations:
A) each result set is sorted, the result set after sequence is done and merged again;
B) value of each result set is deposited in HASH container, improve seek rate.
2. from a plurality of index key assignments, choose an inquiry that indexes that selectivity ratios is higher, obtain a result set, scan the total data in this result set, use other these data of index key-value pair that do not index inquiry in querying condition to filter, obtain final query results.
Such as inquiry below:
select*from?user_info?where?username=‘CC’and?sex=‘male’,
Separately according to username, search the number of results obtaining fewer, the selectivity ratios that is username row is higher, so only search according to this condition of username=' CC ', travel through its result set, the result of the sex=' male ' that wherein satisfies condition is returned to inquiring user.
Yet, when prior art is inquired about in multiple row, there is the problems such as inefficiency, resources occupation rate be higher.
In aforementioned processing mode 1 (a), need to sort to each result set, must wait the inquiry of each index key assignments all to finish, just can complete sequence, after having sorted, could start to do and merge and return results.While adopting in this way, if there is the data volume of a result set very large, even if the data volume of all the other result sets is all very little, also cannot return results very soon, its response speed is limited by a slowest subquery.
In aforementioned processing mode 1 (b), each result set need to be deposited in to HASH class container, can take larger internal memory like this, when result set data volume is very large, also can surpass system peak load.
Aforementioned processing mode 2, be only applicable to logical relation between a plurality of index key assignments for situation, if the logical relation between a plurality of index key assignments is or, inapplicable.Secondly, due to the business in actual motion environment and data complicated and changeable, accurately choosing alternative large index key assignments is not easy to accomplish, sometimes or even cannot accomplish, list is done inquiry to an index key assignments and can be obtained a lot of results like this, the data that these indexed results are corresponding all read out and filter from raw data memory module, can cause a large amount of disk I/O, and excessive data access amount and the disk I/O common performance bottleneck place of high-volume database just.
Summary of the invention
The technical matters that present patent application will solve is: a kind of optimization method at distributed data base multiple row compound query is provided, solves current distributed data base system for problems such as multiple row search efficiency are low, resources occupation rate is higher.
In order to solve the problems of the technologies described above, present patent application provides a kind of system and method for distributed data base multiple row compound query.Described in the application system by storage subsystem, index subsystem, linear order maker, enter library module, enquiry module forms, wherein:
Storage subsystem adopts distributed file system, comprises a plurality of data blocks of partitioned storage, for storing complete raw data;
Index subsystem adopts distributed column storage database, for storing the index of data;
Linear order maker is that each data recording generates a monotonically increasing sequential value before data loading;
Enter library module and be responsible for raw data to write storage subsystem, and in index subsystem, set up corresponding index;
Enquiry module is divided into again youngster's submodules such as query parse module, search index module, raw data scan module, and enquiry module is responsible for processing user's inquiry request, returns to Query Result.
When data loading is set up index, for each data recording generates a monotone-increasing sequence value, the value of index field and monotone-increasing sequence value are combined to the line unit as concordance list.During index scanning, return results by row key sequencing.Like this, when inquiring about according to the index key assignments of some appointments, the result obtaining is by its sequential value sequence.Thereby, the Query Result of a plurality of index key assignments is done and merged, be that a plurality of ordered queues are done to merger, its execution efficiency is higher and resources occupation rate is lower, contributes to improve inquiry response speed and the supported concurrent number of system.
During data query, query parse module in enquiry module is decomposed into the sub-condition of multiple queries by query statement, each inquires about sub-condition is an index key assignments, index key assignments can obtain a series of data recording that comprise this index key assignments thus, and the memory location of these data recording, form a result set.Enquiry module merges into one by these result sets.During union operation, can as distinguishing, whether be the foundation of different records with monotone-increasing sequence value or the memory location of record.The result set obtaining according to merging, searches storage subsystem, and the original data record content obtaining is returned to inquiring client terminal.
The application's useful consequence is:
1, because every sub-result set is all to sort according to unified monotone-increasing sequence, so the method union operation execution speed of the distributed data base multiple row compound query described in present patent application is than very fast;
2, during the inquiry returning part result of each index key assignments, just can start these results to do and merge, needn't wait the poll-final of each index key assignments to do and merge again;
3, meanwhile, according to the result set merging, search storage subsystem and also needn't wait and to be combinedly all complete, like this, the inquiry of index key assignments, the merging of indexed results, search storage subsystem and can concurrently carry out, greatly improved inquiry response speed.
4, owing to entering determinant storage, access needed IO amount and be confined to needed field, greatly reduced IO visiting demand.
Through measuring and calculating and simulation, so data access optimization, process optimization and result set calculate after pretrigger, IO request decreased average half, can improve response speed more than one times; If set up, return to transformation, response speed can improve more than ten times.
Accompanying drawing explanation
Accompanying drawing 1 is system architecture diagram
Accompanying drawing 2 is data loading process flow diagram
Accompanying drawing 3 is the concordance list schematic diagram of embodiment 1
Accompanying drawing 4 is data query process flow diagram
Embodiment
The system of a kind of distributed data base multiple row compound query described in present patent application by index subsystem, linear order maker, enter library module, enquiry module forms.Its system architecture diagram as shown in Figure 1.Wherein, enquiry module comprises query parse module, search index module, raw data scan module.
Data loading flow process as shown in Figure 2, before data loading, is a sequential value of each data recording generation.This sequential value is generated by linear order maker, is a monotone-increasing sequence.Preferably, if there is such field in raw readings, its value meets monotone increasing condition and not for empty, linear order maker can directly use the value of this field as sequential value.
During data loading, first deposit raw data in primary data storage subsystem, obtain data storage location, then this data recording is set up to index.
In a raw data table, can set up respectively index to a plurality of fields.While setting up index, will in raw data, need the field that is used as querying condition as index field, each index field is a corresponding concordance list in index subsystem.Every index comprises line unit and two parts of row value, and line unit is comprised of the value of index field and monotone-increasing sequence value two parts of this data recording; Row value is recorded in the memory location in storage subsystem for data, and described data storage location comprises that the position of data recording place data block and data are recorded in the side-play amount in data block, so can be directly targeted to data recording according to this memory location.
During index scanning, the result of returning is by line unit sequence, and while therefore inquiring about with a certain assigned indexes key assignments, the result obtaining sorts by sequential value.
Embodiment 1: have a customer transaction record sheet (ExchangeInfo), each customer transaction information comprises user identification field (UserName), merchandise classification field (Category), transaction value field (Price), in addition be that every record generates a sequential value (Sequence), trading record sheet detailed data is as shown in the table:
Table 1 customer transaction information table
Sequence UserName Category Price
1 Zhang San General merchandise 100
2 Li Si Digital 1000
3 Li Si General merchandise 200
4 King five General merchandise 300
Take user ID and merchandise classification as index field, corresponding two concordance lists in index subsystem, user ID concordance list and merchandise classification concordance list, as shown in Figure 3, concordance list comprises two row, line unit (RowKey), row value (being data recording memory location (RecordLocation)).
Data query flow process as shown in Figure 4.Query parse module is decomposed into the sub-condition of multiple queries by query statement, each inquires about sub-condition is an index key assignments, index key assignments can obtain a series of data recording that comprise this index key assignments thus, and the memory location of these data recording, forms a result set.Enquiry module merges into one by these result sets.
When the logical relation of inquiry between sub-condition be " with " time, each is inquired about to subconditional result set and gets common factor; If have a sub-condition of inquiry poll-final and its Query Result all completed merger, or Query Result quantity reaches the transformation that returns results of setting, stops other and inquires about subconditional inquiry and result set merges operation;
When the logical relation between the sub-condition of inquiry is "or", each is inquired about to subconditional result set and get union.If now Zhi Sheng mono-tunnel result does not have merger to finish, all the other results can directly be put into the result set after merging.
The explanation of above embodiment is only applicable to help to understand the principle of present patent application, simultaneously to one of ordinary skill in the art, according to present patent application embodiment, in embodiment and range of application, all will change, so this description should not be construed as the restriction to present patent application.

Claims (6)

1. a system for distributed data base multiple row compound query, is characterized in that: by storage subsystem, index subsystem, linear order maker, enter library module, enquiry module forms.
2. the system of a kind of distributed data base multiple row compound query as claimed in claim 1, it is characterized in that: storage subsystem adopts distributed file system, index subsystem adopts distributed column storage database, and enquiry module comprises query parse module, search index module and raw data scan module.
3. the method for a distributed data base multiple row compound query, it is characterized in that: when data loading is set up index, for each data recording generates a monotone-increasing sequence value, the value of index field and monotone-increasing sequence value are combined to the line unit as concordance list; During index scanning, return results by row key sequencing.
4. the method for a kind of distributed data base multiple row compound query as claimed in claim 3, it is characterized in that: if existed the value of a field to meet monotone increasing condition in raw readings and not for empty, can directly use the value of this field as sequential value.
5. the method for a kind of distributed data base multiple row compound query as claimed in claim 3, it is characterized in that: during data query, query parse module in enquiry module is decomposed into the sub-condition of multiple queries by query statement, each inquires about sub-condition is an index key assignments, index key assignments can obtain a series of data recording that comprise this index key assignments thus, and the memory location of these data recording, form a result set; Enquiry module merges into one by these result sets, and the result set obtaining according to merging is searched storage subsystem, then the original data record content obtaining is returned to inquiring client terminal.
6. the method for a kind of distributed data base multiple row compound query as claimed in claim 3, is characterized in that:
When the logical relation of inquiry between sub-condition be " with " time, each is inquired about to subconditional result set and gets common factor; If have a sub-condition of inquiry poll-final and its Query Result all completed merger, or Query Result quantity reaches the transformation that returns results of setting, stops other and inquires about subconditional inquiry and result set merges operation;
When the logical relation between the sub-condition of inquiry is "or", each is inquired about to subconditional result set and get union; If Zhi Sheng mono-tunnel result does not have merger to finish, all the other results are directly put into the result set after merging.
CN201310615977.XA 2013-11-26 2013-11-26 Distributed database multi-column composite query system and method Pending CN103631910A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201310615977.XA CN103631910A (en) 2013-11-26 2013-11-26 Distributed database multi-column composite query system and method

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201310615977.XA CN103631910A (en) 2013-11-26 2013-11-26 Distributed database multi-column composite query system and method

Publications (1)

Publication Number Publication Date
CN103631910A true CN103631910A (en) 2014-03-12

Family

ID=50212951

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201310615977.XA Pending CN103631910A (en) 2013-11-26 2013-11-26 Distributed database multi-column composite query system and method

Country Status (1)

Country Link
CN (1) CN103631910A (en)

Cited By (24)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104462206A (en) * 2014-10-31 2015-03-25 国云科技股份有限公司 General database sequence generating method
CN104462291A (en) * 2014-11-27 2015-03-25 杭州华为数字技术有限公司 Method and device for data processing
CN105574093A (en) * 2015-12-10 2016-05-11 深圳市华讯方舟软件技术有限公司 Method for establishing index in HDFS based spark-sql big data processing system
CN105589915A (en) * 2014-11-06 2016-05-18 郑毓融 Database acceleration method through operation index value and hybrid layer cache
CN105740373A (en) * 2016-01-27 2016-07-06 国网上海市电力公司 Distributed memory based virtual reality platform data query method
CN105930407A (en) * 2016-04-18 2016-09-07 北京思特奇信息技术股份有限公司 Cross-database associated query method and system for distributed database
WO2016180123A1 (en) * 2015-09-25 2016-11-17 中兴通讯股份有限公司 Hbase second-level index creation method and device
CN106250409A (en) * 2016-07-21 2016-12-21 中国农业银行股份有限公司 Data query method and device
CN106445968A (en) * 2015-08-11 2017-02-22 阿里巴巴集团控股有限公司 Data merging method and device
CN106802900A (en) * 2015-11-26 2017-06-06 北京国双科技有限公司 Search method and device based on star database
CN106845263A (en) * 2015-12-04 2017-06-13 阿里巴巴集团控股有限公司 A kind of method for accessing database, device and electronic equipment
CN106844539A (en) * 2016-12-30 2017-06-13 曙光信息产业(北京)有限公司 Real-time data analysis method and system
CN106933206A (en) * 2015-10-09 2017-07-07 费希尔-罗斯蒙特系统公司 The inquiry independently of source in distributed industrial systems
CN108170726A (en) * 2015-10-21 2018-06-15 华为技术有限公司 Data query method and apparatus
CN108959457A (en) * 2018-06-15 2018-12-07 北京文创园投资管理有限公司 A kind of inquiry of certificate, verification method and system
CN109271409A (en) * 2018-11-08 2019-01-25 成都索贝数码科技股份有限公司 Database fragmentation execution method based on container resource allocation
CN109471863A (en) * 2018-11-12 2019-03-15 北京懿医云科技有限公司 Information query method and device, electronic equipment based on distributed data base
CN110019218A (en) * 2017-12-08 2019-07-16 阿里巴巴集团控股有限公司 Data storage and querying method and equipment
CN112104743A (en) * 2020-09-21 2020-12-18 北京金山云网络技术有限公司 Sequence generation method and device and electronic equipment
CN112416925A (en) * 2020-11-02 2021-02-26 浙商银行股份有限公司 Query method based on ordered distributed index structure and distributed database system
CN112445873A (en) * 2020-12-02 2021-03-05 深圳市镜玩科技有限公司 List display processing method, related device, equipment and medium
CN110019212B (en) * 2017-11-29 2021-06-18 杭州海康威视数字技术股份有限公司 Data processing method and device and database server
CN113032400A (en) * 2021-03-31 2021-06-25 上海天旦网络科技发展有限公司 High-performance TopN query method, system and medium for mass data
CN113268502A (en) * 2020-12-23 2021-08-17 上海右云信息技术有限公司 Method and equipment for providing information

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101751406A (en) * 2008-12-18 2010-06-23 赵伟 Method and device for realizing column storage based relational database
CN102375853A (en) * 2010-08-24 2012-03-14 中国移动通信集团公司 Distributed database system, method for building index therein and query method
CN102521406A (en) * 2011-12-26 2012-06-27 中国科学院计算技术研究所 Distributed query method and system for complex task of querying massive structured data
CN102591970A (en) * 2011-12-31 2012-07-18 北京奇虎科技有限公司 Distributed key-value query method and query engine system

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101751406A (en) * 2008-12-18 2010-06-23 赵伟 Method and device for realizing column storage based relational database
CN102375853A (en) * 2010-08-24 2012-03-14 中国移动通信集团公司 Distributed database system, method for building index therein and query method
CN102521406A (en) * 2011-12-26 2012-06-27 中国科学院计算技术研究所 Distributed query method and system for complex task of querying massive structured data
CN102591970A (en) * 2011-12-31 2012-07-18 北京奇虎科技有限公司 Distributed key-value query method and query engine system

Cited By (37)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104462206A (en) * 2014-10-31 2015-03-25 国云科技股份有限公司 General database sequence generating method
CN105589915A (en) * 2014-11-06 2016-05-18 郑毓融 Database acceleration method through operation index value and hybrid layer cache
CN104462291A (en) * 2014-11-27 2015-03-25 杭州华为数字技术有限公司 Method and device for data processing
CN104462291B (en) * 2014-11-27 2018-01-09 杭州华为数字技术有限公司 A kind of method and device of data processing
CN106445968A (en) * 2015-08-11 2017-02-22 阿里巴巴集团控股有限公司 Data merging method and device
WO2016180123A1 (en) * 2015-09-25 2016-11-17 中兴通讯股份有限公司 Hbase second-level index creation method and device
CN106933206A (en) * 2015-10-09 2017-07-07 费希尔-罗斯蒙特系统公司 The inquiry independently of source in distributed industrial systems
CN106933206B (en) * 2015-10-09 2021-07-06 费希尔-罗斯蒙特系统公司 Source independent queries in distributed industrial systems
CN108170726A (en) * 2015-10-21 2018-06-15 华为技术有限公司 Data query method and apparatus
CN106802900A (en) * 2015-11-26 2017-06-06 北京国双科技有限公司 Search method and device based on star database
CN106845263A (en) * 2015-12-04 2017-06-13 阿里巴巴集团控股有限公司 A kind of method for accessing database, device and electronic equipment
WO2017096939A1 (en) * 2015-12-10 2017-06-15 深圳市华讯方舟软件技术有限公司 Method for establishing index on hdfs-based spark-sql big-data processing system
CN105574093B (en) * 2015-12-10 2019-09-10 深圳市华讯方舟软件技术有限公司 A method of index is established in the spark-sql big data processing system based on HDFS
CN105574093A (en) * 2015-12-10 2016-05-11 深圳市华讯方舟软件技术有限公司 Method for establishing index in HDFS based spark-sql big data processing system
CN105740373A (en) * 2016-01-27 2016-07-06 国网上海市电力公司 Distributed memory based virtual reality platform data query method
CN105740373B (en) * 2016-01-27 2019-11-08 国网上海市电力公司 Virtual Reality Platform data query method based on distributed memory
CN105930407A (en) * 2016-04-18 2016-09-07 北京思特奇信息技术股份有限公司 Cross-database associated query method and system for distributed database
CN105930407B (en) * 2016-04-18 2019-05-17 北京思特奇信息技术股份有限公司 A kind of inter-library relation query method of distributed data base and system
CN106250409A (en) * 2016-07-21 2016-12-21 中国农业银行股份有限公司 Data query method and device
CN106844539A (en) * 2016-12-30 2017-06-13 曙光信息产业(北京)有限公司 Real-time data analysis method and system
CN110019212B (en) * 2017-11-29 2021-06-18 杭州海康威视数字技术股份有限公司 Data processing method and device and database server
CN110019218A (en) * 2017-12-08 2019-07-16 阿里巴巴集团控股有限公司 Data storage and querying method and equipment
CN110019218B (en) * 2017-12-08 2023-08-25 阿里巴巴集团控股有限公司 Data storage and query method and equipment
CN108959457B (en) * 2018-06-15 2020-11-13 北京文创园投资管理有限公司 Method and system for inquiring and verifying certificate
CN108959457A (en) * 2018-06-15 2018-12-07 北京文创园投资管理有限公司 A kind of inquiry of certificate, verification method and system
CN109271409B (en) * 2018-11-08 2021-11-02 成都索贝数码科技股份有限公司 Database fragmentation execution method based on container resource allocation
CN109271409A (en) * 2018-11-08 2019-01-25 成都索贝数码科技股份有限公司 Database fragmentation execution method based on container resource allocation
CN109471863A (en) * 2018-11-12 2019-03-15 北京懿医云科技有限公司 Information query method and device, electronic equipment based on distributed data base
CN112104743A (en) * 2020-09-21 2020-12-18 北京金山云网络技术有限公司 Sequence generation method and device and electronic equipment
CN112104743B (en) * 2020-09-21 2022-08-16 北京金山云网络技术有限公司 Sequence generation method and device and electronic equipment
CN112416925A (en) * 2020-11-02 2021-02-26 浙商银行股份有限公司 Query method based on ordered distributed index structure and distributed database system
CN112416925B (en) * 2020-11-02 2024-04-09 浙商银行股份有限公司 Query method based on ordered distributed index structure and distributed database system
CN112445873A (en) * 2020-12-02 2021-03-05 深圳市镜玩科技有限公司 List display processing method, related device, equipment and medium
CN112445873B (en) * 2020-12-02 2024-03-26 深圳市镜玩科技有限公司 List display processing method, related device, equipment and medium
CN113268502A (en) * 2020-12-23 2021-08-17 上海右云信息技术有限公司 Method and equipment for providing information
CN113032400A (en) * 2021-03-31 2021-06-25 上海天旦网络科技发展有限公司 High-performance TopN query method, system and medium for mass data
CN113032400B (en) * 2021-03-31 2022-11-08 上海天旦网络科技发展有限公司 High-performance TopN query method, system and medium for mass data

Similar Documents

Publication Publication Date Title
CN103631910A (en) Distributed database multi-column composite query system and method
CN102201001B (en) Fast retrieval method based on inverted technology
CN102270232B (en) Semantic data query system with optimized storage
US20120047158A1 (en) Method and system for performing query optimization using a hybrid execution plan
US20050165733A1 (en) System and method for an in-memory roll up-on-the-fly OLAP engine with a relational backing store
Zou et al. Pareto-based dominant graph: An efficient indexing structure to answer top-k queries
He et al. Efficient iceberg query evaluation using compressed bitmap index
CN107491487A (en) A kind of full-text database framework and bitmap index establishment, data query method, server and medium
CN103970902A (en) Method and system for reliable and instant retrieval on situation of large quantities of data
Giannakouris et al. MuSQLE: Distributed SQL query execution over multiple engine environments
CN102222099A (en) Methods and devices for storing and searching data
CN102456055A (en) Method and device for retrieving interest points
CN108268612B (en) Pre-verification method and pre-verification system based on OLAP pre-calculation model
CN101963993B (en) Method for fast searching database sheet table record
CN114064660B (en) Data structured analysis method based on ElasticSearch
CN106484815B (en) A kind of automatic identification optimization method based on mass data class SQL retrieval scene
CN102402540A (en) Numerical value and text mixed inverted index algorithm based on multilayer-optimization balanced tree
CN109299143A (en) The knowledge fast indexing method in the data interoperation knowledge on testing library based on Redis caching
CN103593409A (en) Real-time database retrieval method and real-time database retrieval system
JP2001216307A (en) Relational database management system and storage medium stored with same
Cuzzocrea et al. Multidimensional database design via schema transformation: turning tpc h into the tpc h* d multidimensional benchmark
Zhang et al. Improving performance by creating a native join-index for OLAP
Ni et al. An Efficient Method for Improving Query Efficiency in Data Warehouse.
Sangat et al. Atrie group join: A parallel star group join and aggregation for in-memory column-stores
CN110688386A (en) Distributed column data indexing method for novel power supply rail transit big data

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
WD01 Invention patent application deemed withdrawn after publication
WD01 Invention patent application deemed withdrawn after publication

Application publication date: 20140312