CN116662449B - OLAP query optimization method and system based on broadcast sub-query cache - Google Patents

OLAP query optimization method and system based on broadcast sub-query cache Download PDF

Info

Publication number
CN116662449B
CN116662449B CN202310704298.3A CN202310704298A CN116662449B CN 116662449 B CN116662449 B CN 116662449B CN 202310704298 A CN202310704298 A CN 202310704298A CN 116662449 B CN116662449 B CN 116662449B
Authority
CN
China
Prior art keywords
query
cache
sub
result
olap
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN202310704298.3A
Other languages
Chinese (zh)
Other versions
CN116662449A (en
Inventor
吕彪
程鹏
戚依宁
方崇荣
陈积明
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Zhejiang University ZJU
Original Assignee
Zhejiang University ZJU
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Zhejiang University ZJU filed Critical Zhejiang University ZJU
Priority to CN202310704298.3A priority Critical patent/CN116662449B/en
Publication of CN116662449A publication Critical patent/CN116662449A/en
Application granted granted Critical
Publication of CN116662449B publication Critical patent/CN116662449B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/27Replication, distribution or synchronisation of data between databases or within a distributed database system; Distributed database system architectures therefor
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/24Querying
    • G06F16/245Query processing
    • G06F16/2453Query optimisation
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/24Querying
    • G06F16/245Query processing
    • G06F16/2453Query optimisation
    • G06F16/24534Query rewriting; Transformation
    • G06F16/24539Query rewriting; Transformation using cached or materialised query results
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/24Querying
    • G06F16/245Query processing
    • G06F16/2455Query execution
    • G06F16/24552Database cache management
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/24Querying
    • G06F16/245Query processing
    • G06F16/2458Special types of queries, e.g. statistical queries, fuzzy queries or distributed queries
    • G06F16/2474Sequence data queries, e.g. querying versioned data
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F9/00Arrangements for program control, e.g. control units
    • G06F9/06Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
    • G06F9/46Multiprogramming arrangements
    • G06F9/54Interprogram communication
    • G06F9/542Event management; Broadcasting; Multicasting; Notifications
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y02TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
    • Y02DCLIMATE CHANGE MITIGATION TECHNOLOGIES IN INFORMATION AND COMMUNICATION TECHNOLOGIES [ICT], I.E. INFORMATION AND COMMUNICATION TECHNOLOGIES AIMING AT THE REDUCTION OF THEIR OWN ENERGY USE
    • Y02D10/00Energy efficient computing, e.g. low power processors, power management or thermal management

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Databases & Information Systems (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Data Mining & Analysis (AREA)
  • Computational Linguistics (AREA)
  • Software Systems (AREA)
  • Computing Systems (AREA)
  • Fuzzy Systems (AREA)
  • Mathematical Physics (AREA)
  • Probability & Statistics with Applications (AREA)
  • Multimedia (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
  • Two-Way Televisions, Distribution Of Moving Picture Or The Like (AREA)
  • Memory System Of A Hierarchy Structure (AREA)

Abstract

The invention discloses an OLAP query optimization method and system based on broadcast sub-query caching. In a time sequence type data analysis scene, the existing OLAP system query result caching scheme has very low cache hit rate, and the invention provides a new, finer granularity and flexible caching scheme, namely, local results of sub-query operators are cached, so that the design defect of query result caching is avoided, and the method can be directly applied to a distributed big data analysis system; the invention can be directly applied in the cluster environment through a cache broadcasting mechanism, so that the cache of sub-queries can be used at all nodes, the overall cache hit rate is improved, and the query performance of the OLAP is accelerated by fully utilizing the distributed cluster capability.

Description

OLAP query optimization method and system based on broadcast sub-query cache
Technical Field
The invention relates to the field of cloud network observable data analysis, in particular to a method and a system for accelerating OLAP query performance.
Background
OLAP (Online Analytical Processing ) is an online analytical processing system, and OLAP is mainly used for querying data. With the continuous development of OLAP, OLAP products are layered endlessly, and most of OLAP systems are based on a ROLAP (relational database online analysis processing) system or a single MOLAP (multidimensional database online analysis processing) system.
With the rapid growth of commercial data volume, the conventional stand-alone OLAP database cannot meet the needs of users, and the modern OLAP system basically adopts a distributed architecture. In a distributed OLAP system, distributed optimization of SQL queries is achieved, thus supporting queries and analysis of massive data.
In modern OLAP systems, the most common SQL query optimization is mainly "query result cache (Query Result Cache)", i.e. a result set for caching a query statement, and if the same query is subsequently performed, the result is directly read from the result set cache without re-execution, so that the query performance is greatly improved.
However, with the increasing data query volume of the time sequence type, due to the design constraint of the query result cache (Query Result Cache), the SQL of each query needs to be completely consistent to hit the cache, but the query of the time sequence data is only partially consistent and unchanged for the inner sub-query, and the SQL time parameter of the outer layer is inconsistent, so that the cache cannot be effectively hit during each query, thereby influencing the query throughput performance of the whole system.
Disclosure of Invention
The invention aims at overcoming the defects of the prior art, and provides an OLAP query optimization method and system based on broadcast sub-query caching, which are used for accelerating the OLAP query performance. Compared with the existing OLAP system query result caching scheme, the method and the device can be directly applied to a distributed big data analysis system by using a more fine-grained and flexible caching algorithm, and the query performance of the OLAP is accelerated by fully utilizing the distributed cluster capacity.
The invention aims at realizing the following technical scheme:
according to a first aspect of the present specification, there is provided an OLAP query optimization method based on broadcast sub-query caching, the method including the steps of:
S1, when a management node receives an SQL query request aiming at time series data, splitting the SQL query request into sub-queries and final result convergence queries according to operators, and caching the execution results of the sub-query physical plan to all working nodes of an OLAP cluster by using a broadcasting mechanism;
S2, when the working node executes SQL query for time series data, directly performing cache query of a sub-query physical plan locally, and if the cache is hit, directly performing operator execution of the next step; if the cache is not hit, sub-query operator execution is carried out, the execution result of the sub-query physical plan is locally cached, and the execution result is updated to all working nodes of the OLAP cluster through broadcasting.
Further, the management node receives an SQL query request aiming at time series data, firstly analyzes SQL into a logic plan, optimizes the logic plan into a physical plan, and then splits the physical plan into sub-queries and final result convergence queries according to operators.
Further, the working node performs hash operation on the received sub-query physical plan to obtain an identification ID, and attempts to obtain a cache of the sub-query physical plan from a local cache module by using the identification ID; if the cache is obtained, loading the cache result into an execution flow, and entering an operator computing stage of hash connection; otherwise, the sub-query physical plan is executed locally, an execution result is loaded into an execution flow, an operator calculation stage of hash connection is entered, and meanwhile, the identification ID of the sub-query physical plan and the execution result are combined into a cache structure to be written into a local cache module.
Further, the local cache structure of the working node is expressed as HashMap < hashID, result >, wherein hashID is an MD5 calculated value for sub-query physical planning, and Result is a data structure value in the memory of the working node.
Further, when the query process of the working node does not hit the cache, sub-query physical plans are executed in the working node, and the execution Result is written into a local cache structure HashMap < HashID, result >; and simultaneously acquiring PRC addresses of all working nodes from the management node, and broadcasting HashMap < HashID, result > to all working nodes.
According to a second aspect of the present specification, there is provided an OLAP query optimization system based on broadcast sub-query caching, the system comprising the following modules:
Sub-query module: the system comprises an OLAP cluster working node, a cache module and a sub-query module, wherein the OLAP cluster working node is used for extracting a sub-query physical plan, carrying out hash operation on the physical plan to obtain an identification ID, and using the identification ID to try to obtain the cache of the sub-query physical plan from the cache module; if the cache is obtained, loading the cache result into an execution flow, and entering an operator computing stage of hash connection; otherwise, the sub-query physical plan is executed locally, an execution result is loaded into an execution flow, an operator calculation stage of hash is entered, and meanwhile, the identification ID of the sub-query physical plan and the execution result are combined into a cache structure to be written into a cache module;
and a cache module: the sub-query module is deployed at an OLAP cluster working node and provides the capability of cache writing and parallel reading for the sub-query module; for the first write-in cache, a broadcasting module is called to carry out cache broadcasting on all working nodes of the OLAP cluster;
and a broadcasting module: and providing a cache broadcasting service, acquiring PRC addresses of all working nodes from the management node after receiving a cache broadcasting request from the cache module, and broadcasting an execution result of the sub-query physical plan to all the working nodes.
Further, the cache module provides a cache service based on the LRU policy, and the cache structure is expressed as HashMap < HashID, result > where HashID is an MD5 calculated value for the sub-query physical plan, and Result is a data structure value in the memory of the working node.
Further, the system can be deployed on various forms of computing node platforms, including ECS, docker, physical machine environments.
Compared with the prior art, the invention has the following advantages:
Firstly, aiming at the data query of the time sequence type, a new, finer granularity and flexible caching algorithm is provided, the design defect of query result caching (Query Result Cache) is avoided, the method can be directly applied to a distributed big data analysis system, and the query performance of the OLAP is accelerated by fully utilizing the distributed cluster capability.
Secondly, the method can be directly applied in a cluster environment through a cache broadcasting mechanism, so that the cache of sub-queries is available at all nodes, the overall cache hit rate is improved, and the query performance of the OLAP is accelerated by fully utilizing the distributed cluster capability.
Drawings
FIG. 1 is a flowchart of an OLAP query optimization method based on broadcast sub-query caching according to an embodiment of the present invention;
Fig. 2 is a block diagram of an OLAP query optimization system based on broadcast sub-query caching according to an embodiment of the present invention.
Detailed Description
In order that the above objects, features and advantages of the invention will be readily understood, a more particular description of the invention will be rendered by reference to the appended drawings.
In the following description, numerous specific details are set forth in order to provide a thorough understanding of the present invention, but the present invention may be practiced in other ways other than those described herein, and persons skilled in the art will readily appreciate that the present invention is not limited to the specific embodiments disclosed below.
As shown in fig. 1, the method for optimizing OLAP query based on broadcast sub-query caching according to the embodiment of the present application may be specifically implemented in three steps.
(1) A new caching scheme is designed, and local results of sub-query operators are cached instead of SQL query results.
The caching scheme is specifically implemented in a working node Worker of the OLAP cluster. The Worker node is responsible for the execution of the physical schema PHYSICAL PLAN of SQL. The traditional SQL query result caching (Query Result Cache) is to cache the SQL query result at the Proxy layer Proxy, and the cache hit rate is very low in a time sequence type data analysis scene; the scheme is used for carrying out local result caching on the operator layer of PHYSICAL PLAN of SQL, is finer and more flexible, can greatly improve the cache hit rate in a time sequence type data analysis scene, and fully utilizes the distributed cluster capability to accelerate the query performance of the OLAP.
(2) When the management node Master receives the SQL query request aiming at the time series data, the SQL is analyzed into a logic plan logical plan, the logical plan is optimized to PHYSICAL PLAN, and PHYSICAL PLAN is split into sub-queries according to operators, and the final results are gathered. The execution result of the sub-query PHYSICAL PLAN is cached in each Worker node of the distributed system by using a broadcasting mechanism.
The operator split, mainly for PHYSICAL PLAN splits, sub-queries are typically optimized as hash connections HashJoin. The Worker node performs hash operation on the received sub-query PHYSICAL PLAN to obtain an identification ID, and attempts to obtain a cache of the sub-query PHYSICAL PLAN from a local cache module of the Worker node by using the identification ID; if the cache is obtained, loading the cache result into an execution flow Pipeline, and entering an operator calculation stage of HashJoin; if no cache is obtained, the sub-query PHYSICAL PLAN is executed locally, the execution Result is loaded into Pipeline, the operator calculation stage of HashJoin is entered, and meanwhile, the identification ID of the sub-query PHYSICAL PLAN and the execution Result are combined into a cache structure HashMap < HashID, and Result > is written into a local cache module of the Worker node.
In the cache structure HashMap < HashID, result > is the MD5 calculated value for the sub-query PHYSICAL PLAN, and Result is the data structure value in the memory of the workbench node.
(3) When the Worker node executes SQL query for time series data, the cache query of sub query PHYSICAL PLAN is directly carried out locally, and if the cache is hit, the operator execution of the next step is directly carried out; if the cache is not hit, sub-query operator execution is performed, the execution result of the sub-query PHYSICAL PLAN is locally cached, and the execution result is updated to all the workbench nodes of the OLAP cluster through broadcasting.
When the query process does not hit the cache, sub-query PHYSICAL PLAN is executed at the Worker node, and the Result after the execution is written into a local cache structure HashMap < HashID, result >; meanwhile, all the Worker node PRC addresses are acquired from the Master node, and HashMap < HashID, result > is broadcasted to all the Worker nodes. By the method, the sub-query cache is available at all nodes, the overall cache hit rate is improved, and the query performance of the OLAP is accelerated by fully utilizing the distributed cluster capability.
As shown in fig. 2, an OLAP query optimization system based on broadcast sub-query caching according to an embodiment of the present application includes the following modules:
Sub-query module: the method comprises the steps of deploying at a workbench node, extracting PHYSICAL PLAN of sub-queries, carrying out hash operation on PHYSICAL PLAN, obtaining an identification ID, and using the identification ID to try to obtain the cache of the sub-queries PHYSICAL PLAN from a cache module; if the cache is obtained, loading the cache result into a Pipeline, and entering an operator calculation stage of HashJoin; if no cache is obtained, the sub-query PHYSICAL PLAN is executed locally, the execution Result is loaded into Pipeline, the operator calculation stage of HashJoin is entered, and meanwhile, the identification ID of the sub-query PHYSICAL PLAN and the execution Result are combined into a cache structure HashMap < HashID, and Result > is written into the cache module.
And a cache module: the system is deployed at a workbench node, provides cache service based on an LRU strategy, and provides cache writing and parallel reading capabilities for a sub-query module. The core is a hash table HashTable, and the read-write lock is used for controlling, and the cache structure is HashMap < HashID, result >. And for the first write-in cache, a broadcasting module is called to broadcast the cache of all the workbench nodes of the OLAP cluster.
And a broadcasting module: and after receiving the cache broadcasting request from the cache module, the cache broadcasting service is provided, all the PRC addresses of the workbench nodes are obtained from the Master node, and the HashMap < HashID, result > is broadcasted to all the workbench nodes. By the method, the sub-query cache is available at all nodes, the overall cache hit rate is improved, and the query performance of the OLAP is accelerated by fully utilizing the distributed cluster capability.
The embodiment of the application realizes a prototype system on the ECS platform based on the Arian cloud and tests the effect of the method. However, according to different hardware devices used by the computing node, the embodiment of the application can be popularized and deployed to platforms such as a physical machine and a Docker.
The foregoing is merely a preferred embodiment of the present invention, and the present invention has been disclosed in the above description of the preferred embodiment, but is not limited thereto. Any person skilled in the art can make many possible variations and modifications to the technical solution of the present invention or modifications to equivalent embodiments using the methods and technical contents disclosed above, without departing from the scope of the technical solution of the present invention. Therefore, any simple modification, equivalent variation and modification of the above embodiments according to the technical substance of the present invention still fall within the scope of the technical solution of the present invention.

Claims (6)

1. An OLAP query optimization method based on broadcast sub-query caching, comprising the steps of:
s1, when a management node receives an SQL query request aiming at time series data, firstly analyzing SQL into a logic plan, optimizing the logic plan into a physical plan, splitting the physical plan into sub-queries and final result convergence queries according to operators, and caching the execution results of the sub-query physical plan to all working nodes of an OLAP cluster by using a broadcasting mechanism;
The working node performs hash operation on the received sub-query physical plan to obtain an identification ID, and attempts to obtain a cache of the sub-query physical plan from a local cache module by using the identification ID; if the cache is obtained, loading the cache result into an execution flow, and entering an operator computing stage of hash connection; otherwise, the sub-query physical plan is executed locally, an execution result is loaded into an execution flow, an operator calculation stage of hash connection is entered, and meanwhile, the identification ID of the sub-query physical plan and the execution result are combined into a cache structure to be written into a local cache module;
S2, when the working node executes SQL query for time series data, directly performing cache query of a sub-query physical plan locally, and if the cache is hit, directly performing operator execution of the next step; if the cache is not hit, sub-query operator execution is carried out, the execution result of the sub-query physical plan is locally cached, and the execution result is updated to all working nodes of the OLAP cluster through broadcasting.
2. The method of claim 1, wherein the local cache structure of the working node is expressed as HashMap < hashd, result >, wherein hashd is an MD5 calculated value for the sub-query physical plan, and Result is a data structure value in the working node memory.
3. The method according to claim 2, wherein the sub-query physical plan is executed at the working node when the query process of the working node does not hit the cache, and the execution Result is written into a local cache structure HashMap < hashd, result >; and simultaneously acquiring PRC addresses of all working nodes from the management node, and broadcasting HashMap < HashID, result > to all working nodes.
4. An OLAP query optimization system implemented using the method of any one of claims 1-3, comprising:
Sub-query module: the method comprises the steps of being deployed at an OLAP cluster working node and used for extracting sub-query physical plans, carrying out hash operation on the sub-query physical plans to obtain identification IDs, and attempting to obtain caches of the sub-query physical plans from a cache module by using the identification IDs; if the cache is obtained, loading the cache result into an execution flow, and entering an operator computing stage of hash connection; otherwise, the sub-query physical plan is executed locally, an execution result is loaded into an execution flow, an operator calculation stage of hash is entered, and meanwhile, the identification ID of the sub-query physical plan and the execution result are combined into a cache structure to be written into a cache module;
and a cache module: the sub-query module is deployed at an OLAP cluster working node and provides the capability of cache writing and parallel reading for the sub-query module; for the first write-in cache, a broadcasting module is called to carry out cache broadcasting on all working nodes of the OLAP cluster;
and a broadcasting module: and providing a cache broadcasting service, acquiring PRC addresses of all working nodes from the management node after receiving a cache broadcasting request from the cache module, and broadcasting an execution result of the sub-query physical plan to all the working nodes.
5. The system of claim 4, wherein the caching module provides a cache service based on an LRU policy, and the cache structure is expressed as HashMap < hashd, result >, where hashd is an MD5 calculated value for a sub-query physical plan, and Result is a data structure value in a working node memory.
6. The system of claim 4, wherein the system is capable of being deployed on various forms of computing node platforms, including ECS, docker, physical machine environments.
CN202310704298.3A 2023-06-14 2023-06-14 OLAP query optimization method and system based on broadcast sub-query cache Active CN116662449B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202310704298.3A CN116662449B (en) 2023-06-14 2023-06-14 OLAP query optimization method and system based on broadcast sub-query cache

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202310704298.3A CN116662449B (en) 2023-06-14 2023-06-14 OLAP query optimization method and system based on broadcast sub-query cache

Publications (2)

Publication Number Publication Date
CN116662449A CN116662449A (en) 2023-08-29
CN116662449B true CN116662449B (en) 2024-06-04

Family

ID=87713596

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202310704298.3A Active CN116662449B (en) 2023-06-14 2023-06-14 OLAP query optimization method and system based on broadcast sub-query cache

Country Status (1)

Country Link
CN (1) CN116662449B (en)

Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103324724A (en) * 2013-06-26 2013-09-25 华为技术有限公司 Method and device for processing data
CN105426467A (en) * 2015-11-16 2016-03-23 北京京东尚科信息技术有限公司 SQL query method and system for Presto
CN114329155A (en) * 2021-12-30 2022-04-12 北京诺司时空科技有限公司 Multi-mode storage caching system containing time sequence database and query method
CN114328598A (en) * 2021-11-29 2022-04-12 浪潮云信息技术股份公司 Cache optimization method and system for pipeline based on ClickHouse database
US11308106B1 (en) * 2018-05-21 2022-04-19 Amazon Technologies, Inc. Caching results for sub-queries to different data store locations
WO2022079856A1 (en) * 2020-10-15 2022-04-21 日本電信電話株式会社 Data processing device, data processing method, and data processing program
CN115221186A (en) * 2022-06-09 2022-10-21 网易(杭州)网络有限公司 Data query method, system and device and electronic equipment
CN115587114A (en) * 2022-09-23 2023-01-10 阿里云计算有限公司 System and query method

Family Cites Families (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8392403B2 (en) * 2009-09-18 2013-03-05 Microsoft Corporation Management of data and computation in data centers

Patent Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103324724A (en) * 2013-06-26 2013-09-25 华为技术有限公司 Method and device for processing data
CN105426467A (en) * 2015-11-16 2016-03-23 北京京东尚科信息技术有限公司 SQL query method and system for Presto
US11308106B1 (en) * 2018-05-21 2022-04-19 Amazon Technologies, Inc. Caching results for sub-queries to different data store locations
WO2022079856A1 (en) * 2020-10-15 2022-04-21 日本電信電話株式会社 Data processing device, data processing method, and data processing program
CN114328598A (en) * 2021-11-29 2022-04-12 浪潮云信息技术股份公司 Cache optimization method and system for pipeline based on ClickHouse database
CN114329155A (en) * 2021-12-30 2022-04-12 北京诺司时空科技有限公司 Multi-mode storage caching system containing time sequence database and query method
CN115221186A (en) * 2022-06-09 2022-10-21 网易(杭州)网络有限公司 Data query method, system and device and electronic equipment
CN115587114A (en) * 2022-09-23 2023-01-10 阿里云计算有限公司 System and query method

Also Published As

Publication number Publication date
CN116662449A (en) 2023-08-29

Similar Documents

Publication Publication Date Title
US11971890B2 (en) Database management system for optimizing queries via multiple optimizers
Zheng et al. DistDGL: Distributed graph neural network training for billion-scale graphs
US10572484B2 (en) Duplicate reduction or elimination with hash join operations
US9990398B2 (en) Inferring dimensional metadata from content of a query
US10642831B2 (en) Static data caching for queries with a clause that requires multiple iterations to execute
CN111159252B (en) Transaction execution method and device, computer equipment and storage medium
US20180232417A1 (en) Techniques for evaluating query predicates during in-memory table scans
US7054852B1 (en) Performance of join operations in parallel database systems
KR100745883B1 (en) A transparent edge-of-network data cache
US5812996A (en) Database system with methods for optimizing query performance with a buffer manager
US7814104B2 (en) Techniques for partition pruning
US20030088579A1 (en) Collecting statistics in a database system
US20170249358A1 (en) System and Method for Parallel Optimization of Database Query using Cluster Cache
US11567934B2 (en) Consistent client-side caching for fine grained invalidations
US7949655B2 (en) Joins of relations in an object relational database system
US11487762B2 (en) Workload aware data partitioning
KR20200092095A (en) Transaction control method to synchronize DML statements in relational database to NoSQL database
CN116662449B (en) OLAP query optimization method and system based on broadcast sub-query cache
US10853253B2 (en) Method and systems for master establishment using service-based statistics
CN118410068B (en) Database SQL query optimization method, terminal and storage medium
Chen Efficient partitioner for distributed OLTP DBMS
Basu et al. Centralized versus distributed index schemes in OODBMS-a performance analysis
Barlos et al. A load balanced multicomputer relational database system for highly skewed data
Ali Evolution of Database Emerging to Sybase Adaptive Server Enterprise and Ensuring Better Server Performance Tuning and Query Optimization
CN113312271A (en) Radio frequency assembly automatic test and diagnosis platform based on big data

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant