CN116662449B - OLAP query optimization method and system based on broadcast sub-query cache - Google Patents
OLAP query optimization method and system based on broadcast sub-query cache Download PDFInfo
- Publication number
- CN116662449B CN116662449B CN202310704298.3A CN202310704298A CN116662449B CN 116662449 B CN116662449 B CN 116662449B CN 202310704298 A CN202310704298 A CN 202310704298A CN 116662449 B CN116662449 B CN 116662449B
- Authority
- CN
- China
- Prior art keywords
- query
- cache
- sub
- result
- olap
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
- 238000000034 method Methods 0.000 title claims abstract description 26
- 238000005457 optimization Methods 0.000 title claims abstract description 13
- 238000004364 calculation method Methods 0.000 claims description 8
- 238000007405 data analysis Methods 0.000 abstract description 7
- 230000007547 defect Effects 0.000 abstract description 3
- 238000013461 design Methods 0.000 abstract description 3
- 238000007726 management method Methods 0.000 description 5
- 238000012986 modification Methods 0.000 description 4
- 230000004048 modification Effects 0.000 description 4
- 238000012545 processing Methods 0.000 description 4
- 238000004458 analytical method Methods 0.000 description 3
- 238000011161 development Methods 0.000 description 1
- 238000010586 diagram Methods 0.000 description 1
- 230000000694 effects Effects 0.000 description 1
- 239000000126 substance Substances 0.000 description 1
- 238000012360 testing method Methods 0.000 description 1
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/20—Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
- G06F16/27—Replication, distribution or synchronisation of data between databases or within a distributed database system; Distributed database system architectures therefor
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/20—Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
- G06F16/24—Querying
- G06F16/245—Query processing
- G06F16/2453—Query optimisation
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/20—Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
- G06F16/24—Querying
- G06F16/245—Query processing
- G06F16/2453—Query optimisation
- G06F16/24534—Query rewriting; Transformation
- G06F16/24539—Query rewriting; Transformation using cached or materialised query results
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/20—Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
- G06F16/24—Querying
- G06F16/245—Query processing
- G06F16/2455—Query execution
- G06F16/24552—Database cache management
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/20—Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
- G06F16/24—Querying
- G06F16/245—Query processing
- G06F16/2458—Special types of queries, e.g. statistical queries, fuzzy queries or distributed queries
- G06F16/2474—Sequence data queries, e.g. querying versioned data
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F9/00—Arrangements for program control, e.g. control units
- G06F9/06—Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
- G06F9/46—Multiprogramming arrangements
- G06F9/54—Interprogram communication
- G06F9/542—Event management; Broadcasting; Multicasting; Notifications
-
- Y—GENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y02—TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
- Y02D—CLIMATE CHANGE MITIGATION TECHNOLOGIES IN INFORMATION AND COMMUNICATION TECHNOLOGIES [ICT], I.E. INFORMATION AND COMMUNICATION TECHNOLOGIES AIMING AT THE REDUCTION OF THEIR OWN ENERGY USE
- Y02D10/00—Energy efficient computing, e.g. low power processors, power management or thermal management
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Databases & Information Systems (AREA)
- Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Data Mining & Analysis (AREA)
- Computational Linguistics (AREA)
- Software Systems (AREA)
- Computing Systems (AREA)
- Fuzzy Systems (AREA)
- Mathematical Physics (AREA)
- Probability & Statistics with Applications (AREA)
- Multimedia (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
- Two-Way Televisions, Distribution Of Moving Picture Or The Like (AREA)
- Memory System Of A Hierarchy Structure (AREA)
Abstract
The invention discloses an OLAP query optimization method and system based on broadcast sub-query caching. In a time sequence type data analysis scene, the existing OLAP system query result caching scheme has very low cache hit rate, and the invention provides a new, finer granularity and flexible caching scheme, namely, local results of sub-query operators are cached, so that the design defect of query result caching is avoided, and the method can be directly applied to a distributed big data analysis system; the invention can be directly applied in the cluster environment through a cache broadcasting mechanism, so that the cache of sub-queries can be used at all nodes, the overall cache hit rate is improved, and the query performance of the OLAP is accelerated by fully utilizing the distributed cluster capability.
Description
Technical Field
The invention relates to the field of cloud network observable data analysis, in particular to a method and a system for accelerating OLAP query performance.
Background
OLAP (Online Analytical Processing ) is an online analytical processing system, and OLAP is mainly used for querying data. With the continuous development of OLAP, OLAP products are layered endlessly, and most of OLAP systems are based on a ROLAP (relational database online analysis processing) system or a single MOLAP (multidimensional database online analysis processing) system.
With the rapid growth of commercial data volume, the conventional stand-alone OLAP database cannot meet the needs of users, and the modern OLAP system basically adopts a distributed architecture. In a distributed OLAP system, distributed optimization of SQL queries is achieved, thus supporting queries and analysis of massive data.
In modern OLAP systems, the most common SQL query optimization is mainly "query result cache (Query Result Cache)", i.e. a result set for caching a query statement, and if the same query is subsequently performed, the result is directly read from the result set cache without re-execution, so that the query performance is greatly improved.
However, with the increasing data query volume of the time sequence type, due to the design constraint of the query result cache (Query Result Cache), the SQL of each query needs to be completely consistent to hit the cache, but the query of the time sequence data is only partially consistent and unchanged for the inner sub-query, and the SQL time parameter of the outer layer is inconsistent, so that the cache cannot be effectively hit during each query, thereby influencing the query throughput performance of the whole system.
Disclosure of Invention
The invention aims at overcoming the defects of the prior art, and provides an OLAP query optimization method and system based on broadcast sub-query caching, which are used for accelerating the OLAP query performance. Compared with the existing OLAP system query result caching scheme, the method and the device can be directly applied to a distributed big data analysis system by using a more fine-grained and flexible caching algorithm, and the query performance of the OLAP is accelerated by fully utilizing the distributed cluster capacity.
The invention aims at realizing the following technical scheme:
according to a first aspect of the present specification, there is provided an OLAP query optimization method based on broadcast sub-query caching, the method including the steps of:
S1, when a management node receives an SQL query request aiming at time series data, splitting the SQL query request into sub-queries and final result convergence queries according to operators, and caching the execution results of the sub-query physical plan to all working nodes of an OLAP cluster by using a broadcasting mechanism;
S2, when the working node executes SQL query for time series data, directly performing cache query of a sub-query physical plan locally, and if the cache is hit, directly performing operator execution of the next step; if the cache is not hit, sub-query operator execution is carried out, the execution result of the sub-query physical plan is locally cached, and the execution result is updated to all working nodes of the OLAP cluster through broadcasting.
Further, the management node receives an SQL query request aiming at time series data, firstly analyzes SQL into a logic plan, optimizes the logic plan into a physical plan, and then splits the physical plan into sub-queries and final result convergence queries according to operators.
Further, the working node performs hash operation on the received sub-query physical plan to obtain an identification ID, and attempts to obtain a cache of the sub-query physical plan from a local cache module by using the identification ID; if the cache is obtained, loading the cache result into an execution flow, and entering an operator computing stage of hash connection; otherwise, the sub-query physical plan is executed locally, an execution result is loaded into an execution flow, an operator calculation stage of hash connection is entered, and meanwhile, the identification ID of the sub-query physical plan and the execution result are combined into a cache structure to be written into a local cache module.
Further, the local cache structure of the working node is expressed as HashMap < hashID, result >, wherein hashID is an MD5 calculated value for sub-query physical planning, and Result is a data structure value in the memory of the working node.
Further, when the query process of the working node does not hit the cache, sub-query physical plans are executed in the working node, and the execution Result is written into a local cache structure HashMap < HashID, result >; and simultaneously acquiring PRC addresses of all working nodes from the management node, and broadcasting HashMap < HashID, result > to all working nodes.
According to a second aspect of the present specification, there is provided an OLAP query optimization system based on broadcast sub-query caching, the system comprising the following modules:
Sub-query module: the system comprises an OLAP cluster working node, a cache module and a sub-query module, wherein the OLAP cluster working node is used for extracting a sub-query physical plan, carrying out hash operation on the physical plan to obtain an identification ID, and using the identification ID to try to obtain the cache of the sub-query physical plan from the cache module; if the cache is obtained, loading the cache result into an execution flow, and entering an operator computing stage of hash connection; otherwise, the sub-query physical plan is executed locally, an execution result is loaded into an execution flow, an operator calculation stage of hash is entered, and meanwhile, the identification ID of the sub-query physical plan and the execution result are combined into a cache structure to be written into a cache module;
and a cache module: the sub-query module is deployed at an OLAP cluster working node and provides the capability of cache writing and parallel reading for the sub-query module; for the first write-in cache, a broadcasting module is called to carry out cache broadcasting on all working nodes of the OLAP cluster;
and a broadcasting module: and providing a cache broadcasting service, acquiring PRC addresses of all working nodes from the management node after receiving a cache broadcasting request from the cache module, and broadcasting an execution result of the sub-query physical plan to all the working nodes.
Further, the cache module provides a cache service based on the LRU policy, and the cache structure is expressed as HashMap < HashID, result > where HashID is an MD5 calculated value for the sub-query physical plan, and Result is a data structure value in the memory of the working node.
Further, the system can be deployed on various forms of computing node platforms, including ECS, docker, physical machine environments.
Compared with the prior art, the invention has the following advantages:
Firstly, aiming at the data query of the time sequence type, a new, finer granularity and flexible caching algorithm is provided, the design defect of query result caching (Query Result Cache) is avoided, the method can be directly applied to a distributed big data analysis system, and the query performance of the OLAP is accelerated by fully utilizing the distributed cluster capability.
Secondly, the method can be directly applied in a cluster environment through a cache broadcasting mechanism, so that the cache of sub-queries is available at all nodes, the overall cache hit rate is improved, and the query performance of the OLAP is accelerated by fully utilizing the distributed cluster capability.
Drawings
FIG. 1 is a flowchart of an OLAP query optimization method based on broadcast sub-query caching according to an embodiment of the present invention;
Fig. 2 is a block diagram of an OLAP query optimization system based on broadcast sub-query caching according to an embodiment of the present invention.
Detailed Description
In order that the above objects, features and advantages of the invention will be readily understood, a more particular description of the invention will be rendered by reference to the appended drawings.
In the following description, numerous specific details are set forth in order to provide a thorough understanding of the present invention, but the present invention may be practiced in other ways other than those described herein, and persons skilled in the art will readily appreciate that the present invention is not limited to the specific embodiments disclosed below.
As shown in fig. 1, the method for optimizing OLAP query based on broadcast sub-query caching according to the embodiment of the present application may be specifically implemented in three steps.
(1) A new caching scheme is designed, and local results of sub-query operators are cached instead of SQL query results.
The caching scheme is specifically implemented in a working node Worker of the OLAP cluster. The Worker node is responsible for the execution of the physical schema PHYSICAL PLAN of SQL. The traditional SQL query result caching (Query Result Cache) is to cache the SQL query result at the Proxy layer Proxy, and the cache hit rate is very low in a time sequence type data analysis scene; the scheme is used for carrying out local result caching on the operator layer of PHYSICAL PLAN of SQL, is finer and more flexible, can greatly improve the cache hit rate in a time sequence type data analysis scene, and fully utilizes the distributed cluster capability to accelerate the query performance of the OLAP.
(2) When the management node Master receives the SQL query request aiming at the time series data, the SQL is analyzed into a logic plan logical plan, the logical plan is optimized to PHYSICAL PLAN, and PHYSICAL PLAN is split into sub-queries according to operators, and the final results are gathered. The execution result of the sub-query PHYSICAL PLAN is cached in each Worker node of the distributed system by using a broadcasting mechanism.
The operator split, mainly for PHYSICAL PLAN splits, sub-queries are typically optimized as hash connections HashJoin. The Worker node performs hash operation on the received sub-query PHYSICAL PLAN to obtain an identification ID, and attempts to obtain a cache of the sub-query PHYSICAL PLAN from a local cache module of the Worker node by using the identification ID; if the cache is obtained, loading the cache result into an execution flow Pipeline, and entering an operator calculation stage of HashJoin; if no cache is obtained, the sub-query PHYSICAL PLAN is executed locally, the execution Result is loaded into Pipeline, the operator calculation stage of HashJoin is entered, and meanwhile, the identification ID of the sub-query PHYSICAL PLAN and the execution Result are combined into a cache structure HashMap < HashID, and Result > is written into a local cache module of the Worker node.
In the cache structure HashMap < HashID, result > is the MD5 calculated value for the sub-query PHYSICAL PLAN, and Result is the data structure value in the memory of the workbench node.
(3) When the Worker node executes SQL query for time series data, the cache query of sub query PHYSICAL PLAN is directly carried out locally, and if the cache is hit, the operator execution of the next step is directly carried out; if the cache is not hit, sub-query operator execution is performed, the execution result of the sub-query PHYSICAL PLAN is locally cached, and the execution result is updated to all the workbench nodes of the OLAP cluster through broadcasting.
When the query process does not hit the cache, sub-query PHYSICAL PLAN is executed at the Worker node, and the Result after the execution is written into a local cache structure HashMap < HashID, result >; meanwhile, all the Worker node PRC addresses are acquired from the Master node, and HashMap < HashID, result > is broadcasted to all the Worker nodes. By the method, the sub-query cache is available at all nodes, the overall cache hit rate is improved, and the query performance of the OLAP is accelerated by fully utilizing the distributed cluster capability.
As shown in fig. 2, an OLAP query optimization system based on broadcast sub-query caching according to an embodiment of the present application includes the following modules:
Sub-query module: the method comprises the steps of deploying at a workbench node, extracting PHYSICAL PLAN of sub-queries, carrying out hash operation on PHYSICAL PLAN, obtaining an identification ID, and using the identification ID to try to obtain the cache of the sub-queries PHYSICAL PLAN from a cache module; if the cache is obtained, loading the cache result into a Pipeline, and entering an operator calculation stage of HashJoin; if no cache is obtained, the sub-query PHYSICAL PLAN is executed locally, the execution Result is loaded into Pipeline, the operator calculation stage of HashJoin is entered, and meanwhile, the identification ID of the sub-query PHYSICAL PLAN and the execution Result are combined into a cache structure HashMap < HashID, and Result > is written into the cache module.
And a cache module: the system is deployed at a workbench node, provides cache service based on an LRU strategy, and provides cache writing and parallel reading capabilities for a sub-query module. The core is a hash table HashTable, and the read-write lock is used for controlling, and the cache structure is HashMap < HashID, result >. And for the first write-in cache, a broadcasting module is called to broadcast the cache of all the workbench nodes of the OLAP cluster.
And a broadcasting module: and after receiving the cache broadcasting request from the cache module, the cache broadcasting service is provided, all the PRC addresses of the workbench nodes are obtained from the Master node, and the HashMap < HashID, result > is broadcasted to all the workbench nodes. By the method, the sub-query cache is available at all nodes, the overall cache hit rate is improved, and the query performance of the OLAP is accelerated by fully utilizing the distributed cluster capability.
The embodiment of the application realizes a prototype system on the ECS platform based on the Arian cloud and tests the effect of the method. However, according to different hardware devices used by the computing node, the embodiment of the application can be popularized and deployed to platforms such as a physical machine and a Docker.
The foregoing is merely a preferred embodiment of the present invention, and the present invention has been disclosed in the above description of the preferred embodiment, but is not limited thereto. Any person skilled in the art can make many possible variations and modifications to the technical solution of the present invention or modifications to equivalent embodiments using the methods and technical contents disclosed above, without departing from the scope of the technical solution of the present invention. Therefore, any simple modification, equivalent variation and modification of the above embodiments according to the technical substance of the present invention still fall within the scope of the technical solution of the present invention.
Claims (6)
1. An OLAP query optimization method based on broadcast sub-query caching, comprising the steps of:
s1, when a management node receives an SQL query request aiming at time series data, firstly analyzing SQL into a logic plan, optimizing the logic plan into a physical plan, splitting the physical plan into sub-queries and final result convergence queries according to operators, and caching the execution results of the sub-query physical plan to all working nodes of an OLAP cluster by using a broadcasting mechanism;
The working node performs hash operation on the received sub-query physical plan to obtain an identification ID, and attempts to obtain a cache of the sub-query physical plan from a local cache module by using the identification ID; if the cache is obtained, loading the cache result into an execution flow, and entering an operator computing stage of hash connection; otherwise, the sub-query physical plan is executed locally, an execution result is loaded into an execution flow, an operator calculation stage of hash connection is entered, and meanwhile, the identification ID of the sub-query physical plan and the execution result are combined into a cache structure to be written into a local cache module;
S2, when the working node executes SQL query for time series data, directly performing cache query of a sub-query physical plan locally, and if the cache is hit, directly performing operator execution of the next step; if the cache is not hit, sub-query operator execution is carried out, the execution result of the sub-query physical plan is locally cached, and the execution result is updated to all working nodes of the OLAP cluster through broadcasting.
2. The method of claim 1, wherein the local cache structure of the working node is expressed as HashMap < hashd, result >, wherein hashd is an MD5 calculated value for the sub-query physical plan, and Result is a data structure value in the working node memory.
3. The method according to claim 2, wherein the sub-query physical plan is executed at the working node when the query process of the working node does not hit the cache, and the execution Result is written into a local cache structure HashMap < hashd, result >; and simultaneously acquiring PRC addresses of all working nodes from the management node, and broadcasting HashMap < HashID, result > to all working nodes.
4. An OLAP query optimization system implemented using the method of any one of claims 1-3, comprising:
Sub-query module: the method comprises the steps of being deployed at an OLAP cluster working node and used for extracting sub-query physical plans, carrying out hash operation on the sub-query physical plans to obtain identification IDs, and attempting to obtain caches of the sub-query physical plans from a cache module by using the identification IDs; if the cache is obtained, loading the cache result into an execution flow, and entering an operator computing stage of hash connection; otherwise, the sub-query physical plan is executed locally, an execution result is loaded into an execution flow, an operator calculation stage of hash is entered, and meanwhile, the identification ID of the sub-query physical plan and the execution result are combined into a cache structure to be written into a cache module;
and a cache module: the sub-query module is deployed at an OLAP cluster working node and provides the capability of cache writing and parallel reading for the sub-query module; for the first write-in cache, a broadcasting module is called to carry out cache broadcasting on all working nodes of the OLAP cluster;
and a broadcasting module: and providing a cache broadcasting service, acquiring PRC addresses of all working nodes from the management node after receiving a cache broadcasting request from the cache module, and broadcasting an execution result of the sub-query physical plan to all the working nodes.
5. The system of claim 4, wherein the caching module provides a cache service based on an LRU policy, and the cache structure is expressed as HashMap < hashd, result >, where hashd is an MD5 calculated value for a sub-query physical plan, and Result is a data structure value in a working node memory.
6. The system of claim 4, wherein the system is capable of being deployed on various forms of computing node platforms, including ECS, docker, physical machine environments.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202310704298.3A CN116662449B (en) | 2023-06-14 | 2023-06-14 | OLAP query optimization method and system based on broadcast sub-query cache |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202310704298.3A CN116662449B (en) | 2023-06-14 | 2023-06-14 | OLAP query optimization method and system based on broadcast sub-query cache |
Publications (2)
Publication Number | Publication Date |
---|---|
CN116662449A CN116662449A (en) | 2023-08-29 |
CN116662449B true CN116662449B (en) | 2024-06-04 |
Family
ID=87713596
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202310704298.3A Active CN116662449B (en) | 2023-06-14 | 2023-06-14 | OLAP query optimization method and system based on broadcast sub-query cache |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN116662449B (en) |
Citations (8)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN103324724A (en) * | 2013-06-26 | 2013-09-25 | 华为技术有限公司 | Method and device for processing data |
CN105426467A (en) * | 2015-11-16 | 2016-03-23 | 北京京东尚科信息技术有限公司 | SQL query method and system for Presto |
CN114329155A (en) * | 2021-12-30 | 2022-04-12 | 北京诺司时空科技有限公司 | Multi-mode storage caching system containing time sequence database and query method |
CN114328598A (en) * | 2021-11-29 | 2022-04-12 | 浪潮云信息技术股份公司 | Cache optimization method and system for pipeline based on ClickHouse database |
US11308106B1 (en) * | 2018-05-21 | 2022-04-19 | Amazon Technologies, Inc. | Caching results for sub-queries to different data store locations |
WO2022079856A1 (en) * | 2020-10-15 | 2022-04-21 | 日本電信電話株式会社 | Data processing device, data processing method, and data processing program |
CN115221186A (en) * | 2022-06-09 | 2022-10-21 | 网易(杭州)网络有限公司 | Data query method, system and device and electronic equipment |
CN115587114A (en) * | 2022-09-23 | 2023-01-10 | 阿里云计算有限公司 | System and query method |
Family Cites Families (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US8392403B2 (en) * | 2009-09-18 | 2013-03-05 | Microsoft Corporation | Management of data and computation in data centers |
-
2023
- 2023-06-14 CN CN202310704298.3A patent/CN116662449B/en active Active
Patent Citations (8)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN103324724A (en) * | 2013-06-26 | 2013-09-25 | 华为技术有限公司 | Method and device for processing data |
CN105426467A (en) * | 2015-11-16 | 2016-03-23 | 北京京东尚科信息技术有限公司 | SQL query method and system for Presto |
US11308106B1 (en) * | 2018-05-21 | 2022-04-19 | Amazon Technologies, Inc. | Caching results for sub-queries to different data store locations |
WO2022079856A1 (en) * | 2020-10-15 | 2022-04-21 | 日本電信電話株式会社 | Data processing device, data processing method, and data processing program |
CN114328598A (en) * | 2021-11-29 | 2022-04-12 | 浪潮云信息技术股份公司 | Cache optimization method and system for pipeline based on ClickHouse database |
CN114329155A (en) * | 2021-12-30 | 2022-04-12 | 北京诺司时空科技有限公司 | Multi-mode storage caching system containing time sequence database and query method |
CN115221186A (en) * | 2022-06-09 | 2022-10-21 | 网易(杭州)网络有限公司 | Data query method, system and device and electronic equipment |
CN115587114A (en) * | 2022-09-23 | 2023-01-10 | 阿里云计算有限公司 | System and query method |
Also Published As
Publication number | Publication date |
---|---|
CN116662449A (en) | 2023-08-29 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US11971890B2 (en) | Database management system for optimizing queries via multiple optimizers | |
Zheng et al. | DistDGL: Distributed graph neural network training for billion-scale graphs | |
US10572484B2 (en) | Duplicate reduction or elimination with hash join operations | |
US9990398B2 (en) | Inferring dimensional metadata from content of a query | |
US10642831B2 (en) | Static data caching for queries with a clause that requires multiple iterations to execute | |
CN111159252B (en) | Transaction execution method and device, computer equipment and storage medium | |
US20180232417A1 (en) | Techniques for evaluating query predicates during in-memory table scans | |
US7054852B1 (en) | Performance of join operations in parallel database systems | |
KR100745883B1 (en) | A transparent edge-of-network data cache | |
US5812996A (en) | Database system with methods for optimizing query performance with a buffer manager | |
US7814104B2 (en) | Techniques for partition pruning | |
US20030088579A1 (en) | Collecting statistics in a database system | |
US20170249358A1 (en) | System and Method for Parallel Optimization of Database Query using Cluster Cache | |
US11567934B2 (en) | Consistent client-side caching for fine grained invalidations | |
US7949655B2 (en) | Joins of relations in an object relational database system | |
US11487762B2 (en) | Workload aware data partitioning | |
KR20200092095A (en) | Transaction control method to synchronize DML statements in relational database to NoSQL database | |
CN116662449B (en) | OLAP query optimization method and system based on broadcast sub-query cache | |
US10853253B2 (en) | Method and systems for master establishment using service-based statistics | |
CN118410068B (en) | Database SQL query optimization method, terminal and storage medium | |
Chen | Efficient partitioner for distributed OLTP DBMS | |
Basu et al. | Centralized versus distributed index schemes in OODBMS-a performance analysis | |
Barlos et al. | A load balanced multicomputer relational database system for highly skewed data | |
Ali | Evolution of Database Emerging to Sybase Adaptive Server Enterprise and Ensuring Better Server Performance Tuning and Query Optimization | |
CN113312271A (en) | Radio frequency assembly automatic test and diagnosis platform based on big data |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant |