CN112328578A - Database query optimization method based on reinforcement learning and graph attention network - Google Patents

Database query optimization method based on reinforcement learning and graph attention network Download PDF

Info

Publication number
CN112328578A
CN112328578A CN202011351761.3A CN202011351761A CN112328578A CN 112328578 A CN112328578 A CN 112328578A CN 202011351761 A CN202011351761 A CN 202011351761A CN 112328578 A CN112328578 A CN 112328578A
Authority
CN
China
Prior art keywords
node
query
graph
bits
network
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN202011351761.3A
Other languages
Chinese (zh)
Other versions
CN112328578B (en
Inventor
詹思瑜
周维清
王玉林
卢国明
戴波
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
University of Electronic Science and Technology of China
Original Assignee
University of Electronic Science and Technology of China
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by University of Electronic Science and Technology of China filed Critical University of Electronic Science and Technology of China
Priority to CN202011351761.3A priority Critical patent/CN112328578B/en
Publication of CN112328578A publication Critical patent/CN112328578A/en
Application granted granted Critical
Publication of CN112328578B publication Critical patent/CN112328578B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/21Design, administration or maintenance of databases
    • G06F16/217Database tuning
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/22Indexing; Data structures therefor; Storage structures
    • G06F16/2228Indexing structures
    • G06F16/2246Trees, e.g. B+trees
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/24Querying
    • G06F16/242Query formulation
    • G06F16/2433Query languages
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/24Querying
    • G06F16/245Query processing
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/25Integrating or interfacing systems involving database management systems
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N20/00Machine learning
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y02TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
    • Y02DCLIMATE CHANGE MITIGATION TECHNOLOGIES IN INFORMATION AND COMMUNICATION TECHNOLOGIES [ICT], I.E. INFORMATION AND COMMUNICATION TECHNOLOGIES AIMING AT THE REDUCTION OF THEIR OWN ENERGY USE
    • Y02D10/00Energy efficient computing, e.g. low power processors, power management or thermal management

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Databases & Information Systems (AREA)
  • Physics & Mathematics (AREA)
  • Data Mining & Analysis (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Software Systems (AREA)
  • Computational Linguistics (AREA)
  • Mathematical Physics (AREA)
  • Artificial Intelligence (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Evolutionary Computation (AREA)
  • Medical Informatics (AREA)
  • Computing Systems (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

The invention relates to the technical field of databases, provides a database query optimization method based on reinforcement learning and a graph attention network, and aims to solve the technical problems that when the connection relation of the existing query sentences is very complex, the query execution plan space is very large, and a large amount of time is consumed for searching the whole query execution space. Randomly generating query statements in a database and executing the query statements, splitting an execution plan tree corresponding to the query statements from a root node, and recording the connection relation of each node; initializing Q-network parameters w in a DQN model, wherein the Q-network in the DQN model adopts a GAT (generic object model) graph attention network, takes a coding feature matrix and a graph description set Edge as network input, and trains the DQN model; and (3) initializing the graph description and the code of a query statement, and generating a connection relation by using the DQN model obtained by training in the step 2 until all tables are connected to generate a complete query plan.

Description

Database query optimization method based on reinforcement learning and graph attention network
Technical Field
The invention relates to a database query optimization method based on reinforcement learning and a graph attention network. Aiming at large-scale multi-connection query, a better database query execution plan can be obtained in a shorter time, so that the execution time of the query in the database is reduced.
Background
For a query statement, the database cannot be executed directly. The database needs to analyze the query statement first, then the optimizer generates a corresponding query execution plan, and finally the plan is handed to the execution engine to execute the plan. The invention provides an effective solution for generating a better query plan aiming at the multi-connection query in a shorter time.
The technical scheme in the two prior arts is most similar to that proposed in the present application:
1. chinese invention patent, patent name: a database multi-connection query optimization method based on an improved SDD-1 algorithm is disclosed, and the application number is as follows: CN 201110043615.9.
Firstly, the improved SDD-1 algorithm is executed, a query execution strategy set is obtained by utilizing the algorithm, and the execution strategy set is used as the basis for the initial population generation of the genetic algorithm. And then, executing a genetic algorithm, and optimizing the result obtained by the SDD-1 algorithm by using the global search capability of the genetic algorithm. And finally, obtaining a relatively ideal query execution strategy. The method specifically comprises the following steps:
step 1: setting initial parameters: initial parameter settings including SDD-1 and genetic algorithms;
step 2: acquiring a query execution policy set: searching beneficial bidirectional half-links from the constructed query graph, selecting the beneficial bidirectional half-links from the beneficial bidirectional half-link candidate set to be connected to the beneficial bidirectional half-link set BS, repeating the steps until no beneficial bidirectional half-links exist in the query graph, adding the value of the obtained beneficial bidirectional half-link set BS to the execution strategy set ES, and repeating the steps until the operation frequency reaches N;
and step 3: constructing an initial population of a genetic algorithm: sequentially executing coding operation on elements in the execution strategy set ES, and taking the obtained result as an initial population of the genetic algorithm;
and 4, step 4: running a genetic algorithm: repeatedly performing crossing, variation and selection operations on the population until the running times reach M;
and 5: outputting a query execution strategy: and (4) outputting the best individual in the population as a final result, and decoding the final result into a query tree, namely a query execution strategy.
2. Chinese invention patent, patent name: a big data real-time query optimization method based on hypergraph and dynamic plan is disclosed as follows: CN 201020231887.2.
A big data real-time query optimization method based on a hypergraph and a dynamic plan comprises an optimal cost model construction process and an execution plan space search process. The optimal cost model construction process comprises the following steps:
step 1: analyzing table data in a metadata server, constructing and generating a column-level statistical information histogram with fine granularity, and storing the column-level statistical information histogram in the metadata server;
step 2: and constructing a corresponding optimal cost model for use in generating the plan by using the statistical information.
Performing the planned space search process includes the steps of:
step 1: and analyzing the database query statement, and storing and querying the result in the hypergraph data structure.
Step 2: the execution plan is initially set for a single relationship and saved in the corresponding dynamic schedule.
And step 3: a compute enumeration policy is defined: each connected subgraph and the connected complement set are generated only once;
and 4, step 4: enumerating connected subgraphs by computing a domain;
and 5: finding a suitable connected complement set for each connected subgraph;
step 6: calculating the cost of the execution plan formed by each pair of connected subgraphs and the complementary set, and updating the execution plan according to the cost model;
and 7: and (5) repeatedly executing the step (4) to the step (7) until the execution plan space formed by the whole left linear tree is searched, and generating an execution plan tree.
The first technical scheme has the following defects:
1. the genetic algorithm is essentially a greedy strategy and is easy to fall into a local optimal solution;
2. the scheme needs to set the iteration times of the genetic algorithm, and when the iteration times are less, a better query execution plan cannot be obtained. When the number of iterations is large, the algorithm execution time needs to be long, and the situation that the local optimization is trapped cannot be avoided.
3. The tree structure information of the query execution plan cannot be captured by encoding the one-dimensional code of the query execution plan.
The second technical scheme has the following disadvantages:
1. the left linear tree is used for enumerating and searching the whole query execution plan space, when the connection relation of the query statements is very complex, the query execution plan space is very huge, and a great amount of time is consumed for searching the whole query execution space.
Disclosure of Invention
The invention aims to solve the technical problems that when the connection relation of the existing query statement is very complex, the query execution plan space is very huge, and a great amount of time is consumed for searching the whole query execution space.
In order to solve the technical problems, the invention adopts the following technical scheme:
the invention provides a database query optimization method based on reinforcement learning and a graph attention network, which comprises the following specific steps of:
step 1: data collection, namely randomly generating query statements in a database and executing the query statements, splitting an execution plan tree corresponding to the query statements from a root node, and recording the connection relation of each node;
step 2: model training, namely performing code description on each node according to the connection relation of each node to obtain a code characteristic matrix, performing graph description on each node to obtain a graph description set Edge, initializing a Q-network parameter w in a DQN model, wherein the Q-network in the DQN model adopts a GAT (generic object model) graph attention network, and training the DQN model by taking the code characteristic matrix and the graph description set Edge as network input;
and step 3: and (3) model application, namely taking each table as a node for the tables related to a query statement, initializing the graph description and the code of each node, selecting the connection of 2 nodes in each step in the query execution plan, generating by using the DQN model obtained by training in the step 2, and at the moment, carrying out state transition, updating the graph description and the code description until all the tables are connected to generate the complete query plan.
In the above technical solution, the drawings are described as follows:
numbering n tables related to the query statement, wherein the table is represented as [1, 2, 3, 4 … n ], each table is used as a Node index, the initial Node indexes are n, all current common Node index sets Node [1, 2.. once, n ]) are stored, and the initial Edge set Edge is null;
selecting 2 nodes i and j which are not marked as connected query plans in the Node set, adding the query plans connected with the nodes i and j into the Node set as a new Node max (index) +1, wherein index belongs to the Node, marking the nodes i and j as connected query plans, representing the nodes i, j and the new Node max (index) +1 into two edges (i, max (index) +1) and (j, max (index) + j), and adding the Edge set to obtain a graph description set Edge.
In the above technical solution, the encoding is described as follows:
for each node, the number of coding bits is n + m + k bits, the initial n bits are table 1-hot codes related to the current node, the middle m bits are column attribute 1-hot codes related to the current node, the last k bits are 1-hot codes of connection types, the coding description n + m bits of each node can be assigned during initialization, the last k bits are initialized to 0, when connection operation is performed, the coding of the newly added node is newly added, the first n + m bits are bits of connection 2 nodes or operation results, the last k bits are set with the 1-hot codes according to the connection types, and finally a coding feature matrix is obtained.
Because the invention adopts the technical scheme, the invention has the following beneficial effects:
first, compared to the dynamic programming algorithm, the dynamic programming algorithm needs to search all possible solutions of the query execution plan, and the solution space of the query execution plan increases exponentially with the number of connections. Although the dynamic programming algorithm may obtain the optimal solution for the query execution plan, it may take a significant amount of time in obtaining the optimal solution. And by using the reinforcement learning DQN model, the model is trained off line without consideration. When the DQN model is used for deciding a query execution plan, the algorithm execution times are linearly related to the connection numbers. If there are n connections, only n-1 algorithm executions at most are needed to obtain a better query execution plan.
Compared with intelligent algorithms such as genetic algorithm and the like, as described in the second technical scheme, the iteration times of the algorithm need to be set firstly. However, for different query statements, the iteration number of the algorithm cannot be adapted, which may result in that if the iteration number is not enough, a better solution cannot be obtained. If the number of iterations is too large, a lot of time is consumed while the local optimum may be involved. And by using the DQN model, the selection of each step is selected by the Q-network trained by using a large amount of data, so that the local optimization is less likely to be involved, and the result can be obtained only by executing the algorithm n-1 times at most.
And thirdly, compared with a common reinforcement learning algorithm, the tree structure characteristics of the query execution plan cannot be described, the connection selection adopted in each step cannot be described, and the influence degree of the final query execution plan is not uniform. And the two features can be better described by replacing the ordinary fully-connected network Q-network in the DQN network with a GAT graph attention network.
Drawings
FIG. 1 is a schematic view of a model structure;
FIG. 2 is a schematic diagram of an execution plan tree split from a root node.
Detailed Description
In order to make the objects, technical solutions and advantages of the present invention more apparent, the present invention is described in further detail below with reference to the accompanying drawings and embodiments. It should be understood that the detailed description and specific examples, while indicating the preferred embodiment of the invention, are intended for purposes of illustration only and are not intended to limit the scope of the invention. The components of embodiments of the present invention generally described and illustrated in the figures herein may be arranged and designed in a wide variety of different configurations.
Thus, the following detailed description of the embodiments of the present invention, presented in the figures, is not intended to limit the scope of the invention, as claimed, but is merely representative of selected embodiments of the invention. All other embodiments, which can be derived by a person skilled in the art from the embodiments of the present invention without making any creative effort, shall fall within the protection scope of the present invention.
In the reinforcement learning model DQN, the most important part is to predict the long-term influence of different single step selections through Q-network, and each time the model selects the selection with the best long-term influence, the best query execution plan is finally considered to be obtained. Therefore, the correctness of the prediction of the long-term impact on each single-step selection is crucial. The common Q-network is a fully connected network, and can only take one-dimensional data as input. If one-dimensional data is used as input, many problems arise in describing current query execution plans. The common encoding method for the query execution plan can lead to that some different query execution plans have the same encoding representation. Such training may make the long-term impact of Q-network on connection selection inaccurate, as the same input will have the same output result as the input. So to capture the structural information of the overall query execution plan, the GAT attention network is selected as the Q-network instead of the normal full-connection network. The brand-new network structure can enable the long-term influence of the trained model on single-step connection selection to be more accurate. The best connection selection can also be selected more accurately each time when using the DQN model.
For a query statement, the database needs to be executed according to its query execution plan. Query statements involve many table join operations, and table join operations all have commutative associativity. Therefore, in the query execution plan, it is necessary to determine which two parts (which may be tables or the result after the tables and the tables are connected) are connected each time, and finally, each table is connected, i.e. the complete query execution plan is formed. Abstracting the query execution plan tree into a Markov decision process: in the start state, all tables are not connected, and it is necessary to select which two tables are connected. After selection, the two tables are replaced by a connection result, and the next state is entered, and the connection result generated in the previous step can be selected as well. I.e. each time a connection is selected, it may be two tables, one table with one connection result or two connection results. Until only one join result remains in all states, a complete query plan is generated.
The invention specifically provides a database query optimization method based on reinforcement learning and a graph attention network, which is characterized by comprising the following specific steps of:
step 1: data collection, namely randomly generating query statements in a database and executing the query statements, splitting an execution plan tree corresponding to the query statements from a root node, and recording the connection relation of each node; as shown in fig. 2, a query execution plan including 6 nodes is illustrated, where the node 6 is a root node, and the splitting is completed each time the root node is deleted in the splitting process until each node is a leaf node.
Step 2: model training, namely performing code description on each node according to the connection relation of each node to obtain a code characteristic matrix, performing graph description on each node to obtain a graph description set Edge, initializing a Q-network parameter w in a DQN model, wherein the Q-network in the DQN model adopts a GAT (generic object model) graph attention network, and training the DQN model by taking the code characteristic matrix and the graph description set Edge as network input;
and step 3: and (3) model application, namely taking each table as a node for the tables related to a query statement, initializing the graph description and the code of each node, selecting the connection of 2 nodes in each step in the query execution plan, generating by using the DQN model obtained by training in the step 2, and at the moment, carrying out state transition, updating the graph description and the code description until all the tables are connected to generate the complete query plan.
The GAT graph attention network is composed of an attention convolution layer, an active layer, an attention convolution layer and an active layer 4. Wherein the first attention convolution layer receiving input part is state+And action+Determined next statet+1. To execute a plan srate on a querytData model, state pair, described as being tractable by a graph attention networktTwo parts are described for graph description and coding:
the figures are described as follows:
numbering n tables related to the query statement, wherein the table is represented as [1, 2, 3, 4 … n ], each table is used as a Node index, the initial Node indexes are n, all current common Node index sets Node [1, 2.. once, n ]) are stored, and the initial Edge set Edge is null;
selecting 2 nodes i and j which are not marked as connected query plans in the Node set, adding the query plans connected with the nodes i and j into the Node set as a new Node max (index) +1, wherein index belongs to the Node, marking the nodes i and j as connected query plans, representing the nodes i, j and the new Node max (index) +1 into two edges (i, max (index) +1) and (j, max (index) + j), and adding the Edge set to obtain a graph description set Edge.
The encoding in the above technical solution is described as follows:
in the query execution plan, any node i is connected with a query plan node at the later stage, and the result of each query plan node is used as a node j;
for each node, the number of coding bits is n + m + k bits, the initial n bits are table 1-hot codes related to the current node, the middle m bits are column attribute 1-hot codes related to the current node, the last k bits are 1-hot codes of connection types, the coding description n + m bits of each node can be assigned during initialization, the last k bits are initialized to 0, when connection operation is performed, the coding of the newly added node is newly added, the first n + m bits are bits of connection 2 nodes or operation results, the last k bits are set with the 1-hot codes according to the connection types, and finally a coding feature matrix is obtained.
Because the invention adopts the technical scheme, the invention has the following characteristics:
1. in the step 1, enough data can be collected, so that a model with enough generalization capability can be obtained through training in the subsequent model training process;
2. in step 1, all operations are performed off-line, that is, although a large amount of data is collected, the time consumed by the operations is the same as the time of generating the query statement plan by using the model in step 3;
3. in step 2, the DQN model is used as a framework in the whole, so that the connection selection of each step is not limited to whether single-step selection is excellent or not, but is focused on a selection which is more beneficial to the generation of the whole query execution plan, and is less prone to falling into local optimization;
4. in step 2, the general full-connection network is replaced by the attention network GAT in the DQN model as a Q-network, and the tree structure information of the query execution plan can be captured.
5. In step 3, for all query statements, the time for generating a complete query execution plan is linearly related to the number of connections of the query statements, so that a better execution plan can be generated effectively in a very short time.

Claims (3)

1. A database query optimization method based on reinforcement learning and graph attention network is characterized by comprising the following specific steps:
step 1: data collection, namely randomly generating query statements in a database and executing the query statements, splitting an execution plan tree corresponding to the query statements from a root node, and recording the connection relation of each node;
step 2: model training, namely performing code description on each node according to the connection relation of each node to obtain a code characteristic matrix, performing graph description on each node to obtain a graph description set Edge, initializing a Q-network parameter w in a DQN model, wherein the Q-network in the DQN model adopts a GAT (generic object model) graph attention network, and training the DQN model by taking the code characteristic matrix and the graph description set Edge as network input;
and step 3: and (3) model application, namely taking each table as a node for the tables related to a query statement, initializing the graph description and the code of each node, selecting the connection of 2 nodes in each step in the query execution plan, generating by using the DQN model obtained by training in the step 2, and at the moment, carrying out state transition, updating the graph description and the code description until all the tables are connected to generate the complete query plan.
2. The database query optimization method based on reinforcement learning and graph attention network as claimed in claim 1, wherein the graph is described as follows:
numbering n tables related to the query statement, wherein the table is represented as [1, 2, 3, 4 … n ], each table is used as a Node index, the initial Node indexes are n, all current common Node index sets Node [1, 2.. once, n ]) are stored, and the initial Edge set Edge is null;
selecting 2 nodes i and j which are not marked as connected query plans in the Node set, adding the query plans connected with the nodes i and j into the Node set as a new Node max (index) +1, wherein index belongs to the Node, marking the nodes i and j as connected query plans, representing the nodes i, j and the new Node max (index) +1 into two edges (i, max (index) +1) and (j, max (index) + j), and adding the Edge set to obtain a graph description set Edge.
3. The database query optimization method based on reinforcement learning and graph attention network as claimed in claim 1, wherein the code is described as follows:
for each node, the number of coding bits is n + m + k bits, the initial n bits are table 1-hot codes related to the current node, the middle m bits are column attribute 1-hot codes related to the current node, the last k bits are 1-hot codes of connection types, the coding description n + m bits of each node can be assigned during initialization, the last k bits are initialized to 0, when connection operation is performed, the coding of the newly added node is newly added, the first n + m bits are bits of connection 2 nodes or operation results, the last k bits are set with the 1-hot codes according to the connection types, and finally a coding feature matrix is obtained.
CN202011351761.3A 2020-11-26 2020-11-26 Database query optimization method based on reinforcement learning and graph attention network Active CN112328578B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202011351761.3A CN112328578B (en) 2020-11-26 2020-11-26 Database query optimization method based on reinforcement learning and graph attention network

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202011351761.3A CN112328578B (en) 2020-11-26 2020-11-26 Database query optimization method based on reinforcement learning and graph attention network

Publications (2)

Publication Number Publication Date
CN112328578A true CN112328578A (en) 2021-02-05
CN112328578B CN112328578B (en) 2023-03-28

Family

ID=74309558

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202011351761.3A Active CN112328578B (en) 2020-11-26 2020-11-26 Database query optimization method based on reinforcement learning and graph attention network

Country Status (1)

Country Link
CN (1) CN112328578B (en)

Cited By (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112883066A (en) * 2021-03-29 2021-06-01 电子科技大学 Multidimensional range query cardinality estimation method on database
CN112988802A (en) * 2021-04-29 2021-06-18 电子科技大学 Relational database query optimization method and system based on reinforcement learning
CN113010547A (en) * 2021-05-06 2021-06-22 电子科技大学 Database query optimization method and system based on graph neural network
CN113515540A (en) * 2021-06-09 2021-10-19 清华大学 Query rewriting method for database
CN115168408A (en) * 2022-08-16 2022-10-11 北京永洪商智科技有限公司 Query optimization method, device, equipment and storage medium based on reinforcement learning
CN116383454A (en) * 2023-04-10 2023-07-04 星环信息科技(上海)股份有限公司 Data query method of graph database, electronic equipment and storage medium
CN116561173A (en) * 2023-07-11 2023-08-08 天津博冕科技发展有限公司 Method and system for selecting query execution plan by using relational graph and attention neural network
WO2024110016A1 (en) * 2022-11-22 2024-05-30 Huawei Technologies Co., Ltd. Devices and methods for generating a query plan based on a sql query

Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20110046775A1 (en) * 2007-09-13 2011-02-24 Lockheed Martin Corporation Facility Wide Mixed Mail Sorting and/or Sequencing System and Components and Methods Thereof
CN106294888A (en) * 2016-10-24 2017-01-04 北京亚控科技发展有限公司 A kind of method for subscribing of object data based on space-time database
CN110084245A (en) * 2019-04-04 2019-08-02 中国科学院自动化研究所 The Weakly supervised image detecting method of view-based access control model attention mechanism intensified learning, system
US20200134491A1 (en) * 2018-06-08 2020-04-30 United States Of America As Represented By The Secretary Of The Navy Swarm System Including an Operator Control Section Enabling Operator Input of Mission Objectives and Responses to Advice Requests from a Heterogeneous Multi-Agent Population Including Information Fusion, Control Diffusion, and Operator Infusion Agents that Controls Platforms, Effectors, and Sensors
CN111581454A (en) * 2020-04-27 2020-08-25 清华大学 Depth map compression algorithm-based parallel query expression prediction system and method
CN111611274A (en) * 2020-05-28 2020-09-01 华中科技大学 Database query optimization method and system
US20200327118A1 (en) * 2020-06-27 2020-10-15 Intel Corporation Similarity search using guided reinforcement learning

Patent Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20110046775A1 (en) * 2007-09-13 2011-02-24 Lockheed Martin Corporation Facility Wide Mixed Mail Sorting and/or Sequencing System and Components and Methods Thereof
CN106294888A (en) * 2016-10-24 2017-01-04 北京亚控科技发展有限公司 A kind of method for subscribing of object data based on space-time database
US20200134491A1 (en) * 2018-06-08 2020-04-30 United States Of America As Represented By The Secretary Of The Navy Swarm System Including an Operator Control Section Enabling Operator Input of Mission Objectives and Responses to Advice Requests from a Heterogeneous Multi-Agent Population Including Information Fusion, Control Diffusion, and Operator Infusion Agents that Controls Platforms, Effectors, and Sensors
CN110084245A (en) * 2019-04-04 2019-08-02 中国科学院自动化研究所 The Weakly supervised image detecting method of view-based access control model attention mechanism intensified learning, system
CN111581454A (en) * 2020-04-27 2020-08-25 清华大学 Depth map compression algorithm-based parallel query expression prediction system and method
CN111611274A (en) * 2020-05-28 2020-09-01 华中科技大学 Database query optimization method and system
US20200327118A1 (en) * 2020-06-27 2020-10-15 Intel Corporation Similarity search using guided reinforcement learning

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
WEIQING ZHOU等: "SOAR::a learned join order selector with graph attention mechanism" *

Cited By (12)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112883066A (en) * 2021-03-29 2021-06-01 电子科技大学 Multidimensional range query cardinality estimation method on database
CN112988802A (en) * 2021-04-29 2021-06-18 电子科技大学 Relational database query optimization method and system based on reinforcement learning
CN112988802B (en) * 2021-04-29 2022-07-15 电子科技大学 Relational database query optimization method and system based on reinforcement learning
CN113010547A (en) * 2021-05-06 2021-06-22 电子科技大学 Database query optimization method and system based on graph neural network
CN113515540A (en) * 2021-06-09 2021-10-19 清华大学 Query rewriting method for database
CN115168408A (en) * 2022-08-16 2022-10-11 北京永洪商智科技有限公司 Query optimization method, device, equipment and storage medium based on reinforcement learning
CN115168408B (en) * 2022-08-16 2024-05-28 北京永洪商智科技有限公司 Query optimization method, device, equipment and storage medium based on reinforcement learning
WO2024110016A1 (en) * 2022-11-22 2024-05-30 Huawei Technologies Co., Ltd. Devices and methods for generating a query plan based on a sql query
CN116383454A (en) * 2023-04-10 2023-07-04 星环信息科技(上海)股份有限公司 Data query method of graph database, electronic equipment and storage medium
CN116383454B (en) * 2023-04-10 2024-01-30 星环信息科技(上海)股份有限公司 Data query method of graph database, electronic equipment and storage medium
CN116561173A (en) * 2023-07-11 2023-08-08 天津博冕科技发展有限公司 Method and system for selecting query execution plan by using relational graph and attention neural network
CN116561173B (en) * 2023-07-11 2023-10-13 太原理工大学 Method and system for selecting query execution plan by using relational graph and attention neural network

Also Published As

Publication number Publication date
CN112328578B (en) 2023-03-28

Similar Documents

Publication Publication Date Title
CN112328578B (en) Database query optimization method based on reinforcement learning and graph attention network
CN111428054B (en) Construction and storage method of knowledge graph in network space security field
CN111597209B (en) Database materialized view construction system, method and system creation method
Zou et al. Finding top-k maximal cliques in an uncertain graph
Zhang et al. Treepi: A novel graph indexing method
CN113010547B (en) Database query optimization method and system based on graph neural network
CN113515539B (en) Method for inquiring data in database
CN113535972B (en) Knowledge graph link prediction model method and device fusing context semantics
CN111581454A (en) Depth map compression algorithm-based parallel query expression prediction system and method
WO2021139753A1 (en) Method and apparatus for processing database
CN105335510A (en) Text data efficient searching method
CN104504018A (en) Top-down real-time big data query optimization method based on bushy tree
CN114911844B (en) Approximate query optimization system based on machine learning
CN113515540A (en) Query rewriting method for database
CN117290376A (en) Two-stage Text2SQL model, method and system based on large language model
CN117149974A (en) Knowledge graph question-answering method for sub-graph retrieval optimization
CN105160046A (en) Text-based data retrieval method
Zou et al. Survey on learnable databases: A machine learning perspective
CN108537581B (en) Energy consumption time series prediction method and device based on GMDH selective combination
CN117474010A (en) Power grid language model-oriented power transmission and transformation equipment defect corpus construction method
Gao et al. Automatic index selection with learned cost estimator
CN109086373B (en) Method for constructing fair link prediction evaluation system
Aghaei Question answering over knowledge graphs
CN117390064B (en) Database query optimization method based on embeddable subgraph
CN116483863A (en) Query optimization method based on tree attention and radix perception and storage medium

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant