CN112328578A

CN112328578A - Database query optimization method based on reinforcement learning and graph attention network

Info

Publication number: CN112328578A
Application number: CN202011351761.3A
Authority: CN
Inventors: 詹思瑜; 周维清; 王玉林; 卢国明; 戴波
Original assignee: University of Electronic Science and Technology of China
Current assignee: University of Electronic Science and Technology of China
Priority date: 2020-11-26
Filing date: 2020-11-26
Publication date: 2021-02-05
Anticipated expiration: 2040-11-26
Also published as: CN112328578B

Abstract

The invention relates to the technical field of databases, provides a database query optimization method based on reinforcement learning and a graph attention network, and aims to solve the technical problems that when the connection relation of the existing query sentences is very complex, the query execution plan space is very large, and a large amount of time is consumed for searching the whole query execution space. Randomly generating query statements in a database and executing the query statements, splitting an execution plan tree corresponding to the query statements from a root node, and recording the connection relation of each node; initializing Q-network parameters w in a DQN model, wherein the Q-network in the DQN model adopts a GAT (generic object model) graph attention network, takes a coding feature matrix and a graph description set Edge as network input, and trains the DQN model; and (3) initializing the graph description and the code of a query statement, and generating a connection relation by using the DQN model obtained by training in the step 2 until all tables are connected to generate a complete query plan.

Description

Database query optimization method based on reinforcement learning and graph attention network

Technical Field

The invention relates to a database query optimization method based on reinforcement learning and a graph attention network. Aiming at large-scale multi-connection query, a better database query execution plan can be obtained in a shorter time, so that the execution time of the query in the database is reduced.

Background

For a query statement, the database cannot be executed directly. The database needs to analyze the query statement first, then the optimizer generates a corresponding query execution plan, and finally the plan is handed to the execution engine to execute the plan. The invention provides an effective solution for generating a better query plan aiming at the multi-connection query in a shorter time.

The technical scheme in the two prior arts is most similar to that proposed in the present application:

1. chinese invention patent, patent name: a database multi-connection query optimization method based on an improved SDD-1 algorithm is disclosed, and the application number is as follows: CN 201110043615.9.

Firstly, the improved SDD-1 algorithm is executed, a query execution strategy set is obtained by utilizing the algorithm, and the execution strategy set is used as the basis for the initial population generation of the genetic algorithm. And then, executing a genetic algorithm, and optimizing the result obtained by the SDD-1 algorithm by using the global search capability of the genetic algorithm. And finally, obtaining a relatively ideal query execution strategy. The method specifically comprises the following steps:

step 1: setting initial parameters: initial parameter settings including SDD-1 and genetic algorithms;

step 2: acquiring a query execution policy set: searching beneficial bidirectional half-links from the constructed query graph, selecting the beneficial bidirectional half-links from the beneficial bidirectional half-link candidate set to be connected to the beneficial bidirectional half-link set BS, repeating the steps until no beneficial bidirectional half-links exist in the query graph, adding the value of the obtained beneficial bidirectional half-link set BS to the execution strategy set ES, and repeating the steps until the operation frequency reaches N;

and step 3: constructing an initial population of a genetic algorithm: sequentially executing coding operation on elements in the execution strategy set ES, and taking the obtained result as an initial population of the genetic algorithm;

and 4, step 4: running a genetic algorithm: repeatedly performing crossing, variation and selection operations on the population until the running times reach M;

and 5: outputting a query execution strategy: and (4) outputting the best individual in the population as a final result, and decoding the final result into a query tree, namely a query execution strategy.

2. Chinese invention patent, patent name: a big data real-time query optimization method based on hypergraph and dynamic plan is disclosed as follows: CN 201020231887.2.

A big data real-time query optimization method based on a hypergraph and a dynamic plan comprises an optimal cost model construction process and an execution plan space search process. The optimal cost model construction process comprises the following steps:

step 1: analyzing table data in a metadata server, constructing and generating a column-level statistical information histogram with fine granularity, and storing the column-level statistical information histogram in the metadata server;

step 2: and constructing a corresponding optimal cost model for use in generating the plan by using the statistical information.

Performing the planned space search process includes the steps of:

step 1: and analyzing the database query statement, and storing and querying the result in the hypergraph data structure.

Step 2: the execution plan is initially set for a single relationship and saved in the corresponding dynamic schedule.

And step 3: a compute enumeration policy is defined: each connected subgraph and the connected complement set are generated only once;

and 4, step 4: enumerating connected subgraphs by computing a domain;

and 5: finding a suitable connected complement set for each connected subgraph;

step 6: calculating the cost of the execution plan formed by each pair of connected subgraphs and the complementary set, and updating the execution plan according to the cost model;

and 7: and (5) repeatedly executing the step (4) to the step (7) until the execution plan space formed by the whole left linear tree is searched, and generating an execution plan tree.

The first technical scheme has the following defects:

1. the genetic algorithm is essentially a greedy strategy and is easy to fall into a local optimal solution;

2. the scheme needs to set the iteration times of the genetic algorithm, and when the iteration times are less, a better query execution plan cannot be obtained. When the number of iterations is large, the algorithm execution time needs to be long, and the situation that the local optimization is trapped cannot be avoided.

3. The tree structure information of the query execution plan cannot be captured by encoding the one-dimensional code of the query execution plan.

The second technical scheme has the following disadvantages:

1. the left linear tree is used for enumerating and searching the whole query execution plan space, when the connection relation of the query statements is very complex, the query execution plan space is very huge, and a great amount of time is consumed for searching the whole query execution space.

Disclosure of Invention

The invention aims to solve the technical problems that when the connection relation of the existing query statement is very complex, the query execution plan space is very huge, and a great amount of time is consumed for searching the whole query execution space.

In order to solve the technical problems, the invention adopts the following technical scheme:

the invention provides a database query optimization method based on reinforcement learning and a graph attention network, which comprises the following specific steps of:

step 1: data collection, namely randomly generating query statements in a database and executing the query statements, splitting an execution plan tree corresponding to the query statements from a root node, and recording the connection relation of each node;

step 2: model training, namely performing code description on each node according to the connection relation of each node to obtain a code characteristic matrix, performing graph description on each node to obtain a graph description set Edge, initializing a Q-network parameter w in a DQN model, wherein the Q-network in the DQN model adopts a GAT (generic object model) graph attention network, and training the DQN model by taking the code characteristic matrix and the graph description set Edge as network input;

and step 3: and (3) model application, namely taking each table as a node for the tables related to a query statement, initializing the graph description and the code of each node, selecting the connection of 2 nodes in each step in the query execution plan, generating by using the DQN model obtained by training in the step 2, and at the moment, carrying out state transition, updating the graph description and the code description until all the tables are connected to generate the complete query plan.

In the above technical solution, the drawings are described as follows:

numbering n tables related to the query statement, wherein the table is represented as [1, 2, 3, 4 … n ], each table is used as a Node index, the initial Node indexes are n, all current common Node index sets Node [1, 2.. once, n ]) are stored, and the initial Edge set Edge is null;

selecting 2 nodes i and j which are not marked as connected query plans in the Node set, adding the query plans connected with the nodes i and j into the Node set as a new Node max (index) +1, wherein index belongs to the Node, marking the nodes i and j as connected query plans, representing the nodes i, j and the new Node max (index) +1 into two edges (i, max (index) +1) and (j, max (index) + j), and adding the Edge set to obtain a graph description set Edge.

In the above technical solution, the encoding is described as follows:

for each node, the number of coding bits is n + m + k bits, the initial n bits are table 1-hot codes related to the current node, the middle m bits are column attribute 1-hot codes related to the current node, the last k bits are 1-hot codes of connection types, the coding description n + m bits of each node can be assigned during initialization, the last k bits are initialized to 0, when connection operation is performed, the coding of the newly added node is newly added, the first n + m bits are bits of connection 2 nodes or operation results, the last k bits are set with the 1-hot codes according to the connection types, and finally a coding feature matrix is obtained.

Because the invention adopts the technical scheme, the invention has the following beneficial effects:

first, compared to the dynamic programming algorithm, the dynamic programming algorithm needs to search all possible solutions of the query execution plan, and the solution space of the query execution plan increases exponentially with the number of connections. Although the dynamic programming algorithm may obtain the optimal solution for the query execution plan, it may take a significant amount of time in obtaining the optimal solution. And by using the reinforcement learning DQN model, the model is trained off line without consideration. When the DQN model is used for deciding a query execution plan, the algorithm execution times are linearly related to the connection numbers. If there are n connections, only n-1 algorithm executions at most are needed to obtain a better query execution plan.

Compared with intelligent algorithms such as genetic algorithm and the like, as described in the second technical scheme, the iteration times of the algorithm need to be set firstly. However, for different query statements, the iteration number of the algorithm cannot be adapted, which may result in that if the iteration number is not enough, a better solution cannot be obtained. If the number of iterations is too large, a lot of time is consumed while the local optimum may be involved. And by using the DQN model, the selection of each step is selected by the Q-network trained by using a large amount of data, so that the local optimization is less likely to be involved, and the result can be obtained only by executing the algorithm n-1 times at most.

And thirdly, compared with a common reinforcement learning algorithm, the tree structure characteristics of the query execution plan cannot be described, the connection selection adopted in each step cannot be described, and the influence degree of the final query execution plan is not uniform. And the two features can be better described by replacing the ordinary fully-connected network Q-network in the DQN network with a GAT graph attention network.

Drawings

FIG. 1 is a schematic view of a model structure;

FIG. 2 is a schematic diagram of an execution plan tree split from a root node.

Detailed Description

In order to make the objects, technical solutions and advantages of the present invention more apparent, the present invention is described in further detail below with reference to the accompanying drawings and embodiments. It should be understood that the detailed description and specific examples, while indicating the preferred embodiment of the invention, are intended for purposes of illustration only and are not intended to limit the scope of the invention. The components of embodiments of the present invention generally described and illustrated in the figures herein may be arranged and designed in a wide variety of different configurations.

Thus, the following detailed description of the embodiments of the present invention, presented in the figures, is not intended to limit the scope of the invention, as claimed, but is merely representative of selected embodiments of the invention. All other embodiments, which can be derived by a person skilled in the art from the embodiments of the present invention without making any creative effort, shall fall within the protection scope of the present invention.

In the reinforcement learning model DQN, the most important part is to predict the long-term influence of different single step selections through Q-network, and each time the model selects the selection with the best long-term influence, the best query execution plan is finally considered to be obtained. Therefore, the correctness of the prediction of the long-term impact on each single-step selection is crucial. The common Q-network is a fully connected network, and can only take one-dimensional data as input. If one-dimensional data is used as input, many problems arise in describing current query execution plans. The common encoding method for the query execution plan can lead to that some different query execution plans have the same encoding representation. Such training may make the long-term impact of Q-network on connection selection inaccurate, as the same input will have the same output result as the input. So to capture the structural information of the overall query execution plan, the GAT attention network is selected as the Q-network instead of the normal full-connection network. The brand-new network structure can enable the long-term influence of the trained model on single-step connection selection to be more accurate. The best connection selection can also be selected more accurately each time when using the DQN model.

For a query statement, the database needs to be executed according to its query execution plan. Query statements involve many table join operations, and table join operations all have commutative associativity. Therefore, in the query execution plan, it is necessary to determine which two parts (which may be tables or the result after the tables and the tables are connected) are connected each time, and finally, each table is connected, i.e. the complete query execution plan is formed. Abstracting the query execution plan tree into a Markov decision process: in the start state, all tables are not connected, and it is necessary to select which two tables are connected. After selection, the two tables are replaced by a connection result, and the next state is entered, and the connection result generated in the previous step can be selected as well. I.e. each time a connection is selected, it may be two tables, one table with one connection result or two connection results. Until only one join result remains in all states, a complete query plan is generated.

The invention specifically provides a database query optimization method based on reinforcement learning and a graph attention network, which is characterized by comprising the following specific steps of:

step 1: data collection, namely randomly generating query statements in a database and executing the query statements, splitting an execution plan tree corresponding to the query statements from a root node, and recording the connection relation of each node; as shown in fig. 2, a query execution plan including 6 nodes is illustrated, where the node 6 is a root node, and the splitting is completed each time the root node is deleted in the splitting process until each node is a leaf node.

The GAT graph attention network is composed of an attention convolution layer, an active layer, an attention convolution layer and an active layer 4. Wherein the first attention convolution layer receiving input part is state₊And action₊Determined next state_t+1. To execute a plan srate on a query_tData model, state pair, described as being tractable by a graph attention network_tTwo parts are described for graph description and coding:

the figures are described as follows:

The encoding in the above technical solution is described as follows:

in the query execution plan, any node i is connected with a query plan node at the later stage, and the result of each query plan node is used as a node j;

Because the invention adopts the technical scheme, the invention has the following characteristics:

1. in the step 1, enough data can be collected, so that a model with enough generalization capability can be obtained through training in the subsequent model training process;

2. in step 1, all operations are performed off-line, that is, although a large amount of data is collected, the time consumed by the operations is the same as the time of generating the query statement plan by using the model in step 3;

3. in step 2, the DQN model is used as a framework in the whole, so that the connection selection of each step is not limited to whether single-step selection is excellent or not, but is focused on a selection which is more beneficial to the generation of the whole query execution plan, and is less prone to falling into local optimization;

4. in step 2, the general full-connection network is replaced by the attention network GAT in the DQN model as a Q-network, and the tree structure information of the query execution plan can be captured.

5. In step 3, for all query statements, the time for generating a complete query execution plan is linearly related to the number of connections of the query statements, so that a better execution plan can be generated effectively in a very short time.

Claims

1. A database query optimization method based on reinforcement learning and graph attention network is characterized by comprising the following specific steps:

2. The database query optimization method based on reinforcement learning and graph attention network as claimed in claim 1, wherein the graph is described as follows:

3. The database query optimization method based on reinforcement learning and graph attention network as claimed in claim 1, wherein the code is described as follows: