CN116610756A

CN116610756A - Distributed database self-adaptive copy selection method and device

Info

Publication number: CN116610756A
Application number: CN202310868595.1A
Authority: CN
Inventors: 赵衎衎; 冷友方; 魏可伟; 陈磊
Original assignee: Shandong Inspur Database Technology Co Ltd
Current assignee: Shandong Inspur Database Technology Co Ltd
Priority date: 2023-07-17
Filing date: 2023-07-17
Publication date: 2023-08-18
Anticipated expiration: 2043-07-17
Also published as: CN116610756B

Abstract

The invention discloses a self-adaptive copy selection method and a self-adaptive copy selection device for a distributed database, which belong to the field of distributed databases, and the technical problem to be solved by the invention is how to quickly select copies for improving efficiency, and the technical scheme is as follows: s101, a gateway node analyzes an SQL sentence and acquires a data range to be read in the sentence; s102, filtering out the copy of the old version from the copy cache information, and acquiring the Range size with the latest version; s103, transferring the copy information parameters obtained in the step S102 to a genetic evolution algorithm, and then generating population starting iteration about selectable copy distribution information; s104, setting a cut-off mechanism; s105, the gateway node sends the data to the corresponding node for reading; s106, the results of the nodes are collected to the gateway node and returned to the client. Compared with the prior art, the genetic evolution can obtain good optimization results in a short time.

Description

Distributed database self-adaptive copy selection method and device

Technical Field

The invention relates to the field of distributed databases, and particularly provides a distributed database self-adaptive copy selection method and device.

Background

Reading data, which is a common operation in a database and is commonly used for a first operator in an SQL statement execution plan; the read data operator is a basic operator in both OLTP scenarios of various transaction models and OLAP scenarios of computational analysis models.

The performance of the distributed database is mainly reflected in that shared-nothing nodes in the cluster store partial data and can independently perform parallel computation, and when the task of one node is completed and waits for other nodes, the phenomenon of unbalanced load occurs, so that resources are idle.

The distributed database adopts a multi-copy strategy on storage, namely each data after the data table is partitioned is stored in a plurality of nodes at the same time, and the plurality of nodes are selected for reading task allocation. The amount of data per data is different, as is the delay between the node and the gateway node. The main technical implementation scheme for realizing the load balancing based on the combination optimization problem is realized by distributing the reading tasks to the nodes and ensuring the balance of the reading task quantity on the nodes.

The related art of replica selection policies typically employ a proximity principle in a distributed database to select the node that has the shortest delay from the gateway node. I.e. the range of data known to the gateway node to be read, which range involves three copies of each piece of data, i.e. three nodes, to select the node with the shortest delay from the gateway node, reducing the time delay for the request to be sent and returned. And for the strong consistency reading, namely acquiring the latest version data to ensure that the data is up-to-date, considering the consensus algorithm, and selecting a leader node in the consensus algorithm to ensure the strong consistency reading due to the fact that the latest version of the data is ensured.

The performance of the distributed database is mainly reflected in that a plurality of nodes can work in parallel to improve the efficiency of a single machine mode. However, in the task allocation process of the read data, no matter whether the nearby principle or the selection based on the consensus algorithm can not ensure the balance of the read task quantity on the plurality of nodes, the phenomenon that the node with small read task quantity among the plurality of nodes for reading the data firstly completes the node waiting for large read task quantity, and the resource is idle is caused, so that the efficiency of the distributed database is reduced.

In addition, in the generation of the distributed database physical plan, the nodes participating in SQL execution are all the nodes involved in the data reading process. The specific operators into which other operators on the optimal logical plan tree are converted are stacked on these operators reading data, i.e. the spreadsheet operators, in a bottom-up direction of the optimal logical plan tree. The whole physical plan is usually started by reading data, the efficiency of reading the data is low, and the unbalanced allocation of the reading task quantity can seriously influence the execution efficiency of subsequent operators in the physical plan.

How to select a copy is a key for determining nodes participating in reading in a distributed database and selecting a node related to the subsequent execution efficiency of a distributed execution plan, and is a problem to be solved by those skilled in the art.

Disclosure of Invention

Aiming at the defects of the prior art, the invention provides a distributed database self-adaptive copy selection method with strong practicability.

The invention further aims to provide a distributed database self-adaptive copy selection device which is reasonable in design, safe and applicable.

The technical scheme adopted for solving the technical problems is as follows:

the self-adaptive duplicate selection method for the distributed database is characterized in that the distributed database is divided into two types of strong consistency reading and delay reading;

the strong consistency reading self-adaptive hybrid copy selection method comprises the following steps:

s101, a gateway node analyzes an SQL sentence and acquires a data range to be read in the sentence;

s102, filtering out the copy of the old version from the copy cache information, and acquiring the Range size with the latest version;

s103, transferring the copy information parameters obtained in the step S102 to a genetic evolution algorithm, and then generating population starting iteration about selectable copy distribution information;

s104, setting a cut-off mechanism;

s105, the gateway node sends the data to the corresponding node for reading;

s106, collecting results of all the nodes to the gateway node and returning the results to the client;

the delay reading self-adaptive hybrid copy selection method comprises the following steps:

s201, a gateway node analyzes an SQL sentence, and obtains a data range to be read in the SQL sentence;

s202, transmitting the distribution information acquired in the step S201 as parameters into a genetic evolution algorithm for iteration;

s203, judging a genetic evolution algorithm and a truncation mechanism;

s204, the gateway node sends the data to the corresponding node for reading;

s205, the results of the nodes are collected to the gateway node and returned to the client.

Further, in step S101, the gateway node parses the SQL statement, obtains a data Range to be read in the SQL statement, and obtains distribution information of each Range in the data Range according to the tree structure indexed in the distribution layer.

Further, in step S102, the copy of the old version is filtered from the copy cache information, the Range size with the latest version is obtained, and meanwhile, the difference value between the maximum task reading amount and the minimum task reading amount in all the nodes is obtained by parallel running of a default copy selection policy of the system;

the default copy selection policy enables a single simple look-ahead acquisition.

Further, in step S103, the copy information parameters obtained in step S102 are transferred to the genetic evolutionary algorithm, and then the population related to the selectable copy distribution information is generated to start iteration, the iteration process is formed by cross variation, and each iteration round can select the best gene to obtain the difference value between the current maximum task amount and the minimum task amount.

Further, in step S104, a cut-off mechanism is set, that is, the minimum difference obtained in a period of time during which the genetic evolution algorithm operates is smaller than the difference obtained by the default copy selection policy of the system, and is smaller than the amount of reading tasks that can be performed by the hard disk at the same time under the condition of the hardware of the network.

Further, in step S105, the copy selection result obtained in step S104 is obtained by the gateway node in the generation of the physical plan, and then the physical plans of other operators are generated on the read table operator from bottom to top, and then sent to the corresponding node by the gateway node for reading.

Further, in step S201, the gateway node parses the SQL statement to obtain the data range to be read in the statement. Acquiring the distribution information of each Range in the data Range according to the tree structure indexed in the distribution layer;

in step S202, the distribution information obtained in step S201 is also transmitted as parameters to the genetic evolution algorithm for iteration, each gene segment has 3 nodes which can be mutated, and the default copy selection strategy of the parallel operation system obtains the difference value between the maximum task reading amount and the minimum task reading amount in all the nodes;

Further, in step S204, the copy selection result obtained in step S203 is obtained by the gateway node in the generation of the physical plan, and then the physical plans of other operators are generated on the read table operator from bottom to top, and then sent to the corresponding node by the gateway node for reading.

Further, in step S205, the results of the nodes are collected into gateway nodes and returned to the client, and the hash connection result of each node is obtained as a union, that is, the final hash connection result.

A distributed database adaptive replica selection apparatus, comprising: at least one memory and at least one processor;

the at least one memory for storing a machine readable program;

the at least one processor is configured to invoke the machine-readable program to perform a distributed database adaptive replica selection method.

Compared with the prior art, the distributed database self-adaptive copy selection method and device have the following outstanding beneficial effects:

the distribution information of the obtained copy is obtained in the cache, so that time expenditure is not brought. The distribution information of the copies is used as parameters to be transmitted into a genetic evolution algorithm to carry out iteration time with maximum cross variation, the time complexity of population generation and the time complexity of each iteration are respectively(N is the number of data blocks read), spatial complexity is +.>. It can be seen that genetic evolution can give good optimization results in a very short time.

Drawings

In order to more clearly illustrate the embodiments of the present invention or the technical solutions in the prior art, the drawings that are required in the embodiments or the description of the prior art will be briefly described, and it is obvious that the drawings in the following description are some embodiments of the present invention, and other drawings may be obtained according to these drawings without inventive effort for a person skilled in the art.

FIG. 1 is a schematic diagram of a distributed database adaptive replica selection method;

FIG. 2 is a diagram of distribution information of replicas in a distributed database adaptive replica selection method.

Detailed Description

In order to provide a better understanding of the aspects of the present invention, the present invention will be described in further detail with reference to specific embodiments. It will be apparent that the described embodiments are only some, but not all, embodiments of the invention. All other embodiments, which can be made by those skilled in the art based on the embodiments of the invention without making any inventive effort, are intended to be within the scope of the invention.

A preferred embodiment is given below:

in the self-adaptive copy selection method of the distributed database in the embodiment, the client is only connected with one node in the cluster, the node is called a gateway node, and the gateway node converts the SQL request into read-write operation of the bottom layer storage and coordinates node execution of distributed query processing. And is responsible for responding the query results to the client. The storage engine of the switch database is based on a RocksDB (Rocks database) stored by key values, each node is provided with a hard disk, and one hard disk corresponds to one RocksDB (Rocks database) instance. The switch database adopts horizontal data slicing, and the key space of the data is divided into a plurality of disjoint data blocks continuously by using Range partitioning by a distribution layer, wherein the data blocks are called Ranges, and the Range (data Range) is copied for 3 copies and distributed on different nodes in the cluster. As shown in FIG. 1, each table in the open database corresponds to a contiguous portion of the key space, and the distribution layer divides the table into ranges by Range partitions.

The distributed database is divided into two types of strong consistency reading and delay reading;

the method for selecting the self-adaptive hybrid copy with strong consistency comprises the following steps:

the gateway node analyzes the SQL statement and obtains the data range to be read in the statement. And acquiring the distribution information of each Range in the data Range according to the tree structure indexed in the distribution layer. As shown in fig. 2, taking the data Range [ a, d ] as an example, three ranges of distributed information of [ a, b ], [ b, c ], [ c, d ] can be obtained, namely [1,3,4], [2,4,5], [2,3,5].

and filtering the old version copy from the copy cache information, and acquiring the Range size with the latest version. And simultaneously, the default copy selection strategy of the parallel operation system acquires the difference value between the maximum task reading quantity and the minimum task reading quantity in all the nodes.

The default copy selection policy may be obtained in advance due to a single simple implementation.

S103, transferring the copy information parameters obtained in the step S2 to a genetic evolution algorithm, and then generating population starting iteration about selectable copy distribution information;

and (3) transferring the copy information parameters obtained in the step (S2) to a genetic evolution algorithm, and then generating a population starting iteration about selectable copy distribution information. The iteration process consists of cross variation, and each iteration round can select the best gene to obtain the difference value between the current maximum task reading quantity and the minimum task reading quantity.

S104, setting a cut-off mechanism;

as the Range distribution of the switch database is redistributed every time the switch is started, the copy selection of the system can obtain good load balance under some distribution information through experimental verification. In order to enable the system to adaptively select two strategies so that the maximum benefit can be obtained under any distribution condition, a cut-off mechanism is arranged, namely, the minimum difference value which can be obtained within 100ms of running a genetic evolution algorithm is smaller than the difference value obtained by the default copy selection strategy of the system, and the minimum difference value is smaller than the amount of reading tasks which can be carried out by a hard disk under the condition of the hardware and the network in 100 ms.

The phenomenon that the resource is idle can be improved by the copy selection strategy of the genetic evolution algorithm due to the fact that the load imbalance caused by the default copy selection strategy of the system can be judged, and the execution efficiency of operators is improved. If the genetic evolution algorithm fails to obtain good feedback in the setting time, the default copy selection strategy of the system is used for uniformly distributing the reading task to each node, and the phenomenon of resource idling and waiting does not occur.

S105, the gateway node sends the data to the corresponding node for reading;

the copy selection result obtained in the step S4 is obtained by the gateway node in the process of generating the physical plan, then the physical plans of other operators are generated on the meter reading operator from bottom to top, and then the physical plans are sent to the corresponding nodes by the gateway node for reading.

S106, the results of the nodes are collected to the gateway node and returned to the client.

The method for selecting the delay reading self-adaptive hybrid copy comprises the following steps:

the gateway node analyzes the SQL statement and obtains the data range to be read in the statement. And acquiring the distribution information of each Range in the data Range according to the tree structure indexed in the distribution layer.

S202, transmitting the distribution information acquired in the step S1 as parameters into a genetic evolution algorithm for iteration;

because the latest version of the Range copy is not required to be ensured, the distribution information acquired in the step S1 is taken as a parameter and also transmitted to a genetic evolution algorithm for iteration, and under the scene, each gene segment has 3 nodes for variation.

And simultaneously, the default copy selection strategy of the parallel operation system acquires the difference value between the maximum task reading quantity and the minimum task reading quantity in all the nodes. The default copy selection policy may be obtained in advance due to a single simple implementation.

S203, judging a genetic evolution algorithm and a truncation mechanism;

similar to step S4 in the strong consistency reading scenario, to implement the adaptive hybrid copy selection strategy, the genetic evolution algorithm needs to be combined with the judgment of the truncation mechanism, so as to obtain the maximum benefit under various distribution information conditions.

S204, the gateway node sends the data to the corresponding node for reading;

and 3, the copy selection result obtained in the step is obtained by the gateway node in the process of generating the physical plan, the physical plans of other operators are generated on the meter reading operator from bottom to top, and then the physical plans are sent to the corresponding nodes by the gateway node for reading.

S205, the results of the nodes are collected to the gateway node and returned to the client. The hash connection result of each node is the final hash connection result by union.

The following table compares the selection results with respect to an increase in the amount of read data:

the performance of the adaptive replica selection strategy was tested on the cockreach db database and compared to the basic scheme in the system. The table above shows the load balancing degree brought by the two modes of selecting and comparing the copies by using the method of the system and the self-adaptive strategy along with the increase of the number of the data blocks, namely the difference value between the maximum reading task amount and the minimum reading task amount, and can be seen under the condition of any copy distribution. The method can well lead the reading task quantity to be evenly distributed on each node.

Based on the above method, a distributed database adaptive copy selection apparatus in this embodiment includes: at least one memory and at least one processor;

the at least one memory for storing a machine readable program;

The above specific embodiments are merely illustrative of specific cases of the present invention, and the scope of the present invention includes, but is not limited to, the specific embodiments described above, any suitable changes or substitutions made by one of ordinary skill in the art, which are consistent with the present invention, of a distributed database adaptive replica selection method and apparatus claims, and all fall within the scope of the present invention.

Although embodiments of the present invention have been shown and described, it will be understood by those skilled in the art that various changes, modifications, substitutions and alterations can be made therein without departing from the principles and spirit of the invention, the scope of which is defined in the appended claims and their equivalents.

Claims

1. The self-adaptive duplicate selection method for the distributed database is characterized in that the distributed database is divided into two types of strong consistency reading and delay reading;

s104, setting a cut-off mechanism;

s105, the gateway node sends the data to the corresponding node for reading;

s203, judging a genetic evolution algorithm and a truncation mechanism;

s204, the gateway node sends the data to the corresponding node for reading;

2. The method for adaptively selecting a copy of a distributed database according to claim 1, wherein in step S101, a gateway node parses an SQL statement, obtains a data Range to be read in the SQL statement, and obtains distribution information of each Range in the data Range according to a tree structure indexed in a distribution layer.

3. The method for adaptively selecting a copy of a distributed database according to claim 2, wherein in step S102, an old version of the copy is filtered from the copy cache information, a Range size having the latest version is obtained, and a default copy selection policy of a parallel operation system obtains a difference value between a maximum task amount and a minimum task amount of all nodes;

4. A distributed database adaptive replica selection method as claimed in claim 3, wherein in step S103, the replica information parameters obtained in step S102 are transferred to a genetic evolution algorithm, and then a population start iteration is generated for selectable replica distribution information, the iteration process is composed of cross variation, and each iteration round selects the best gene to obtain the difference between the current maximum task amount and the current minimum task amount.

5. The method according to claim 4, wherein in step S104, a cut-off mechanism is set, that is, a minimum difference obtained in a period of time during which the genetic evolution algorithm operates is smaller than a difference obtained by a default copy selection policy of the system, and is smaller than a read task amount that can be performed by the hard disk in the same environment and in the same time.

6. The method according to claim 5, wherein in step S105, the copy selection result obtained in step S104 is obtained by the gateway node in generating the physical plan, and then the physical plans of other operators are generated on the read table operator from bottom to top, and then the physical plan is sent to the corresponding node by the gateway node for reading.

7. The method for adaptively selecting a copy of a distributed database according to claim 6, wherein in step S201, the gateway node parses an SQL statement, obtains a data Range to be read in the statement, and obtains distribution information of each Range in the data Range according to a tree structure indexed in a distribution layer;

8. The method according to claim 7, wherein in step S204, the copy selection result obtained in step S3 is obtained by the gateway node in generating the physical plan, and then the physical plans of other operators are generated on the read table operator from bottom to top, and then the physical plan is sent to the corresponding node by the gateway node for reading.

9. The method according to claim 8, wherein in step S205, the results of the nodes are collected into gateway nodes and returned to the client, and the hash connection result of each node is obtained as a final hash connection result by combining.

10. A distributed database adaptive replica selection apparatus, comprising: at least one memory and at least one processor;

the at least one memory for storing a machine readable program;

the at least one processor being configured to invoke the machine readable program to perform the method of any of claims 1 to 9.