CN113282607A

CN113282607A - Optimized query method for database

Info

Publication number: CN113282607A
Application number: CN202110613507.4A
Authority: CN
Inventors: 庞永亮; 门合建
Original assignee: Shandong Health Medical Big Data Co ltd
Current assignee: Shandong Health Medical Big Data Co ltd
Priority date: 2021-06-02
Filing date: 2021-06-02
Publication date: 2021-08-20

Abstract

The invention discloses an optimized query method of a database, which relates to the technical field of data query, and the implementation content comprises two parts of optimized data processing and optimized query statement; in the optimized data processing section, the operations performed include: (1.1) deleting duplicate records in the database; (1.2) setting a primary key in the data table; in the optimized query statement section, the operations performed include: (2.1) arranging the data tables in descending order according to the data quantity, and connecting two adjacent data tables by using inner join; (2.2) performing alias operation on the data tables before connecting the plurality of data tables; (2.3) avoiding use of query-all in Select clauses; (2.4) setting clauses which can be replaced in the data table; and (2.5) establishing an index in the data table. The invention can improve the query sequence of the data table, reduce the scanning times of the unrelated data, reduce the occupied resources in the data query process and realize the quick query of the data.

Description

Optimized query method for database

Technical Field

The invention relates to the technical field of data query, in particular to an optimized query method of a database.

Background

The index is a data structure for helping to efficiently acquire data, and can sequence one or more columns of values in a database table, the index can be used for quickly accessing specific information in the database table, the positions of the columns needing to be retrieved are preferentially recorded when a large-data-volume data table is queried, so that all data in the data table is prevented from being scanned, and the speed of querying the data in the database is improved.

In a large project, the amount of data to be processed by a business system is huge, SQL bears huge operating pressure, and the performance of the SQL has huge influence on the performance of the whole business system. Based on the method, an optimized query method of the database is designed and developed, so that the data can be rapidly queried through optimized data processing and optimized query sentences.

Disclosure of Invention

Aiming at the requirements and the defects of the prior art development, the invention provides an optimized query method of a database.

The invention discloses an optimized query method of a database, which adopts the following technical scheme for solving the technical problems:

the optimized query method for the database includes two parts of optimized data processing and optimized query statement;

in the optimized data processing section, the operations performed include: (1.1) deleting duplicate records in the database; (1.2) setting a primary key in the data table;

in the optimized query statement section, the operations performed include: (2.1) arranging the data tables in descending order according to the data quantity, and connecting two adjacent data tables by using inner join; (2.2) performing alias operation on the data tables before connecting the plurality of data tables; (2.3) avoiding use of query-all in Select clauses; (2.4) setting clauses which can be replaced in the data table; and (2.5) establishing an index in the data table.

Further, in the optimized data processing section, the operations performed further include: (1.3) for fields containing only numeric types, numeric types are used and fields containing characters use varchar/nvarchar instead of char/nchar.

Further, when the optimized data processing part is executed (1.1) and the duplicate records in the database are deleted, truncate is used to replace delete, when truncate is used, the rollback section does not store any information which can be replied, and the deleted data cannot be recovered.

Further, in the optimized query statement part, the (2.2) is executed, and after the data table is subjected to the alias operation, the alias of the data table is used, and the alias of the data table is prefixed on each column needing to be queried.

Further, in the optimized query statement part, executing (2.4) a clause which can be replaced in the setting data table, wherein the clause comprises:

replacing the having clause with the where clause: the method comprises the following steps that a change clause is used for searching all records and then filtering a result set, and the change clause is also used for carrying out sequencing operation and total operation on the results;

EXISTS is used for replacing IN, and NOT EXISTS is used for replacing NOT IN;

replace the UNION operator with UNION ALL: the UNION operator will filter out duplicate records after the data table join, and the UNION ALL operation returns simply by merging the results.

Further, in the optimized query statement section, the principle of establishing an index in the data table is performed (2.5) and includes:

(a) setting a maximum column threshold, and establishing an index of a certain column of data when the daily query frequency of the column of data in the data table exceeds the maximum column threshold;

(b) setting a column minimum threshold, and establishing an index of a certain column of data when the modification times of the column of data in the data table is lower than the column minimum threshold;

(c) when the type of a certain row of data is different from the types of the previous row of data and the next row of data in the data table, establishing an index of the row of data;

(d) setting a field threshold value, and establishing an index of a field when the text length of the field is smaller than the field threshold value;

(e) and aiming at the intermediate data tables connected with other data tables, establishing indexes on the fields of the intermediate data tables connected with other data tables.

Further, in the optimized query statement section, the principle of indexing in the data table is performed (2.5), and the principle further includes:

(f) forbidding to use% xx in the established index, and meanwhile, replacing% xx with select t from t where reverse (f) like reverse ('% abc');

(g) in the where clause, use is prohibited! A ═ and < > operator;

(h) the use of is null and is not null in the index column is prohibited to determine whether or not the index column is empty.

Preferably, the number of indexes in one data table does not exceed 6;

aiming at the principles (a), (b) and (d) for establishing the index, when the multi-column data of the data table meet the same principle, establishing the index on the column with the maximum difference with the set threshold value;

aiming at the principle (c) of establishing the index, when the data of multiple columns of the data table meet the same principle, if the type of data of a certain column is different from the types of the data of the upper M columns and the data of the lower N columns, the column corresponding to the maximum M + N is selected, and the index is established on the column.

Compared with the prior art, the optimized query method of the database has the following beneficial effects:

by optimizing the data processing and optimizing the query statement, the query sequence of the data table can be improved, the scanning times of unrelated data are reduced, the resources occupied in the data query process are reduced, and the rapid query of the data is realized.

Detailed Description

In order to make the technical scheme, the technical problems to be solved and the technical effects of the present invention more clearly apparent, the following technical scheme of the present invention is clearly and completely described with reference to the specific embodiments.

The first embodiment is as follows:

the embodiment provides an optimized query method for a database, and the implementation content of the optimized query method comprises two parts, namely optimized data processing and optimized query statement.

In the optimized data processing section, the operations performed include:

and (1.1) deleting the repeated records in the database. When using truncate, the rollback segment no longer stores any information that can be replied to, and the deleted data cannot be recovered.

(1.2) setting a primary key in the data table.

(1.3) for fields containing only numeric types, numeric types are used and fields containing characters use varchar/nvarchar instead of char/nchar.

(II) in the optimization query statement part, the executed operation comprises the following steps:

(2.1) arranging the data tables in descending order of data amount, and connecting two adjacent data tables by using inner join.

And (2.2) performing alias operation on the data tables before connecting the plurality of data tables. And after carrying out alias operation on the data table, using the alias of the data table, and prefixing the alias of the data table on each column needing to be queried.

(2.3) avoid using query-all in Select clauses.

(2.4) setting clauses which can be replaced in the data table, which specifically includes:

EXISTS is used for replacing IN, and NOT EXISTS is used for replacing NOT IN;

(2.5) establishing an index in the data table, wherein the principle of establishing the index comprises the following steps:

(e) aiming at the intermediate data tables connected with other data tables, establishing indexes on fields of the intermediate data tables connected with other data tables;

(g) in the where clause, use is prohibited! A ═ and < > operator;

It should be added that, in order to avoid a large number of indexes in one data table, the number of indexes in one data table is set to be not more than 6. At this time, the process of the present invention,

In summary, by optimizing data processing and query statements, the query sequence of the data table can be improved, the scanning times of unrelated data are reduced, resources occupied in the data query process are reduced, and the data is rapidly queried.

The principles and embodiments of the present invention have been described in detail using specific examples, which are provided only to aid in understanding the core technical content of the present invention. Based on the above embodiments of the present invention, those skilled in the art of data query technology should make any improvements and modifications to the present invention without departing from the principle of the present invention, and therefore, the present invention should fall into the protection scope of the present invention.

Claims

1. The optimized query method of the database is characterized in that the implementation content comprises two parts of optimized data processing and optimized query statements;

2. The method of claim 1, wherein in the optimized data processing portion, the operations further comprise: (1.3) for fields containing only numeric types, numeric types are used and fields containing characters use varchar/nvarchar instead of char/nchar.

3. The method according to claim 1, wherein in the optimized data processing part, when (1.1) the duplicate records in the database are deleted, a truncate is used instead of a delete, when a truncate is used, the rollback segment no longer stores any information that can be replied, and the deleted data cannot be recovered.

4. The optimized query method for database according to claim 1, wherein in the optimized query statement section, performing (2.2), performing alias operation on the data table, using the alias of the data table, and prefixing the alias of the data table to each column to be queried.

5. The optimized query method for database according to claim 1, wherein in the optimized query statement section, executing (2.4) the clauses that can be replaced in the data table are set, and the method comprises:

EXISTS is used for replacing IN, and NOT EXISTS is used for replacing NOT IN;

6. The optimized query method for database according to claim 1, wherein in the optimized query statement section, performing (2.5), the principle of building an index in the data table comprises:

7. The optimized query method for database according to claim 6, wherein in the optimized query statement section, performing (2.5), the principle of building an index in the data table further comprises:

(g) in the where clause, use is prohibited! A ═ and < > operator;

8. The method of claim 7, wherein the number of indexes in a data table is not more than 6;