CN101145158A

CN101145158A - Data base table partition method

Info

Publication number: CN101145158A
Application number: CNA2007101106338A
Authority: CN
Inventors: 任泰云; 马润宏; 陈明
Original assignee: ZTE Corp
Current assignee: ZTE Corp
Priority date: 2007-06-06
Filing date: 2007-06-06
Publication date: 2008-03-19

Abstract

The present invention discloses a partitioned method of a database table, including the following steps: <1> the needed partitioned parameters are predefined, a certain field of the table is designated, and the output value of the field is worked out by calculation; <2> an additional field is added to the table, and the calculated output value of the selected field in <1> is deposited; <3> the table is divided into a plurality of partitions according to the output value in <1>. The output value of the recorded selected field is worked out based on <1>, contrasted with the plurality of partitions and deposited in a corresponding partition. The present invention maintains the core advantage of the prior partitioned mode that all records are well distributed, and fully enhances the autonomy, the flexibility and the adaptability of the partitioning as well.

Description

A kind of database table partitioned method

Technical field

The present invention relates to use the field of software development of database (Database), sybase for example, oracle, mssql or the like as data storage medium.

Background technology

Data query all is a basic function in large-scale relevant database application system, inquires about such as the history alarm in the telecommunication network management system.When data volume is very big, carry out the key element that efficient just becomes data query.Have a lot of methods to can be used to improve search efficiency, the table subregion is exactly one of them.It is set about from the storage organization of data, and the record burst of different characteristics is deposited, and each burst has clear and definite feature, can dwindle the scanning work amount that query manipulation need carry out to a great extent.

Existing data base management system (DBMS) DBMS (as: Sybase, oracle and Microsoft SQLServer) provide following three kinds of typical partition schemes:

1, hash subregion:

1) certain field of option table is in order to calculate the hash value;

2) at this Field Definition hash subregion;

3) the selected field hash logistic by record goes out value, is stored in corresponding subregion;

4) the corresponding different hash output valve of each subregion, the record that all hash values are identical is stored in same subregion;

2, list subregion:

1) certain field of option table is in order to subregion;

2) list by the possible different values of these fields, be divided into some subregions in advance, the corresponding value set of each subregion, set does not overlap mutually;

The selected field and the subregion that 3) will write down compare, the subregion that the value of being stored in equates;

3, Range subregion:

1) certain field of option table is in order to subregion;

2) by the possible different spans of these fields, be divided into some subregions in advance, the corresponding possible interval of each subregion;

The selected field and the subregion that 3) will write down compare, and the value of being stored in falls into interval corresponding subregion;

More than 3 kinds of typical partitioned modes, all good action is arranged to improving efficiency data query.But in the practical application, also all expose significant disadvantages simultaneously:

1), do not need the value of default selected field, so can not produce the problem of two kinds of partitioned modes of present face for the hash subregion.But because the hash algorithm that the deposit position of record is included by DBMS decision, so we can't be by the characteristics of application-specific, specify voluntarily certain bar concrete be recorded in that subregion.

2), need enumerate the selected all possible values of field in the time of owing to the definition subregion, and not all field can be accomplished this point, has therefore limited applicable surface for the list subregion.

3) for the range subregion, if the distribution of the selected field value of record is in a state that constantly changes, preassigned interval in the time of may defining subregion can make the equally distributed effect of record because of this variation has lost over time.

Just because of above shortcoming, make the application of traditional subregion in telecommunicatioin network management software be subjected to considerable restraint.

Summary of the invention

Technical matters solved by the invention is to provide a kind of database table partitioned method, can't be to solve existing different partition method by the characteristics of application-specific, and because definition preassigned interval during subregion can lose problems such as making the equally distributed effect of record because of this variation over time.

In order to address the above problem, the invention provides a kind of database table partitioned method, it is characterized in that, may further comprise the steps:

(1) pre-determine needed partitioned parameters, certain field of named list, and carry out the output valve that calculates this field;

(2), and deposit the selected field of step (1) through the output valve after calculating for table increases a field;

(3) the output valve his-and-hers watches according to step (1) are divided into some subregions, and the selected field of record is calculated output valve according to step (1), compare with described some subregions, are stored in the corresponding subregion.

Method of the present invention, wherein, described database table subregion is the list subregion based on calculated column.

Wherein, carry out described in the step (1) and calculate, comprising:

(111) value with certain field of the specified table of step (1) is converted to the integer type data;

(112) according to the number that pre-determines in the needed partitioned parameters;

(113) data that step (111) conversion is got are carried out: output valve=data % subregion number+1.

Wherein, step (3) comprising:

(311) list the output valve after as calculated of specific field in the step (1), be divided into some subregions, the corresponding output valve set of each subregion, set does not overlap mutually;

(312) the selected field that will write down compares according to step (1) calculating output valve and subregion, the subregion that the value of being stored in equates.

Method of the present invention, wherein, described database table subregion is the range subregion based on calculated column.

Wherein, carry out described in the step (1) and calculate, comprising:

(121) value with certain field of the specified table of step (1) is converted to the integer type data;

(122) according to the interval size that pre-determines in the needed partitioned parameters, the equal and opposite in direction that each is interval;

(123) data that step (121) conversion is got are carried out: the interval size of output valve scope=data % subregion.

Wherein, step (3) comprising:

(321) list the output valve scope after as calculated of specific field in the step (1), be divided into some subregions, the corresponding output valve scope of each subregion;

(322) the selected field that will write down is calculated the output valve scope according to step (1) and subregion compares, and is stored in the subregion of the correspondence that the output valve scope fallen into.

The invention provides the database table partition method after a kind of improve, the core advantage that had both kept existing partitioned mode " all records evenly to be distributed " fully improves independence, dirigibility, the adaptability of subregion again.

Description of drawings

Fig. 1 is the described partition method synoptic diagram based on calculated column of the embodiment of the invention.

Embodiment

The objective of the invention is to introduce a kind of database table partitioned method, both kept the core advantage of existing partitioned mode " all are write down evenly distributes ", fully improve independence, dirigibility, the adaptability of subregion again.Below embodiment is described in detail, but not as a limitation of the invention.

Partition method described in the embodiment of the invention is that the hash thinking is applied on list and the range subregion, and the advantage fusion with them derives a kind of list and range partitioned mode based on calculated column.

In conjunction with the accompanying drawings 1, embodiment 1 is based on the list subregion example of calculated column:

Be the specific implementation method of example explanation List subregion below with the veneer table in the telecommunications optical transport network management system.

1, preliminary work

A) in the veneer table, add a calculated column ID, and select network element to classify the subregion calculated column as;

B) according to actual needs, set the number of subregion.Be assumed to be 100.

2, carry out list subregion for the veneer table based on calculated column

A) round values that will from 1 to 100 is divided into some set on demand, and is integrated into ID according to these and lists definition list subregion;

B) insert the method that writes down:

1) network element of getting the veneer record is numbered, and is assumed to be character string type " 1001 ", and converting thereof into round values is 1001;

2) carry out specific calculating: output valve=1001%100+1=2 to 1001;

3) with the output valve 2 that calculates as the id field value that is inserted into record;

4) carry out to insert, this record will the list subregion of the value of being stored in 2 correspondences in.

C) Cha Xun method

One of common inquiry of telecommunications Optical Transmission Network OTN guard system is " obtaining the veneer that belongs to this network element according to element name ".At this moment need the element name that issues is calculated earlier, then the calculating output valve is added in the where statement, as:

Veneer under the inquiry network element " 1001 ", the condition that constructs is:

where?ID＝2?and?MeName＝‘1001’

Like this, the query analyzer of DBMS will in corresponding subregion scope, improve search efficiency with scanning limit(s) according to this condition of ID=2.

D) method for updating

Similar with querying method, need in being updated the where condition of record, the location add calculated value.As all single board states that upgrade network element " 1001 " are 3, and the condition that constructs is:

where?ID＝2?and?MeName＝’1001’

In conjunction with the accompanying drawings 1, embodiment 2 is based on the range subregion example of calculated column:

For the interval size of output valve scope=data % subregion, the interval size of Range subregion is an integer, is applicable to range (scope) subregion.The size in subregion interval has shown the span that is assigned to same subregion.

For example:, four field A/B/C/D are arranged if any a table.The A field is an integer, now will carry out range (scope) subregion on the A field.Suppose that the interval size of the subregion of determining is 100, definite simultaneously A field value then can be carried out subregion in such a way since 1:

1～100 1 district;

101～200 1 districts;

201～300 1 districts; Or the like.

For the range that describes among the embodiment in this patent (scope) subregion for example, by the HASH calculated column, then be to carry out subregion according to month, the interval size of output valve=data % subregion.For example: since on January 1st, 2007, be assigned to a district annual January, be assigned to a district annual February; As, the data in January in 2008 and the data in January, 2007 are assigned to a district.

Be the range partition method of example explanation based on calculated column with the history alarm table in the telecommunications optical transport network management system more below, concrete steps are as follows:

1, preliminary work

A) in history alarm, add a calculated column ID, and select generation time to classify the subregion calculated column as; The generation time type is that the form of varchar (14) is year (4) month (2) day (2) hour (2) minute (2), such as " 200701021020 ";

B) according to actual needs, set the interval size of subregion.Here hypothesis is want according to month under the generation time all history alarms evenly to be distributed, and then interval size is 100000000, year this position of corresponding character string.

2, carry out range subregion for the veneer table based on calculated column

A) will be by dividing the calculated value interval January to Dec, that is:

1000000～2000000 corresponding subregions 1 (January),

2000000～3000000 corresponding subregions 2 (February),

B) insert the method that writes down:

Get the generation time of history alarm record, be assumed to be " 200701021020 ", converting thereof into round values is 200701021020;

Carry out specific calculating to 200701021020:

Output valve=200701021020%100000000=1021020;

With the output valve 1021020 that calculates as the id field value that is inserted into record;

Carry out to insert, this record will the range subregion of the value of being stored in correspondence in February in.

C) Cha Xun method

One of common inquiry of telecommunications Optical Transmission Network OTN guard system is " history alarm that obtains certain time period ".At this moment need the time period that issues is worth earlier from beginning to end and calculate, then the calculating output valve is added in the where statement, as:

The history alarm record in query time on February 20th, 2007, the time period that issues arrives " 200702210000 " for " 200702200000 " from beginning to end, after these two strings converted round values to and carry out calculating respectively, the output that obtains was 2, so it is added in the where statement:

where?ID>＝2?and?ID<＝2?and......

D) method for updating

Similar with querying method, need in being updated the where condition of record, the location add calculated value.All history alarms as update date " 200702210000 " are expired, and the condition that constructs is:

where?ID＝2?and......

The embodiment of the invention is described to provide a kind of scheme of list and the range subregion based on calculated column, has compared following advantage with existing partitioning technique:

1, through specific calculating, will be listed as (such as the time) can not preset value range, is transformed in the fixing value set, has overcome traditional list and range subregion and has been not easy to shortcoming in these row application.

2, specific calculation method provides sufficient dirigibility, and the subregion number that divide can be set arbitrarily as required, evenly distributes so that all are recorded in each subregion, and controls the scale of each subregion.

Certainly; the present invention also can have other various embodiments; under the situation that does not deviate from spirit of the present invention and essence thereof; those of ordinary skill in the art can make various corresponding changes and distortion according to the present invention, but these corresponding changes and distortion all should belong to the protection domain of the appended claim of the present invention.

Claims

1. a database table partitioned method is characterized in that, may further comprise the steps:

2. the method for claim 1 is characterized in that, described database table subregion is the list subregion based on calculated column.

3. method as claimed in claim 2 is characterized in that, carries out described in the step (1) and calculates, and comprising:

4. method as claimed in claim 3 is characterized in that, step (3) comprising:

5. the method for claim 1 is characterized in that, described database table subregion is the range subregion based on calculated column.

6. method as claimed in claim 5 is characterized in that, carries out described in the step (1) and calculates, and comprising:

7. method as claimed in claim 6 is characterized in that, step (3) comprising: