WO2014101070A1 - 一种分区扩展方法及装置 - Google Patents
一种分区扩展方法及装置 Download PDFInfo
- Publication number
- WO2014101070A1 WO2014101070A1 PCT/CN2012/087743 CN2012087743W WO2014101070A1 WO 2014101070 A1 WO2014101070 A1 WO 2014101070A1 CN 2012087743 W CN2012087743 W CN 2012087743W WO 2014101070 A1 WO2014101070 A1 WO 2014101070A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- partitions
- partition
- data
- physical node
- storage system
- Prior art date
Links
- 238000005192 partition Methods 0.000 title claims abstract description 450
- 238000000034 method Methods 0.000 title claims abstract description 46
- 238000013507 mapping Methods 0.000 claims abstract description 38
- 230000005012 migration Effects 0.000 claims description 16
- 238000013508 migration Methods 0.000 claims description 16
- 230000004044 response Effects 0.000 claims description 8
- 230000008569 process Effects 0.000 abstract description 5
- 238000013316 zoning Methods 0.000 description 8
- 238000013500 data storage Methods 0.000 description 4
- 238000000638 solvent extraction Methods 0.000 description 4
- 230000006870 function Effects 0.000 description 3
- 230000000717 retained effect Effects 0.000 description 3
- 238000010586 diagram Methods 0.000 description 2
- 238000012935 Averaging Methods 0.000 description 1
- 230000008859 change Effects 0.000 description 1
- 238000005516 engineering process Methods 0.000 description 1
- 230000003287 optical effect Effects 0.000 description 1
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F3/00—Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
- G06F3/06—Digital input from, or digital output to, record carriers, e.g. RAID, emulated record carriers or networked record carriers
- G06F3/0601—Interfaces specially adapted for storage systems
- G06F3/0602—Interfaces specially adapted for storage systems specifically adapted to achieve a particular effect
- G06F3/0604—Improving or facilitating administration, e.g. storage management
- G06F3/0607—Improving or facilitating administration, e.g. storage management by facilitating the process of upgrading existing storage systems, e.g. for improving compatibility between host and storage device
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F12/00—Accessing, addressing or allocating within memory systems or architectures
- G06F12/02—Addressing or allocation; Relocation
- G06F12/0223—User address space allocation, e.g. contiguous or non contiguous base addressing
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F3/00—Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
- G06F3/06—Digital input from, or digital output to, record carriers, e.g. RAID, emulated record carriers or networked record carriers
- G06F3/0601—Interfaces specially adapted for storage systems
- G06F3/0628—Interfaces specially adapted for storage systems making use of a particular technique
- G06F3/0629—Configuration or reconfiguration of storage systems
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F3/00—Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
- G06F3/06—Digital input from, or digital output to, record carriers, e.g. RAID, emulated record carriers or networked record carriers
- G06F3/0601—Interfaces specially adapted for storage systems
- G06F3/0628—Interfaces specially adapted for storage systems making use of a particular technique
- G06F3/0629—Configuration or reconfiguration of storage systems
- G06F3/0632—Configuration or reconfiguration of storage systems by initialisation or re-initialisation of storage systems
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F3/00—Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
- G06F3/06—Digital input from, or digital output to, record carriers, e.g. RAID, emulated record carriers or networked record carriers
- G06F3/0601—Interfaces specially adapted for storage systems
- G06F3/0628—Interfaces specially adapted for storage systems making use of a particular technique
- G06F3/0638—Organizing or formatting or addressing of data
- G06F3/0644—Management of space entities, e.g. partitions, extents, pools
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F3/00—Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
- G06F3/06—Digital input from, or digital output to, record carriers, e.g. RAID, emulated record carriers or networked record carriers
- G06F3/0601—Interfaces specially adapted for storage systems
- G06F3/0668—Interfaces specially adapted for storage systems adopting a particular infrastructure
- G06F3/067—Distributed or networked storage systems, e.g. storage area networks [SAN], network attached storage [NAS]
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F2212/00—Indexing scheme relating to accessing, addressing or allocation within memory systems or architectures
- G06F2212/10—Providing a specific technical effect
- G06F2212/1048—Scalability
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F2212/00—Indexing scheme relating to accessing, addressing or allocation within memory systems or architectures
- G06F2212/15—Use in a specific computing environment
- G06F2212/152—Virtualized environment, e.g. logically partitioned system
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F2212/00—Indexing scheme relating to accessing, addressing or allocation within memory systems or architectures
- G06F2212/15—Use in a specific computing environment
- G06F2212/154—Networked environment
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F2212/00—Indexing scheme relating to accessing, addressing or allocation within memory systems or architectures
- G06F2212/25—Using a specific main memory architecture
- G06F2212/254—Distributed memory
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F2212/00—Indexing scheme relating to accessing, addressing or allocation within memory systems or architectures
- G06F2212/26—Using a specific storage system architecture
- G06F2212/261—Storage comprising a plurality of storage devices
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F2212/00—Indexing scheme relating to accessing, addressing or allocation within memory systems or architectures
- G06F2212/26—Using a specific storage system architecture
- G06F2212/263—Network storage, e.g. SAN or NAS
Definitions
- the present invention relates to the field of storage technologies, and in particular, to a partition expansion method and apparatus. Background technique
- DHT Distributed Hash Table
- the Hash space is an end-to-end circular space, and the Hash ring can be divided into several equal parts, each of which is called a partition, and then The partitions are evenly distributed to each physical node, and at the same time, data to partitions and partitions to physical nodes are mapped and stored in the DHT.
- the reading and writing of data, the expansion of distributed clusters, and fault handling all depend on the changes of DHT. Therefore, DHT needs to achieve uniform load, good reliability, and data storage on the node, which is conducive to data migration and good scalability (that is, the system can perform multiple expansions).
- a new physical node is added to the distributed storage system, that is, the number of physical nodes is increased to implement system expansion.
- the specific expansion method may include: adding a new physical node in the system, and moving the partition on the original physical node to the new physical node, thereby expanding the distributed storage system.
- the number of partitions on the physical node is roughly the same.
- the inventor found that at least the following problems exist in the prior art: If the distributed storage system is expanded multiple times, the number of partitions on the physical node will be less and less, which will affect system expansion (for example, When there are only 4 partitions on each physical node, the system can only be expanded to a maximum of 4 times the original capacity, which leads to poor scalability of the distributed storage system.
- Embodiments of the present invention provide a partition expansion method and apparatus, which can be distributed When the storage system is expanded, the number of partitions on the physical node is increased to ensure the scalability of the distributed storage system.
- the first aspect of the embodiment of the present invention provides a partition expansion method, including: obtaining an average value of the number of partitions in all physical nodes in the distributed storage system; If the average value of the number of partitions is less than a preset threshold, multiplying the number of partitions in each physical node;
- mapping relationship between the newly added partition and the physical node where the newly added partition is located is added in the pre-stored distributed hash table DHT.
- the method further includes:
- the multiplying the number of partitions in each physical node includes:
- the obtaining an average value of the number of partitions in all the physical nodes in the distributed storage system includes:
- An average of the number of partitions in all physical nodes in the distributed storage system is obtained in response to a user's triggering instruction.
- a second aspect of the embodiment of the present invention further provides a partition expansion apparatus, including: a first acquiring unit, configured to acquire an average value of the number of partitions in all physical nodes in the distributed storage system;
- a first multiplication unit configured to multiply the number of partitions in each physical node if an average value of the number of partitions is less than a preset threshold
- a mapping unit configured to add, in the pre-stored distributed hash table DHT, a mapping relationship between the newly added partition and the physical node where the newly added partition is located.
- the zoning expansion apparatus further includes:
- a second acquiring unit configured to acquire a hash value of data in each of the physical nodes after the first multiplication unit multiplies the number of partitions in the physical nodes;
- a third obtaining unit configured to obtain a modulus value obtained by modulating a hash value of the data and a number of partitions obtained by expanding the physical node where the data is located;
- a migration unit configured to determine, according to the modulus value of the data, a destination partition of the data migration, and if the destination partition is different from a partition where the data is currently located, migrate the data to the destination partition.
- the first multiplication unit is further configured to multiply the number of partitions in the each physical node to at least 2 Times.
- the zoning expansion apparatus further includes: a fourth obtaining unit, configured to acquire the distributed storage system after the mapping unit adds a mapping relationship between the newly added partition and the physical node where the newly added partition is located in the pre-stored distributed hash table DHT The average of the number of extents in all physical nodes in the middle;
- a second multiplication unit configured to: if the average value of the expanded number of partitions is smaller than the preset threshold, multiply the number of partitions in each physical node; wherein, the multiple of each multiplication is the same.
- the first acquiring unit is further configured to periodically acquire an average value of the number of partitions in all physical nodes in the distributed storage system.
- An average of the number of partitions in all physical nodes in the distributed storage system is obtained in response to a user's triggering instruction.
- a third aspect of the embodiments of the present invention further provides a partition expansion apparatus, including: a processor, configured to obtain an average value of the number of partitions in all physical nodes in the distributed storage system; Setting a threshold value, multiplying the number of partitions in each physical node; adding a mapping relationship between the newly added partition and the physical node where the newly added partition is located in the pre-stored distributed hash table DHT;
- a storage configured to store a mapping relationship between the newly added partition added by the processor and the physical node where the newly added partition is located.
- the processor is further configured to acquire a hash value of data in each of the physical nodes, and obtain a hash value of the data.
- a modulus value obtained by modulating the number of partitions obtained by expanding the physical node where the data is located; determining a destination partition of the data migration according to the modulus value of the data, if the destination partition and the current partition of the data are located Different, the data is migrated to the destination partition.
- the processor is further configured to multiply the number of partitions in the each physical node to at least twice the number of the partitions.
- the processor is further configured to: after adding a mapping relationship between a new partition corresponding to the partition and a physical node where the partition is located in the DHT, obtain an extension in all physical nodes in the distributed storage system. An average of the number of partitions after the partition; if the average of the number of expanded partitions is less than the preset threshold, the number of partitions in each physical node is multiplied again; wherein, the multiple of each multiplication is the same .
- the processor is further configured to periodically acquire an average value of the number of partitions in all physical nodes in the distributed storage system; or ,
- An average of the number of partitions in all physical nodes in the distributed storage system is obtained in response to a user's triggering instruction.
- the method and device for partition expansion provided by the embodiment of the present invention can obtain an average value of the number of partitions in all physical nodes in the distributed storage system. If the average value of the number of partitions is less than a preset threshold, the number of partitions in each physical node. Multiply, and add the mapping relationship between the newly added partition and the physical node where the newly added partition is located in the pre-stored distributed hash table DHT. Compared with the prior art, the scalability of the distributed storage system is poor due to the small number of partitions in the physical node. When the distributed storage system is expanded, if the number of partitions is less than the preset value, the number of partitions is multiplied. Therefore, the scalability of the distributed storage system can be guaranteed.
- FIG. 1 is a flowchart of a method for partition expansion according to Embodiment 1 of the present invention
- FIG. 2 is a flowchart of a method for partition expansion according to Embodiment 2 of the present invention
- FIG. 3 is a partition extension according to Embodiment 3 of the present invention
- FIG. 4 is a schematic structural diagram of another partition expansion apparatus according to Embodiment 3 of the present invention
- FIG. 5 is a schematic diagram of a configuration of a partition expansion apparatus according to Embodiment 4 of the present invention.
- the embodiment of the present invention provides a partition expansion method, as shown in FIG. 1 , including:
- the partition expansion device obtains an average of the number of partitions in all physical nodes in the distributed storage system.
- the hash value Hash of the data corresponding to the data identifier can be calculated by using a hash function according to the data identifier of the different data. According to the hash value, the data is mapped to a hash space.
- the hash space is an end-to-end circular space.
- the hash ring can be divided into several equal parts, each part is called a partition, and then the partition is evenly distributed to each physical.
- the physical node may be a physical address in a distributed storage system, and the distributed storage system may divide the physical address into multiple address segments, and each physical address is a physical node.
- the method for the partition extension device to obtain an average value of the number of partitions in the physical node in the distributed storage system may include: the partition extension device periodically acquires an average value of the number of partitions in all physical nodes in the distributed storage system; or, the partition extension device responds Obtain an average of the number of partitions in all physical nodes in the distributed storage system for the user's trigger instruction.
- the zoning extension device may preset an acquisition period, and acquire an average value of the number of partitions in all physical nodes in the distributed storage system according to the period; or, when performing distributed system expansion, the zoning extension device receives the user.
- the triggering instruction is responsive to the triggering instruction to obtain an average of the number of partitions in all physical nodes in the distributed storage system.
- the partition expansion device multiplies the number of partitions in each physical node.
- the preset threshold may be an average value of the number of partitions in all physical nodes in the distributed storage system when the distributed storage system is expanded at least once.
- the method of the embodiment of the present invention further includes: the partition expansion apparatus acquires a hash value of data in each partition in each physical node; the partition extension apparatus acquires, after the partition expansion apparatus multiplies the number of partitions in each physical node The hash value of the data is compared with the modulus obtained by expanding the number of partitions in the physical node where the data is located; the partition extension device determines the destination partition of the data migration according to the modulus value of the data, and if the destination partition is different from the current partition of the data, The partition expansion device migrates the data to the destination partition.
- the partition expansion device adds a mapping relationship between the newly added partition and the physical node where the newly added partition is located in the pre-stored DHT.
- DHT can be used to determine the data storage location in the distributed storage system. That is, the DHT can store the mapping relationship between the data identifier and the partition, and the mapping relationship between the partition and the physical node. Therefore, after the partition expansion device multiplies the number of partitions in each physical node, the newly added partition needs to be added to the pre-stored DHT. The mapping relationship with the physical node where the newly added partition is located.
- the partition expansion method provided by the embodiment of the present invention can obtain an average value of the number of partitions in all physical nodes in the distributed storage system, and if the average number of partitions is less than a preset threshold, multiply the number of partitions in each physical node. And adding a mapping relationship between the newly added partition and the physical node where the newly added partition is located in the pre-stored distributed hash table DHT.
- the scalability of the distributed storage system is poor due to the small number of partitions in the physical node.
- the distributed storage system is expanded, if the number of partitions is less than the preset value, the number of partitions is multiplied. Therefore, the scalability of the distributed storage system can be guaranteed.
- the embodiment of the present invention provides a partition expansion method. As shown in FIG. 2, the method includes: 201.
- a partition expansion apparatus obtains an average value of the number of partitions in all physical nodes in the distributed storage system.
- the partition expansion device acquires all physical nodes in the distributed storage system.
- the method of averaging the number of partitions may specifically be:
- the partition expansion device is pre-stored
- the DHT queries the number of physical nodes in the distributed storage system and the total number of partitions in all physical nodes; the partition expansion device calculates the number of partitions in the physical node according to the number of physical nodes and the total number of partitions in all physical nodes. average value.
- the number of physical nodes is the total number of physical nodes in the distributed storage system.
- the method for the partition expansion device to calculate the average value of the number of partitions of the physical node according to the number of physical nodes and the total number of partitions of all physical nodes may specifically be:
- the partition extension device divides the total number of partitions of all physical nodes by the physical number.
- the quotient obtained from the number of nodes is the average of the number of partitions in the physical node.
- the number of physical nodes is A
- the total number of partitions of all physical nodes is B
- the average number of partitions of physical nodes is C
- the distributed storage system uses the DHT for data storage, and the number of partitions on the physical node is substantially the same. Therefore, the partition expansion device may also query the distribution in the pre-stored DHT. The number of partitions in any physical node in the storage system, and the number of partitions in the physical node is taken as the average of the number of partitions in all physical nodes in the distributed storage system.
- the partition expansion device determines whether the average value of the number of partitions is lower than a preset threshold. If the average value of the number of partitions is lower than a preset threshold, step 203 is performed; if the average number of partitions is not lower than a preset threshold Then, step 21 1 is performed.
- the partition expansion device multiplies the number of partitions in each physical node. Among them, the partition expansion device multiplies the number of partitions in each physical node to include: The partition expansion device doubles the number of partitions in each physical node to at least twice the number of partitions.
- the partition expansion apparatus may modify the number of partitions in each physical node in the pre-stored DHT to at least 2 times the number of partitions in each physical node.
- partition expansion device multiplies the number of partitions in each physical node
- the multiple of the number of partitions is 2.
- partition expansion The device can also extend the number of partitions in each physical node by a factor of three or even multiples.
- the partition expansion device double-extends the number of partitions in each physical node as an example, but does not limit the number multiplication of the number of partitions in which the partition expansion device multiplies the number of partitions in each physical node.
- the partition expansion apparatus acquires a hash value of data in each partition in each physical node.
- the partition expansion apparatus may query the hash value of the data in each of the physical nodes in the pre-stored DHT according to the identifier of the data in each partition.
- the hash values of different data in one partition are different.
- Partition A contains Data 1, Data 2, and Data 3.
- Data 1 has the data identifier a and the hash value is Hashl;
- data 2 has the data identifier b and the hash value is Hash2;
- data 3 has the data identifier c and the hash value is Hash3.
- the partitioning extension device obtains a hash value of the data and a modulus value obtained by modulating the number of partitions obtained by expanding the physical node where the data is located.
- the number of partitions obtained by expanding the physical node where the data is located is the product of the number of partitions before the extension of the physical node where the data is located and the multiple of the number of partitions of the physical extension node by the partition extension device.
- the partition expansion device multiplies the number of partitions in each physical node by n times, then the physical node where the data K is located
- the number of partitions obtained after expansion is nx N
- the partition expansion device determines a destination partition of the data migration according to the modulus value of the data, and if the destination partition is different from the current partition of the data, the data is migrated to the destination partition.
- the hash value Hash of the data to be stored may be calculated according to a hash function, and then the data to be stored is allocated according to the hash value of each data to be stored and the total number of partitions in the distributed storage system.
- the storage is performed in each partition, and then the partitions in the distributed storage system are relatively uniformly allocated to the respective physical nodes.
- the hash value Hash of some data is 45, 46, 47, 47...75, and the number of partitions is N
- the hash value of each data is the same as the number of partitions.
- Partition 1 (the remainder is 0): 45, 48, 51, 54, 57, 60, 63, 66, 69, 72, 75.
- Partition 2 (the remainder is 1): 46, 49, 52, 55, 58, 61, 64, 67, 70,
- Partition 3 (the remainder is 2): 47, 50, 53, 56, 59, 62, 65, 68, 71,
- partitioning is performed by taking the above partition 1 as an example.
- the number of partitions N is 3.
- the partition extension can migrate data with a modulus of 3 to the data.
- the partition extension device can migrate data with a modulus value of 0 to the newly added physical node of the data.
- the corresponding data is kept in the original partition.
- the partition expansion device can migrate the data corresponding to the modulus according to the modulus values 0, 3, and 6 of the data. Among them, the method and the number of partitions for the distributed expansion device to migrate data to the destination partition 2 The method of multiplying is similar, and will not be described in detail in this embodiment.
- the partition expansion device adds a mapping relationship between the newly added partition and the physical node where the newly added partition is located in the pre-stored DHT.
- the partition expansion device obtains an average value of the number of extended partitions among all physical nodes in the distributed storage system.
- the partition expansion device multiplies the number of partitions on all the physical nodes in the distributed storage system, and after completing the migration of the corresponding data in the partition, the average number of the expanded partitions in all the physical nodes in the distributed storage system can be obtained.
- the method for obtaining the average of the number of extended partitions among all physical nodes in the distributed storage system by the partition expansion apparatus and the partition expansion apparatus in the embodiment obtain the average number of partitions in all physical nodes in the distributed storage system.
- the method of the value is similar, and the embodiment is not described herein again.
- the partition expansion apparatus determines whether an average value of the expanded number of partitions is less than a preset threshold; if the average value of the expanded number of partitions is less than a preset threshold, step 210 is performed; if the average number of partitions after expansion is averaged If it is not less than the preset threshold, step 21 1 is performed.
- the partition expansion device doubles the number of partitions in each physical node; wherein, the multiple of each multiplication is the same.
- the partition expansion device may multiply the number of partitions in each physical node again.
- the multiplication factor of the partition expansion device when multiplying the number of partitions in each physical node again
- the partition expansion device has the same multiplication factor when the number of partitions is first multiplied.
- the method of the present implementation may further include: the partitioning device adds a mapping relationship between the newly added partition and the physical node where the newly added partition is located in the pre-stored DHT. .
- the partition expansion apparatus may multiply the number of partitions in the physical node at least once, the multiple of each multiplication is the same, and the multiple of the multiplication is greater than or equal to 2.
- the partition expansion apparatus may extend the partition of the physical node in the distributed storage system at least once according to the average of the number of partitions of the physical nodes in the distributed storage system.
- the embodiment of the present invention does not limit the number of times of the partition expansion.
- the method for the second extension or the extension of the partition of the physical node in the distributed storage system by the partition expansion apparatus is similar to the partition expansion method in this embodiment. This embodiment will not be described again here.
- the multiplication method provided in this embodiment can be applied not only to the partition expansion process of the distributed storage system, but also to the multiplication process of the storage node in other storage systems, and the method of the present invention.
- the scope of application, this embodiment is not limited herein.
- the partition expansion method provided by the embodiment of the present invention can obtain an average value of the number of partitions in all physical nodes in the distributed storage system, and if the average number of partitions is less than a preset threshold, multiply the number of partitions in each physical node. And adding a mapping relationship between the newly added partition and the physical node where the newly added partition is located in the pre-stored distributed hash table DHT.
- the scalability of the distributed storage system is poor due to the small number of partitions in the physical node.
- the distributed storage system is expanded, if the number of partitions is less than the preset value, the number of partitions is multiplied. Therefore, the scalability of the distributed storage system can be guaranteed.
- the embodiment of the present invention provides a partition expansion apparatus. As shown in FIG. 3, the apparatus includes: a first obtaining unit 3 1 , a first multiplication unit 32 , and a mapping unit 33 .
- the first obtaining unit 3 1 is configured to acquire all physical nodes in the distributed storage system. The average number of partitions.
- the first multiplication unit 32 is configured to multiply the number of partitions in each physical node if the average value of the number of partitions is less than a preset threshold.
- the mapping unit 33 is configured to add a mapping relationship between the newly added partition and the physical node where the newly added partition is located in the pre-stored distributed hash table DHT.
- the partition expansion apparatus may further include: a second obtaining unit 34, a third obtaining unit 35, and a migration unit 36.
- the second obtaining unit 34 is configured to obtain, after the first multiplication unit 32 multiplies the number of partitions in the physical nodes, a hash value of data in each of the physical nodes.
- the third obtaining unit 35 is configured to obtain a modulus value obtained by modulating a hash value of the data and a number of partitions obtained by expanding the physical node where the data is located.
- the migration unit 36 is configured to determine, according to the modulus value of the data, a destination partition of the data migration, and if the destination partition is different from a partition where the data is currently located, migrate the data to the destination partition.
- the first multiplication unit 32 is further configured to multiply the number of partitions in each of the physical nodes to at least twice the number of the partitions.
- the partition expansion device may further include: a fourth obtaining unit 37 and a second multiplying unit 38.
- a fourth obtaining unit 37 configured to acquire the distributed relationship after the mapping unit 33 adds a mapping relationship between the newly added partition and the physical node where the newly added partition is located in the pre-stored distributed hash table DHT. The average of the number of extents in all physical nodes in the storage system.
- a second multiplication unit 38 configured to: if the average value of the expanded number of partitions is smaller than the preset threshold, multiply the number of partitions in each physical node; wherein, the multiple of each multiplication is the same .
- the first obtaining unit 3 1 is further configured to periodically acquire an average value of the number of partitions in all the physical nodes in the distributed storage system; or, obtain the distributed storage in response to a trigger instruction of the user. Number of partitions in all physical nodes in the system The average of the quantities.
- the zoning extension device provided by the embodiment of the present invention can obtain an average value of the number of partitions in all physical nodes in the distributed storage system. If the average value of the number of partitions is less than a preset threshold, the number of partitions in each physical node is multiplied. And adding a mapping relationship between the newly added partition and the physical node where the newly added partition is located in the pre-stored distributed hash table DHT. Compared with the prior art, the scalability of the distributed storage system is poor due to the small number of partitions in the physical node. When the distributed storage system is expanded, if the number of partitions is less than the preset value, the number of partitions is multiplied. Therefore, the scalability of the distributed storage system can be guaranteed.
- the embodiment of the present invention provides a partition expansion apparatus, as shown in FIG. 5, which may include: a processor 41 and a memory 42.
- the processor 41 is configured to obtain an average value of the number of partitions in all physical nodes in the distributed storage system; if the average value of the number of partitions is less than a preset threshold, multiply the number of partitions in each physical node; a distributed hash table DHT is added with a mapping relationship between the newly added partition and the physical node where the newly added partition is located; a memory 42 for storing the newly added partition added by the processor 41 and the newly added The mapping relationship of the physical nodes where the partition is located.
- the processor 41 is further configured to acquire a hash value of data in each partition of each physical node, obtain a hash value of the data, and obtain an extension of the physical node where the data is located.
- the number of partitions is obtained by modulating the obtained modulus value; determining a destination partition of the data migration according to the modulus value of the data, and if the destination partition is different from the current partition of the data, moving the data to the location Describe the destination partition.
- the processor 41 is further configured to multiply the number of partitions in each physical node to at least twice the number of the partitions. Further, the processor 41 is further configured to: after adding, in the DHT, a mapping relationship between a new partition corresponding to the partition and a physical node where the partition is located, acquiring all physical nodes in the distributed storage system An average of the number of expanded partitions; if the average of the expanded number of partitions is less than the predetermined threshold, the number of partitions in each physical node is multiplied again; wherein, the multiple of each multiplication the same.
- the processor 41 is further configured to periodically acquire an average value of the number of partitions in all physical nodes in the distributed storage system; or, in response to a trigger instruction of the user, acquire all the distributed storage systems. The average of the number of partitions in the physical node.
- the zoning extension device provided by the embodiment of the present invention can obtain an average value of the number of partitions in all physical nodes in the distributed storage system. If the average value of the number of partitions is less than a preset threshold, the number of partitions in each physical node is multiplied. And adding a mapping relationship between the newly added partition and the physical node where the newly added partition is located in the pre-stored distributed hash table DHT. Compared with the prior art, the scalability of the distributed storage system is poor due to the small number of partitions in the physical node. When the distributed storage system is expanded, if the number of partitions is less than the preset value, the number of partitions is multiplied. Therefore, the scalability of the distributed storage system can be guaranteed.
- the present invention can be implemented by means of software plus necessary general hardware, and of course, by hardware, but in many cases, the former is a better implementation. .
- the technical solution of the present invention which is essential or contributes to the prior art, may be embodied in the form of a software product stored in a readable storage medium, such as a floppy disk of a computer.
- a hard disk or optical disk or the like includes instructions for causing a computer device (which may be a personal computer, a server, or a network device, etc.) to perform the methods described in various embodiments of the present invention.
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Human Computer Interaction (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
Abstract
Description
Claims
Priority Applications (10)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CA2894623A CA2894623C (en) | 2012-12-27 | 2012-12-27 | Partition extension method and apparatus |
CN201280002874.XA CN103229151B (zh) | 2012-12-27 | 2012-12-27 | 一种分区扩展方法及装置 |
AU2012398262A AU2012398262C1 (en) | 2012-12-27 | 2012-12-27 | Partition extension method and apparatus |
PCT/CN2012/087743 WO2014101070A1 (zh) | 2012-12-27 | 2012-12-27 | 一种分区扩展方法及装置 |
KR1020157017725A KR101677946B1 (ko) | 2012-12-27 | 2012-12-27 | 파티션 확장 방법 및 장치 |
EP12890935.5A EP2921964B8 (en) | 2012-12-27 | 2012-12-27 | Partition expansion method and device |
JP2015549922A JP6020867B2 (ja) | 2012-12-27 | 2012-12-27 | パーティション拡張方法および装置 |
EP17195364.9A EP3376395A1 (en) | 2012-12-27 | 2012-12-27 | Partition extension method and apparatus |
ES12890935.5T ES2658188T3 (es) | 2012-12-27 | 2012-12-27 | Método y aparato de extensión de particiones |
US14/752,068 US9665284B2 (en) | 2012-12-27 | 2015-06-26 | Partition extension method and apparatus |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
PCT/CN2012/087743 WO2014101070A1 (zh) | 2012-12-27 | 2012-12-27 | 一种分区扩展方法及装置 |
Related Child Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US14/752,068 Continuation US9665284B2 (en) | 2012-12-27 | 2015-06-26 | Partition extension method and apparatus |
Publications (1)
Publication Number | Publication Date |
---|---|
WO2014101070A1 true WO2014101070A1 (zh) | 2014-07-03 |
Family
ID=48838330
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/CN2012/087743 WO2014101070A1 (zh) | 2012-12-27 | 2012-12-27 | 一种分区扩展方法及装置 |
Country Status (9)
Country | Link |
---|---|
US (1) | US9665284B2 (zh) |
EP (2) | EP3376395A1 (zh) |
JP (1) | JP6020867B2 (zh) |
KR (1) | KR101677946B1 (zh) |
CN (1) | CN103229151B (zh) |
AU (1) | AU2012398262C1 (zh) |
CA (1) | CA2894623C (zh) |
ES (1) | ES2658188T3 (zh) |
WO (1) | WO2014101070A1 (zh) |
Families Citing this family (18)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN104462124B (zh) * | 2013-09-22 | 2018-04-06 | 中国电信股份有限公司 | 基于线性哈希表的数据存储平台组织方法和数据存储平台 |
CN103634401B (zh) * | 2013-12-03 | 2015-11-25 | 北京京东尚科信息技术有限公司 | 一种存储数据副本的方法和终端装置以及服务器装置 |
CN103810244B (zh) * | 2013-12-09 | 2017-10-27 | 北京理工大学 | 一种基于数据分布的分布式数据存储系统的扩容方法 |
CN103929500A (zh) * | 2014-05-06 | 2014-07-16 | 刘跃 | 一种分布式存储系统的数据分片方法 |
EP3128716B1 (en) * | 2014-11-05 | 2019-09-04 | Huawei Technologies Co., Ltd. | Data processing method and apparatus |
CN106843745A (zh) * | 2015-12-03 | 2017-06-13 | 南京中兴新软件有限责任公司 | 容量扩展方法及装置 |
CN105630419B (zh) * | 2015-12-25 | 2018-09-21 | 华为技术有限公司 | 一种资源池的分区视图发送方法及管理节点 |
US10157193B2 (en) | 2016-03-03 | 2018-12-18 | International Business Machines Corporation | Switching between a non-partitioned hash join and a partitioned hash join based on an amount of available memory |
CN107943412B (zh) * | 2016-10-12 | 2021-10-22 | 阿里巴巴集团控股有限公司 | 一种分区分裂、删除分区中数据文件的方法、装置及系统 |
US10430706B2 (en) * | 2016-12-01 | 2019-10-01 | Via Alliance Semiconductor Co., Ltd. | Processor with memory array operable as either last level cache slice or neural network unit memory |
CN108804018A (zh) * | 2017-04-26 | 2018-11-13 | 华为技术有限公司 | 一种数据分区的方法及装置 |
CN108932256A (zh) * | 2017-05-25 | 2018-12-04 | 中兴通讯股份有限公司 | 分布式数据重分布控制方法、装置及数据管理服务器 |
CN107329692B (zh) * | 2017-06-07 | 2020-02-28 | 杭州宏杉科技股份有限公司 | 一种数据重删的方法及存储设备 |
CN109995813B (zh) * | 2017-12-29 | 2021-02-26 | 华为技术有限公司 | 一种分区扩展方法、数据存储方法及装置 |
KR102093145B1 (ko) * | 2018-06-07 | 2020-03-25 | 한밭대학교 산학협력단 | 생체정보 인식 기반의 데이터 최적화를 위한 오브젝트 스토리지 클라우드 시스템 |
WO2020010502A1 (zh) * | 2018-07-10 | 2020-01-16 | 深圳花儿数据技术有限公司 | 一种基于一致性哈希算法的分布式数据冗余存储方法 |
CN111694693A (zh) * | 2019-03-12 | 2020-09-22 | 上海晶赞融宣科技有限公司 | 数据流存储方法、装置以及计算机存储介质 |
CN110083312B (zh) * | 2019-04-28 | 2021-01-15 | 联想(北京)有限公司 | 磁盘扩容方法、装置及计算机设备 |
Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101374087A (zh) * | 2007-08-20 | 2009-02-25 | 华为技术有限公司 | 一种移动自组网络、节点及其区域划分方法 |
CN102043726A (zh) * | 2010-12-29 | 2011-05-04 | 北京播思软件技术有限公司 | 一种大规模时序数据的存储管理方法 |
CN102591970A (zh) * | 2011-12-31 | 2012-07-18 | 北京奇虎科技有限公司 | 一种分布式键-值查询方法和查询引擎系统 |
Family Cites Families (12)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CA2167790A1 (en) * | 1995-01-23 | 1996-07-24 | Donald S. Maier | Relational database system and method with high data availability during table data restructuring |
US7469253B2 (en) * | 2003-04-01 | 2008-12-23 | Microsoft Corporation | Associative hash partitioning using pseudo-random number generator |
JP2009295127A (ja) * | 2008-06-09 | 2009-12-17 | Nippon Telegr & Teleph Corp <Ntt> | アクセス方法、アクセス装置及び分散データ管理システム |
US8074049B2 (en) * | 2008-08-26 | 2011-12-06 | Nine Technology, Llc | Online backup system with global two staged deduplication without using an indexing database |
US9996572B2 (en) * | 2008-10-24 | 2018-06-12 | Microsoft Technology Licensing, Llc | Partition management in a partitioned, scalable, and available structured storage |
US8078825B2 (en) * | 2009-03-11 | 2011-12-13 | Oracle America, Inc. | Composite hash and list partitioning of database tables |
EP2485150A1 (en) | 2009-05-18 | 2012-08-08 | Shuhei Nishiyama | Meta-information sharing distributed database system in virtual single memory storage |
JP4362839B1 (ja) * | 2009-05-18 | 2009-11-11 | 修平 西山 | 仮想単一メモリストレージ上におけるメタ情報共有型分散データベース・システム |
US20100312749A1 (en) * | 2009-06-04 | 2010-12-09 | Microsoft Corporation | Scalable lookup service for distributed database |
US8996803B2 (en) * | 2010-07-02 | 2015-03-31 | Futurewei Technologies, Inc. | Method and apparatus for providing highly-scalable network storage for well-gridded objects |
US9684702B2 (en) * | 2010-12-07 | 2017-06-20 | International Business Machines Corporation | Database redistribution utilizing virtual partitions |
WO2012164735A1 (ja) * | 2011-06-03 | 2012-12-06 | 富士通株式会社 | 情報処理装置、データ管理方法およびデータ管理プログラム |
-
2012
- 2012-12-27 KR KR1020157017725A patent/KR101677946B1/ko active IP Right Grant
- 2012-12-27 JP JP2015549922A patent/JP6020867B2/ja active Active
- 2012-12-27 EP EP17195364.9A patent/EP3376395A1/en not_active Withdrawn
- 2012-12-27 AU AU2012398262A patent/AU2012398262C1/en active Active
- 2012-12-27 EP EP12890935.5A patent/EP2921964B8/en active Active
- 2012-12-27 CA CA2894623A patent/CA2894623C/en active Active
- 2012-12-27 WO PCT/CN2012/087743 patent/WO2014101070A1/zh active Application Filing
- 2012-12-27 CN CN201280002874.XA patent/CN103229151B/zh active Active
- 2012-12-27 ES ES12890935.5T patent/ES2658188T3/es active Active
-
2015
- 2015-06-26 US US14/752,068 patent/US9665284B2/en active Active
Patent Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101374087A (zh) * | 2007-08-20 | 2009-02-25 | 华为技术有限公司 | 一种移动自组网络、节点及其区域划分方法 |
CN102043726A (zh) * | 2010-12-29 | 2011-05-04 | 北京播思软件技术有限公司 | 一种大规模时序数据的存储管理方法 |
CN102591970A (zh) * | 2011-12-31 | 2012-07-18 | 北京奇虎科技有限公司 | 一种分布式键-值查询方法和查询引擎系统 |
Also Published As
Publication number | Publication date |
---|---|
US20150293707A1 (en) | 2015-10-15 |
AU2012398262A1 (en) | 2015-07-02 |
JP6020867B2 (ja) | 2016-11-02 |
CN103229151B (zh) | 2016-08-10 |
CN103229151A (zh) | 2013-07-31 |
EP2921964A1 (en) | 2015-09-23 |
US9665284B2 (en) | 2017-05-30 |
JP2016507813A (ja) | 2016-03-10 |
AU2012398262C1 (en) | 2016-10-13 |
EP2921964B8 (en) | 2018-02-07 |
CA2894623A1 (en) | 2014-07-03 |
ES2658188T3 (es) | 2018-03-08 |
CA2894623C (en) | 2019-09-10 |
EP2921964B1 (en) | 2017-12-20 |
EP3376395A1 (en) | 2018-09-19 |
AU2012398262B2 (en) | 2016-04-14 |
KR20150091161A (ko) | 2015-08-07 |
EP2921964A4 (en) | 2015-12-09 |
KR101677946B1 (ko) | 2016-11-21 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
WO2014101070A1 (zh) | 一种分区扩展方法及装置 | |
JP6302951B2 (ja) | コンシステントハッシュ法を用いたグローバルネームスペースのための方法およびシステム | |
EP4220395B1 (en) | Method and apparatus for determining virtual machine migration | |
US10218618B2 (en) | Method and apparatus for processing operation request in storage system | |
JP4068473B2 (ja) | ストレージ装置、分担範囲決定方法及びプログラム | |
US9633040B2 (en) | Distributed processing system including a name node and a plurality of data nodes, and method of operating the same | |
US8762534B1 (en) | Server load balancing using a fair weighted hashing technique | |
WO2017124933A1 (zh) | 一种信息处理方法、装置和系统 | |
WO2015142969A1 (en) | Switch-based load balancer | |
WO2014101044A1 (zh) | 分布式存储系统中的分区平衡方法、装置及服务器 | |
US8924513B2 (en) | Storage system | |
CA2941163C (en) | Data processing method and apparatus | |
CN106164898A (zh) | 数据处理方法和装置 | |
JP2019016042A (ja) | データ取得プログラム、装置、及び方法 | |
US20150095494A1 (en) | Server Selection | |
CN106412075A (zh) | 一种基于云计算的资源配置方法及装置 | |
JPWO2014132966A1 (ja) | ストレージシステム、ストレージ装置、ストレージ装置の制御方法及び制御プログラム、管理装置、管理装置の制御方法及び制御プログラム | |
US20130226867A1 (en) | Apparatus and method for converting replication-based file into parity-based file in asymmetric clustering file system | |
WO2013106993A1 (zh) | 扩容的方法和设备、以及访问数据的方法和设备 | |
US20230015979A1 (en) | Storage System, Data Processing Method, Apparatus, Node, and Storage Medium | |
WO2021098268A1 (zh) | 一种mon服务迁移方法、装置、设备及可读存储介质 | |
WO2016065613A1 (zh) | 访问文件的方法、分布式存储系统和网络设备 | |
CN111404820A (zh) | 一种路由查询方法、装置、设备及可读存储介质 | |
WO2016065612A1 (zh) | 访问文件的方法、系统和主机 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 12890935 Country of ref document: EP Kind code of ref document: A1 |
|
ENP | Entry into the national phase |
Ref document number: 2894623 Country of ref document: CA |
|
WWE | Wipo information: entry into national phase |
Ref document number: 2012890935 Country of ref document: EP |
|
ENP | Entry into the national phase |
Ref document number: 2015549922 Country of ref document: JP Kind code of ref document: A |
|
NENP | Non-entry into the national phase |
Ref country code: DE |
|
ENP | Entry into the national phase |
Ref document number: 2012398262 Country of ref document: AU Date of ref document: 20121227 Kind code of ref document: A Ref document number: 20157017725 Country of ref document: KR Kind code of ref document: A |