RU2016124319A

RU2016124319A - METHOD AND DEVICE FOR RESTORING DEDUPLICATED DATA

Info

Publication number: RU2016124319A
Application number: RU2016124319A
Authority: RU
Inventors: Фей Куи; Джиайя Ченг; Нинг Ченг
Original assignee: Зте Корпарейшн
Priority date: 2013-11-26
Filing date: 2014-04-21
Publication date: 2018-01-09
Also published as: RU2665272C1; WO2015078136A1; CN104679746A

Claims

1. A method for restoring deduplicated data, comprising:

receiving the first number of accesses to the first data block, while the first number of accesses is the number of visitors who are currently accessing the file at the same time;

comparing the first number of accesses with the first limit and the second limit, respectively, wherein the first limit is less than the second limit; and

restoring the first data block in the first data medium or the second data medium, in accordance with the comparison results, wherein the first data block is restored in the first data medium when the first number of accesses is greater than the first limit and less than the second limit, and the first a data block is restored in the second storage medium when the first number of accesses is greater than the second limit; the second storage medium is more efficient than the first storage medium.

2. The method according to p. 1, which before receiving the first number of accesses to the file corresponding to the first data block contains:

obtaining a second number of accesses to the first data block, while the second number of accesses represents the number of visitors who are currently and simultaneously accessing the first data block; and

when the second number of accesses is greater than the third limit, search for a file corresponding to the first data block.

3. The method according to p. 2, which before receiving the second number of accesses to the first data block contains:

obtaining a description of the characteristics of the first data block, while the description of the characteristics is used to represent the content that only the first data block has; and informing the current distributed file system and other distributed file systems associated with the current file system about the description of the characteristics, wherein the description of the characteristics is used to perform deduplication processing in the current distributed file system and other distributed file systems.

4. The method according to p. 3, wherein informing the current distributed file system about the description of the signs contains:

informing the node server in the current distributed file system about the description of the signs.

5. The method according to p. 2, in which the restoration of the first data block in the first data medium or second data medium contains:

duplication of the first data block to obtain a second data block; and

duplication of the second data block in the first data medium or second data medium.

6. The method according to claim 5, which after duplication of the second data block in the first data medium or second data medium further comprises:

subtracting the first number of accesses from the second number of accesses to obtain the actual number of accesses to the first data block, and subtracting 1 from the reference count of the first data blocks.

7. A device for recovering deduplicated data, comprising:

a first receiving module configured to obtain a first number of accesses to a file corresponding to the first data block, the first number of accesses being the number of visitors currently accessing the file at the same time;

a comparison module configured to compare a first number of accesses with a first limit and a second limit, respectively, wherein the first limit is less than the second limit; and

a recovery module configured to recover the first data block in the first data medium or second data medium in accordance with the comparison results; wherein the recovery module restores the first data block in the first storage medium when the first number of accesses is greater than the first limit and less than the second limit, and the recovery module restores the first data block in the second storage medium when the first number of accesses is greater, than the second limit; the second storage medium is more efficient than the first storage medium.

8. The device according to claim 7, further comprising:

a second receiving module, configured to obtain a second number of accesses to the first data block, wherein the second number of accesses represents the number of visitors currently accessing the first data block; and

a search module, configured to, when the second number of accesses is greater than the third limit, search for a file corresponding to the first data block.

9. The device according to p. 8, further comprising:

a third receiving module, configured to obtain a description of the characteristics, while the description of the characteristics is used to represent content that is only in the first data block; and

an information module configured to inform the current distributed file system and other distributed file systems associated with the current distributed file system, wherein the description of the features is used to perform deduplicating processing of the current distributed file system and other distributed file systems.

10. The device according to claim 9, further comprising:

a reading module configured to, after duplicating the second data block in the first data medium or second data medium, subtracting the first number of accesses from the second number of accesses to obtain the current number of accesses to the first data block, and subtracting 1 from the value of the counter of the link mechanism first block of data.