CN102523290A - Data processing method, device and system - Google Patents

Data processing method, device and system Download PDF

Info

Publication number
CN102523290A
CN102523290A CN2011104266316A CN201110426631A CN102523290A CN 102523290 A CN102523290 A CN 102523290A CN 2011104266316 A CN2011104266316 A CN 2011104266316A CN 201110426631 A CN201110426631 A CN 201110426631A CN 102523290 A CN102523290 A CN 102523290A
Authority
CN
China
Prior art keywords
data
backup
client
historical
random number
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN2011104266316A
Other languages
Chinese (zh)
Other versions
CN102523290B (en
Inventor
张程伟
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Chengdu Huawei Technology Co Ltd
Original Assignee
Huawei Symantec Technologies Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Huawei Symantec Technologies Co Ltd filed Critical Huawei Symantec Technologies Co Ltd
Priority to CN201110426631.6A priority Critical patent/CN102523290B/en
Publication of CN102523290A publication Critical patent/CN102523290A/en
Application granted granted Critical
Publication of CN102523290B publication Critical patent/CN102523290B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Abstract

The invention provides a data processing method, device and system. The method comprises the steps of receiving a data backup request information sent by a client, wherein the data backup request information comprises user tags and finger print information of the backup data; inquiring a folder corresponding to the user tag according to the fingerprint information of the backup data, and judging whether the folder has the same data with the backup data; if no, inquiring folders corresponding to other user tags according to the fingerprint information of the backup data, and judging whether the folders have the same data with the backup data; if so, the data same with the backup data is a second history data, judging whether the number of the quoted data corresponding to the second history data is less than the random number corresponding to the second history data, wherein the random number is greater than the pre-set threshold value; if the the number of the quoted data corresponding to the second history data is less than the random number corresponding to the second history data, sending a backup information to the client, receiving the backup data sent by the client and generating the quoted data of the second history data.

Description

Processing method of data, equipment and system
Technical field
The present invention relates to computer technology, relate in particular to a kind of processing method of data, equipment and system.
Background technology
Cloud storage is main through using the distributed computing technology means, with the memory device that distributes on the physical layer virtual be a storage system that high-performance is highly reliable, and offer the user uniformly.In addition; In order to make the data in the cloud storage get the greatest degree of optimization, mainly adopt the technology of source end data de-duplication, reduce the total capacity of available data; Its concrete implementation is: high in the clouds will prepare the data of storing and the data of having stored carried out consistency relatively; And only store the data that preparation inequality is stored, thus demand reduced to transmission bandwidth, and improved backup efficient.
For instance; If user A has backed up first data on beyond the clouds, when user B contained second data of same content toward this high in the clouds backup, this high in the clouds was detected and has been stored first data with identical content; Then carrying out source end data de-duplication handles; Wherein, the groundwork principle that this source end data de-duplication is handled is: generate the corresponding reference data of these first data, this reference data is used in reference to these first data; And send the backup success message and give user B, user B then can not transmit these second data.Thereby making client that user B can be through monitoring oneself judge whether to have produced source end data de-duplication to the flow in high in the clouds handles.
But; When these first data are external maintaining secrecy; Adopt above-mentioned art methods; The client of user B monitoring oneself is judged to the flow in this high in the clouds and has been produced source end data de-duplication processing, and then user B can know that high in the clouds stored the data identical with second data content, thereby has stolen the content of first data of user A.For example: if user A and user B are two manufacturers of competitive bidding simultaneously; And the identical quotation template data of all using identical cloud stores service and tenderer to provide; Then user B may generate the quotation template data of various quotations in this locality; And these quotation template data are backuped to high in the clouds, and to the flow in high in the clouds variation has taken place through detecting when sending which data client, just can judge that the data of carrying out source end data de-duplication are identical with the quotation template that user A feeds back; Thereby caused the leakage of the data of user A storage, and then can't guarantee user's security of storage data beyond the clouds effectively.
Summary of the invention
First aspect of the present invention provides a kind of processing method of data, comprising:
Receive the data backup requests message that client is sent, said data backup requests message comprises: the finger print information of ID and Backup Data;
According to the finger print information of said Backup Data, inquire about said ID corresponding file, judge whether to exist the data identical with said Backup Data; Do not have to have the data identical if judge,, inquire about other ID corresponding file folders, judge whether to exist the data identical with said Backup Data then according to the finger print information of said Backup Data with said Backup Data;
If judge in said other ID corresponding file folders and have the data identical with said Backup Data; Wherein, The data identical with said Backup Data are second historical data in said other history identification corresponding file folders, and whether the quantity of judging the reference data that said second historical data is corresponding is less than the said second historical data random number corresponding; Wherein, said random number is more than or equal to predetermined threshold value;
If the quantity of judging the corresponding reference data of said second historical data is less than the said second historical data random number corresponding; Then send backup messages and give said client, and receive the said Backup Data of said client transmission and the reference data that generates said second historical data.
Another aspect of the present invention provides a kind of treatment facility of data, comprising:
Transceiver module is used to receive the data backup requests message that client is sent, and said data backup requests message comprises: the finger print information of ID and Backup Data;
Judge module is used for the finger print information according to said Backup Data, inquires about said ID corresponding file, judges whether to exist the data identical with said Backup Data; Do not have to have the data identical if judge,, inquire about other ID corresponding file folders, judge whether to exist the data identical with said Backup Data then according to the finger print information of said Backup Data with said Backup Data; If judge in said other ID corresponding file folders and have the data identical with said Backup Data; Wherein, The data identical with said Backup Data are second historical data in said other history identification corresponding file folders, and whether the quantity of judging the reference data that said second historical data is corresponding is less than the said second historical data random number corresponding; Wherein, said random number is more than or equal to predetermined threshold value;
Said transceiver module also is used for then sending backup messages and giving said client if said judge module is judged the quantity of the corresponding reference data of said second historical data less than the said second historical data random number corresponding; And receive said Backup Data;
The reference data generation module is used to generate the reference data of said second historical data.
Another aspect of the present invention provides a kind of treatment system of data, comprising: the treatment facility of client and above-mentioned described data.
In the embodiment of the invention; Finger print information according to this Backup Data; Inquire about in the data in this ID corresponding file folder and other user folders; Whether stored identical data, if judge when obtaining having identical data in the data in other user files, whether the quantity of judging the reference data that this identical data is corresponding is less than this identical data random number corresponding; Wherein the identical data in other ID corresponding file is second historical data; Can the quantity and the random number size of the reference data of second historical data be compared, thereby when making other users will have the Backup Data of guessing content to backup to high in the clouds, even the data of identical content have been preserved in high in the clouds; Because in the quantity of the corresponding reference data of second historical data during less than the second historical data random number corresponding; Client still will be transmitted this Backup Data to high in the clouds, therefore makes other users can't detect in the database whether backed up identical data, and then has avoided the user's data leakage effectively.
Description of drawings
Fig. 1 is the flow chart of an embodiment of processing method of data of the present invention;
Fig. 2 is the flow chart of another embodiment of processing method of data of the present invention;
Fig. 3 is the structural representation of an embodiment of the treatment facility of data of the present invention;
Fig. 4 is the structural representation of another embodiment of the treatment facility of data of the present invention;
Fig. 5 is the structural representation of an embodiment of the treatment system of data of the present invention.
Embodiment
Fig. 1 is the flow chart of an embodiment of processing method of data of the present invention, and as shown in Figure 1, the executive agent of present embodiment is the treatment facility of data, and this equipment is arranged in the cloud storage, and then this method comprises:
The data backup requests message that step 101, reception client are sent, this data backup requests message comprises: the finger print information that ID and Backup Data are corresponding;
Wherein, The cloud storage can also be referred to as high in the clouds; This cloud storage is in cloud computing (cloud computing) conceptive extension and develops a new notion of coming out; Be meant through functions such as cluster application, grid or distributed file systems, a large amount of various dissimilar memory devices in the network are gathered collaborative work through application software, a system of storage and Operational Visit function externally is provided jointly.Finger print information can be Hash (HASH) value of Backup Data, also can use numerical value that other can the unique characteristic of representative data to be used as the finger print information of these data.
In the present embodiment, client is a unit with the Backup Data, calculates the finger print information of this Backup Data, and finger print information and the ID with this Backup Data is carried at the treatment facility that sends to data in the data backup requests message again.
Step 102, according to the finger print information of this Backup Data, inquire about this ID corresponding file folder, judge whether to exist the data identical with this Backup Data; Do not have to have the data identical if judge,, inquire about this other ID corresponding file folders, judge whether to exist the data identical with this Backup Data then according to the finger print information of this Backup Data with this Backup Data.
Step 103, there are the data identical with this Backup Data if judge in these other ID corresponding file folder; Wherein, The data identical with this Backup Data are second historical data in these other ID corresponding file folders, and whether the quantity of judging the reference data that this second historical data is corresponding is less than this second historical data random number corresponding; Wherein, this random number is more than or equal to predetermined threshold value.
Step 104, if the quantity of judging the corresponding reference data of this second historical data less than this second historical data random number corresponding; Then send backup messages and give this client, and receive this Backup Data of this client transmission and the reference data that generates this second historical data
In the present embodiment; The size of this reference data is very little; And the content of this reference data is the sensing to the historical data of identical content, and promptly client can be according to the sensing of the historical data with identical content when reading these data; Find the corresponding identical historical data of content of this sensing, and this historical data is read out.
In the present embodiment, stored history corresponding random number average is to generate at random in the treatment facility of data, and promptly any two historical data random number corresponding can be identical, also can be inequality.When random number more than or equal to predetermined threshold value; Then in case judge the existence data identical in other ID files with this Backup Data; It is second historical data that the data identical with this Backup Data then are set, and whether the reference data of judging this second historical data is less than the random number of this second historical data.Because being the historical data content that how to prevent secret, the technical problem that the present invention mainly solves is not stolen; Then generally speaking, the quantity of the secret historical data of storage is 1, and the quantity of reference data generally also is 1; When the HASH value of the HASH of Backup Data value and this confidential data is identical; The content that this Backup Data then is described is identical with the content of secret historical data, but under the situation of reference data less than random number, then still need point out the user to preserve said Backup Data; Therefore, the user can't know that high in the clouds stored the secret historical data identical with Backup Data.If the user is once more to the high in the clouds backup data identical with this Backup Data; Even in data preservation process; The user judges through data traffic and has preserved identical data in the database, and promptly high in the clouds has adopted source end data de-duplication to handle, but because stored one time Backup Data before; And random number is ignorant for this user, whether has stored the secret historical data identical with this Backup Data so still can't determine high in the clouds, the end.
In the present embodiment; Receive the data backup requests message of the HASH value that carries ID and Backup Data of client transmission; And according to the finger print information of this Backup Data; Inquire about in the data in this ID corresponding file folder and other user folders, whether stored identical data, if judge when obtaining having identical data in the data in other user files; Whether the quantity of judging the reference data that this identical data is corresponding is less than this identical data random number corresponding; Wherein the identical data in other ID corresponding file is second historical data, can the quantity and the random number size of the reference data of second historical data be compared, thereby when making other users will have the Backup Data of guessing content to backup to high in the clouds; Even the data of identical content have been preserved in high in the clouds; Because less than the second historical data random number corresponding, and random number is during more than or equal to predetermined threshold value in the quantity of the corresponding reference data of second historical data, client still will be transmitted this Backup Data to high in the clouds; Therefore make other users can't detect in the database whether backed up identical data, and then avoided the user's data leakage effectively.
Fig. 2 is the flow chart of another embodiment of processing method of data of the present invention; At present embodiment, the executive agent of this method is the treatment facility of data, and this equipment is arranged in the cloud storage; And with the finger print data is that the HASH value is an example; Introduce the technical scheme of present embodiment in detail, then as shown in Figure 2, then this method comprises:
The data backup requests message that step 201, reception client are sent, this data backup requests message comprises: the HASH value of ID and Backup Data.
In the present embodiment, client is a unit with the Backup Data, calculates the HASH value of this Backup Data, and HASH value and the ID with this Backup Data is carried at the treatment facility that sends to data in the data backup requests message again.
Step 202, according to the HASH value of this Backup Data, inquire about this ID corresponding file folder, judge whether to exist the data identical with this Backup Data, if do not exist, then execution in step 203; If exist, then execution in step 207.
Step 203, according to the HASH value of this Backup Data; Inquire about other ID corresponding file folders; Judge whether to exist the data identical with this Backup Data, wherein, in other ID corresponding file folders, having the data identical with this Backup Data is second historical data; If exist, then execution in step 204; If do not exist, then execution in step 209.
Step 204, judge the reference data that this second historical data is corresponding quantity whether less than this second historical data random number corresponding; If less than, then execution in step 205; If more than or equal to, then execution in step 208.Wherein, this random number is more than or equal to predetermined threshold value.
In the present embodiment, stored history corresponding random number average is to generate at random in the treatment facility of data, and promptly any two historical data random number corresponding can be identical, also can be inequality.The setting of random number threshold can be provided with the backup custom of confidential data according to the statistical analysis user, for example; For confidential information, the user gets used to only preserving once usually, and the quantity of this user's confidential information reference data just can not surpass 2 usually; So, the threshold value of this random number just can be set to 2, like this; Random number will when other users preserve data beyond the clouds, the situation of source end data de-duplication just can not occur greater than the quantity of reference data.If user's custom is carried out learning after the statistical analysis, user's custom backs up portion again with confidential data usually, and so, the quantity of the reference data of this user's confidential information can not surpass 3 usually, and the threshold value of this random number just can be set to 3.
Step 205, transmission backup messages are given client, and receive this Backup Data of this client transmission and the reference data that generates this second historical data.
Step 206, the quantity of the reference data of second historical data is added 1.Finish.
Step 207, generate the reference data of first historical data, the quantity of the reference data of first historical data is added 1, the backup success message of redispatching is given client.Finish.
Step 208, the corresponding reference data of generation second historical data, and send the backup success message and give client, and execution in step 206.
Step 209, transmission data backup requests acknowledge message are given this client, and receive and preserve the Backup Data that this client is sent, this Backup Data random number corresponding of regeneration.
In the present embodiment, for instance, when the user backs up a Backup Data for the first time; Client is a unit with this Backup Data earlier, calculates the HASH value of this Backup Data, and the HASH value and the ID of this Backup Data is carried at the treatment facility that sends to data in the data backup requests message; After the treatment facility of data receives this data backup requests message,, inquire about this ID corresponding file folder according to the HASH value of this Backup Data; Judge whether to exist the data corresponding,, so do not have the data identical in this document folder with this Backup Data because this Backup Data is a Backup Data that the user backs up for the first time with this Backup Data; Then according to the HASH value of this Backup Data; Inquire about other ID corresponding file folders and whether have the data identical,, send backup request message and give this client if do not exist with this Backup Data; And receive and preserve this client and send this Backup Data, this Backup Data random number corresponding of regeneration; Wherein, the span of this random number can be [2, N], and wherein, N is an integer.This N can be 10.In addition, preserve this Backup Data when the treatment facility of data, the quantity of the reference data that then this Backup Data is corresponding is 1.
If have the data identical in these other ID corresponding file folders with this Backup Data; Wherein, The data that this Backup Data is identical are second historical data, and whether the quantity that need judge the reference data that this second historical data is corresponding is less than this second historical data random number corresponding, if the quantity of the reference data of this second historical data correspondence of judgement is less than this second historical data random number corresponding; Then send backup messages and give client; And receive this Backup Data, and the reference data of this second historical data of regeneration, the quantity of the reference data that second historical data is corresponding adds 1.
When the user backs up this Backup Data for the second time; Client is a unit with this Backup Data earlier, calculates the HASH value of this Backup Data, and the HASH value of this Backup Data is identical with the HASH value that the user backs up this Backup Data for the first time; And the HASH value and the ID of this Backup Data be carried at the treatment facility that sends to data in the data backup requests message; Because this Backup Data is that the user backs up for the second time, the treatment facility of data is judged in this ID corresponding file folder and is had the data identical with this Backup Data, then is provided with in this ID corresponding file folder; The data identical with this Backup Data are first historical data; And generate the reference data of this first historical data, and the quantity of the application data of this first historical data is added 1, the backup success message of redispatching is given client.
In the present embodiment; Through receiving the data backup requests message of the HASH value that carries ID and Backup Data that client sends; And according to the HASH value of this Backup Data; Inquire about in the data in this ID corresponding file folder and other user folders, whether stored identical data, if judge when obtaining having identical data in the data in other user files; Whether the quantity of judging the reference data that this identical data is corresponding is less than this identical data random number corresponding; Wherein the identical data in other ID corresponding file is second historical data, can the quantity and the random number size of the reference data of second historical data be compared, thereby when making other users will have the Backup Data of guessing content to backup to high in the clouds; Even the data of identical content have been preserved in high in the clouds; Because less than the second historical data random number corresponding, and random number is during more than or equal to predetermined threshold value in the quantity of the corresponding reference data of second historical data, client still will be transmitted this Backup Data to high in the clouds; Therefore make other users can't detect in the database whether backed up identical data, and then avoided the user's data leakage effectively.In addition, when the quantity of the corresponding application data of second historical data during, generate the corresponding reference data of second historical data, thereby improved the client backup performance effectively more than or equal to the second historical data random number corresponding.
Fig. 3 is the structural representation of an embodiment of the treatment facility of data of the present invention; As shown in Figure 3; The equipment of present embodiment comprises: transceiver module 11, judge module 12 and reference data generation module 13; Wherein, transceiver module 11 is used to receive the data backup requests message that client is sent, and this data backup requests message comprises: the finger print information of ID and Backup Data; Judge module 12 is used for the finger print information according to this Backup Data, inquires about this ID corresponding file, judges whether to exist the data identical with this Backup Data; Do not have to have the data identical if judge,, inquire about other ID corresponding file folders, judge whether to exist the data identical with this Backup Data then according to the finger print information of this Backup Data with this Backup Data; If judge in these other ID corresponding file folders and have the data identical with this Backup Data; Wherein, The data identical with this Backup Data are second historical data in these other history identification corresponding file folders, and whether the quantity of judging the reference data that this second historical data is corresponding is less than this second historical data random number corresponding; Wherein, this random number is more than or equal to predetermined threshold value.Transceiver module 11 also is used for then sending backup messages and giving this client if this judge module 12 is judged the quantity of the corresponding reference data of this second historical data less than this second historical data random number corresponding; And receive this Backup Data; Reference data generation module 13 is used to generate the reference data of this second historical data.
The treatment facility of the data of present embodiment can be carried out the technical scheme of method embodiment shown in Figure 1, and its principle is similar, repeats no more here.
In the present embodiment; Through receiving the data backup requests message of the finger print information that carries ID and Backup Data that client sends; And according to the finger print information of this Backup Data; Inquire about in the data in this ID corresponding file folder and other user folders, whether stored identical data, if judge when obtaining having identical data in the data in other user files; Whether the quantity of judging the reference data that this identical data is corresponding is less than this identical data random number corresponding; Wherein the identical data in other ID corresponding file is second historical data, can the quantity and the random number size of the reference data of second historical data be compared, thereby when making other users will have the Backup Data of guessing content to backup to high in the clouds; Even the data of identical content have been preserved in high in the clouds; Because less than the second historical data random number corresponding, and random number is during more than or equal to predetermined threshold value in the quantity of the corresponding reference data of second historical data, client still will be transmitted this Backup Data to high in the clouds; Therefore make other users to detect and whether produced source end data de-duplication, and then avoided the user's data leakage effectively.
Fig. 4 is the structural representation of another embodiment of the treatment facility of data of the present invention; As shown in Figure 4; On above-mentioned basis embodiment illustrated in fig. 3, judge module 12 also is used for having the data identical with this Backup Data if judge this ID corresponding file folder, wherein; In this ID corresponding file folder, having the data identical with this Backup Data is first historical data; Reference data generation module 13 also is used to generate the reference data of this first historical data.Transceiver module 11 also is used for sending the backup success message and gives this client.
Further, reference data generation module 13 also is used for then generating the corresponding reference data of this second historical data if judge module 12 is judged the quantity of the corresponding reference data of this second historical data more than or equal to this second historical data random number corresponding; This transceiver module 11 also is used for sending the backup success message and gives this client.
Further, this equipment also comprises: reference data quantity logging modle 14, and the quantity that is used for the reference data that this first historical data is corresponding adds 1; Perhaps, the quantity that also is used for the reference data that this second historical data is corresponding adds 1.
Further, transceiver module 11 also is used for if this judge module 12 is judged at these other ID corresponding file folders not to be had to have the data identical with this Backup Data, then sends backup messages and gives this client; And receive the Backup Data that this client is sent; Then this equipment also comprises: data memory module 15 and random number generation module 16, and wherein, data memory module 15 is used to preserve said Backup Data; Random number generation module 16 is used to generate said Backup Data random number corresponding.
The treatment facility of the data of present embodiment can be carried out the technical scheme of method embodiment shown in Figure 2, and its principle is similar, repeats no more here.
In the present embodiment; Through receiving the data backup requests message of the finger print information that carries ID and Backup Data that client sends; And according to the finger print information of this Backup Data; Inquire about in the data in this ID corresponding file folder and other user folders, whether stored identical data, if judge when obtaining having identical data in the data in other user files; Whether the quantity of judging the reference data that this identical data is corresponding is less than this identical data random number corresponding; Wherein the identical data in other ID corresponding file is second historical data, can the quantity and the random number size of the reference data of second historical data be compared, thereby when making other users will have the Backup Data of guessing content to backup to high in the clouds; Even the data of identical content have been preserved in high in the clouds; Because less than the second historical data random number corresponding, and random number is during more than or equal to predetermined threshold value in the quantity of the corresponding reference data of second historical data, client still will be transmitted this Backup Data to high in the clouds; Therefore make other users to detect and whether produced source end data de-duplication, and then avoided the user's data leakage effectively.In addition, when the quantity of the corresponding application data of second historical data during, generate the corresponding reference data of second historical data, thereby improved the client backup performance effectively more than or equal to the second historical data random number corresponding.
Fig. 5 is the structural representation of an embodiment of the treatment system of data of the present invention; As shown in Figure 5, this system comprises the treatment facility 22 of client 21 and data, wherein; The treatment facility 22 of data can be Fig. 3 or equipment shown in Figure 4; And can execution graph 1 or the technical scheme of method embodiment shown in Figure 2, its principle is similar, repeats no more here.
One of ordinary skill in the art will appreciate that: all or part of step that realizes above-mentioned each method embodiment can be accomplished through the relevant hardware of program command.Aforesaid program can be stored in the computer read/write memory medium.This program the step that comprises above-mentioned each method embodiment when carrying out; And aforesaid storage medium comprises: various media that can be program code stored such as ROM, RAM, magnetic disc or CD.
What should explain at last is: above each embodiment is only in order to explaining technical scheme of the present invention, but not to its restriction; Although the present invention has been carried out detailed explanation with reference to aforementioned each embodiment; Those of ordinary skill in the art is to be understood that: it still can be made amendment to the technical scheme that aforementioned each embodiment put down in writing, perhaps to wherein part or all technical characteristic are equal to replacement; And these are revised or replacement, do not make the scope of the essence disengaging various embodiments of the present invention technical scheme of relevant art scheme.

Claims (9)

1. a processing method of data is characterized in that, comprising:
Receive the data backup requests message that client is sent, said data backup requests message comprises: the finger print information of ID and Backup Data;
According to the finger print information of said Backup Data, inquire about said ID corresponding file folder, judge whether to exist the data identical with said Backup Data; There are not the data identical if judge,, inquire about other ID corresponding file folders, judge whether to exist the data identical with said Backup Data then according to the finger print information of said Backup Data with said Backup Data;
If judge in said other ID corresponding file folders and have the data identical with said Backup Data; Wherein, The data identical with said Backup Data are second historical data in said other ID corresponding file folders, and whether the quantity of then judging the reference data that said second historical data is corresponding is less than the said second historical data random number corresponding; Wherein, said random number is more than or equal to predetermined threshold value;
If the quantity of judging the corresponding reference data of said second historical data is less than the said second historical data random number corresponding; Then send backup messages and give said client, and receive the said Backup Data of said client transmission and the reference data that generates said second historical data.
2. processing method of data according to claim 1 is characterized in that, also comprises:
If judge in the said ID corresponding file folder and have the data identical with said Backup Data; Wherein, In said ID corresponding file folder; Having the data identical with said Backup Data is first historical data, then generates the reference data of said first historical data and sends the backup success message to said client.
3. processing method of data according to claim 1 is characterized in that, also comprises:
If the quantity of judging the corresponding reference data of said second historical data more than or equal to the said second historical data random number corresponding, then generates the corresponding reference data of said second historical data, and send the backup success message and give said client.
4. processing method of data according to claim 1 is characterized in that, also comprises:
Do not have to have the data identical if judge in said other ID corresponding file folders, then send backup messages and give said client, and receive and preserve the said Backup Data that said client is sent with said Backup Data;
Generate said Backup Data random number corresponding.
5. the treatment facility of data is characterized in that, comprising:
Transceiver module is used to receive the data backup requests message that client is sent, and said data backup requests message comprises: the finger print information of ID and Backup Data;
Judge module is used for the finger print information according to said Backup Data, inquires about said ID corresponding file folder, judges whether to exist the data identical with said Backup Data; Do not have to have the data identical if judge,, inquire about other ID corresponding file folders, judge whether to exist the data identical with said Backup Data then according to the finger print information of said Backup Data with said Backup Data; If judge in said other ID corresponding file folders and have the data identical with said Backup Data; Wherein, The data identical with said Backup Data are second historical data in said other history identification corresponding file folders, and whether the quantity of judging the reference data that said second historical data is corresponding is less than the said second historical data random number corresponding; Wherein, said random number is more than or equal to predetermined threshold value;
Said transceiver module also is used for then sending backup messages and giving said client if said judge module is judged the quantity of the corresponding reference data of said second historical data less than the said second historical data random number corresponding; And receive said Backup Data;
The reference data generation module is used to generate the reference data of said second historical data.
6. the treatment facility of data according to claim 5; It is characterized in that; Said judge module also is used for having the data identical with said Backup Data if judge said ID corresponding file folder; Wherein, in said ID corresponding file folder, having the data identical with said Backup Data is first historical data;
Said reference data generation module also is used to generate the reference data of said first historical data;
Said transceiver module also is used for sending the backup success message and gives said client.
7. the treatment facility of data according to claim 5; It is characterized in that; Said reference data generation module also is used for then generating the corresponding reference data of said second historical data if said judge module is judged the quantity of the corresponding reference data of said second historical data more than or equal to the said second historical data random number corresponding;
Said transceiver module also is used for sending the backup success message and gives said client.
8. the treatment facility of data according to claim 5; It is characterized in that; Said transceiver module also is used for if said judge module is judged at said other ID corresponding file folders not to be had to have the data identical with said Backup Data, then sends backup messages and gives said client; And receive the Backup Data that said client is sent;
Data memory module is used to preserve said Backup Data;
The random number generation module is used to generate said Backup Data random number corresponding.
9. the treatment system of data is characterized in that, comprising: client and like the treatment facility of each described data of claim 5 to 8.
CN201110426631.6A 2011-12-19 2011-12-19 Data processing method, device and system Active CN102523290B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201110426631.6A CN102523290B (en) 2011-12-19 2011-12-19 Data processing method, device and system

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201110426631.6A CN102523290B (en) 2011-12-19 2011-12-19 Data processing method, device and system

Publications (2)

Publication Number Publication Date
CN102523290A true CN102523290A (en) 2012-06-27
CN102523290B CN102523290B (en) 2015-04-08

Family

ID=46294077

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201110426631.6A Active CN102523290B (en) 2011-12-19 2011-12-19 Data processing method, device and system

Country Status (1)

Country Link
CN (1) CN102523290B (en)

Cited By (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102999400A (en) * 2012-11-22 2013-03-27 中国电信股份有限公司云计算分公司 Data backup method and device of cloud storage system
CN103064757A (en) * 2012-12-12 2013-04-24 鸿富锦精密工业(深圳)有限公司 Method and system for backing up data
CN106250723A (en) * 2016-08-10 2016-12-21 智者四海(北京)技术有限公司 A kind of control method based on page word and device
CN106572177A (en) * 2016-11-07 2017-04-19 广东欧珀移动通信有限公司 Data transmission method and mobile terminal
CN106598765A (en) * 2015-10-15 2017-04-26 北京国双科技有限公司 Data check method and device
CN107276857A (en) * 2017-08-16 2017-10-20 郑州云海信息技术有限公司 A kind of method and device for monitoring flow
CN107562555A (en) * 2017-08-02 2018-01-09 网宿科技股份有限公司 The cleaning method and server of duplicate data
WO2020220536A1 (en) * 2019-04-28 2020-11-05 平安科技(深圳)有限公司 Data backup method and device, and computer readable storage medium
CN112988497A (en) * 2019-12-13 2021-06-18 伊姆西Ip控股有限责任公司 Method, electronic device and computer program product for managing backup system
CN114442904A (en) * 2020-10-30 2022-05-06 伊姆西Ip控股有限责任公司 Method, apparatus and computer program product for managing a storage system

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20090182789A1 (en) * 2003-08-05 2009-07-16 Sepaton, Inc. Scalable de-duplication mechanism
CN101582076A (en) * 2009-06-24 2009-11-18 浪潮电子信息产业股份有限公司 Data de-duplication method based on data base
CN101882141A (en) * 2009-05-08 2010-11-10 北京众志和达信息技术有限公司 Method and system for implementing repeated data deletion

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20090182789A1 (en) * 2003-08-05 2009-07-16 Sepaton, Inc. Scalable de-duplication mechanism
CN101882141A (en) * 2009-05-08 2010-11-10 北京众志和达信息技术有限公司 Method and system for implementing repeated data deletion
CN101582076A (en) * 2009-06-24 2009-11-18 浪潮电子信息产业股份有限公司 Data de-duplication method based on data base

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
廖海生等: "基于MD5算法的重复数据删除技术的研究与改进", 《计算机测量与控制》 *

Cited By (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102999400A (en) * 2012-11-22 2013-03-27 中国电信股份有限公司云计算分公司 Data backup method and device of cloud storage system
CN103064757A (en) * 2012-12-12 2013-04-24 鸿富锦精密工业(深圳)有限公司 Method and system for backing up data
CN106598765A (en) * 2015-10-15 2017-04-26 北京国双科技有限公司 Data check method and device
CN106250723A (en) * 2016-08-10 2016-12-21 智者四海(北京)技术有限公司 A kind of control method based on page word and device
CN106572177A (en) * 2016-11-07 2017-04-19 广东欧珀移动通信有限公司 Data transmission method and mobile terminal
CN107562555A (en) * 2017-08-02 2018-01-09 网宿科技股份有限公司 The cleaning method and server of duplicate data
CN107276857A (en) * 2017-08-16 2017-10-20 郑州云海信息技术有限公司 A kind of method and device for monitoring flow
WO2020220536A1 (en) * 2019-04-28 2020-11-05 平安科技(深圳)有限公司 Data backup method and device, and computer readable storage medium
CN112988497A (en) * 2019-12-13 2021-06-18 伊姆西Ip控股有限责任公司 Method, electronic device and computer program product for managing backup system
CN114442904A (en) * 2020-10-30 2022-05-06 伊姆西Ip控股有限责任公司 Method, apparatus and computer program product for managing a storage system

Also Published As

Publication number Publication date
CN102523290B (en) 2015-04-08

Similar Documents

Publication Publication Date Title
CN102523290B (en) Data processing method, device and system
US8762743B2 (en) Encrypting data objects to back-up
AU757667B2 (en) Access to content addressable data over a network
US7793112B2 (en) Access to content addressable data over a network
CN102301377B (en) Methods and apparatus for content-aware data partitioning and data de-duplication
US8788831B2 (en) More elegant exastore apparatus and method of operation
CN103095843A (en) Method and client of data backup based on version vectors
CN107436725A (en) A kind of data are write, read method, apparatus and distributed objects storage cluster
CN109857710A (en) File memory method and terminal device
CN103067525A (en) Cloud storage data backup method based on characteristic codes
CN103186554A (en) Distributed data mirroring method and data storage node
US10795860B1 (en) WAN optimized micro-service based deduplication
CN103227818A (en) Terminal, server, file transferring method, file storage management system and file storage management method
CN103324533A (en) distributed data processing method, device and system
US10558581B1 (en) Systems and techniques for data recovery in a keymapless data storage system
CN103988201A (en) Efficient backup replication
CN103118104A (en) Data restoration method based on version vector, and server
CN103116615A (en) Data index method and server based edition vector
CN104967591A (en) Cloud storage data read-write method and device, and read-write control method and device
CN103823807A (en) Data de-duplication method, device and system
CN110245129B (en) Distributed global data deduplication method and device
CN102082791A (en) Data backup implementation method, client, server and system
CN105574008A (en) Task scheduling method and equipment applied to distributed file system
CN113300875A (en) Return source data verification method, server, system and storage medium
CN109460182A (en) A kind of storage of data, read method and device

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C53 Correction of patent of invention or patent application
CB02 Change of applicant information

Address after: 611731 Chengdu high tech Zone, Sichuan, West Park, Qingshui River

Applicant after: HUAWEI DIGITAL TECHNOLOGIES (CHENG DU) Co.,Ltd.

Address before: 611731 Chengdu high tech Zone, Sichuan, West Park, Qingshui River

Applicant before: CHENGDU HUAWEI SYMANTEC TECHNOLOGIES Co.,Ltd.

COR Change of bibliographic data

Free format text: CORRECT: APPLICANT; FROM: CHENGDU HUAWEI SYMANTEC TECHNOLOGIES CO., LTD. TO: HUAWEI DIGITAL TECHNOLOGY (CHENGDU) CO., LTD.

C14 Grant of patent or utility model
GR01 Patent grant
TR01 Transfer of patent right
TR01 Transfer of patent right

Effective date of registration: 20220905

Address after: No. 1899 Xiyuan Avenue, high tech Zone (West District), Chengdu, Sichuan 610041

Patentee after: Chengdu Huawei Technologies Co.,Ltd.

Address before: 611731 Qingshui River District, Chengdu hi tech Zone, Sichuan, China

Patentee before: HUAWEI DIGITAL TECHNOLOGIES (CHENG DU) Co.,Ltd.