CN102523290B - Data processing method, device and system - Google Patents

Data processing method, device and system Download PDF

Info

Publication number
CN102523290B
CN102523290B CN201110426631.6A CN201110426631A CN102523290B CN 102523290 B CN102523290 B CN 102523290B CN 201110426631 A CN201110426631 A CN 201110426631A CN 102523290 B CN102523290 B CN 102523290B
Authority
CN
China
Prior art keywords
data
backup
user
historical
random number
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201110426631.6A
Other languages
Chinese (zh)
Other versions
CN102523290A (en
Inventor
张程伟
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Chengdu Huawei Technology Co Ltd
Original Assignee
Huawei Symantec Technologies Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Huawei Symantec Technologies Co Ltd filed Critical Huawei Symantec Technologies Co Ltd
Priority to CN201110426631.6A priority Critical patent/CN102523290B/en
Publication of CN102523290A publication Critical patent/CN102523290A/en
Application granted granted Critical
Publication of CN102523290B publication Critical patent/CN102523290B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Landscapes

  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

The invention provides a data processing method, device and system. The method comprises the steps of receiving a data backup request information sent by a client, wherein the data backup request information comprises user tags and finger print information of the backup data; inquiring a folder corresponding to the user tag according to the fingerprint information of the backup data, and judging whether the folder has the same data with the backup data; if no, inquiring folders corresponding to other user tags according to the fingerprint information of the backup data, and judging whether the folders have the same data with the backup data; if so, the data same with the backup data is a second history data, judging whether the number of the quoted data corresponding to the second history data is less than the random number corresponding to the second history data, wherein the random number is greater than the pre-set threshold value; if the the number of the quoted data corresponding to the second history data is less than the random number corresponding to the second history data, sending a backup information to the client, receiving the backup data sent by the client and generating the quoted data of the second history data.

Description

The processing method of data, equipment and system
Technical field
The present invention relates to computer technology, particularly relate to a kind of processing method of data, equipment and system.
Background technology
Cloud stores mainly through using distributed computing technology means, by virtual for the memory device that physical layer distributes be the highly reliable storage system of a high-performance, and be supplied to user uniformly.In addition, data in storing to make cloud get the greatest degree of optimization, the technology of main employing source data de-duplication, reduce the total capacity of available data, its concrete implementation is: the data that preparation stores by high in the clouds and the data stored carry out comparison of coherence, and only store the data of not identical preparation storage, thus reduce the demand to transmission bandwidth, and improve backup efficiency.
For example, if user A beyond the clouds on backed up the first data, when user B contains the second data of same content toward the backup of this high in the clouds, this high in the clouds is detected and has been stored first data with identical content, then carry out the process of source data de-duplication, wherein, the main operational principle of this source data de-duplication process is: generate the reference data that these the first data are corresponding, this reference data is used in reference to these the first data, and sending backup success message to user B, user B then can not transmit these second data.Thus make user B can judge whether to create the process of source data de-duplication by the flow of the client to high in the clouds of monitoring oneself.
But, when these first data are external maintaining secrecy, adopt above-mentioned art methods, the client that user B monitors oneself is judged to create the process of source data de-duplication to the flow in this high in the clouds, then user B can know that high in the clouds stores the data identical with the second data content, thus has stolen the content of first data of user A.Such as: if user A and user B is the Liang Ge manufacturer of competitive bidding simultaneously, and the identical quotation template data all using identical cloud stores service and tenderer to provide, then user B may generate the quotation template data of various quotation in this locality, and these quotation template data are backuped to high in the clouds, and there occurs change by detecting when sending which data client to the flow in high in the clouds, just can judge that the data of carrying out source data de-duplication are identical with the quotation template that user A feeds back, thus result in the leakage of the data that user A stores, and then effectively cannot ensure that user stores the fail safe of data beyond the clouds.
Summary of the invention
First aspect of the present invention is to provide a kind of processing method of data, comprising:
Receive the data backup requests message that client sends, described data backup requests message comprises: the finger print information of user ID and Backup Data;
According to the finger print information of described Backup Data, inquire about the file that described user ID is corresponding, judge whether to there are the data identical with described Backup Data; If judge not identical with described Backup Data data, then according to the finger print information of described Backup Data, inquire about the file that other user ID are corresponding, judge whether to there are the data identical with described Backup Data;
If judge to there are the data identical with described Backup Data in the file that other user ID described are corresponding, wherein, in the file that other history identifications described are corresponding, the data identical with described Backup Data are the second historical data, judge whether the quantity of the reference data that described second historical data is corresponding is less than random number corresponding to described second historical data; Wherein, described random number is more than or equal to predetermined threshold value;
If judge, the quantity of the reference data that described second historical data is corresponding is less than random number corresponding to described second historical data, then send backup messages to described client, and receive the described Backup Data that described client sends and the reference data generating described second historical data.
Another aspect of the present invention is to provide a kind for the treatment of facility of data, comprising:
Transceiver module, for receiving the data backup requests message that client sends, described data backup requests message comprises: the finger print information of user ID and Backup Data;
Judge module, for the finger print information according to described Backup Data, inquires about the file that described user ID is corresponding, judges whether to there are the data identical with described Backup Data; If judge not identical with described Backup Data data, then according to the finger print information of described Backup Data, inquire about the file that other user ID are corresponding, judge whether to there are the data identical with described Backup Data; If judge to there are the data identical with described Backup Data in the file that other user ID described are corresponding, wherein, in the file that other history identifications described are corresponding, the data identical with described Backup Data are the second historical data, judge whether the quantity of the reference data that described second historical data is corresponding is less than random number corresponding to described second historical data; Wherein, described random number is more than or equal to predetermined threshold value;
If for described judge module, described transceiver module also judges that the quantity of the reference data that described second historical data is corresponding is less than random number corresponding to described second historical data, then send backup messages to described client; And receive described Backup Data;
Reference data generation module, for generating the reference data of described second historical data.
Another aspect of the present invention is to provide a kind for the treatment of system of data, comprising: the treatment facility of client and data described above.
In the embodiment of the present invention, according to the finger print information of this Backup Data, inquire about in the data in file corresponding to this user ID and other user folders, whether identical data are stored, when obtaining there are identical data in the data in other user files if judge, judge whether the quantity of the reference data that this identical data is corresponding is less than random number corresponding to this identical data, identical data wherein in the file that other user ID are corresponding is the second historical data, can compare the quantity of the reference data of the second historical data and random number size, thus when making other users that the Backup Data with conjecture content is backuped to high in the clouds, even if high in the clouds saves the data of identical content, during owing to being less than random number corresponding to the second historical data in the quantity of reference data corresponding to the second historical data, client still will transmit this Backup Data to high in the clouds, therefore other users are made whether cannot to have backed up identical data in Test database, and then efficiently avoid the leaking data of user.
Accompanying drawing explanation
Fig. 1 is the flow chart of an embodiment of the processing method of data of the present invention;
Fig. 2 is the flow chart of another embodiment of the processing method of data of the present invention;
Fig. 3 is the structural representation of an embodiment of the treatment facility of data of the present invention;
Fig. 4 is the structural representation of another embodiment of the treatment facility of data of the present invention;
Fig. 5 is the structural representation of an embodiment of the treatment system of data of the present invention.
Embodiment
Fig. 1 is the flow chart of an embodiment of the processing method of data of the present invention, and as shown in Figure 1, the executive agent of the present embodiment is the treatment facility of data, and this equipment is arranged in cloud storage, then the method comprises:
Step 101, receive the data backup requests message that client sends, this data backup requests message comprises: user ID and finger print information corresponding to Backup Data;
Wherein, cloud storage can also be referred to as high in the clouds, this cloud storage is in cloud computing (cloud computing) conceptive extension and the new concept of development out one, refer to by functions such as cluster application, grid or distributed file systems, various dissimilar memory device a large amount of in network is gathered collaborative work by application software, a system of data storage and Operational Visit function is externally provided jointly.Finger print information can be Hash (HASH) value of Backup Data, and other also can be used numerical value of the unique feature of representative data can be used as the finger print information of these data.
In the present embodiment, client, in units of Backup Data, calculates the finger print information of this Backup Data, then the finger print information of this Backup Data and user ID is carried at the treatment facility sending to data in data backup requests message.
Step 102, finger print information according to this Backup Data, inquire about the file that this user ID is corresponding, judges whether to there are the data identical with this Backup Data; If judge not identical with this Backup Data data, then according to the finger print information of this Backup Data, inquire about the file that these other user ID are corresponding, judge whether to there are the data identical with this Backup Data.
If step 103 judges to there are the data identical with this Backup Data in the file that these other user ID are corresponding, wherein, data identical with this Backup Data in the file that these other user ID are corresponding are the second historical data, judge whether the quantity of the reference data that this second historical data is corresponding is less than random number corresponding to this second historical data; Wherein, this random number is more than or equal to predetermined threshold value.
If step 104 judges that the quantity of the reference data that this second historical data is corresponding is less than random number corresponding to this second historical data, then send backup messages to this client, and receive this Backup Data that this client sends and the reference data generating this second historical data
In the present embodiment, the size of this reference data is very little, and the content of this reference data is the sensing of the historical data to identical content, namely client is when reading these data, can according to the sensing of historical data with identical content, find the historical data that the content of this sensing correspondence is identical, and this historical data is read out.
In the present embodiment, random number corresponding to the historical data stored in the treatment facility of data is all stochastic generation, and the random number that namely any two historical datas are corresponding can be identical, also can not be identical.When random number is more than or equal to predetermined threshold value, then once judge to there are the data identical with this Backup Data in other user ID files, then arranging the data identical with this Backup Data is the second historical data, and judges whether the reference data of this second historical data is less than the random number of this second historical data.The technical problem mainly solved due to the present invention how to prevent the historical data content of secret not to be stolen, then generally, the quantity of the secret historical data stored is 1, the quantity of reference data is also generally 1, when the HASH value of Backup Data is identical with the HASH value of this confidential data, then illustrate that the content of this Backup Data is identical with the content of secret historical data, but when reference data is less than random number, prompting user is then still needed to preserve described Backup Data, therefore, user cannot know that high in the clouds stores the secret historical data identical with Backup Data.If user is again to the data that high in the clouds backup is identical with this Backup Data, even if in data storing procedure, user judges to have preserved identical data in database by data traffic, namely high in the clouds have employed the process of source data de-duplication, but because stored a Backup Data before, and random number is ignorant for this user, so whether or cannot determine high in the clouds, the end stores the secret historical data identical with this Backup Data.
In the present embodiment, receive the data backup requests message carrying the HASH value of user ID and Backup Data that client sends, and according to the finger print information of this Backup Data, inquire about in the data in file corresponding to this user ID and other user folders, whether identical data are stored, when obtaining there are identical data in the data in other user files if judge, judge whether the quantity of the reference data that this identical data is corresponding is less than random number corresponding to this identical data, identical data wherein in the file that other user ID are corresponding is the second historical data, can compare the quantity of the reference data of the second historical data and random number size, thus when making other users that the Backup Data with conjecture content is backuped to high in the clouds, even if high in the clouds saves the data of identical content, owing to being less than random number corresponding to the second historical data in the quantity of reference data corresponding to the second historical data, and random number is when being more than or equal to predetermined threshold value, client still will transmit this Backup Data to high in the clouds, therefore other users are made whether cannot to have backed up identical data in Test database, and then efficiently avoid the leaking data of user.
Fig. 2 is the flow chart of another embodiment of the processing method of data of the present invention, at the present embodiment, the executive agent of the method is the treatment facility of data, this equipment is arranged in cloud storage, and be HASH value with finger print data be example, introduce the technical scheme of the present embodiment in detail, then as shown in Figure 2, then the method comprises:
The data backup requests message that step 201, reception client send, this data backup requests message comprises: the HASH value of user ID and Backup Data.
In the present embodiment, client, in units of Backup Data, calculates the HASH value of this Backup Data, then the HASH value of this Backup Data and user ID are carried at the treatment facility sending to data in data backup requests message.
Step 202, HASH value according to this Backup Data, inquire about the file that this user ID is corresponding, judges whether to there are the data identical with this Backup Data, if do not exist, then and execution step 203; If exist, then perform step 207.
Step 203, HASH value according to this Backup Data, inquire about the file that other user ID are corresponding, judge whether to there are the data identical with this Backup Data, wherein, in the file that other user ID are corresponding, there are the data identical with this Backup Data is the second historical data; If exist, then perform step 204; If do not exist, then perform step 209.
Step 204, judge whether the quantity of the reference data that this second historical data is corresponding is less than random number corresponding to this second historical data; If be less than, then perform step 205; If be more than or equal to, then perform step 208.Wherein, this random number is more than or equal to predetermined threshold value.
In the present embodiment, random number corresponding to the historical data stored in the treatment facility of data is all stochastic generation, and the random number that namely any two historical datas are corresponding can be identical, also can not be identical.The setting of random number threshold, can arrange, such as, for confidential information the backup custom of confidential data according to statistical analysis user, usual user habit is in only preserving once, the quantity of the confidential information reference data of this user usually would not more than 2, and so, the threshold value of this random number just can be set to 2, like this, random number will be greater than the quantity of reference data, when other users preserve data beyond the clouds, would not occur the situation of source data de-duplication.If learnt after carrying out statistical analysis to the custom of user, confidential data is backed up portion by usual user habit again, and so, the quantity of the reference data of the confidential information of this user usually can not more than 3, and the threshold value of this random number just can be set to 3.
Step 205, transmission backup messages to client, and receive this Backup Data that this client sends and the reference data generating this second historical data.
Step 206, the quantity of the reference data of the second historical data is added 1.Terminate.
Step 207, generate the reference data of the first historical data, the quantity of the reference data of the first historical data is added 1, then sends backup success message to client.Terminate.
Step 208, generate reference data corresponding to the second historical data, and send backup success message to client, and perform step 206.
Step 209, transmission data backup requests acknowledge message to this client, and receive and preserve the Backup Data of this client transmission, the random number that this Backup Data of regeneration is corresponding.
In the present embodiment, for example, when user backs up a Backup Data first time, client is first in units of this Backup Data, calculate the HASH value of this Backup Data, and the HASH value of this Backup Data and user ID are carried at the treatment facility sending to data in data backup requests message, after the treatment facility of data receives this data backup requests message, according to the HASH value of this Backup Data, inquire about the file that this user ID is corresponding, judge whether to there are the data corresponding with this Backup Data, due to a Backup Data that this Backup Data is user's first time backup, therefore there are not the data identical with this Backup Data in this file, then according to the HASH value of this Backup Data, inquire about file corresponding to other user ID and whether there are the data identical with this Backup Data, if do not exist, send backup request message to this client, and receive and preserve this client and send this Backup Data, the random number that this Backup Data of regeneration is corresponding, wherein, the span of this random number can be [2, N], and wherein, N is integer.This N can be 10.In addition, when the treatment facility of data preserves this Backup Data, then the quantity of the reference data that this Backup Data is corresponding is 1.
If there are the data identical with this Backup Data in the file that these other user ID are corresponding, wherein, the data that this Backup Data is identical are the second historical data, and need the quantity judging the reference data that this second historical data is corresponding whether to be less than random number corresponding to this second historical data, if judge, the quantity of the reference data that this second historical data is corresponding is less than random number corresponding to this second historical data, then send backup messages to client, and receive this Backup Data, the reference data of this second historical data of regeneration, the quantity of reference data corresponding for the second historical data is added 1.
When user's second time backs up this Backup Data, client is first in units of this Backup Data, calculate the HASH value of this Backup Data, the HASH value of this Backup Data is identical with the HASH value that user backs up this Backup Data first time, and the HASH value of this Backup Data and user ID are carried at the treatment facility sending to data in data backup requests message, because this Backup Data is the backup of user's second time, the treatment facility of data judges to there are the data identical with this Backup Data in the file that this user ID is corresponding, then arrange in file corresponding to this user ID, the data identical with this Backup Data are the first historical data, and generate the reference data of this first historical data, the quantity of the application data of this first historical data is added 1, send backup success message again to client.
In the present embodiment, by receiving the data backup requests message carrying the HASH value of user ID and Backup Data that client sends, and according to the HASH value of this Backup Data, inquire about in the data in file corresponding to this user ID and other user folders, whether identical data are stored, when obtaining there are identical data in the data in other user files if judge, judge whether the quantity of the reference data that this identical data is corresponding is less than random number corresponding to this identical data, identical data wherein in the file that other user ID are corresponding is the second historical data, can compare the quantity of the reference data of the second historical data and random number size, thus when making other users that the Backup Data with conjecture content is backuped to high in the clouds, even if high in the clouds saves the data of identical content, owing to being less than random number corresponding to the second historical data in the quantity of reference data corresponding to the second historical data, and random number is when being more than or equal to predetermined threshold value, client still will transmit this Backup Data to high in the clouds, therefore other users are made whether cannot to have backed up identical data in Test database, and then efficiently avoid the leaking data of user.In addition, when the quantity of application data corresponding to the second historical data is more than or equal to random number corresponding to the second historical data, generates the reference data that the second historical data is corresponding, thus effectively improve client backup performance.
Fig. 3 is the structural representation of an embodiment of the treatment facility of data of the present invention, as shown in Figure 3, the equipment of the present embodiment comprises: transceiver module 11, judge module 12 and reference data generation module 13, wherein, the data backup requests message that transceiver module 11 sends for receiving client, this data backup requests message comprises: the finger print information of user ID and Backup Data; Judge module 12, for the finger print information according to this Backup Data, inquires about the file that this user ID is corresponding, judges whether to there are the data identical with this Backup Data; If judge not identical with this Backup Data data, then according to the finger print information of this Backup Data, inquire about the file that other user ID are corresponding, judge whether to there are the data identical with this Backup Data; If judge to there are the data identical with this Backup Data in the file that these other user ID are corresponding, wherein, data identical with this Backup Data in the file that these other history identifications are corresponding are the second historical data, judge whether the quantity of the reference data that this second historical data is corresponding is less than random number corresponding to this second historical data; Wherein, this random number is more than or equal to predetermined threshold value.If for this judge module 12, transceiver module 11 also judges that the quantity of the reference data that this second historical data is corresponding is less than random number corresponding to this second historical data, then send backup messages to this client; And receive this Backup Data; Reference data generation module 13 is for generating the reference data of this second historical data.
The treatment facility of the data of the present embodiment can perform the technical scheme of embodiment of the method shown in Fig. 1, and its principle is similar, repeats no more herein.
In the present embodiment, by receiving the data backup requests message carrying the finger print information of user ID and Backup Data that client sends, and according to the finger print information of this Backup Data, inquire about in the data in file corresponding to this user ID and other user folders, whether identical data are stored, when obtaining there are identical data in the data in other user files if judge, judge whether the quantity of the reference data that this identical data is corresponding is less than random number corresponding to this identical data, identical data wherein in the file that other user ID are corresponding is the second historical data, can compare the quantity of the reference data of the second historical data and random number size, thus when making other users that the Backup Data with conjecture content is backuped to high in the clouds, even if high in the clouds saves the data of identical content, owing to being less than random number corresponding to the second historical data in the quantity of reference data corresponding to the second historical data, and random number is when being more than or equal to predetermined threshold value, client still will transmit this Backup Data to high in the clouds, therefore make other users to detect and whether create source data de-duplication, and then efficiently avoid the leaking data of user.
Fig. 4 is the structural representation of another embodiment of the treatment facility of data of the present invention, as shown in Figure 4, on above-mentioned basis embodiment illustrated in fig. 3, if judge module 12 is also for judging to there are the data identical with this Backup Data in the file that this user ID is corresponding, wherein, in the file that this user ID is corresponding, there are the data identical with this Backup Data is the first historical data; Reference data generation module 13 is also for generating the reference data of this first historical data.Transceiver module 11 is also for sending backup success message to this client.
Further, if for judge module 12, reference data generation module 13 also judges that the quantity of the reference data that this second historical data is corresponding is more than or equal to random number corresponding to this second historical data, then the reference data that this second historical data is corresponding is generated; This transceiver module 11 is also for sending backup success message to this client.
Further, this equipment also comprises: reference data quantity logging modle 14, for the quantity of reference data corresponding for this first historical data is added 1; Or, also for the quantity of reference data corresponding for this second historical data is added 1.
Further, if transceiver module 11 also judges data not identical with this Backup Data in the file that these other user ID are corresponding for this judge module 12, then send backup messages to this client; And receive the Backup Data of this client transmission; Then this equipment also comprises: data memory module 15 and random number generation module 16, and wherein, data memory module 15 is for preserving described Backup Data; Random number generation module 16 is for generating random number corresponding to described Backup Data.
The treatment facility of the data of the present embodiment can perform the technical scheme of embodiment of the method shown in Fig. 2, and its principle is similar, repeats no more herein.
In the present embodiment, by receiving the data backup requests message carrying the finger print information of user ID and Backup Data that client sends, and according to the finger print information of this Backup Data, inquire about in the data in file corresponding to this user ID and other user folders, whether identical data are stored, when obtaining there are identical data in the data in other user files if judge, judge whether the quantity of the reference data that this identical data is corresponding is less than random number corresponding to this identical data, identical data wherein in the file that other user ID are corresponding is the second historical data, can compare the quantity of the reference data of the second historical data and random number size, thus when making other users that the Backup Data with conjecture content is backuped to high in the clouds, even if high in the clouds saves the data of identical content, owing to being less than random number corresponding to the second historical data in the quantity of reference data corresponding to the second historical data, and random number is when being more than or equal to predetermined threshold value, client still will transmit this Backup Data to high in the clouds, therefore make other users to detect and whether create source data de-duplication, and then efficiently avoid the leaking data of user.In addition, when the quantity of application data corresponding to the second historical data is more than or equal to random number corresponding to the second historical data, generates the reference data that the second historical data is corresponding, thus effectively improve client backup performance.
Fig. 5 is the structural representation of an embodiment of the treatment system of data of the present invention, as shown in Figure 5, this system comprises the treatment facility 22 of client 21 and data, wherein, the treatment facility 22 of data can be equipment shown in Fig. 3 or Fig. 4, and can perform the technical scheme of embodiment of the method shown in Fig. 1 or Fig. 2, its principle is similar, repeats no more herein.
One of ordinary skill in the art will appreciate that: all or part of step realizing above-mentioned each embodiment of the method can have been come by the hardware that program command is relevant.Aforesaid program can be stored in a computer read/write memory medium.This program, when performing, performs the step comprising above-mentioned each embodiment of the method; And aforesaid storage medium comprises: ROM, RAM, magnetic disc or CD etc. various can be program code stored medium.
Last it is noted that above each embodiment is only in order to illustrate technical scheme of the present invention, be not intended to limit; Although with reference to foregoing embodiments to invention has been detailed description, those of ordinary skill in the art is to be understood that: it still can be modified to the technical scheme described in foregoing embodiments, or carries out equivalent replacement to wherein some or all of technical characteristic; And these amendments or replacement, do not make the essence of appropriate technical solution depart from the scope of various embodiments of the present invention technical scheme.

Claims (9)

1. a processing method for data, is characterized in that, comprising:
Receive the data backup requests message that client sends, described data backup requests message comprises: the finger print information of user ID and Backup Data;
According to the finger print information of described Backup Data, inquire about the file that described user ID is corresponding, judge whether to there are the data identical with described Backup Data; If judge to there are not the data identical with described Backup Data, then according to the finger print information of described Backup Data, inquire about the file that other user ID are corresponding, judge whether to there are the data identical with described Backup Data;
If judge to there are the data identical with described Backup Data in the file that other user ID described are corresponding, wherein, in the file that other user ID described are corresponding, the data identical with described Backup Data are the second historical data, then judge whether the quantity of the reference data that described second historical data is corresponding is less than random number corresponding to described second historical data; Wherein, described random number is more than or equal to predetermined threshold value;
If judge, the quantity of the reference data that described second historical data is corresponding is less than random number corresponding to described second historical data, then send backup messages to described client, and receive the described Backup Data that described client sends and the reference data generating described second historical data.
2. the processing method of data according to claim 1, is characterized in that, also comprises:
If there are the data identical with described Backup Data in judging the file that described user ID is corresponding, wherein, in the file that described user ID is corresponding, there are the data identical with described Backup Data is the first historical data, then generate the reference data of described first historical data and send backup success message to described client.
3. the processing method of data according to claim 1, is characterized in that, also comprises:
If judge, the quantity of the reference data that described second historical data is corresponding is more than or equal to random number corresponding to described second historical data, then generate the reference data that described second historical data is corresponding, and sends backup success message to described client.
4. the processing method of data according to claim 1, is characterized in that, also comprises:
If judge data identical with described Backup Data in the file that other user ID described are corresponding, then send backup request acknowledge message to described client, and receive and preserve the described Backup Data of described client transmission;
Generate the random number that described Backup Data is corresponding.
5. a treatment facility for data, is characterized in that, comprising:
Transceiver module, for receiving the data backup requests message that client sends, described data backup requests message comprises: the finger print information of user ID and Backup Data;
Judge module, for the finger print information according to described Backup Data, inquires about the file that described user ID is corresponding, judges whether to there are the data identical with described Backup Data; If judge not identical with described Backup Data data, then according to the finger print information of described Backup Data, inquire about the file that other user ID are corresponding, judge whether to there are the data identical with described Backup Data; If judge to there are the data identical with described Backup Data in the file that other user ID described are corresponding, wherein, in the file that other user ID described are corresponding, the data identical with described Backup Data are the second historical data, judge whether the quantity of the reference data that described second historical data is corresponding is less than random number corresponding to described second historical data; Wherein, described random number is more than or equal to predetermined threshold value;
If for described judge module, described transceiver module also judges that the quantity of the reference data that described second historical data is corresponding is less than random number corresponding to described second historical data, then send backup messages to described client; And receive described Backup Data;
Reference data generation module, for generating the reference data of described second historical data.
6. the treatment facility of data according to claim 5, it is characterized in that, if described judge module is also for judging to there are the data identical with described Backup Data in the file that described user ID is corresponding, wherein, in the file that described user ID is corresponding, there are the data identical with described Backup Data is the first historical data;
Described reference data generation module is also for generating the reference data of described first historical data;
Described transceiver module is also for sending backup success message to described client.
7. the treatment facility of data according to claim 5, it is characterized in that, if for described judge module, described reference data generation module also judges that the quantity of the reference data that described second historical data is corresponding is more than or equal to random number corresponding to described second historical data, then generate the reference data that described second historical data is corresponding;
Described transceiver module is also for sending backup success message to described client.
8. the treatment facility of data according to claim 5, it is characterized in that, if described transceiver module also judges data not identical with described Backup Data in the file that other user ID described are corresponding for described judge module, then send backup request acknowledge message to described client; And receive the Backup Data of described client transmission;
Data memory module, for preserving described Backup Data;
Random number generation module, for generating random number corresponding to described Backup Data.
9. a treatment system for data, is characterized in that, comprising: the treatment facility of client and the data as described in any one of claim 5 to 8.
CN201110426631.6A 2011-12-19 2011-12-19 Data processing method, device and system Active CN102523290B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201110426631.6A CN102523290B (en) 2011-12-19 2011-12-19 Data processing method, device and system

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201110426631.6A CN102523290B (en) 2011-12-19 2011-12-19 Data processing method, device and system

Publications (2)

Publication Number Publication Date
CN102523290A CN102523290A (en) 2012-06-27
CN102523290B true CN102523290B (en) 2015-04-08

Family

ID=46294077

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201110426631.6A Active CN102523290B (en) 2011-12-19 2011-12-19 Data processing method, device and system

Country Status (1)

Country Link
CN (1) CN102523290B (en)

Families Citing this family (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102999400A (en) * 2012-11-22 2013-03-27 中国电信股份有限公司云计算分公司 Data backup method and device of cloud storage system
CN103064757A (en) * 2012-12-12 2013-04-24 鸿富锦精密工业(深圳)有限公司 Method and system for backing up data
CN106598765A (en) * 2015-10-15 2017-04-26 北京国双科技有限公司 Data check method and device
CN106250723B (en) * 2016-08-10 2020-09-25 智者四海(北京)技术有限公司 Control method and device based on page characters
CN106572177A (en) * 2016-11-07 2017-04-19 广东欧珀移动通信有限公司 Data transmission method and mobile terminal
CN107562555A (en) * 2017-08-02 2018-01-09 网宿科技股份有限公司 The cleaning method and server of duplicate data
CN107276857A (en) * 2017-08-16 2017-10-20 郑州云海信息技术有限公司 A kind of method and device for monitoring flow
CN110096388A (en) * 2019-04-28 2019-08-06 平安科技(深圳)有限公司 A kind of method, apparatus and computer storage medium of data backup
CN114442904A (en) * 2020-10-30 2022-05-06 伊姆西Ip控股有限责任公司 Method, apparatus and computer program product for managing a storage system

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101582076A (en) * 2009-06-24 2009-11-18 浪潮电子信息产业股份有限公司 Data de-duplication method based on data base
CN101882141A (en) * 2009-05-08 2010-11-10 北京众志和达信息技术有限公司 Method and system for implementing repeated data deletion

Family Cites Families (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8280926B2 (en) * 2003-08-05 2012-10-02 Sepaton, Inc. Scalable de-duplication mechanism

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101882141A (en) * 2009-05-08 2010-11-10 北京众志和达信息技术有限公司 Method and system for implementing repeated data deletion
CN101582076A (en) * 2009-06-24 2009-11-18 浪潮电子信息产业股份有限公司 Data de-duplication method based on data base

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
基于MD5算法的重复数据删除技术的研究与改进;廖海生等;《计算机测量与控制》;20100331;第18卷(第3期);第635-638页 *

Also Published As

Publication number Publication date
CN102523290A (en) 2012-06-27

Similar Documents

Publication Publication Date Title
CN102523290B (en) Data processing method, device and system
US9026679B1 (en) Methods and apparatus for persisting management information changes
KR102460096B1 (en) Method and apparatus for managing encryption keys for cloud service
CN103095687B (en) metadata processing method and device
CN103019960B (en) Distributed caching method and system
US9342370B2 (en) Server migration
US8375200B2 (en) Embedded device and file change notification method of the embedded device
CN103765373B (en) Date storage method, data storage device and memory device
JP2016510148A (en) Data processing method and device in distributed file storage system
US10795860B1 (en) WAN optimized micro-service based deduplication
CN109857710A (en) File memory method and terminal device
CN103095843A (en) Method and client of data backup based on version vectors
CN103988201A (en) Efficient backup replication
CN103227818A (en) Terminal, server, file transferring method, file storage management system and file storage management method
CN104735110A (en) Metadata management method and system
CN113742135B (en) Data backup method, device and computer readable storage medium
CN103118104A (en) Data restoration method based on version vector, and server
CN113687964B (en) Data processing method, device, electronic equipment, storage medium and program product
WO2013165388A1 (en) Segment combining for deduplication
US20170269847A1 (en) Method and Device for Differential Data Backup
CN109597903A (en) Image file processing apparatus and method, document storage system and storage medium
US10915251B2 (en) Dynamic parallelism
CN105471955A (en) Writing method of distributed file system, client device and distributed file system
CN102082791A (en) Data backup implementation method, client, server and system
US11416447B2 (en) Deduplicating distributed erasure coded objects

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C53 Correction of patent of invention or patent application
CB02 Change of applicant information

Address after: 611731 Chengdu high tech Zone, Sichuan, West Park, Qingshui River

Applicant after: HUAWEI DIGITAL TECHNOLOGIES (CHENG DU) Co.,Ltd.

Address before: 611731 Chengdu high tech Zone, Sichuan, West Park, Qingshui River

Applicant before: CHENGDU HUAWEI SYMANTEC TECHNOLOGIES Co.,Ltd.

COR Change of bibliographic data

Free format text: CORRECT: APPLICANT; FROM: CHENGDU HUAWEI SYMANTEC TECHNOLOGIES CO., LTD. TO: HUAWEI DIGITAL TECHNOLOGY (CHENGDU) CO., LTD.

C14 Grant of patent or utility model
GR01 Patent grant
TR01 Transfer of patent right

Effective date of registration: 20220905

Address after: No. 1899 Xiyuan Avenue, high tech Zone (West District), Chengdu, Sichuan 610041

Patentee after: Chengdu Huawei Technologies Co.,Ltd.

Address before: 611731 Qingshui River District, Chengdu hi tech Zone, Sichuan, China

Patentee before: HUAWEI DIGITAL TECHNOLOGIES (CHENG DU) Co.,Ltd.

TR01 Transfer of patent right