CN109086172A - A kind of method and relevant apparatus of data processing - Google Patents

A kind of method and relevant apparatus of data processing Download PDF

Info

Publication number
CN109086172A
CN109086172A CN201811108304.4A CN201811108304A CN109086172A CN 109086172 A CN109086172 A CN 109086172A CN 201811108304 A CN201811108304 A CN 201811108304A CN 109086172 A CN109086172 A CN 109086172A
Authority
CN
China
Prior art keywords
target data
identifier
data
storage equipment
target
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201811108304.4A
Other languages
Chinese (zh)
Other versions
CN109086172B (en
Inventor
何孝金
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Zhengzhou Yunhai Information Technology Co Ltd
Original Assignee
Zhengzhou Yunhai Information Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Zhengzhou Yunhai Information Technology Co Ltd filed Critical Zhengzhou Yunhai Information Technology Co Ltd
Priority to CN201811108304.4A priority Critical patent/CN109086172B/en
Publication of CN109086172A publication Critical patent/CN109086172A/en
Application granted granted Critical
Publication of CN109086172B publication Critical patent/CN109086172B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/07Responding to the occurrence of a fault, e.g. fault tolerance
    • G06F11/14Error detection or correction of the data by redundancy in operation
    • G06F11/1402Saving, restoring, recovering or retrying
    • G06F11/1446Point-in-time backing up or restoration of persistent data
    • G06F11/1458Management of the backup or restore process
    • G06F11/1464Management of the backup or restore process for networked environments

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Quality & Reliability (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

The embodiment of the present application discloses a kind of method of data processing, comprising: the first storage equipment reads target data;First storage equipment detection target data whether there is corresponding first identifier and corresponding second identifier, wherein second identifier is for marking target data by compression processing;If there are corresponding first identifier and corresponding second identifier, the first storage equipment to send first object data packet to the second storage equipment for target data.The embodiment of the present application also discloses a kind of data processing equipment.The embodiment of the present application alleviates the burden of main storage device and backup storage device data processing.

Description

A kind of method and relevant apparatus of data processing
Technical field
This application involves field of data storage more particularly to the methods and relevant apparatus of a kind of data processing.
Background technique
Remote copy technology is a kind of remote data backup technology based on storage equipment, is generally divided into synchronous remote copy And asynchronous remote copy.Synchronous remote copy cardinal principle is that data need while writing main storage device and backup storage device On, the cardinal principle of asynchronous remote copy is then first to write data on main storage device, it is subsequent by data from main storage device It copies on backup storage device.
In big data era, the data storage of magnanimity occupies a large amount of memory space.It is deleted at processing technique and compression again Reason technology is can currently to reduce the core technology of data space, especially in the full flash memory storage battle array of memory space higher cost It arranges (all flash array, AFA), deletes processing technique again and compression processing technology has become characteristic indispensable in AFA. It is usual that processing technique is deleted again are as follows: the data being newly written are calculated into a cryptographic Hash, are then compared with stored cryptographic Hash, if It was found that there is identical cryptographic Hash, then the position of identical cryptographic Hash corresponding data is recorded, current data is not written into storage equipment.
However, carrying out target data between main storage device and backup storage device in existing remote copy technology Duplication when, not perceive target data whether deleted processing and compression processing again, cause data replicate when, even if Target data has been carried out and deletes processing and compression processing again, and main storage device is sent after still unziping it to target data To backup storage device, backup storage device is deleted processing and compression processing to the target data after decompression again again.By This, has not only aggravated the burden of main storage device and backup storage device data processing, while main storage device and backup storage The data volume transmitted between equipment is very big, causes to restore point target (recovery point when asynchronous remote copy Objective, RPO) it is very high.
Summary of the invention
The embodiment of the present application provides a kind of method of data processing, for storing the remote data backup of equipment.
In view of this, the application first aspect provides a kind of method of data processing, comprising:
First storage equipment reads target data;
The first storage equipment detects the target data and whether there is corresponding first identifier and corresponding second identifier, Wherein, the second identifier is for marking the target data by compression processing;
If there are the corresponding first identifier and the corresponding second identifier, the first storage equipment for the target data First object data packet is sent to the second storage equipment, so that the second storage equipment is according to the first object data packet to this Target data is handled;
Wherein, the target data, the first identifier and the second identifier are at least carried in the first object data packet, it should First identifier is used to indicate the second storage equipment and is deleted processing again to the target data according to the first identifier, second mark Knowledge is used to indicate the second storage equipment and carries out write-in processing to the target data.
In conjunction with the embodiment of the present application in a first aspect, in the first possible implementation of the first aspect, this first After storage equipment detects the target data with the presence or absence of corresponding first identifier and corresponding second identifier, this method is also wrapped It includes:
If the corresponding first identifier and the corresponding second identifier is not present in the target data, which is set It is standby to send the second target packet to the second storage equipment, so that the second storage equipment is according to second target packet The target data is handled;
Wherein, the target data and third mark are at least carried in second data packet, third mark is used to indicate The second storage equipment carries out compression processing to the target data.
In conjunction with the first possible implementation of the first aspect of the embodiment of the present application, second in first aspect can In the implementation of energy, after which reads the target data, this method further include:
If the target data is stored in the buffer zone of the first storage equipment, which second deposits to this Storage equipment sends second target packet, so that the second storage equipment is according to second target packet to the number of targets According to being handled;
Wherein, the target data and third mark are at least carried in second data packet.
In conjunction with the embodiment of the present application in a first aspect, in a third possible implementation of the first aspect, this first After storage equipment detects the target data with the presence or absence of corresponding first identifier and corresponding second identifier, this method is also wrapped It includes:
If the target data is there are the corresponding first identifier and the corresponding second identifier is not present, first storage Equipment sends third target packet to the second storage equipment, so that the second storage equipment is according to the third target data Packet handles the target data;
Wherein, the target data, the first identifier and third mark are at least carried in the third target packet.
In conjunction with the embodiment of the present application in a first aspect, in a fourth possible implementation of the first aspect, this first After storage equipment detects the target data with the presence or absence of corresponding first identifier and corresponding second identifier, this method is also wrapped It includes:
If the target data is there is no the corresponding first identifier and there are the corresponding second identifier, first storages Equipment sends the 4th target packet to the second storage equipment, so that the second storage equipment is according to the 4th target data Packet handles the target data;
Wherein, the target data and the second identifier are at least carried in the 4th target packet.
The application second aspect provides a kind of data processing equipment, which includes:
Read module, for reading target data;
Detection module whether there is corresponding first identifier and corresponding second identifier for detecting the target data, Wherein, the second identifier is for marking the target data by compression processing;
Sending module, if for the target data there are the corresponding first identifier and the corresponding second identifier, The sending module sends first object data packet to the second storage equipment, so that the second storage equipment is according to the first object Data packet handles the target data;
Wherein, the target data, the first identifier and the second identifier are at least carried in the first object data packet, it should First identifier is used to indicate the second storage equipment and is deleted processing again to the target data according to the first identifier, second mark Knowledge is used to indicate the second storage equipment and carries out write-in processing to the target data.
In conjunction with the second aspect of the embodiment of the present application, in the first possible implementation of the second aspect, provide A kind of data processing equipment, comprising:
The sending module, if being also used to the target data, there is no the corresponding first identifier and corresponding second marks Know, then the sending module to this second storage equipment send the second target packet so that this second storage equipment according to be somebody's turn to do Second target packet handles the target data;
Wherein, the target data and third mark are at least carried in second data packet, third mark is used to indicate The second storage equipment carries out compression processing to the target data.
In conjunction with the first possible implementation of the second aspect of the embodiment of the present application, second in second aspect can In the implementation of energy, a kind of data processing equipment is provided, comprising:
The sending module, if being also used to the buffer zone that the target data is stored in the first storage equipment, the transmission Module sends second target packet to the second storage equipment, so that the second storage equipment is according to second number of targets The target data is handled according to packet;
Wherein, the target data and third mark are at least carried in second data packet.
In conjunction with the second aspect of the embodiment of the present application, in the third possible implementation of the second aspect, provide A kind of data processing equipment, comprising:
The sending module, if be also used to the target data there are the corresponding first identifier and there is no it is corresponding this second Mark, then the sending module to this second storage equipment send third target packet so that this second storage equipment according to The third target packet handles the target data;
Wherein, the target data, the first identifier and third mark are at least carried in the third target packet.
In conjunction with the second aspect of the embodiment of the present application, in the fourth possible implementation of the second aspect, provide A kind of data processing equipment, comprising:
The sending module, if be also used to the target data there is no the corresponding first identifier and there are it is corresponding this second Mark, then the sending module to this second storage equipment send the 4th target packet so that this second storage equipment according to 4th target packet handles the target data;
Wherein, the target data and the second identifier are at least carried in the 4th target packet.
As can be seen from the above technical solutions, the embodiment of the present application has the advantage that
The embodiment of the present application provides a kind of method of data processing, for storing the remote data backup of equipment.Mitigate The burden of main storage device and backup storage device data processing, at the same reduce main storage device and backup storage device it Between the data volume transmitted, reduce recovery point target when asynchronous remote copy.
Detailed description of the invention
Fig. 1 is the network frame schematic diagram that equipment is stored in the embodiment of the present application;
Fig. 2 is a flow diagram of data processing in the application scenarios of the application;
Fig. 3 is one embodiment schematic diagram of the method for data processing in the embodiment of the present application;
Fig. 4 is one embodiment schematic diagram of data processing equipment in the embodiment of the present application.
Specific embodiment
The embodiment of the present application provides a kind of method of data processing, for storing the remote data backup of equipment.Mitigate The burden of main storage device and backup storage device data processing, at the same reduce main storage device and backup storage device it Between the data volume transmitted, reduce recovery point target when asynchronous remote copy.
The description and claims of this application and term " first ", " second ", " third ", " in above-mentioned attached drawing The (if present)s such as four " are to be used to distinguish similar objects, without being used to describe a particular order or precedence order.It should manage The data that solution uses in this way are interchangeable under appropriate circumstances, so that the embodiments described herein can be in addition to illustrating herein Or the sequence other than the content of description is implemented.In addition, term " includes " and " having " and their any deformation, it is intended that Cover it is non-exclusive include, for example, containing the process, method, system, product or equipment of a series of steps or units need not limit In step or unit those of is clearly listed, but may include be not clearly listed or for these process, methods, produce The other step or units of product or equipment inherently.
Data processing equipment provided by the present application can be deployed in the number by main storage device and backup storage device foundation According in backup network frame, in order to make it easy to understand, referring to Fig. 1, Fig. 1 is the network frame for storing equipment in the embodiment of the present application Schematic diagram.Although in Fig. 1 including a main storage device and a backup storage device it should be appreciated that main memory The type and quantity of the type and quantity and backup storage device of storing up equipment should all be determined according to actual scene, in practical application In, the type and quantity of the type of main storage device and quantity and backup storage device are not defined, main storage device And backup storage device is either individually storage equipment is also possible to the storage array of multiple storage equipment compositions, wherein Equipment is stored either solid state hard disk (solid state drive, SSD) is also possible to hybrid hard disk (hybrid hard Drive, HHD) it is also possible to mechanical hard disk (hard disk drive, HDD) and is also possible to CD server and tape library etc., when When main storage device and backup storage device are the storage array of multiple storage equipment compositions, can by above-mentioned SSD, HHD, One or more compositions of HDD, CD server and tape library, are not construed as limiting herein.Main storage device and backup storage device it Between data communication can pass through transmission control protocol/Internet Protocol (transmission control Protocol/internet protocol, TCP/IP) transmission.
The application can be applied to the remote copy technology of data, wherein the remote copy technology of data is generally divided into synchronization Remote copy and asynchronous remote copy.Synchronous remote copy refers to through remote mirroring software, by the data of main storage device with The mode of synchronous mirror copies to backup storage device, and input/output (in/out, I/O) affairs of each main storage device are equal The completion confirmation message for needing to wait for remote copy, is just discharged.It is multiple that synchronous mirror require telecopy can with local The content of system matches.When main storage device breaks down, after the application program of user is switched to backup storage device, by mirror The remote copy of picture can guarantee that business is continued to execute without the loss of data.Asynchronous remote copy refers to guaranteeing updating The basic I/O operation of main storage system is completed before backup storage device, the I/O operation of main storage device is not set by backup storage Standby I/O operation influences.Long-range data duplication is carried out in a manner of background synchronization, this is subject to local system performance Very little is influenced, transmission range is long (up to 1000 kilometers or more), small to network bandwidth requirement.
It is described in detail below from the angle of main storage device and backup storage device.In order to make it easy to understand, below The application scenarios of a kind of method of data processing will be introduced in conjunction with Fig. 2, referring to Fig. 2, Fig. 2 is number in the application scenarios of the application According to a flow diagram of processing, as shown, specifically:
In step S1, during the remote copy of data, when data need to carry out data synchronization job, main storage device A difference bitmap can be generated, which, should for marking data different from backup storage device on main storage device Data are usually the data being newly written on main storage device, need to be written the data at this time on backup storage device, complete number According to synchronization job, main storage device first can find the logical volume address for needing the data replicated according to difference bitmap, in reality In the application of border, it will include the target data that data, which are usually stored in storage equipment in the form of data block (data block), Data block be known as target data block, since in software level, the processing to data is operated by logical volume address, because This is in the method for the data processing that the application proposes, the logical volume address of target data block first in acquisition main storage device;
It is readable according to the address in getting main storage device behind the logical volume address of target data block in step S2 Get target data block, due to data through it is overweight delete processing when data can be calculated first, generate corresponding finger print information, The finger print information be used to indicate main storage device using the finger print information search in the main storage device with the presence or absence of it is identical The finger print information of storage then records the address of identical stored finger print information corresponding data if it exists, to be currently written Data are not written into storage equipment, establish mapping relations with the address of the corresponding data of record, processing is deleted in completion again, and usually this refers to Line information is stored in the build of the data block of the data.After data carry out compression processing, a mark can be generated, for identifying this Data pass through compression processing, and the mark of the usual compression processing is stored in the build of the data block of the data.Getting main memory In storage equipment behind the logical volume address of target data block, according to the address, target data block may be read into, by searching for number of targets Compression is deleted again according to judging whether the data have been done with the presence or absence of the corresponding mark of finger print information and compression processing in block with this.If Find in target data block there are finger print information and the corresponding mark of compression processing, judge the data done delete again processing with And compression processing, judging result be it is yes, S4 is entered step, if finding in target data block there is no finger print information and compression Handle corresponding mark, judge that the data are not done and delete processing and compression processing again, judging result be it is no, enter step S3.
In step S3, when finding in target data block there is no after finger print information and compression processing corresponding mark, Data corresponding in target data block are directly sent to backup storage device by main storage device, and backup storage device is according to itself The data received are further processed in business processing demand.
In step S4, when finding in target data block, there are finger print information and the corresponding mark of compression processing, judgements The data have been done delete processing and compression processing again after, main storage device is by target data block address lookup target data block The target data stored at build deletes finger print information again, if needing to enter step when the data block for replicating multiple batches simultaneously at this time Rapid S5 enters step S6 if the data block for needing to replicate at this time is individual data block;
In step S5, after inquiring the corresponding finger print information of target data, when current needs while multiple batches are replicated Data block when, the fingerprint, which can be used, in main storage device compares the finger print information of data block that is other while needing to replicate Right, identical fingerprint, then retain a data block, and record the information of data block that is other while needing to replicate if it exists, into Row duplicate removal processing.
In step S6, find the target data it is corresponding it is heavy delete finger print information after, due to the target data Overcompression processing, therefore the target data corresponding with finger print information is deleted again read, for the target data compressed.
In step S7, after reading the target data compressed, main storage device is by the logical volume of target data block Location, the compressed data deleting fingerprint again and reading are sent to backup storage device in the form of data packet, and backup storage is set It is standby that the data received are further processed according to own service process demand.
In the present solution, main storage device, before sending target data to backup storage device, meeting is first to the target data institute Target data block inquired, inquiry then will if it exists with the presence or absence of the mark of finger print information and compression processing is deleted again This is heavy to delete the mark of finger print information and compression processing and is sent to backup in the form of data packet with the target data compressed and deposits Equipment is stored up, backup storage device can be not repeated to calculate number of targets according to the mark for deleting finger print information and compression processing again According to directly being deleted processing again using the heavy finger print information of deleting.Backup storage device can also be according to the compression processing received Mark, judge that compression processing has been carried out in the target data being currently received, therefore do not need to carry out compression processing again, It can write direct.The burden of main storage device and backup storage device data processing is alleviated, while reducing primary storage and setting The standby data volume transmitted between backup storage device reduces recovery point target when asynchronous remote copy.
Referring to Fig. 3, Fig. 3 is one embodiment schematic diagram of the method for data processing in the embodiment of the present application, the application One embodiment of the method for data processing includes: in embodiment
101, the first storage equipment reads target data;
In the present embodiment, the first storage equipment is different by obtaining record between the first storage equipment and the second storage equipment The difference bitmap or differentiated identification of data determine target data, and according to the logical volume address where the target data, read The target data, wherein logical volume address is a kind of position code, and the location information of data is indicated by Arabic numerals, such as: The corresponding logical volume address of data 1 is 1, and the corresponding logical volume address of data 2 is 2, and so on.
102, the first storage equipment detection target data whether there is corresponding first identifier and corresponding second identifier;
In the present embodiment, when target data is deleted processing again, storage equipment can carry out Hash operation to target data To generate corresponding cryptographic Hash (hash), which is known as the finger print information of the data, in the present embodiment the referred to as first mark Know, carries out Hash operation and need using hash algorithm, the hash algorithm being applicable in the application may include: xxhash algorithm, MD Hash algorithm, SHA-1 hash algorithm, SHA-2 hash algorithm, MD5 hash algorithm etc., are not construed as limiting herein.When target data into After row compression processing, storage equipment can generate one and identify for marking the target data to have been carried out compression processing, the mark It is known as second identifier in the present embodiment.The logical volume address of first identifier, second identifier and target data is target data Metadata, metadata are normally stored in the build for the data block that target data is stored, also can setting according to different storage manufacturers It sets and is stored in different regions, for example, metadata is stored in storage equipment in nonvolatile memory, be not construed as limiting herein. First storage equipment detects in this storage equipment with the presence or absence of first identifier and second identifier.
If 103, target data is there are corresponding first identifier and corresponding second identifier, and the first storage equipment is to the Two storage equipment send first object data packet;
In the present embodiment, there is the first mark corresponding with target data when the first storage equipment detects in this storage equipment Know and second identifier corresponding with target data after, first storage equipment using get first identifier, second identifier with And target data makes first object data packet, also includes the logical volume address of target data in first object data packet.The First object data packet is sent to the second storage equipment by ICP/IP protocol by one storage equipment, and the second storage equipment is receiving To after the first object data packet, the first identifier in the first object data packet can be used, detection second stores in equipment With the presence or absence of finger print information identical with first identifier, and if it exists, then the target data in first object data packet is not written into, and The logical volume address for recording the target data, and there are the addresses of the corresponding data of identical fingerprints information to establish mapping relations, complete Processing is deleted again in the second storage equipment at target data.Second storage equipment is according to the second mark in first object data packet Know, can determine that the target data being currently received is compressed data, therefore do not need again to carry out the target data Compression processing.
In the embodiment of the present application, the first storage equipment, can be first to the mesh before sending target data to the second storage equipment Target data block where mark data is detected, and detects whether there is the mark for deleting finger print information and compression processing again, if In the presence of then the heavy mark for deleting finger print information and compression processing is sent in the form of data packet with the target data compressed To the second storage equipment.Second storage equipment can be not repeated to count according to the mark for deleting finger print information and compression processing again Target data is calculated, is directly deleted processing again using the heavy finger print information of deleting.Second storage equipment can also be according to receiving The mark of compression processing judges that compression processing has been carried out in the target data being currently received, therefore does not need to carry out again Compression processing can be write direct.The burden for alleviating the first storage equipment and the processing of the second storage device data, reduces simultaneously The data volume transmitted between first storage equipment and the second storage equipment, reduces return contact mesh when asynchronous remote copy Mark.
Optionally, on the basis of Fig. 3 corresponding embodiment, the side of second of data processing provided by the embodiments of the present application In the embodiment of method, the first storage equipment detection target data whether there is corresponding first identifier and corresponding second identifier Later, method further include:
If corresponding first identifier and corresponding second identifier is not present in target data, first stores equipment to second It stores equipment and sends the second target packet, so that the second storage equipment carries out target data according to the second target packet Processing;
Wherein, target data and third mark are at least carried in the second data packet, third mark is used to indicate second and deposits It stores up equipment and compression processing is carried out to target data.
In the present embodiment, the first storage equipment detection target data is with the presence or absence of corresponding first identifier and corresponding the Two mark after, if target data be not present corresponding first identifier and corresponding second identifier, i.e., the target data without It is overweight delete processing and compression processing after, first storage equipment obtain third mark, and using third mark and number of targets It also include the logical volume address of target data in the second target packet according to the second target packet is made.Wherein third mark Know for the first storage equipment after detecting the uncompressed processing of the target data, the newly-built mark target data is without pressure The mark of contracting processing data.Second storage equipment is identified according to the third carried in the second target packet received, is determined Target data in second target packet is the data of uncompressed processing, simultaneously because not carrying in the second target packet First identifier, therefore the second storage equipment can choose whether to need to be deleted processing again to the target data according to self-demand And compression processing.
In the embodiment of the present application, when the first storage equipment detects target data, there is no first identifier and second identifiers Later, the second target packet is sent to the second storage equipment, carries target data and third mark in the second target packet Know.The method of data processing of the target data without deleting processing and compression processing again is provided, the realization spirit of scheme is improved Activity.
Optionally, on the basis of the embodiment of the method for second of data processing provided by the embodiments of the present application, this Shen Please embodiment provide the third data processing method embodiment in, first storage equipment read target data after, side Method further include:
If target data is stored in the buffer zone of the first storage equipment, the first storage equipment is sent out to the second storage equipment The second target packet is sent, so that the second storage equipment is handled target data according to the second target packet;
Wherein, target data and third mark are at least carried in the second data packet.
In the present embodiment, after the first storage equipment reads target data, the first storage equipment may determine that target The currently stored position of data whether be the first storage equipment buffer zone (cache), if so, skip detection first store The step of whether there is first identifier and second identifier in equipment sends the second target packet to the second storage equipment.The Two storage equipment receive the process flow executed after the second target packet, similar second of number provided by the embodiments of the present application According to the embodiment of the method for processing, details are not described herein again.
In the embodiment of the present application, when in the buffer zone that target data is stored in the first storage equipment, due to buffer area Data in domain are to delete processing and compression processing without overweight, can directly judge the target data for without deleting processing again And the data of compression processing, the first storage equipment send the second target packet to the second storage equipment.Provide a kind of mesh Mark data are when being stored in buffer zone, the method for data processing, simplify the first storage equipment to the process flow of target data, Improve the feasibility of scheme.
Optionally, on the basis of Fig. 3 corresponding embodiment, the side of the 4th kind of data processing provided by the embodiments of the present application In the embodiment of method, the first storage equipment detection target data whether there is corresponding first identifier and corresponding second identifier Later, method further include:
If target data is there are corresponding first identifier and corresponding second identifier is not present, the first storage equipment is to the Two storage equipment send third target packets so that second storage equipment according to third target packet to target data into Row processing;
Wherein, target data, first identifier and third mark are at least carried in third target packet.
In the present embodiment, when the first storage equipment detects target data there are corresponding first identifier and correspondence is not present Second identifier when, i.e., target data through it is overweight delete processing and uncompressed processing, at this point, first storage equipment to second storage Equipment sends third target packet, and target data, first identifier, third mark and mesh are carried in the third target packet Mark the logical volume address of data.After second storage equipment receives third target packet, this is can be used in the second storage equipment First identifier in first object data packet, detection second stores in equipment to be believed with the presence or absence of fingerprint identical with first identifier Breath, and if it exists, then the target data in first object data packet is not written into, and records the logical volume address of the target data, with There are the addresses of the corresponding data of identical fingerprints information to establish mapping relations, completes weight of the target data in the second storage equipment Processing is deleted, the second storage equipment is identified according to the third carried in the third target packet received, determines third number of targets It is the data of uncompressed processing according to the target data in packet, the second storage equipment can choose whether needs according to self-demand Compression processing is carried out to the target data.
In the embodiment of the present application, providing target data is to delete processing and without the data processing of compression processing through overweight Method, the second storage equipment deleted processing to target data according to the third target packet received again, and according to oneself The demand of body chooses whether to carry out compression processing to target data, improves the realization flexibility of scheme.
Optionally, on the basis of Fig. 3 corresponding embodiment, the side of the 5th kind of data processing provided by the embodiments of the present application In the embodiment of method, the first storage equipment detection target data whether there is corresponding first identifier and corresponding second identifier Later, method further include:
If target data, there is no corresponding first identifier and there are corresponding second identifier, the first storage equipment is to the Two storage equipment send the 4th target packets so that second storage equipment according to the 4th target packet to target data into Row processing;
Wherein, target data and second identifier are at least carried in the 4th target packet.
In the present embodiment, when the first storage equipment detects target data there is no corresponding first identifier and there is correspondence Second identifier when, i.e., target data deletes processing and through compression processing without overweight, and the first storage equipment is to the second storage equipment The 4th target packet is sent, the logical volume of target data, second identifier and target data is carried in the 4th target packet Address.Due to not carrying first identifier in the 4th target packet, the second storage equipment can be selected according to self-demand Whether need to be deleted processing again to the target data.Second storage equipment according to the second identifier in the 4th target packet, It can determine that the target data being currently received is compressed data, therefore not need again to compress the target data Processing.
In the embodiment of the present application, providing target data is to delete processing and data processing Jing Guo compression processing without overweight Method, second storage equipment is chosen whether to delete target data again processing according to the demand of itself, improve scheme Realize flexibility.
Data processing equipment in the application is described in detail below, referring to Fig. 4, Fig. 4 is in the embodiment of the present application One embodiment schematic diagram of data processing equipment, one embodiment of data processing equipment 20 provided by the embodiments of the present application In, data processing equipment 20 includes:
Read module 201, for reading target data;
Detection module 202 whether there is corresponding first identifier and corresponding second identifier for detecting target data, Wherein, second identifier is for marking target data by compression processing;
Sending module 203, if being sent for target data there are corresponding first identifier and corresponding second identifier Module 203 sends first object data packet to the second storage equipment, so that the second storage equipment is according to first object data packet Target data is handled.
In the present embodiment, read module 201 read target data, detection module 202 detect target data with the presence or absence of pair The first identifier and corresponding second identifier answered, wherein second identifier is for marking target data by compression processing, if mesh Marking data, there are corresponding first identifier and corresponding second identifiers, then are sent out by sending module 203 to the second storage equipment First object data packet is sent, so that the second storage equipment is handled target data according to first object data packet.
In the embodiment of the present application, the first storage equipment, can be first to the mesh before sending target data to the second storage equipment Target data block where mark data is detected, and detects whether there is the mark for deleting finger print information and compression processing again, if In the presence of then the heavy mark for deleting finger print information and compression processing is sent in the form of data packet with the target data compressed To the second storage equipment.Second storage equipment can be not repeated to count according to the mark for deleting finger print information and compression processing again Target data is calculated, is directly deleted processing again using the heavy finger print information of deleting.Second storage equipment can also be according to receiving The mark of compression processing judges that compression processing has been carried out in the target data being currently received, therefore does not need to carry out again Compression processing can be write direct.The burden for alleviating the first storage equipment and the processing of the second storage device data, reduces simultaneously The data volume transmitted between first storage equipment and the second storage equipment, reduces return contact mesh when asynchronous remote copy Mark.
Optionally, on the basis of Fig. 4 corresponding embodiment, second of data processing equipment provided by the embodiments of the present application Embodiment in,
Sending module 203, if being also used to target data is not present corresponding first identifier and corresponding second identifier, Sending module 203 sends the second target packet to the second storage equipment, so that the second storage equipment is according to the second number of targets Target data is handled according to packet;
Wherein, target data and third mark are at least carried in the second data packet, third mark is used to indicate second and deposits It stores up equipment and compression processing is carried out to target data.
In the embodiment of the present application, when the first storage equipment detects target data, there is no first identifier and second identifiers Later, the second target packet is sent to the second storage equipment, carries target data and third mark in the second target packet Know.The method of data processing of the target data without deleting processing and compression processing again is provided, the realization spirit of scheme is improved Activity.
Optionally, on the basis of the embodiment of second of data processing equipment provided by the embodiments of the present application, the application In the embodiment for the third data processing equipment that embodiment provides,
Sending module 203, if being also used to the buffer zone that target data is stored in the first storage equipment, sending module 203 send the second target packet to the second storage equipment, so that the second storage equipment is according to the second target packet to mesh Mark data are handled;
Wherein, target data and third mark are at least carried in the second data packet.
In the embodiment of the present application, when in the buffer zone that target data is stored in the first storage equipment, due to buffer area Data in domain are to delete processing and compression processing without overweight, can directly judge the target data for without deleting processing again And the data of compression processing, the first storage equipment send the second target packet to the second storage equipment.Provide a kind of mesh Mark data are when being stored in buffer zone, the method for data processing, simplify the first storage equipment to the process flow of target data, Improve the feasibility of scheme.
Optionally, on the basis of Fig. 4 corresponding embodiment, the 4th kind of data processing equipment provided by the embodiments of the present application Embodiment in,
Sending module 203, if being also used to target data there are corresponding first identifier and corresponding second identifier being not present, Then sending module 203 sends third target packet to the second storage equipment, so that the second storage equipment is according to third target Data packet handles target data;
Wherein, target data, first identifier and third mark are at least carried in third target packet.
In the embodiment of the present application, providing target data is to delete processing and without the data processing of compression processing through overweight Method, the second storage equipment deleted processing to target data according to the third target packet received again, and according to oneself The demand of body chooses whether to carry out compression processing to target data, improves the realization flexibility of scheme.
Optionally, on the basis of Fig. 4 corresponding embodiment, the 5th kind of data processing equipment provided by the embodiments of the present application Embodiment in,
Sending module 203, if being also used to target data there is no corresponding first identifier and there are corresponding second identifier, Then sending module 203 sends the 4th target packet to the second storage equipment, so that the second storage equipment is according to the 4th target Data packet handles target data;
Wherein, target data and second identifier are at least carried in the 4th target packet.
In the embodiment of the present application, providing target data is to delete processing and data processing Jing Guo compression processing without overweight Method, second storage equipment is chosen whether to delete target data again processing according to the demand of itself, improve scheme Realize flexibility.
It is apparent to those skilled in the art that for convenience and simplicity of description, the system of foregoing description, The specific work process of device and unit, can refer to corresponding processes in the foregoing method embodiment, and details are not described herein.
In several embodiments provided herein, it should be understood that disclosed system, device and method can be with It realizes by another way.For example, the apparatus embodiments described above are merely exemplary, for example, the unit It divides, only a kind of logical function partition, there may be another division manner in actual implementation, such as multiple units or components It can be combined or can be integrated into another system, or some features can be ignored or not executed.Another point, it is shown or The mutual coupling, direct-coupling or communication connection discussed can be through some interfaces, the indirect coupling of device or unit It closes or communicates to connect, can be electrical property, mechanical or other forms.
The unit as illustrated by the separation member may or may not be physically separated, aobvious as unit The component shown may or may not be physical unit, it can and it is in one place, or may be distributed over multiple In network unit.It can select some or all of unit therein according to the actual needs to realize the mesh of this embodiment scheme 's.
It, can also be in addition, each functional unit in each embodiment of the application can integrate in one processing unit It is that each unit physically exists alone, can also be integrated in one unit with two or more units.Above-mentioned integrated list Member both can take the form of hardware realization, can also realize in the form of software functional units.
If the integrated unit is realized in the form of SFU software functional unit and sells or use as independent product When, it can store in a computer readable storage medium.Based on this understanding, the technical solution of the application is substantially The all or part of the part that contributes to existing technology or the technical solution can be in the form of software products in other words It embodies, which is stored in a storage medium, including some instructions are used so that a computer Equipment (can be personal computer, server or the network equipment etc.) executes the complete of each embodiment the method for the application Portion or part steps.And storage medium above-mentioned include: USB flash disk, mobile hard disk, read-only memory (read-only memory, ROM), random access memory (random access memory, RAM), magnetic or disk etc. are various can store program The medium of code.
The above, above embodiments are only to illustrate the technical solution of the application, rather than its limitations;Although referring to before Embodiment is stated the application is described in detail, those skilled in the art should understand that: it still can be to preceding Technical solution documented by each embodiment is stated to modify or equivalent replacement of some of the technical features;And these It modifies or replaces, the spirit and scope of each embodiment technical solution of the application that it does not separate the essence of the corresponding technical solution.

Claims (10)

1. a kind of method of data processing characterized by comprising
First storage equipment reads target data;
The first storage equipment detects the target data and whether there is corresponding first identifier and corresponding second identifier, Wherein, the second identifier is for marking the target data by compression processing;
If there are the corresponding first identifier and the corresponding second identifier, first storages for the target data Equipment sends first object data packet to the second storage equipment, so that the second storage equipment is according to the first object number The target data is handled according to packet;
Wherein, the target data, the first identifier and second mark are at least carried in the first object data packet Know, the first identifier is used to indicate the second storage equipment and is deleted again according to the first identifier to the target data Processing, the second identifier are used to indicate the second storage equipment and carry out write-in processing to the target data.
2. the method according to claim 1, wherein whether the first storage equipment detects the target data There are after corresponding first identifier and corresponding second identifier, the method also includes:
If the corresponding first identifier and the corresponding second identifier is not present in the target data, described first is deposited It stores up equipment and sends the second target packet to the second storage equipment, so that the second storage equipment is according to described second Target packet handles the target data;
Wherein, the target data and third mark are at least carried in second data packet, the third mark is for referring to Show that the second storage equipment carries out compression processing to the target data.
3. according to the method described in claim 2, it is characterized in that, the first storage equipment read the target data it Afterwards, the method also includes:
If the target data is stored in the buffer zone of the first storage equipment, the first storage equipment is to described the Two storage equipment send second target packet, so that the second storage equipment is according to second target packet The target data is handled;
Wherein, the target data and third mark are at least carried in second data packet.
4. the method according to claim 1, wherein whether the first storage equipment detects the target data There are after corresponding first identifier and corresponding second identifier, the method also includes:
If the target data is there are the corresponding first identifier and the corresponding second identifier is not present, described first It stores equipment and sends third target packet to the second storage equipment, so that the second storage equipment is according to described the Three target packets handle the target data;
Wherein, the target data, the first identifier and the third mark are at least carried in the third target packet Know.
5. the method according to claim 1, wherein whether the first storage equipment detects the target data There are after corresponding first identifier and corresponding second identifier, the method also includes:
If the target data is there is no the corresponding first identifier and there are the corresponding second identifier, described first It stores equipment and sends the 4th target packet to the second storage equipment, so that the second storage equipment is according to described the Four target packets handle the target data;
Wherein, the target data and the second identifier are at least carried in the 4th target packet.
6. a kind of data processing equipment characterized by comprising
Read module, for reading target data;
Detection module whether there is corresponding first identifier and corresponding second identifier for detecting the target data, In, the second identifier is for marking the target data by compression processing;
Sending module, if for the target data there are the corresponding first identifier and the corresponding second identifier, Then the sending module sends first object data packet to the second storage equipment, so that the second storage equipment is according to First object data packet handles the target data;
Wherein, the target data, the first identifier and second mark are at least carried in the first object data packet Know, the first identifier is used to indicate the second storage equipment and is deleted again according to the first identifier to the target data Processing, the second identifier are used to indicate the second storage equipment and carry out write-in processing to the target data.
7. data processing equipment according to claim 6, which is characterized in that
The sending module, if being also used to the target data, there is no the corresponding first identifier and corresponding described the Two marks, then the sending module sends the second target packet to the second storage equipment, so that second storage Equipment is handled the target data according to second target packet;
Wherein, the target data and third mark are at least carried in second data packet, the third mark is for referring to Show that the second storage equipment carries out compression processing to the target data.
8. data processing equipment according to claim 7, which is characterized in that
The sending module, it is described if being also used to the buffer zone that the target data is stored in the first storage equipment Sending module sends second target packet to the second storage equipment, so that the second storage equipment is according to institute The second target packet is stated to handle the target data;
Wherein, the target data and third mark are at least carried in second data packet.
9. data processing equipment according to claim 6, which is characterized in that
The sending module, if being also used to the target data there are the corresponding first identifier and being not present corresponding described Second identifier, then the sending module sends third target packet to the second storage equipment, so that described second deposits Storage equipment is handled the target data according to the third target packet;
Wherein, the target data, the first identifier and the third mark are at least carried in the third target packet Know.
10. data processing equipment according to claim 6, which is characterized in that
The sending module, if being also used to, the corresponding first identifier is not present in the target data and there are corresponding described Second identifier, then the sending module sends the 4th target packet to the second storage equipment, so that described second deposits Storage equipment is handled the target data according to the 4th target packet;
Wherein, the target data and the second identifier are at least carried in the 4th target packet.
CN201811108304.4A 2018-09-21 2018-09-21 Data processing method and related device Active CN109086172B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201811108304.4A CN109086172B (en) 2018-09-21 2018-09-21 Data processing method and related device

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201811108304.4A CN109086172B (en) 2018-09-21 2018-09-21 Data processing method and related device

Publications (2)

Publication Number Publication Date
CN109086172A true CN109086172A (en) 2018-12-25
CN109086172B CN109086172B (en) 2022-12-06

Family

ID=64842307

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201811108304.4A Active CN109086172B (en) 2018-09-21 2018-09-21 Data processing method and related device

Country Status (1)

Country Link
CN (1) CN109086172B (en)

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107025289A (en) * 2017-04-14 2017-08-08 腾讯科技(深圳)有限公司 The method and relevant device of a kind of data processing
CN107179878A (en) * 2016-03-11 2017-09-19 伊姆西公司 The method and apparatus of data storage based on optimizing application
CN107193503A (en) * 2017-05-27 2017-09-22 杭州宏杉科技股份有限公司 A kind of data delete method and storage device again
CN107229420A (en) * 2017-05-27 2017-10-03 郑州云海信息技术有限公司 Date storage method, read method, delet method and data operation system
WO2018121455A1 (en) * 2016-12-29 2018-07-05 华为技术有限公司 Cached-data processing method and device, and storage controller
CN108268219A (en) * 2018-02-01 2018-07-10 杭州宏杉科技股份有限公司 A kind of method and device for handling I/O request

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107179878A (en) * 2016-03-11 2017-09-19 伊姆西公司 The method and apparatus of data storage based on optimizing application
WO2018121455A1 (en) * 2016-12-29 2018-07-05 华为技术有限公司 Cached-data processing method and device, and storage controller
CN107025289A (en) * 2017-04-14 2017-08-08 腾讯科技(深圳)有限公司 The method and relevant device of a kind of data processing
CN107193503A (en) * 2017-05-27 2017-09-22 杭州宏杉科技股份有限公司 A kind of data delete method and storage device again
CN107229420A (en) * 2017-05-27 2017-10-03 郑州云海信息技术有限公司 Date storage method, read method, delet method and data operation system
CN108268219A (en) * 2018-02-01 2018-07-10 杭州宏杉科技股份有限公司 A kind of method and device for handling I/O request

Also Published As

Publication number Publication date
CN109086172B (en) 2022-12-06

Similar Documents

Publication Publication Date Title
US20200150890A1 (en) Data Deduplication Method and Apparatus
US6397309B2 (en) System and method for reconstructing data associated with protected storage volume stored in multiple modules of back-up mass data storage facility
CN105339929B (en) Select the storage for cancelling repeated data
CN106201771B (en) Data-storage system and data read-write method
US7516286B1 (en) Conversion between full-data and space-saving snapshots
CN109327539A (en) A kind of distributed block storage system and its data routing method
US8458145B2 (en) System and method of storage optimization
US20120323864A1 (en) Distributed de-duplication system and processing method thereof
US8745744B2 (en) Storage system and storage system management method
CN106407040A (en) Remote data copy method and system
CN107038092B (en) Data copying method and device
CN106407224B (en) The method and apparatus of file compacting in a kind of key assignments storage system
CN103959256A (en) Fingerprint-based data deduplication
EP3862883A1 (en) Data backup method and apparatus, and system
US20100070724A1 (en) Storage system and method for operating storage system
CN107451013B (en) Data recovery method, device and system based on distributed system
US10572335B2 (en) Metadata recovery method and apparatus
CN107193503B (en) Data deduplication method and storage device
US20110282843A1 (en) Method and system for data backup and replication
CN109582245A (en) Data processing method, device and equipment
CN107203331A (en) Write the method and device of data
CN105824846A (en) Data migration method and device
CN104935469A (en) Distributive storage method and system for log information
CN104520802A (en) Data sending method, data receiving method and storage device
CN109753381B (en) Continuous data protection method based on object storage

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant