CN102467514A - Data deduplication system - Google Patents

Data deduplication system Download PDF

Info

Publication number
CN102467514A
CN102467514A CN2010105357854A CN201010535785A CN102467514A CN 102467514 A CN102467514 A CN 102467514A CN 2010105357854 A CN2010105357854 A CN 2010105357854A CN 201010535785 A CN201010535785 A CN 201010535785A CN 102467514 A CN102467514 A CN 102467514A
Authority
CN
China
Prior art keywords
module
file
client
data
access request
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN2010105357854A
Other languages
Chinese (zh)
Inventor
刘杰
王云松
陈志丰
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Inventec Corp
Original Assignee
Inventec Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Inventec Corp filed Critical Inventec Corp
Priority to CN2010105357854A priority Critical patent/CN102467514A/en
Publication of CN102467514A publication Critical patent/CN102467514A/en
Pending legal-status Critical Current

Links

Images

Abstract

The invention relates to a data deduplication system, which is applied to a file backup program. A server is electrically connected with a file storage device; the server additionally comprises a server interface module and a second deduplication service module; a client interface module of a client is used for receiving a file access request and a corresponding object file; a first deduplication service module is connected with the second deduplication service module through a network and is used for sending out the file access request; and a driver module is connected with the first deduplication service module and is used for intercepting the file access request and the corresponding object file, modifying the file access request and the object file according to an access means and transmitting the modified file access request and the modified object file to the first deduplication service module.

Description

The data de-duplication system
Technical field
The present invention relates to a kind of file backup system, particularly a kind of data de-duplication that in file backup system, adds is handled.
Background technology
Along with the capacity increase day by day of storage device, make data on the kind of preserving, more selection arranged.Existing backup mode is that all data are backed up, though such practice can be complete data is preserved.But such backup mode is with a large amount of storage areas that takies.And when data are more, must add new storage facilities, and so, with the raising of the increase that causes running electric power, managing human and the problems such as waste of server running resource.
For solving the problem of above-mentioned data backup, each tame manufacturer does not have inexertion and proposes various solutions.Wherein, built archival backup (Window Server Backup) in especially in Microsoft (Microsoft) Windows of releasing 2008 (Windows Server 2008), the data backup of each client that connects in order to provide is handled.
Please refer to shown in Figure 1ly, it is the archival backup configuration diagram for prior art.The operand of the archival backup of Windows 2008 is data block or disk volume.Archival backup can become the book collection with contents processing to be backed up automatically, and after will having the unusual fluctuation partial data to be connected on corresponding backup location with the mode of increment.And each book collection can be used as by service end be an independently disk block.Therefore Windows 2008 is in the process of carrying out Backup Data, and archival backup is that disk block is that data transmission is carried out on the basis, so the mode speed of transmission data also is very fast.For Windows 2008, existing data backup/restoring function be with common data file as operand, in the transmission data, also be that next file transmits, the mode speed of this Backup Data naturally can be not fast.
Though Windows 2008 can be only the part of increment is proposed the processing of backup, can be quicker when making data backup/reduction, in backed up data, still having many data is repetitions.
Summary of the invention
In view of above problem; Technical matters to be solved by this invention is to provide a kind of data de-duplication system; Can support the archival backup (Window Server Backup) built in the operating system, make archival backup have the function of deletion repeating data.
For achieving the above object, the data de-duplication system that the present invention disclosed comprises service end and client.Service end electrically connect file storage device; Service end comprises that also service end interface module and second repeats to delete service module; Client comprises that client-side interface module (Client Graph User Interface), first repeats to delete service module and driver module; Client-side interface module (Client GUI) is in order to receive user's operational order (operation refers to that preface can comprise the setting to content); First repeats to delete service module (ClientService) is connected in second through the networking and repeats to delete service module, and with second repeat to delete that service module is collaborative carries out the operation of data de-duplication; Driver module (Driver) is connected in first and repeats to delete service module; Driver module is in order to its corresponding file destination of interception this document access request; Driver module is according to access means revised file access request and file destination, and amended file access request and file destination are forwarded to first repeat to delete service module.
Data de-duplication provided by the present invention system is applied to the archival backup of the Windows of Microsoft.Data de-duplication of the present invention system is through intercepting and capturing the real-time I/O operation to archival backup, so can archival backup not carried out archival backup being carried out the action of data de-duplication under the prerequisite of unusual fluctuation.
Describe the present invention below in conjunction with accompanying drawing and specific embodiment, but not as to qualification of the present invention.
Description of drawings
Fig. 1 is the archival backup configuration diagram of prior art;
Fig. 2 is a configuration diagram of the present invention.
Wherein, Reference numeral
210 service ends
211 service end interface modules
212 second repeat to delete service module
220 clients
221 client end interface modules
222 first repeat to delete service module
223 driver modules
230 archival backups
Embodiment
Below in conjunction with accompanying drawing structural principle of the present invention and principle of work are done concrete description:
The present invention is applied to the archival backup (Window Server Backup) in the Windows 2008 (Windows 2008) of Microsoft, uses and reduce the repeating data that archival backup is produced when carrying out back-up processing.
Please refer to shown in Figure 2ly, it is a configuration diagram of the present invention.Data de-duplication of the present invention system comprises service end 210 and client 220.Service end 210 and client 220 also can be set at client 220 and service end 210 on the same main frame except can linking through Internet.Service end 210 is electrically connected at file storage device (being not limited to in-building type or circumscribed); The kind of file storage device does not limit independent disk; Also can be redundant array of inexpensive disk (Redundant Array of IndependentDisks; RAID) or world-wide web personal computer storage element (Internet Small Computer SystemInterface, iSCSI).
Comprise also that in service end 210 service end interface module 211 and second repeats to delete service module 212.Second repeats to delete service module 212 in order to those clients 220 are carried out interaction process, and received data is carried out repeating data handle.Service end interface module 211 according to second repeat to delete 212 pairs of data of service module the processing progress, handle and the connection processing of cutting off client 220 in order to the capacity (Dedup.Volume) of the connection situation that shows current mutual this client 220, backup instances, setting data de-duplication.
At each client 220 operating file stand-by program 230.Client 220 is in order to receiving the file destination of desire backup, and is sent to service end 210 after file destination handled through following element.Client 220 comprises that client-side interface module 221 (Client GUI), first repeats to delete service module 222 and driver module 223.Client-side interface module 221 is in order to respond connection status, the Status of Backups of file destination and the statistical information of all these file destinations of service end 210.Client end interface module 221 is in order to the communication port numbers of depositing path and this service end 210 of target setting file in file backup system.
First repeats to delete service module 222 (Client Service) is connected in second through the networking and repeats to delete service module 212, and with second repeat to delete the service module 212 collaborative accessing operations that carry out data de-duplication.Client-side interface module 221 in order to set backup path, to the port numbers or the communication protocol of service end 210.Driver module 223 (Driver) is connected in first and repeats to delete service module 222.Driver module 223 is in order to its corresponding file destination of interception file access request.
Driver module 223 is according to access means revised file access request and file destination, and amended file access request and file destination are forwarded to first repeat to delete service module 222.In other words, archival backup 230 stored path and capacity are set in service end 210, the file directory of the service end 210 of desire visit then is set in client 220.When 220 pairs of file destinations of client back up, then the data de-duplication system will carry out the handled of data de-duplication to file destination, and the file destination after will handling is forwarded to corresponding directory stores in the service end 210.
Data de-duplication provided by the present invention system is applied to the archival backup 230 of the Windows of Microsoft.Data de-duplication of the present invention system so can archival backup 230 not carried out under the prerequisite of unusual fluctuation, carries out the action of data de-duplication to archival backup 230 through intercepting and capturing the real-time I/O operation to archival backup 230.
Certainly; The present invention also can have other various embodiments; Under the situation that does not deviate from spirit of the present invention and essence thereof; Those of ordinary skill in the art work as can make various corresponding changes and distortion according to the present invention, but these corresponding changes and distortion all should belong to the protection domain of the appended claim of the present invention.

Claims (3)

1. a data de-duplication system can support the archival backup of building in the operating system, makes archival backup have the function of deletion repeating data, it is characterized in that this data de-duplication system comprises:
One service end, electrically connect one file storage device, this service end also comprises: service end interface module and second repeats to delete service module, and a service end interface module receives a file access request; And one second repeat to delete service module, according to this document access request a corresponding file destination is recorded to this document storage device; And
At least one client is connected in this service end through the networking, and this client more comprises: client-side interface module, first repeats to delete service module and driver module, and this client-side interface module is in order to receive this document access request and corresponding this file destination; This first repeats to delete service module and is connected in this through network and second repeats to delete service module, and this first repeats to delete service module in order to send this document access request; This driver module is connected in this and first repeats to delete service module; This driver module is in order to its corresponding this file destination of interception this document access request; This driver module is according to access means; In order to revising this document access request and this file destination, and amended this document access request and this file destination are forwarded to this first repeat to delete service module.
2. data de-duplication according to claim 1 system is characterized in that this client-side interface module is in order to the connection status of responding this service end, the Status of Backups of this file destination and the statistical information of all these file destinations.
3. data de-duplication according to claim 1 system is characterized in that this client-side interface module is in order to set the communication port numbers of depositing path and this service end of this file destination in a file backup system.
CN2010105357854A 2010-11-04 2010-11-04 Data deduplication system Pending CN102467514A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN2010105357854A CN102467514A (en) 2010-11-04 2010-11-04 Data deduplication system

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN2010105357854A CN102467514A (en) 2010-11-04 2010-11-04 Data deduplication system

Publications (1)

Publication Number Publication Date
CN102467514A true CN102467514A (en) 2012-05-23

Family

ID=46071158

Family Applications (1)

Application Number Title Priority Date Filing Date
CN2010105357854A Pending CN102467514A (en) 2010-11-04 2010-11-04 Data deduplication system

Country Status (1)

Country Link
CN (1) CN102467514A (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107315653A (en) * 2017-03-02 2017-11-03 陈辉 A kind of band deletes the portable storage device and implementation method of calculating and processing function again

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101216850A (en) * 2008-01-11 2008-07-09 清华大学 File systems accessing register dynamic collection method
CN101873342A (en) * 2010-06-02 2010-10-27 深圳市迪菲特科技股份有限公司 Data access method, data access system and disk array storage system
CN101908077A (en) * 2010-08-27 2010-12-08 华中科技大学 Duplicated data deleting method applicable to cloud backup

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101216850A (en) * 2008-01-11 2008-07-09 清华大学 File systems accessing register dynamic collection method
CN101873342A (en) * 2010-06-02 2010-10-27 深圳市迪菲特科技股份有限公司 Data access method, data access system and disk array storage system
CN101908077A (en) * 2010-08-27 2010-12-08 华中科技大学 Duplicated data deleting method applicable to cloud backup

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107315653A (en) * 2017-03-02 2017-11-03 陈辉 A kind of band deletes the portable storage device and implementation method of calculating and processing function again

Similar Documents

Publication Publication Date Title
JP5260333B2 (en) Method and apparatus for backing up data with NAS and CAS integration
US7430616B2 (en) System and method for reducing user-application interactions to archivable form
US7370083B2 (en) System and method for providing virtual network attached storage using excess distributed storage capacity
US9558075B2 (en) Synthetic full backup generation
US8260913B2 (en) Reading a file from a cloud storage solution
US8234372B2 (en) Writing a file to a cloud storage solution
CN104239493A (en) Cross-cluster data migration method and system
CN101694637A (en) Method and system for restoring database
US8392437B2 (en) Method and system for providing deduplication information to applications
CN102193926A (en) Method and system for managing manuscript based on online automatic manuscript storage
CN107347062A (en) A kind of method, electronic equipment and the readable storage medium storing program for executing of daily record data processing
JP2013504806A (en) Method, apparatus and system for file transfer based on file directory
US9846622B1 (en) Parallel computer system recovery
US9817834B1 (en) Techniques for performing an incremental backup
US20130339307A1 (en) Managing system image backup
CN106843760A (en) It is a kind of based on the asynchronous remote copy system deleted and method again
CN102377688A (en) File transmission method and equipment
CN102467514A (en) Data deduplication system
CN111726401A (en) File transmission method and device
CN105187489A (en) File transfer method and system capable of clustering and supporting multiple users to upload simultaneously
US10761742B1 (en) Dynamic redundancy in storage systems
CN103488768A (en) File management method and file management system based on cloud computing
US11537480B1 (en) Systems and methods of backup and recovery of journaling systems
CN103037031A (en) Internet protocol (IP) address administration method of internet small computer system interface (ISCSI) target device
US11169960B2 (en) Data transfer appliance method and system

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
WD01 Invention patent application deemed withdrawn after publication

Application publication date: 20120523

WD01 Invention patent application deemed withdrawn after publication