CN103793293A - Incremental backup method for archived data files - Google Patents

Incremental backup method for archived data files Download PDF

Info

Publication number
CN103793293A
CN103793293A CN201210432786.5A CN201210432786A CN103793293A CN 103793293 A CN103793293 A CN 103793293A CN 201210432786 A CN201210432786 A CN 201210432786A CN 103793293 A CN103793293 A CN 103793293A
Authority
CN
China
Prior art keywords
file
backed
backup
filing data
current
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201210432786.5A
Other languages
Chinese (zh)
Inventor
顾胜溢
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
SHANGHAI OUPENG SOFTWARE Co Ltd
Original Assignee
SHANGHAI OUPENG SOFTWARE Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by SHANGHAI OUPENG SOFTWARE Co Ltd filed Critical SHANGHAI OUPENG SOFTWARE Co Ltd
Priority to CN201210432786.5A priority Critical patent/CN103793293A/en
Publication of CN103793293A publication Critical patent/CN103793293A/en
Pending legal-status Critical Current

Links

Images

Landscapes

  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

The invention discloses an incremental backup method for archived data files. The method includes the implementation steps: firstly, creating a global backup for to-be-backuped archived data files, and generating file hash values for all files in the global backup; secondly, traversing all the archived data files, generating a file hash value of each currently-traversed archived data file, then searching in the global backup to find out whether or not the same file hash values exist, and adding the currently-traversed archived data files into a to-be-backuped file list if not; thirdly, creating backups for all the corresponding archived data files in the to-be-backuped file list to serve as incremental backups of the archived data files at the time. The method has the advantages of short backup time, less backup space, high backup efficiency and simpleness and convenience in use.

Description

Filing data file incremental backup method
Technical field
The present invention relates to computing machine filing data file backup field of storage, be specifically related to a kind of filing data file incremental backup method.
Background technology
At present in the time of filing data file backup, the method that general employing directly copies, but the filing data file for Large Volume Data, often copy on the one hand a one side and need to expend great storage space, because file is more, copy backup also requires a great deal of time on the other hand.
Summary of the invention
The technical problem to be solved in the present invention is to provide that a kind of BACKUP TIME is fast, backup space is few, backup efficiency filing data file incremental backup method high, easy to use.
For solving the problems of the technologies described above, the technical solution used in the present invention is:
A kind of filing data file incremental backup method, implementation step is as follows:
1) being in advance the filing data file set up overall situation to be backed up backup, is All Files spanned file cryptographic hash in described overall situation backup;
2) travel through all filing data files, for the filing data file generated file cryptographic hash of each current traversal, then in overall situation backup, search whether there is identical file cryptographic hash, if there is no identical file cryptographic hash is added into listed files to be backed up by the filing data file of current traversal;
3) all filing data files corresponding in listed files to be backed up are backed up as the incremental backup of this filing data file.
Further improvement as technique scheme of the present invention:
Described file cryptographic hash is the MD5 file verification code of aim file data file specifically.
The detailed step of described step 3) comprises:
3.1) file to be backed up taking out from listed files to be backed up is as current file to be backed up;
3.2) attempt current file to be backed up to read in internal memory by dma operation, if read failure, repeated attempt preset times, if successfully read in internal memory, the file current to be backed up in internal memory is write to the backup directory of current backup, then redirect execution step 3.3); If unsuccessful, output alarm information redirect execution step 3.3);
3.3) if current file to be backed up is last file in listed files to be backed up, exit, otherwise from listed files to be backed up, take out next file to be backed up as current file to be backed up redirect execution step 3.2).
Described step 3.2) in preset times be 2.
Described step 3) also comprises the step that listed files to be backed up is stored as backup log.
the present invention has following advantage
The present invention is by setting up overall situation backup, travel through all filing data files, for the filing data file generated file cryptographic hash of each current traversal, then in overall situation backup, search whether there is identical file cryptographic hash, if there is no identical file cryptographic hash is added into listed files to be backed up by the filing data file of current traversal, all filing data files corresponding in listed files to be backed up are backed up as the incremental backup of this filing data file, can only carry out incremental backup for the file of setting up overall situation backup generation modification, there is BACKUP TIME fast, backup space is few, backup efficiency is high, advantage easy to use.
Accompanying drawing explanation
In order to be illustrated more clearly in the embodiment of the present invention or technical scheme of the prior art, to the accompanying drawing of required use in embodiment or description of the Prior Art be briefly described below, apparently, accompanying drawing in the following describes is only some embodiments of the present invention, for those of ordinary skills, do not paying under the prerequisite of creative work, can also obtain according to these accompanying drawings other accompanying drawing.
Fig. 1 is the basic procedure schematic diagram of the embodiment of the present invention.
Embodiment
Below in conjunction with accompanying drawing, the preferred embodiments of the present invention are described in detail, thereby so that advantages and features of the invention can be easier to be it will be appreciated by those skilled in the art that, protection scope of the present invention are made to more explicit defining.
As shown in Figure 1, the implementation step of embodiment of the present invention filing data file incremental backup method is as follows:
1) being in advance the filing data file set up overall situation to be backed up backup, is All Files spanned file cryptographic hash in overall situation backup;
2) travel through all filing data files, for the filing data file generated file cryptographic hash of each current traversal, then in overall situation backup, search whether there is identical file cryptographic hash, if there is no identical file cryptographic hash is added into listed files to be backed up by the filing data file of current traversal;
3) all filing data files corresponding in listed files to be backed up are backed up as the incremental backup of this filing data file.
In the present embodiment, file cryptographic hash is the MD5 file verification code of aim file data file specifically, and MD5 file verification code has advantages of that formation speed is fast, can improve the processing speed of backup.
In the present embodiment, the detailed step of step 3) comprises:
3.1) file to be backed up taking out from listed files to be backed up is as current file to be backed up;
3.2) attempt current file to be backed up to read in internal memory by dma operation, if read failure, repeated attempt preset times, if successfully read in internal memory, the file current to be backed up in internal memory is write to the backup directory of current backup, then redirect execution step 3.3); If unsuccessful, output alarm information redirect execution step 3.3);
3.3) if current file to be backed up is last file in listed files to be backed up, exit, otherwise from listed files to be backed up, take out next file to be backed up as current file to be backed up redirect execution step 3.2).
In the present embodiment, step 3.2) in preset times be 2.
In the present embodiment, step 3) also comprises the step that listed files to be backed up is stored as backup log.
The foregoing is only the preferred embodiment of the present invention, protection scope of the present invention is not limited in above-mentioned embodiment, and every technical scheme that belongs to the principle of the invention all belongs to protection scope of the present invention.For a person skilled in the art, some improvements and modifications of carrying out under the prerequisite that does not depart from principle of the present invention, these improvements and modifications also should be considered as protection scope of the present invention.

Claims (5)

1. a filing data file incremental backup method, is characterized in that implementation step is as follows:
1) being in advance the filing data file set up overall situation to be backed up backup, is All Files spanned file cryptographic hash in described overall situation backup;
2) travel through all filing data files, for the filing data file generated file cryptographic hash of each current traversal, then in overall situation backup, search whether there is identical file cryptographic hash, if there is no identical file cryptographic hash is added into listed files to be backed up by the filing data file of current traversal;
3) all filing data files corresponding in listed files to be backed up are backed up as the incremental backup of this filing data file.
2. filing data file incremental backup method according to claim 1, is characterized in that: described file cryptographic hash is the MD5 file verification code of aim file data file specifically.
3. filing data file incremental backup method according to claim 2, is characterized in that: the detailed step of described step 3) comprises:
3.1) file to be backed up taking out from listed files to be backed up is as current file to be backed up;
3.2) attempt current file to be backed up to read in internal memory by dma operation, if read failure, repeated attempt preset times, if successfully read in internal memory, the file current to be backed up in internal memory is write to the backup directory of current backup, then redirect execution step 3.3); If unsuccessful, output alarm information redirect execution step 3.3);
3.3) if current file to be backed up is last file in listed files to be backed up, exit, otherwise from listed files to be backed up, take out next file to be backed up as current file to be backed up redirect execution step 3.2).
4. filing data file incremental backup method according to claim 3, is characterized in that: described step 3.2) in preset times be 2.
5. according to the filing data file incremental backup method described in claim 1 or 2 or 3 or 4, it is characterized in that: described step 3) also comprises the step that listed files to be backed up is stored as backup log.
CN201210432786.5A 2012-11-03 2012-11-03 Incremental backup method for archived data files Pending CN103793293A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201210432786.5A CN103793293A (en) 2012-11-03 2012-11-03 Incremental backup method for archived data files

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201210432786.5A CN103793293A (en) 2012-11-03 2012-11-03 Incremental backup method for archived data files

Publications (1)

Publication Number Publication Date
CN103793293A true CN103793293A (en) 2014-05-14

Family

ID=50669001

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201210432786.5A Pending CN103793293A (en) 2012-11-03 2012-11-03 Incremental backup method for archived data files

Country Status (1)

Country Link
CN (1) CN103793293A (en)

Cited By (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104077053A (en) * 2014-06-30 2014-10-01 珠海市君天电子科技有限公司 Terminal information operating method and device and mobile terminal
CN105068891A (en) * 2015-08-14 2015-11-18 惠州Tcl移动通信有限公司 Method and terminal for repairing eMMC file
CN105302675A (en) * 2015-11-25 2016-02-03 上海爱数信息技术股份有限公司 Method and device for data backup
WO2016070655A1 (en) * 2014-11-05 2016-05-12 中兴通讯股份有限公司 Information backup method, device, server and terminal
CN110908972A (en) * 2019-11-19 2020-03-24 加和(北京)信息科技有限公司 Log data preprocessing method and device, electronic equipment and storage medium

Cited By (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104077053A (en) * 2014-06-30 2014-10-01 珠海市君天电子科技有限公司 Terminal information operating method and device and mobile terminal
CN104077053B (en) * 2014-06-30 2017-05-24 珠海市君天电子科技有限公司 Terminal information operating method and device and mobile terminal
WO2016070655A1 (en) * 2014-11-05 2016-05-12 中兴通讯股份有限公司 Information backup method, device, server and terminal
CN105068891A (en) * 2015-08-14 2015-11-18 惠州Tcl移动通信有限公司 Method and terminal for repairing eMMC file
CN105068891B (en) * 2015-08-14 2020-09-29 Tcl移动通信科技(宁波)有限公司 Method and terminal for repairing eMMC file
CN105302675A (en) * 2015-11-25 2016-02-03 上海爱数信息技术股份有限公司 Method and device for data backup
CN110908972A (en) * 2019-11-19 2020-03-24 加和(北京)信息科技有限公司 Log data preprocessing method and device, electronic equipment and storage medium
CN110908972B (en) * 2019-11-19 2022-09-02 加和(北京)信息科技有限公司 Log data preprocessing method and device, electronic equipment and storage medium

Similar Documents

Publication Publication Date Title
US11153094B2 (en) Secure data deduplication with smaller hash values
US9152500B1 (en) Hash collision recovery in a deduplication vault
US8504528B2 (en) Duplicate backup data identification and consolidation
EP3519965B1 (en) Systems and methods for healing images in deduplication storage
US8281093B1 (en) Systems and methods for creating consolidated backups of snapshot hierarchies
US8140537B2 (en) Block level tagging with file level information
US20150293817A1 (en) Secure Relational File System With Version Control, Deduplication, And Error Correction
US8793222B1 (en) Systems and methods for indexing backup content
CN103793293A (en) Incremental backup method for archived data files
US8229898B1 (en) Methods and systems for utilizing redundant data
US8495022B1 (en) Systems and methods for synthetic backups
US20130167145A1 (en) Systems and methods for safely migrating to virtualized platforms
EP2788876A1 (en) System and method for restoring application data
CN113918385B (en) Method, device, electronic equipment and medium for online incremental backup and recovery of virtual machine
CN105573859A (en) Data recovery method and device of database
CN104239443A (en) Serialization data operation log storage method
CN104077380A (en) Method and device for deleting duplicated data and system
CN103617260A (en) Index generation method and device for repeated data deletion
EP3658995B1 (en) Interruption recovery method for machine tool machining file and machine tool applying same
US9223793B1 (en) De-duplication of files for continuous data protection with remote storage
CN105608150A (en) Business data processing method and system
CN103559224A (en) Method and device for hashing metadata objects
US10503717B1 (en) Method for locating data on a deduplicated storage system using a SSD cache index
CN103838645A (en) Remote difference synthesis backup method based on Hash
CN104965835A (en) Method and apparatus for reading and writing files of a distributed file system

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C02 Deemed withdrawal of patent application after publication (patent law 2001)
WD01 Invention patent application deemed withdrawn after publication

Application publication date: 20140514