CN117194109A - Method and system for data backup and recovery - Google Patents

Method and system for data backup and recovery Download PDF

Info

Publication number
CN117194109A
CN117194109A CN202311201866.4A CN202311201866A CN117194109A CN 117194109 A CN117194109 A CN 117194109A CN 202311201866 A CN202311201866 A CN 202311201866A CN 117194109 A CN117194109 A CN 117194109A
Authority
CN
China
Prior art keywords
file
backup
data
area
files
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN202311201866.4A
Other languages
Chinese (zh)
Other versions
CN117194109B (en
Inventor
董志华
朱东涛
陶倩倩
戴雨晨
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Zhejiang Yangji Software Technology Co ltd
Original Assignee
Zhejiang Yangji Software Technology Co ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Zhejiang Yangji Software Technology Co ltd filed Critical Zhejiang Yangji Software Technology Co ltd
Priority to CN202311201866.4A priority Critical patent/CN117194109B/en
Publication of CN117194109A publication Critical patent/CN117194109A/en
Application granted granted Critical
Publication of CN117194109B publication Critical patent/CN117194109B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Landscapes

  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

The invention relates to the technical field of data backup, in particular to a system for data backup and recovery, which comprises an input module, a data storage module and a data storage module, wherein the input module inputs file data; a backup area that performs backup storage for an input file; the central control module is respectively connected with the input module and the backup module and is used for identifying and processing the files, and comprises an input detection area which is used for identifying the importance degree of the input files; a bad block detection area for detecting file data damage of the file in the backup area under the use state; and the recovery processing area is used for recovering the file in the damaged backup area, and outputting and reminding the file. The invention reduces the probability of file loss by setting the hierarchical backup of the file data with different importance degrees, and simultaneously, the invention also provides a data backup and recovery method which is a more rapid method for users to know the use method of the system.

Description

Method and system for data backup and recovery
Technical Field
The present invention relates to the field of data backup technologies, and in particular, to a method and a system for data backup and recovery.
Background
The traditional data backup mainly adopts an internal or external tape drive to carry out cold backup. However, this method can only prevent human failures such as misoperation and the like, and the recovery time is long. With the continuous development of technology, the mass of data increases, and many enterprises begin to adopt network backup. Network backup is typically implemented through specialized data storage management software in combination with corresponding hardware and storage devices.
Chinese patent publication No.: CN110196787a. The recovery system comprises front-end equipment, a communication module, a server group and a management terminal; the front-end equipment performs data interaction with the server group through the communication module, and backups data in the server group; the management terminal comprises a management system and a system management module, wherein the management system manages the work of the server group, and the system management module manages the management system; the management system consists of an equipment management module, a backup management module, a recovery management module, a bad block management module and a query statistics module. It follows that the data backup and recovery system has the following problems: the data can not be backed up according to the conditions, so that the data is redundant and difficult to process.
Disclosure of Invention
Therefore, the invention provides a data backup and recovery method and system, which are used for solving the problems that in the prior art, the data backup and recovery system cannot perform data case-by-case backup processing, so that the data backup is redundant and difficult to process.
To achieve the above object, the present invention provides a system for data backup and restore, comprising,
an input module that inputs file data;
the backup area is used for carrying out backup storage on the input files and is divided into three according to the input files, and comprises a backup cloud end, a backup first area and a backup second area; the backup cloud is a general file integral backup position and an important file key backup position, the backup first area is an important file integral backup position, and the backup second area is a secret file compression backup position;
the central control module is respectively connected with the input module and the backup module and used for identifying and processing the files, and comprises,
the input detection area is used for identifying the importance degree of the input file and carrying out regional backup of the backup area on the file according to the detection result;
a bad block detection area, which detects file data damage to the files in the backup area in the use state and judges whether the files in the backup area are damaged;
The recovery processing area judges the damage processing degree of the file in the damaged backup area, restores the damaged file, calculates the file restoration degree according to the damage degree, and outputs and reminds the file;
the central control module performs information preprocessing on the file input by the input module in an input detection area, sorts and marks file information, and calculates a backup information initial value; identifying the marked content of the file and judging the importance degree of the file; judging backup areas of the files to backup the files one by one according to the importance degree of the files; calculating a backup information change value for the initial value and the existence time of the file backup information of the backup area, judging whether the file of the backup area needs to be re-backed up according to the backup information change value, and simultaneously, judging whether the importance degree of the file needing to be re-backed up is changed again, and determining the backup position of the re-backed up file; performing operation monitoring on files in the backup area, and judging whether to trigger file data damage degree detection of a bad block detection area; judging whether the file for detecting the damage degree of the file data can recover the file data or not; and detecting a recovery result of the file data which is subjected to recovery processing by any triggering recovery processing area, calculating the recovery degree of the recovery file and outputting a prompt.
Further, for the file input by the input module, the input detection area of the central control module carries out information preprocessing on the file,
for any input file, an initial value is set in the file management system according to the size proportion of the file, weighting processing is carried out according to file data information, the initial value of backup information of the file is calculated according to the initial value of the input file and the added value of the weighting processing, and the initial value of the backup information and the backup file are marked;
the file data information comprises a file source, a file destination, keywords in the file and timeliness of the file;
the central control module is internally provided with a vocabulary library, and the vocabulary library identifies keywords in the file;
the timeliness of the file comprises the timeliness of manual setting and the timeliness of the file built-in, and for the file without the timeliness of manual setting, the timeliness supplementary mark is built in according to the importance degree of the file.
Further, a keyword identification pointer aiming at the file data is stored in the central control module, and identification marking is carried out according to keywords in the file data information;
The identification mark has three levels, namely a general mark, an important mark and a secret mark; in judging the key degree of any file, judging the importance degree of the file after identifying three mark levels one by one;
for any of the file data items,
if the unique identification mark exists, the unique identification mark is marked as a whole result of the unique mark;
if the non-unique identification mark exists, the identification mark is marked to the greatest extent;
the maximum mark is that when three levels of marks are stored in the file, the file is judged to be a secret file, and the secret data amount in the file is calculated; when any two level marks are stored in the file, judging that the file is the largest level of the two levels; if the maximum grade is the important mark, judging the file as an important file, and calculating the important data quantity in the file; if the maximum grade is the secret mark, judging the file as a secret file, and calculating the secret data amount in the file;
the central control module is internally provided with ageing parameters set according to the file importance degree, and the ageing parameters comprise primary ageing calculation parameters and secondary ageing calculation parameters, wherein the primary ageing calculation parameters are larger than the secondary ageing calculation parameters, the primary ageing calculation parameters are positively correlated according to the secret data volume in the secret file, and the secondary ageing calculation parameters are positively correlated according to the important data volume in the important file;
For any file that does not have a time-effect set by the person,
if the file is a secret file, marking the secret file as a first-stage aging calculation parameter; if it is an important file, it is marked as a secondary aging calculation parameter.
Further, for the files judged to be general in the central control module, the copied backup is stored in a backup cloud;
for the files which are judged to be important in the central control module, storing the data in a backup area, converting all information of the data backup into keys and storing the keys in a backup cloud;
and compressing and storing the files which are judged to be secret in the central control module in the backup second area.
Further, in the central control module, a backup information change value is calculated according to the backup information initial value, the backup existence time length and the aging parameter of the backup area file mark of the backup area file, whether the backup area file needs to be backed up again is judged according to the backup information change value, and a backup information change threshold value is stored in the central control module;
for any of the files in the backup area,
if the change value of the backup information is larger than or equal to the change threshold value of the backup information, the file is backed up again;
If the change value of the backup information is smaller than the change threshold value of the backup information, continuing to calculate the change value of the backup information for the file;
judging whether the importance degree of the file to be re-backed up is changed or not again according to the file to be re-backed up, and determining the backup position of backup information, wherein a reduced initial value is stored in the central control module, and the reduced initial value is related to the importance degree of the file in the backup area;
the two reduced initial values are stored, and the two reduced initial values comprise a first reduced initial value and a second reduced initial value, wherein the first reduced initial value is smaller than the second reduced initial value;
the first reduction initial value is a reduction time length initial value for reducing the important file to the general file, and the second reduction initial value is a reduction time length initial value for reducing the confidential file to the important file.
Further, in the central control module, there is an operation detection program for detecting the state of the file in the backup area, and the operation detection program detects and judges the use state of the file in the backup area;
triggering the file data damage degree detection of the bad block detection area for the file in the backup area in the use state, and judging whether the file data damage degree detection is transmitted to the recovery processing area for processing;
and continuing to perform operation detection on the files in the backup area in the non-operation state.
Further, in the central control module, damage detection is carried out on any file data triggering the bad block detection area, the existing content of the file data is compared with the initial input data of the backup area one by one in the damage detection, the damage degree of the file is calculated, whether the file is transmitted to the recovery processing area for processing is judged according to the comparison result, and a damage degree threshold value exists in the central control module;
any file data that triggers the bad block detection zone,
if the damage degree is less than or equal to the damage degree threshold value, judging that the file data can be recovered, transmitting the file data to a recovery processing area, and carrying out recovery processing;
if the damage degree is larger than the damage degree threshold, judging that the file data cannot be recovered, and directly reminding the data damage.
Further, in the central control module, detecting a recovery result of the file data which is subjected to recovery processing by any triggering recovery processing area, and calculating a recovery degree ratio of the recovery result to the damage degree;
and marking the recovered file with a recovery degree ratio, outputting the file, and reminding the file which is not recovered.
The invention also provides a data backup and recovery method, which comprises the following steps,
Step S1, a user inputs file data to a data backup and recovery system through an input module;
s2, an input detection area of a central control module in the data backup and recovery system identifies an entered file, and performs information preprocessing on data information;
step S3, preprocessing according to the information of the file, continuously identifying the importance degree of the file data, backing up according to the identification result and storing the file data in a backup area;
step S4, when the user uses the files of the backup database, the condition of the data backup information is inquired, and the bad block detection area detects bad blocks of the searched files of the backup area;
step S5, judging whether recovery processing is carried out or not according to the detection condition of the bad block detection area;
s6, recovering and recovering degree detection can be carried out on the file subjected to recovery processing in a recovery processing area, and user file extraction processing is carried out;
and S7, carrying out user reminding processing on the file which cannot be recovered.
Compared with the prior art, the invention has the beneficial effects that the input files are detected, identified, regulated and controlled, subjected to different backup treatments according to different importance degrees, the backup files in the use state are subjected to damage analysis, the damaged files are subjected to recovery treatment, and users are prompted when the files are extracted, so that the loss of certain important files due to data loss is prevented, and the flexibility in the use of the system is improved. Furthermore, the invention preprocesses the input file information data, captures the relevant information of the input file, marks the relevant information, lays a cushion for other processing of the file information, refines the attribute of the file and is convenient for processing the relevant information of the file.
Furthermore, the invention marks the key words of the input files, judges the importance of the input files, classifies the input files, refines the importance of the files, facilitates the processing of files with different confidentiality degrees, reduces the risks of data leakage and virus invasion, and improves the data disaster tolerance degree in the data backup process.
Furthermore, the invention processes files with different confidentiality degrees, performs different backup processes, reduces risks of data leakage and loss, and simultaneously performs compression key processing on files with higher confidentiality degrees, increases the reserve of a backup area of a system, reduces redundancy of system data and aims at the problem of data processing.
Further, the invention judges whether the files with higher confidentiality degree are needed to be changed into files with lower confidentiality degree by calculating the change value of the backup information of the file with the existence time length, so as to improve the inquiry degree of a user on the existing files, reduce the confidentiality time length of frequently changing the files by the user, reduce manual operation and ensure higher flexibility of system application.
Further, the invention monitors the use state of the files in the backup area in real time, carries out detail processing according to the running state of the files, and reduces the running program in the system processing.
Further, in the invention, bad block detection is carried out on the backup area file in the use state, whether the backup area file can be restored or not is judged, restoration is carried out on the restorable data, reminding is carried out on the restorable data, the fact that the damaged file can be processed in time is ensured, and the user is prompted to process the file by himself or herself if the damaged file cannot be processed.
Further, the invention detects and calculates the recovery result of the damaged file in the backup area file extracted by the user, outputs the file, reminds the unrecovered file content, and prevents the data reminding messy code caused by the unrecovered file by the user, thereby causing data loss and reducing the unknown of the file extraction result of the user.
Furthermore, the invention provides a data backup and recovery method, which is convenient for users to familiarize with the use of the data backup and recovery system, and simultaneously prompts the key points in the use process of the users.
Drawings
FIG. 1 is a schematic diagram of the internal structure of software of a file management system according to an embodiment;
FIG. 2 is a schematic diagram illustrating an internal structure of control module software in the file management system according to the embodiment;
FIG. 3 is a flow chart illustrating a method for data backup and restore according to an embodiment.
Detailed Description
In order that the objects and advantages of the invention will become more apparent, the invention will be further described with reference to the following examples; it should be understood that the specific embodiments described herein are for purposes of illustration only and are not intended to limit the scope of the invention.
Preferred embodiments of the present invention are described below with reference to the accompanying drawings. It should be understood by those skilled in the art that these embodiments are merely for explaining the technical principles of the present invention, and are not intended to limit the scope of the present invention.
It should be noted that, in the description of the present invention, terms such as "upper," "lower," "left," "right," "inner," "outer," and the like indicate directions or positional relationships based on the directions or positional relationships shown in the drawings, which are merely for convenience of description, and do not indicate or imply that the apparatus or elements must have a specific orientation, be constructed and operated in a specific orientation, and thus should not be construed as limiting the present invention.
Furthermore, it should be noted that, in the description of the present invention, unless explicitly specified and limited otherwise, the terms "mounted," "connected," and "connected" are to be construed broadly, and may be either fixedly connected, detachably connected, or integrally connected, for example; can be mechanically or electrically connected; can be directly connected or indirectly connected through an intermediate medium, and can be communication between two elements. The specific meaning of the above terms in the present invention can be understood by those skilled in the art according to the specific circumstances.
Referring to fig. 1 and fig. 2, an internal structure of software of the file management system according to the embodiment of fig. 1 is shown, and fig. 2 is a schematic internal structure of software of a central control module of the file management system according to the embodiment.
The present invention provides a system for data backup and recovery, comprising,
an input module that inputs file data;
the backup area is used for carrying out backup storage on the input files and is divided into three according to the input files, and comprises a backup cloud end, a backup first area and a backup second area; the backup cloud is a general file integral backup position and an important file key backup position, the backup first area is an important file integral backup position, and the backup second area is a secret file compression backup position;
the central control module is respectively connected with the input module and the backup module and used for identifying and processing the files, and comprises,
the input detection area is used for identifying the importance degree of the input file and carrying out regional backup of the backup area on the file according to the detection result;
a bad block detection area, which detects file data damage to the files in the backup area in the use state and judges whether the files in the backup area are damaged;
the recovery processing area judges the damage processing degree of the file in the damaged backup area, restores the damaged file, calculates the file restoration degree according to the damage degree, and outputs and reminds the file;
The central control module performs information preprocessing on the file input by the input module in an input detection area, sorts and marks file information, and calculates a backup information initial value; identifying the marked content of the file and judging the importance degree of the file; judging backup areas of the files to backup the files one by one according to the importance degree of the files; calculating a backup information change value for the initial value and the existence time of the file backup information of the backup area, judging whether the file of the backup area needs to be re-backed up according to the backup information change value, and simultaneously, judging whether the importance degree of the file needing to be re-backed up is changed again, and determining the backup position of the re-backed up file; performing operation monitoring on files in the backup area, and judging whether to trigger file data damage degree detection of a bad block detection area; judging whether the file for detecting the damage degree of the file data can recover the file data or not; and detecting a recovery result of the file data which is subjected to recovery processing by any triggering recovery processing area, calculating the recovery degree of the recovery file and outputting a prompt.
The invention carries out detection and identification processing on the input files, carries out different backup processing according to different importance degrees, carries out damage analysis on the backup files in a use state, carries out recovery processing on the damaged files, prompts a user when extracting the files, prevents the loss of certain important files caused by the loss of the data, and improves the flexibility of the system in use.
Specifically, in this embodiment, for the file input by the input module, the input detection area of the central control module performs information preprocessing on the file,
for any input file, an initial value is set in the file management system according to the size proportion of the file, weighting processing is carried out according to file data information, the initial value of backup information of the file is calculated according to the initial value of the input file and the added value of the weighting processing, and the initial value of the backup information and the backup file are marked;
the file data information comprises a file source, a file destination, keywords in the file and timeliness of the file;
the central control module is internally provided with a vocabulary library, and the vocabulary library identifies keywords in the file;
the timeliness of the file comprises the timeliness of manual setting and the timeliness of the file built-in, and for the file without the timeliness of manual setting, the timeliness supplementary mark is built in according to the importance degree of the file.
For any input file W1, if the file size is 100M, the backup information initial value C1 of the input file W1 is 20.
The initial value of the backup information of the input file W1 is Bw1:
Bw1=20+L1+Q1+G1+T1;
bw1 is the initial value of the backup information of the file W1;
l1 is the source of file W1;
q1 is the destination of the file W1;
g1 is a keyword of the file W1;
t1 is the timeliness of the file W1.
For any input file W2, if the file size is 200M, the backup information initial value C2 of the input file W2 is 10.
The initial value of the backup information of the input file W2 is Bw2:
Bw1=10+L2+Q2+G2+T2;
wherein Bw2 is the initial value of backup information of the file W2;
l2 is the source of file W2;
q2 is the destination of the file W2;
g2 is a keyword of the file W2;
t2 is the timeliness of the file W2.
The invention preprocesses the input file information data, captures the relevant information of the input file, marks the relevant information, lays a cushion for other processing of the file information, refines the attribute of the file and facilitates the processing of the relevant information of the file.
Specifically, in this embodiment, a keyword identification pointer for the file data is stored in the central control module, and identification marking is performed according to keywords in the file data information;
the identification mark has three levels, namely a general mark, an important mark and a secret mark; in judging the key degree of any file, judging the importance degree of the file after identifying three mark levels one by one;
For any of the file data items,
if the unique identification mark exists, the unique identification mark is marked as a whole result of the unique mark;
if the non-unique identification mark exists, the identification mark is marked to the greatest extent;
the maximum mark is that when three levels of marks are stored in the file, the file is judged to be a secret file, and the secret data amount in the file is calculated; when any two level marks are stored in the file, judging that the file is the largest level of the two levels; if the maximum grade is the important mark, judging the file as an important file, and calculating the important data quantity in the file; if the maximum grade is the secret mark, judging the file as a secret file, and calculating the secret data amount in the file;
the central control module is internally provided with ageing parameters set according to the file importance degree, and the ageing parameters comprise primary ageing calculation parameters and secondary ageing calculation parameters, wherein the primary ageing calculation parameters are larger than the secondary ageing calculation parameters, the primary ageing calculation parameters are positively correlated according to the secret data volume in the secret file, and the secondary ageing calculation parameters are positively correlated according to the important data volume in the important file;
For any file that does not have a time-effect set by the person,
if the file is a secret file, marking the secret file as a first-stage aging calculation parameter; if it is an important file, it is marked as a secondary aging calculation parameter.
The central control module is internally provided with a keyword identification pointer aiming at file data, and identification marking is carried out according to keywords in file data information;
the identification marks are of three levels, namely a general mark B1, an important mark B2 and a secret mark B3;
for any input file W1, if the file stores keywords G1, wherein the keywords G1 are a plurality of,
if the input file W1 has a unique identification tag stored therein, the input file W is marked with the entire result of the unique tag.
If all the keywords in the keywords G1 are the general marks B1, the input file W1 is a general file; if all the keywords in the keywords G1 are important marks B2, inputting a file W1 to be an important file; if all the keywords in the keywords G1 are security marks B3, the input file W1 is a security file.
If the input file W1 has a non-unique identification mark, the identification mark is marked to the maximum extent.
If the keywords in the keyword G1 include the general mark B1, the important mark B2, and the security mark B3, the input file W1 is determined to be a security file.
If the keywords in the keywords G12 comprise the general marks B1 and the important marks B2, judging that the input file W1 is an important file; if the keywords in the keywords G1 comprise the general marks B1 and the secret marks B3, judging the input file W1 as a secret file; if the keywords in the keyword G1 include the important mark B2 and the security mark B3, the input file W1 is judged as a security file.
And ageing parameters set according to the file importance degree are stored in the central control module, wherein the ageing parameters comprise a primary ageing calculation parameter t1 and a secondary ageing calculation parameter t2, and the primary ageing calculation parameter t1 is larger than the secondary ageing calculation parameter t2.
When it has been determined as a secret file for any one of the input files W1, the primary calculation parameter t1 is related to the secret data amount Sj1 in the secret file W1, the larger the secret data amount Sj1 is, the larger the primary calculation parameter t1 is, and the secret file W1 is marked as the primary age calculation parameter t1.
When it has been determined for any one of the input files W1 that it is important, the secondary aging calculation parameter t2 is related to the important data amount Zy1 in the important file W1, the larger the important data amount Zy1 is, the larger the secondary aging calculation parameter t2 is, and the important file W1 is marked as the secondary aging calculation parameter t2.
The invention marks the key words of the input files, judges the important condition of the input files, classifies the input files, refines the importance of the files, facilitates the processing of the files with different confidentiality degrees, reduces the risks of data leakage and virus invasion, and improves the data disaster tolerance degree in the data backup process.
Specifically, in this embodiment, for the file determined to be a general file in the central control module, the copied backup is stored in the backup cloud;
for the files which are judged to be important in the central control module, storing the data in a backup area, converting all information of the data backup into keys and storing the keys in a backup cloud;
and compressing and storing the files which are judged to be secret in the central control module in the backup second area.
The invention processes files with different confidentiality degrees, performs different backup processes, reduces risks of data leakage and loss, and simultaneously performs compression key processing on files with higher confidentiality degrees, increases the reserve of a backup area of a system, reduces redundancy of system data and aims at the problem of data processing.
Specifically, in this embodiment, in the central control module, a backup information change value is calculated according to a backup information initial value, a backup existence time length and an aging parameter of a backup area file mark of a backup area file, whether the backup area file needs to be backed up again is judged according to the backup information change value, and a backup information change threshold is stored in the central control module;
For any of the files in the backup area,
if the change value of the backup information is larger than or equal to the change threshold value of the backup information, the file is backed up again;
if the change value of the backup information is smaller than the change threshold value of the backup information, continuing to calculate the change value of the backup information for the file;
judging whether the importance degree of the file to be re-backed up is changed or not again according to the file to be re-backed up, and determining the backup position of backup information, wherein a reduced initial value is stored in the central control module, and the reduced initial value is related to the importance degree of the file in the backup area;
the two reduced initial values are stored, and the two reduced initial values comprise a first reduced initial value and a second reduced initial value, wherein the first reduced initial value is smaller than the second reduced initial value;
the first reduction initial value is a reduction time length initial value for reducing the important file to the general file, and the second reduction initial value is a reduction time length initial value for reducing the confidential file to the important file.
The central control module is internally provided with a reduced initial value D0, wherein the reduced initial value D0 is related to the importance degree of the backup area file, and the reduced initial value D0 is two kinds of the reduced initial values, and comprises a first reduced initial value D01 and a second reduced initial value D02, and the first reduced initial value D01 is smaller than the second reduced initial value D02. The first reduction initial value D01 is a reduction time length initial value for reducing the important file to the general file, the second reduction initial value D02 is a reduction time length initial value for reducing the confidential file to the important file, and the first reduction initial value D01 is smaller than the second reduction initial value D02.
For any input file W1, its backup information initial value is Bw1.
If the input file W1 is an important file, then,
backup information change value Wg1:
Wg1=Bw1-Ts×t2;
wherein Wg1 is a backup information change value of the important file W1;
bw1 is the initial value of the backup information of the important file W1;
ts is the duration of the existence of the important file W1;
t2 is a secondary aging calculation parameter of the existence duration of the important file W1, the secondary aging calculation parameter t2 is related to the existence duration Ts of the backup area file, and the larger the existence duration Ts of the backup area file is, the larger the value of the time compensation value t2 is.
If the input file W1 is a confidential file, then,
backup information change value Wg1:
Wg1=Bw1-Ts’×t1;
wherein Wg1 is a backup information change value of the confidential file W1;
bw1 is the initial value of the backup information of the confidential file W1;
ts' is the duration of the existence of the security document W1;
t1 is a primary calculation parameter of the existence duration of the confidential file W1, the primary calculation parameter t1 is related to the existence duration Ts 'of the backup area file, and the larger the existence duration Ts' of the backup area file is, the larger the value of the time compensation value t1 is.
And the central control module is internally provided with a backup information change threshold Wg0.
For any one of the files W1 of the backup area,
if the change value Wg1 of the backup information is larger than or equal to the change threshold Wg0 of the backup information, the file is backed up again;
If the change value Wg1 of the backup information of the important file W1 is equal to or less than the first reduction initial value D01, the important file W1 is reduced to a normal file, and the original file W1 is backed up to the backup position of the normal file, i.e., the backup copied by the file W1 is stored in the backup cloud.
And if the change value Wg1 of the backup information of the confidential file W1 is smaller than or equal to the second reduction initial value D02, the confidential file W1 will be reduced to be an important file, the original file W1 is backed up to the backup position of the important file, namely the file W1 data is stored in a backup area, and all the information of the data backup is converted into a key to be stored in the backup cloud.
If the backup information change value Wg1 is smaller than the backup information change threshold Wg0, the file is continued to be calculated as the backup information change value.
The invention judges whether the files with higher confidentiality degree are needed to be changed into the files with lower confidentiality degree by calculating the change value of the backup information of the files with the existence time length in the backup area so as to improve the query degree of users on the existing files, and simultaneously, reduces the confidentiality time length of frequently changing the files by the users, reduces manual operation and ensures higher flexibility of system application.
Specifically, in this embodiment, in the central control module, there is an operation detection program for detecting a state of a file in the backup area, where the operation detection program detects and determines a use state of the file in the backup area;
triggering the file data damage degree detection of the bad block detection area for the file in the backup area in the use state;
and continuing to perform operation detection on the files in the backup area in the non-operation state.
The invention monitors the use state of the files in the backup area in real time, carries out detail processing according to the running state of the files, and reduces the running program in the system processing.
Specifically, in this embodiment, in the central control module, damage detection is performed on any file data triggering the bad block detection area, the damage detection compares the existing content of the file data with the initial input data of the backup area one by one, calculates the damage degree of the file, determines whether to transmit the file to the recovery processing area for processing according to the comparison result, and a damage degree threshold is stored in the central control module;
any file data that triggers the bad block detection zone,
if the damage degree is less than or equal to the damage degree threshold value, judging that the file data can be recovered, transmitting the file data to a recovery processing area, and carrying out recovery processing;
If the damage degree is larger than the damage degree threshold, judging that the file data cannot be recovered, and directly reminding the data damage.
And carrying out damage detection on the file data W3 triggering any bad block detection area, wherein a damage degree threshold S0 is stored in the central control module.
Degree of damage S3:
S3=N/M;
s3 is a damage degree value of the file data W3;
m is the initial input data of the file data W3;
n is the existing data amount of the file data W3.
If the damage degree S3 is smaller than or equal to the damage degree threshold S0, judging that the file data can be recovered, transmitting the file data to the recovery processing area, and carrying out recovery processing;
if the damage degree S3 is greater than the damage degree threshold S0, judging that the file data cannot be recovered, and carrying out data damage reminding.
In the invention, the bad block detection is carried out on the backup area file in the use state, whether the file can be restored or not is judged, the restoration of the restorable data is carried out, the restoration is reminded, the file which is stored with the damage can be timely processed, and the prompt is carried out for the processing of the file which cannot be processed by the user.
Specifically, in this embodiment, in the central control module, recovery result detection is performed on file data that is subjected to recovery processing by any one of the trigger recovery processing areas, and a recovery degree ratio is calculated between a recovery result and a damage degree of the file data;
And marking the recovered file with a recovery degree ratio, outputting the file, and reminding the file which is not recovered.
File data W3, recovery result is P1, and damage degree is (M-N).
The recovery ratio is P2:
P2=P1/(M-N);
wherein P2 is the recovery degree ratio of the file data W3;
p1 is a recovery result of the damaged portion of the file data W3;
m is the initial input data amount of the file data W3;
n is the existing data amount of the file data W3.
According to the invention, the recovery result detection calculation is carried out on the damaged file in the backup area file extracted by the user, and the file content which is not recovered is output, so that the data reminding messy code caused by the fact that the file is not recovered by the user is prevented, the data loss is caused, and the unknowing of the file extraction result of the user is reduced.
Referring to fig. 3, fig. 3 is a flowchart illustrating a method for data backup and restore according to an embodiment.
The invention also provides a data backup and recovery method, which comprises the following steps,
step S1, a user inputs file data to a data backup and recovery system through an input module;
s2, an input detection area of a central control module in the data backup and recovery system identifies an entered file, and performs information preprocessing on data information;
Step S3, preprocessing according to the information of the file, continuously identifying the importance degree of the file data, backing up according to the identification result and storing the file data in a backup area;
step S4, when the user uses the files of the backup database, the condition of the data backup information is inquired, and the bad block detection area detects bad blocks of the searched files of the backup area;
step S5, judging whether recovery processing is carried out or not according to the detection condition of the bad block detection area;
s6, recovering and recovering degree detection can be carried out on the file subjected to recovery processing in a recovery processing area, and user file extraction processing is carried out;
and S7, carrying out user reminding processing on the file which cannot be recovered.
The invention provides the data backup and recovery method, which is convenient for users to familiarize and use the data backup and recovery system, and simultaneously prompts the key points in the use process of the users.
Thus far, the technical solution of the present invention has been described in connection with the preferred embodiments shown in the drawings, but it is easily understood by those skilled in the art that the scope of protection of the present invention is not limited to these specific embodiments. Equivalent modifications and substitutions for related technical features may be made by those skilled in the art without departing from the principles of the present invention, and such modifications and substitutions will be within the scope of the present invention.
The foregoing description is only of the preferred embodiments of the invention and is not intended to limit the invention; various modifications and variations of the present invention will be apparent to those skilled in the art. Any modification, equivalent replacement, improvement, etc. made within the spirit and principle of the present invention should be included in the protection scope of the present invention.

Claims (9)

1. A system for data backup and recovery, comprising,
an input module that inputs file data;
the backup area is used for carrying out backup storage on the input files and is divided into three according to the input files, and comprises a backup cloud end, a backup first area and a backup second area; the backup cloud is a general file integral backup position and an important file key backup position, the backup first area is an important file integral backup position, and the backup second area is a secret file compression backup position;
the central control module is respectively connected with the input module and the backup module and used for identifying and processing the files, and comprises,
the input detection area is used for identifying the importance degree of the input file and carrying out regional backup of the backup area on the file according to the detection result;
a bad block detection area, which detects file data damage to the files in the backup area in the use state and judges whether the files in the backup area are damaged;
The recovery processing area judges the damage processing degree of the file in the damaged backup area, restores the damaged file, calculates the file restoration degree according to the damage degree, and outputs and reminds the file;
the central control module performs information preprocessing on the file input by the input module in an input detection area, sorts and marks file information, and calculates a backup information initial value; identifying the marked content of the file and judging the importance degree of the file; judging backup areas of the files to backup the files one by one according to the importance degree of the files; calculating a backup information change value for the initial value and the existence time of the file backup information of the backup area, judging whether the file of the backup area needs to be re-backed up according to the backup information change value, and simultaneously, judging whether the importance degree of the file needing to be re-backed up is changed again, and determining the backup position of the re-backed up file; performing operation monitoring on files in the backup area, and judging whether to trigger file data damage degree detection of a bad block detection area; judging whether the file for detecting the damage degree of the file data can recover the file data or not; and detecting a recovery result of the file data which is subjected to recovery processing by any triggering recovery processing area, calculating the recovery degree of the recovery file and outputting a prompt.
2. The system for data backup and restore according to claim 1, wherein, for the file input by the input module, the input detection area of the central control module performs information preprocessing on the file,
for any input file, an initial value is set in the file management system according to the size proportion of the file, weighting processing is carried out according to file data information, the initial value of backup information of the file is calculated according to the initial value of the input file and the added value of the weighting processing, and the initial value of the backup information and the backup file are marked;
the file data information comprises a file source, a file destination, keywords in the file and timeliness of the file;
the central control module is internally provided with a vocabulary library, and the vocabulary library identifies keywords in the file;
the timeliness of the file comprises the timeliness of manual setting and the timeliness of the file built-in, and for the file without the timeliness of manual setting, the timeliness supplementary mark is built in according to the importance degree of the file.
3. The system for backing up and restoring data according to claim 2, wherein a keyword identification pointer for file data is stored in the central control module, and identification marking is performed according to keywords in file data information;
The identification mark has three levels, namely a general mark, an important mark and a secret mark; in judging the key degree of any file, judging the importance degree of the file after identifying three mark levels one by one;
for any of the file data items,
if the unique identification mark exists, the unique identification mark is marked as a whole result of the unique mark;
if the non-unique identification mark exists, the identification mark is marked to the greatest extent;
the maximum mark is that when three levels of marks are stored in the file, the file is judged to be a secret file, and the secret data amount in the file is calculated; when any two level marks are stored in the file, judging that the file is the largest level of the two levels; if the maximum grade is the important mark, judging the file as an important file, and calculating the important data quantity in the file; if the maximum grade is the secret mark, judging the file as a secret file, and calculating the secret data amount in the file;
the central control module is internally provided with a time length parameter set according to the file importance degree, and comprises a primary aging calculation parameter and a secondary aging calculation parameter, wherein the primary aging calculation parameter is larger than the secondary aging calculation parameter, the primary aging calculation parameter is positively correlated according to the confidential data amount in the confidential file, and the secondary aging calculation parameter is positively correlated according to the important data amount in the important file;
For any file that does not have a time-effect set by the person,
if the file is a secret file, marking the secret file as a first-stage aging calculation parameter; if it is an important file, it is marked as a secondary aging calculation parameter.
4. The system for data backup and restore according to claim 3, wherein, for files determined to be general in the central control module, the duplicated backup is stored in the backup cloud;
for the files which are judged to be important in the central control module, storing the data in a backup area, converting all information of the data backup into keys and storing the keys in a backup cloud;
and compressing and storing the files which are judged to be secret in the central control module in the backup second area.
5. The system for data backup and restore according to claim 4, wherein in the central control module, a backup information change value is calculated according to a backup information initial value, a backup existence time length and an aging parameter of a backup area file mark of the backup area file, whether the backup area file needs to be backed up again is judged according to the backup information change value, and a backup information change threshold is stored in the central control module;
For any of the files in the backup area,
if the change value of the backup information is larger than or equal to the change threshold value of the backup information, the file is backed up again;
if the change value of the backup information is smaller than the change threshold value of the backup information, continuing to calculate the change value of the backup information for the file;
judging whether the importance degree of the file to be re-backed up is changed or not again according to the file to be re-backed up, and determining the backup position of backup information, wherein a reduced initial value is stored in the central control module, and the reduced initial value is related to the importance degree of the file in the backup area;
the two reduced initial values are stored, and the two reduced initial values comprise a first reduced initial value and a second reduced initial value, wherein the first reduced initial value is smaller than the second reduced initial value;
the first reduction initial value is a reduction time length initial value for reducing the important file to the general file, and the second reduction initial value is a reduction time length initial value for reducing the confidential file to the important file.
6. The system for data backup and restore according to claim 5, wherein an operation detection program for detecting the state of the files in the backup area is stored in the central control module, and the operation detection program performs a use state detection judgment on the files in the backup area;
Triggering the file data damage degree detection of the bad block detection area for the file in the backup area in the use state, and judging whether the file data damage degree detection is transmitted to the recovery processing area for processing;
and continuing to perform operation detection on the files in the backup area in the non-operation state.
7. The system for data backup and recovery according to claim 6, wherein in the central control module, damage detection is performed on any file data triggering the bad block detection area, the damage detection compares existing contents of the file data with initial input data of the backup area one by one, calculates damage degree of the file, judges whether the file is transmitted to the recovery processing area for processing according to a comparison result, and a damage degree threshold is stored in the central control module;
any file data that triggers the bad block detection zone,
if the damage degree is less than or equal to the damage degree threshold value, judging that the file data can be recovered, transmitting the file data to a recovery processing area, and carrying out recovery processing;
if the damage degree is larger than the damage degree threshold, judging that the file data cannot be recovered, and directly reminding the data damage.
8. The system for data backup and restoration according to claim 7, wherein in the central control module, restoration result detection is performed on any file data which triggers restoration processing in the restoration processing area, and a restoration degree ratio is calculated between the restoration result and the damage degree;
And marking the recovered file with a recovery degree ratio, outputting the file, and reminding the file which is not recovered.
9. A method for data backup and restore, based on the system of any one of claims 1-8, characterized by comprising the following steps,
step S1, a user inputs file data to a data backup and recovery system through an input module;
s2, an input detection area of a central control module in the data backup and recovery system identifies an entered file, and performs information preprocessing on data information;
step S3, preprocessing according to the information of the file, continuously identifying the importance degree of the file data, backing up according to the identification result and storing the file data in a backup area;
step S4, when the user uses the files of the backup database, the condition of the data backup information is inquired, and the bad block detection area detects bad blocks of the searched files of the backup area;
step S5, judging whether recovery processing is carried out or not according to the detection condition of the bad block detection area;
s6, recovering and recovering degree detection can be carried out on the file subjected to recovery processing in a recovery processing area, and user file extraction processing is carried out;
and S7, carrying out user reminding processing on the file which cannot be recovered.
CN202311201866.4A 2023-09-18 2023-09-18 Method and system for data backup and recovery Active CN117194109B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202311201866.4A CN117194109B (en) 2023-09-18 2023-09-18 Method and system for data backup and recovery

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202311201866.4A CN117194109B (en) 2023-09-18 2023-09-18 Method and system for data backup and recovery

Publications (2)

Publication Number Publication Date
CN117194109A true CN117194109A (en) 2023-12-08
CN117194109B CN117194109B (en) 2024-02-23

Family

ID=88995930

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202311201866.4A Active CN117194109B (en) 2023-09-18 2023-09-18 Method and system for data backup and recovery

Country Status (1)

Country Link
CN (1) CN117194109B (en)

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20110225141A1 (en) * 2010-03-12 2011-09-15 Copiun, Inc. Distributed Catalog, Data Store, and Indexing
CN108415794A (en) * 2018-01-30 2018-08-17 河南职业技术学院 File backup method and file backup device
CN111339564A (en) * 2020-03-27 2020-06-26 河北凯通信息技术服务有限公司 Cloud service analysis management system based on big data
CN114201341A (en) * 2021-11-24 2022-03-18 江苏金农股份有限公司 Automatic data backup system, method and device based on cloud platform
CN116126593A (en) * 2023-01-10 2023-05-16 华南高科(广东)股份有限公司 Data backup system and method in cloud platform environment

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20110225141A1 (en) * 2010-03-12 2011-09-15 Copiun, Inc. Distributed Catalog, Data Store, and Indexing
CN108415794A (en) * 2018-01-30 2018-08-17 河南职业技术学院 File backup method and file backup device
CN111339564A (en) * 2020-03-27 2020-06-26 河北凯通信息技术服务有限公司 Cloud service analysis management system based on big data
CN114201341A (en) * 2021-11-24 2022-03-18 江苏金农股份有限公司 Automatic data backup system, method and device based on cloud platform
CN116126593A (en) * 2023-01-10 2023-05-16 华南高科(广东)股份有限公司 Data backup system and method in cloud platform environment

Also Published As

Publication number Publication date
CN117194109B (en) 2024-02-23

Similar Documents

Publication Publication Date Title
Liang et al. Failure prediction in ibm bluegene/l event logs
US20020128997A1 (en) System and method for estimating the point of diminishing returns in data mining processing
CN115577701B (en) Risk behavior identification method, device, equipment and medium aiming at big data security
Hirose et al. Network anomaly detection based on eigen equation compression
CN104503434B (en) Fault diagnosis method based on active fault symptom pushing
CN111291046B (en) Computer big data storage control system and method
CN108734201B (en) Classification method and system for experience feedback events of nuclear power plant based on hierarchical reason analysis method
CN111274227B (en) Database auditing system and method based on cluster analysis and association rule
CN115618085B (en) Interface data exposure detection method based on dynamic tag
CN102455952B (en) Data backup and recovery method, device and system
CN113220946B (en) Fault link searching method, device, equipment and medium based on reinforcement learning
CN105260469A (en) Sitemap processing method, apparatus and device
CN110399485B (en) Data tracing method and system based on word vector and machine learning
CN117194109B (en) Method and system for data backup and recovery
CN112579781B (en) Text classification method, device, electronic equipment and medium
Klemettinen et al. A data mining methodology and its application to semi-automatic knowledge acquisition
Fard Determination of minimal cut sets of a complex fault tree
CN116450745A (en) Multi-device-based note file operation method, system and readable storage medium
CN113572860B (en) Method and device for tracking leaked data, storage system, equipment and storage medium
CN110727538B (en) Fault positioning system and method based on model hit probability distribution
Zhang et al. New malicious code detection based on n-gram analysis and rough set theory
CN117828644B (en) Computer storage system with information security protection mode
Li et al. Web document duplicate removal algorithm based on keyword sequences
CN118264488B (en) Data security management system based on Internet of things
CN112989793B (en) Article detection method and device

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant