WO2011103791A1 - Backup storage system, backup system, data backup method and recovery method - Google Patents

Backup storage system, backup system, data backup method and recovery method Download PDF

Info

Publication number
WO2011103791A1
WO2011103791A1 PCT/CN2011/071070 CN2011071070W WO2011103791A1 WO 2011103791 A1 WO2011103791 A1 WO 2011103791A1 CN 2011071070 W CN2011071070 W CN 2011071070W WO 2011103791 A1 WO2011103791 A1 WO 2011103791A1
Authority
WO
WIPO (PCT)
Prior art keywords
backup
storage subsystem
data
storage
backup data
Prior art date
Application number
PCT/CN2011/071070
Other languages
French (fr)
Chinese (zh)
Inventor
汪野
方德辉
何非
Original Assignee
成都市华为赛门铁克科技有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 成都市华为赛门铁克科技有限公司 filed Critical 成都市华为赛门铁克科技有限公司
Publication of WO2011103791A1 publication Critical patent/WO2011103791A1/en

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/06Digital input from, or digital output to, record carriers, e.g. RAID, emulated record carriers or networked record carriers
    • G06F3/0601Interfaces specially adapted for storage systems
    • G06F3/0628Interfaces specially adapted for storage systems making use of a particular technique
    • G06F3/0646Horizontal data movement in storage systems, i.e. moving data in between storage devices or systems
    • G06F3/065Replication mechanisms
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/06Digital input from, or digital output to, record carriers, e.g. RAID, emulated record carriers or networked record carriers
    • G06F3/0601Interfaces specially adapted for storage systems
    • G06F3/0602Interfaces specially adapted for storage systems specifically adapted to achieve a particular effect
    • G06F3/061Improving I/O performance
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/06Digital input from, or digital output to, record carriers, e.g. RAID, emulated record carriers or networked record carriers
    • G06F3/0601Interfaces specially adapted for storage systems
    • G06F3/0668Interfaces specially adapted for storage systems adopting a particular infrastructure
    • G06F3/0671In-line storage system
    • G06F3/0683Plurality of storage devices
    • G06F3/0685Hybrid storage combining heterogeneous device types, e.g. hierarchical storage, hybrid arrays

Definitions

  • the application is submitted to the Chinese Patent Office on February 24, 2010, the application number is 201010114868.6, and the invention name is "backup storage system, backup system and data backup method and recovery method" The priority of the Chinese Patent Application, the entire contents of which is incorporated herein by reference.
  • the present invention relates to the field of data backup technologies, and in particular, to a backup storage system, a backup system, and a data backup method and a recovery method.
  • the business system performs a backup every day, but because the data in the system needs to be read during the backup, it is often required to consume a large number of application systems. Resources, if data backup is performed when the system business is busy, not only seriously affects the system business performance, but also the backup performance is not good. Therefore, in the prior art, the backup window (the backup window is an ideal backup window) is used as the backup time (after the backup window is the ideal backup window) after the first day of work to the next day, to effectively complete the backup of the data in the business system.
  • the disk array is a random access medium, the data positioning time is short, and the read and write performance of the disk array is much higher than that of the general physical tape library; the performance of the backup storage system can be improved to meet the requirements of the user for the backup window.
  • the use of a disk array as a backup storage system increases the backup performance while causing a significant increase in the cost and power consumption of the backup storage system, making the backup storage system using the disk array less cost-effective.
  • the embodiments of the present invention provide a backup storage system, a backup system, a data backup method, and a recovery method, which can effectively improve the performance of the backup storage system and meet the existing business system without significantly increasing system cost and energy consumption. Demand.
  • An embodiment of the present invention provides a backup storage system, including:
  • N storage subsystems wherein the i-th storage subsystem is connected to the i+1th storage subsystem, and the performance of the i-th storage subsystem is higher than the i+1 storage subsystem, the i-th storage subsystem
  • the capacity of the i-th storage subsystem is higher than that of the i+1th storage subsystem, N > 2, 1 N-1.
  • the embodiment of the present invention further provides a backup system, including a backup client and a backup server, and the backup storage system.
  • the backup server is configured to receive backup data sent by the client, and store the backup data.
  • the backup server sends a data request to the backup storage system according to the recovery requirement of the user, to obtain and read from the backup storage system
  • the data requests the corresponding backup data; and sends the corresponding backup data to the client to complete the recovery operation of the backup data.
  • the embodiment of the invention further provides a data backup method, including:
  • the first storage subsystem receives and caches the backup data sent by the backup server; sends the copied backup data to the second storage subsystem;
  • the performance of the i-th storage subsystem is higher than the i+1th storage subsystem, the i-th storage subsystem
  • the capacity of the i-th storage subsystem is higher than that of the i+1th storage subsystem.
  • the embodiment of the invention further provides a data recovery method, including:
  • the backup storage system, the backup system, the data backup method, and the recovery method of the embodiment of the present invention use the N storage subsystems sequentially connected with the read/write performance and the unit capacity cost, and sequentially increase the capacity, as the backup storage system, and The first storage subsystem with the highest read/write performance is used as an interface with the backup server to implement storage and recovery processing of backup data.
  • the performance of the backup system can be effectively improved without significantly increasing the cost and power consumption of the backup system. Thereby effectively improving the cost performance of the backup storage system.
  • FIG. 1 is a structural diagram of a backup storage system according to Embodiment 1 of the present invention
  • FIG. 2 is a flowchart of a data backup method according to Embodiment 4 of the present invention
  • 3 is a flowchart of a data recovery method according to Embodiment 5 of the present invention
  • FIG. 4 is a flowchart of a data recovery method according to Embodiment 6 of the present invention
  • FIG. 5 is a structural diagram of a backup system according to Embodiment 7 of the present invention.
  • FIG. 1 is a structural diagram of a backup storage system according to Embodiment 1 of the present invention. As shown in FIG. 1, the backup storage system of this embodiment includes N storage subsystems.
  • N storage subsystems wherein the i-th storage subsystem is connected to the i+1th storage subsystem, and the performance of the i-th storage subsystem is higher than the i+1 storage subsystem, the i-th storage subsystem
  • the capacity of the i-th storage subsystem is lower than that of the i+1th storage subsystem, N > 2, K i ⁇ Nl , and N and i are positive integers.
  • this embodiment introduces a backup storage system, a first storage subsystem 11, and a second storage subsystem 12 - until the Nth storage subsystem 13 is sequentially connected. That is to say, each storage subsystem is only connected to its previous nearest neighbor storage subsystem and the next nearest neighbor storage subsystem to implement data exchange.
  • the backup storage system further includes a controller 10 for controlling the exchange of backup data between adjacent storage subsystems.
  • the first storage subsystem 11 of the backup storage system of this embodiment has the highest read/write performance and the highest unit capacity cost, but its capacity is small.
  • the first storage subsystem 11 can select a small-capacity virtual tape library or a disk array, and with the development of high-tech, a storage medium with high read/write performance such as a FLASH chip array may be used.
  • a storage medium with high read/write performance such as a FLASH chip array may be used.
  • the backup storage system can select a high-performance storage medium with a small capacity as the first storage subsystem 11, and mainly uses For high-speed read and write performance.
  • the capacity of the first storage subsystem 11 it is usually required to accommodate at least one full backup data set, that is, At least one full backup process can be supported, and its capacity is sufficient to store the backup data needed to complete a backup storage operation.
  • the first storage subsystem 11 of a suitable capacity size can be selected according to actual conditions.
  • the first storage subsystem 11 and the second storage subsystem 12, the third storage subsystem, the Nth storage subsystem 13, and the nearest neighbor storage subsystem read and write performance and units
  • the capacity costs are reduced in turn, and the capacity is increased in turn.
  • the first storage subsystem 11 is mainly used to reflect the read and write performance of the backup storage system;
  • the second storage subsystem 12 to the N-1th storage subsystem can function as a cache.
  • the read and write performance between the nearest neighbor storage subsystems can be matched to meet the smooth read and write.
  • the current storage subsystem has a read/write performance of 1000 mb/s, and the read/write performance of the nearest neighbor storage subsystem is 100 mb/s.
  • One or more storage subsystems with read/write performance between 1000 mb/s and 100 mb/s can be added in the middle to achieve normal matching.
  • the second storage subsystem 12 to the N-1 storage subsystem in the middle are used to match the read/write performance of the first storage subsystem 11 and the Nth storage subsystem 13.
  • the read/write performance and unit capacity cost between the nearest neighbor storage subsystems are sequentially reduced, and the capacity is sequentially increased, but a storage subsystem with the same read/write performance and/or unit capacity cost is also possible.
  • the Nth storage subsystem 13 is used to represent the storage data of the backup storage system.
  • the first N-1 storage subsystems are used to cache backup data, which is equivalent to the backup data being stored in the N-1 level cache and finally stored in the Nth storage subsystem.
  • existing backup storage systems typically contain only one Nth storage subsystem 13 of the backup storage system of the present embodiment.
  • the capacity of the first N-1 storage subsystems is small compared to the capacity of the Nth storage subsystem.
  • the storage capacity of the Nth storage subsystem 13 is tens of thousands of times the total capacity of the first N-1 storage subsystems, and the cost per unit capacity is the first N-1 storage subsystem units.
  • the cost of the capacity is a fraction of the total cost.
  • the cost of the backup storage system in this embodiment is less than one percent of the total cost due to the addition of the first N-1 storage subsystems. It can be seen that the backup storage system of this embodiment hardly increases the cost compared to the existing backup storage system. In addition, since the capacity of the first N-1 storage subsystems is not large, the total power consumption is not high.
  • the Nth storage subsystem uses a physical tape library.
  • the backup storage system of this embodiment is compared with the backup storage system of the present embodiment and the backup storage system using the disk array having the same capacity as the Nth storage subsystem.
  • the read and write performance of the disk array can also be achieved, but the cost and power consumption are much smaller than the backup storage system using the disk array of the same capacity.
  • the backup storage system of this embodiment is configured by sequentially adopting N storage subsystems in which the read/write performance is sequentially decreased, the capacity is sequentially increased, and the unit capacity cost is sequentially increased. It can meet the system performance and capacity requirements, and can effectively improve the performance of the backup storage system without significantly increasing the cost and energy consumption, thereby effectively improving the cost performance of the backup storage system.
  • Embodiment 2 of the present invention provides a backup storage system.
  • the backup storage system of this embodiment further describes the performance of the backup storage of the backup storage system of the first embodiment on the basis of the backup storage system of the first embodiment.
  • the first storage system 11 has the highest read and write performance, and is used to reflect the read and write performance of the entire backup storage system.
  • the first storage subsystem 11 is configured to receive and cache backup data sent by the backup server as an interface between the entire backup storage system and the backup server. Because the capacity of the first backup storage system 11 is small, after receiving the backup data sent by the backup server, the backup data is cached, then the backup data is copied and sent to the second storage subsystem 12, and then After the two storage subsystems 12 receive and cache the backup data, the backup data is copied and sent to the next nearest neighbor backup storage subsystem, and the above operations are performed in sequence until the backup data is finally sent to the Nth storage subsystem.
  • the backup data is stored by the Nth storage subsystem 13.
  • the transmission and reception of backup data between the nearest neighbor storage subsystems is completed under the scheduling of the controller 10.
  • the read and write performance of the first storage subsystem 11 determines the read and write performance of the entire backup storage system. After the first storage subsystem 11 receives the backup data, the backup data is copied and cached one level at a time, and belongs to the operation inside the backup storage system, and does not affect the overall read and write performance of the backup storage system.
  • the backup storage system of the present implementation adopts the first storage subsystem with the highest read and write performance as the interface for communicating with the backup server, and then sequentially caches the backup data to the subsequent storage subsystems, and finally stores the backup data.
  • the backup storage system in this embodiment has high backup performance and short backup time; it can meet the requirements of existing business systems. It should be noted that, on the basis of the second embodiment, when the Li N-2, JLN > 3, the i+1th storage subsystem is specifically configured to receive the replication sent by the i th storage subsystem.
  • the storage subsystem directly receives the backup data sent by the last nearest neighbor storage subsystem to directly overwrite the backup data first stored in the storage subsystem.
  • the first stored backup data has been copied to the next nearest neighbor storage subsystem, meaning no storage, so it can be overwritten.
  • the backup data in the storage subsystem if it has not been copied to the next storage subsystem, must be replicated before it is allowed to be overwritten.
  • the i+1th storage subsystem is specifically configured to receive the copied backup data sent by the i th storage subsystem, and overwrite the duplicate backup data with the i+1th Backup data that has expired in the storage subsystem.
  • the copied backup data sent by the received N-1th storage subsystem directly overwrites the expired backup in the Nth storage subsystem 13. data.
  • the expired backup data described here refers to data that exceeds the backup storage time.
  • the backup data of the backup storage system has a certain storage time, for example, three months, half a year, or one year. When the backup storage time is exceeded, the backup data is no longer useful, and there is no need to save it. These backup data are called backup data that has expired. With this technical solution, the security of all backup data stored in the backup storage system can be guaranteed.
  • Embodiment 3 of the present invention provides a backup storage system.
  • the backup storage system of the second embodiment is mainly used for receiving backup data of the backup server and storing the data.
  • This embodiment describes the backup storage system from the perspective of data recovery on the basis of the backup storage system of the first embodiment. The technical solution of the present embodiment will be described in detail below.
  • the first storage subsystem 11 is configured to receive a data request sent by the backup server. When the backup data corresponding to the data request is cached in the first storage subsystem 11, the first storage subsystem 11 returns the backup data to the backup server.
  • the i+1th storage subsystem is further configured to receive the data request sent by the i th storage subsystem when the backup data corresponding to the data request is not cached in the first storage subsystem , And when the backup data corresponding to the data request is cached in the (i+1)th storage subsystem, returning the backup data to the backup server.
  • the target storage subsystem in the embodiment of the present invention is a storage subsystem that caches backup data corresponding to the data request.
  • the target storage subsystem returns the backup data corresponding to the data request to the backup server, which is a level-1 return, that is, the target storage subsystem first returns the backup data corresponding to the data request.
  • the previous nearest neighbor storage subsystem of the target storage subsystem is then forwarded back by the previous nearest neighbor storage subsystem of the target storage subsystem. That is, each storage subsystem returns the backup data corresponding to the data request sent by the next nearest neighbor storage subsystem to its previous nearest neighbor storage subsystem until the backup data corresponding to the data request is returned to the first
  • the storage subsystem is finally returned to the backup server by the first storage subsystem.
  • N storage subsystems return error information to the backup server.
  • the return of the error message is the same as the return of the backup data corresponding to the above data request, and is also a level one return, and finally returned by the first storage subsystem 11 to the backup server, telling the backup server that the data corresponding to the recovery request cannot be found.
  • the transmission of the data request during the data recovery process and the return of the backup data corresponding to the data request are all performed under the scheduling of the controller 10. Since the write data operation is performed in the storage process, the storage record of the backup data is saved in the controller 10. When the data read operation is performed during the process of restoring data, the storage record of the backup data saved in the controller 10 can be referred to, so as to more conveniently find the backup data corresponding to the data request. Preferably, however, the backup data corresponding to the data request is returned to the first level, until the first storage subsystem 11 is returned to the backup server by the first storage subsystem.
  • the storage subsystem storing the backup data corresponding to the data request may be found according to the storage record of the backup data saved in the controller 10, and the storage subsystem may be directly transported to the storage subsystem.
  • the backup server is localized and docked with the backup server to implement backup data recovery.
  • the first storage subsystem 11 communicates with the backup server, receives a data request sent by the server, or returns the requested backup data to the server, so as to achieve efficient recovery of the backup data and satisfy the existing backup.
  • the i+1th storage subsystem is further configured to: when the backup data corresponding to the data request is not cached in the first storage subsystem 11 Receiving, in the middle, the data request sent by the i th storage subsystem, when the backup data corresponding to the data request is cached in the i+1th storage subsystem, storing the i+1th storage The subsystem stores the address of the backup data back to the backup server.
  • the first storage subsystem 11 sends the data request to the second storage subsystem 12, if the second storage subsystem 12, the backup data corresponding to the data request is cached, and the second storage subsystem 12 returns the address of the backup data corresponding to the data request (that is, the pointer of the backup data corresponding to the data request) to the backup server.
  • the backup server returns the channel formed in the process of returning to the backup server according to the address of the corresponding backup data (that is, the pointer of the backup data corresponding to the data request), and restores the data.
  • the process of returning the address of the backup data corresponding to the data request corresponding to the data request in the target storage subsystem (that is, the pointer of the backup data corresponding to the data request) to the backup server is returned by each storage subsystem level first level, because each A storage subsystem is only connected to the nearest storage subsystem and the nearest nearest neighbor storage subsystem of its nearest neighbor for data exchange. Therefore, after each storage subsystem receives the address sent by the next nearest neighbor storage subsystem, the storage subsystem caches the address and returns the address in the storage subsystem for caching the address to the previous nearest neighbor storage. Subsystem. Such a level of return until the second storage subsystem 12 returns an address to the first storage subsystem 11, and then the first storage subsystem 11 will store the address of the address returned by the second storage subsystem 12.
  • the process of returning the address forms a channel from the target storage subsystem to the backup server, and the backup server can find the backup data corresponding to the recovery request according to the channel.
  • the backup storage system of the embodiment can quickly and efficiently find the backup data corresponding to the backup server recovery request, and return the pointer corresponding to the target storage subsystem of the backup data corresponding to the recovery request to the backup server, thereby directly
  • the data is restored in the target storage subsystem that stores the backup data corresponding to the recovery request.
  • the backup data corresponding to the recovery request does not need to be copied and returned step by step, and the data recovery requirement of the existing service system can be satisfactorily satisfied.
  • the backup storage system embodiments described above are merely illustrative, wherein the subsystems described as separate components may or may not be physically separate, that is, may be located in one place, or may be distributed in multiple locations. Place. Some or all of the subsystems may be selected according to actual needs to achieve the objectives of the solution of the embodiment. Those of ordinary skill in the art can understand and implement without undue labor.
  • FIG. 2 is a flowchart of a data backup method according to Embodiment 4 of the present invention; as shown in FIG. 2, the data backup method in this embodiment may specifically include the following steps:
  • Step 100 The first storage subsystem receives and caches backup data sent by the backup server, and sends the copied backup data to the second storage subsystem.
  • N is the number of storage subsystems, N > 2, K i Nl , N and i are positive integers;
  • the performance of the i-th storage subsystem is higher than that of the i+1th storage subsystem, and the capacity of the i-th storage subsystem is smaller than the i+1th storage subsystem, and the unit capacity of the i-th storage subsystem is high. On the i+1th storage subsystem.
  • the data backup method of the embodiment is a technical solution for implementing the backup storage performance of the second embodiment in the backup storage system provided in the first embodiment.
  • the first storage subsystem 11 is configured to receive backup data sent by the backup server, and cache the backup data, and then copy the backup data to the second storage subsystem 12, starting from the second storage subsystem 12, each storage sub-
  • the system receives and caches the replicated backup data sent by the previous nearest neighbor storage subsystem, and then copies the backup data and sends it to the next nearest neighbor backup storage subsystem until it is finally backed up by the N-1th storage subsystem.
  • the data is sent to the Nth storage subsystem 13, and the Nth storage subsystem 13 stores the backup data.
  • the first storage subsystem 11 to the N-1 storage subsystem are used to cache the backup data; the Nth storage subsystem 13 is configured to store the backup data. Because of the first storage subsystem 11 As an interface between the entire backup storage system and the backup server, the read and write performance of the first storage subsystem 11 is equivalent to the read and write performance of the entire backup storage system. Starting from the second storage subsystem 12, the performance and unit capacity costs of each storage subsystem are sequentially reduced, but the capacity is sequentially increased. Until the Nth storage subsystem 13 mainly reflects the function of the memory of the backup storage system, the performance and unit capacity cost are the lowest, but the storage capacity is the largest. For the backup storage system, reference may be made to the foregoing embodiment 1, and details are not described herein again.
  • the first storage subsystem 11 is used as an interface between the backup storage system and the backup server, and the backup data sent by the backup server is received and cached. Because the first storage subsystem 11 has high read and write performance. Therefore, the entire backup storage system also has high backup performance.
  • the mechanism used in the data backup method of this embodiment is the same as that of the first embodiment and the second embodiment.
  • the data backup method of the embodiment is used to store backup data for a backup storage system composed of N storage subsystems in which the read/write performance and the unit capacity cost are sequentially decreased, the storage capacity is sequentially increased, and the logically connected N storage subsystems are sequentially stored;
  • the first storage subsystem with write performance serves as an interface for communication with the backup server, improving the performance of backup processing, effectively shortening the backup window, and meeting the needs of existing business systems.
  • the i+1th storage subsystem receives the copied backup data sent by the i th storage subsystem; when the i+1th storage subsystem has no free storage space And the i+1th storage subsystem directly overwrites the backup data stored in the i+1th storage subsystem.
  • the storage subsystem will receive the replication sent by the last nearest neighbor storage subsystem.
  • the backup data directly covers the backup data that was first stored in the storage subsystem.
  • the first backup data stored has been copied to the next nearest neighbor storage subsystem, meaning no storage, so it can be overwritten.
  • the backup data in the storage subsystem if it has not been copied to the next storage subsystem, must be replicated before it is allowed to be overwritten.
  • the copied backup data sent by the received N-1th storage subsystem directly overwrites the backup data that has expired in the Nth storage subsystem 13. .
  • the expired backup data described here refers to backup data that exceeds the backup storage time.
  • the backup data of the backup storage system has a certain backup storage time, such as three months, six months, or one year, when the backup storage is exceeded. At the time, the corresponding backup data is no longer useful, and there is no need to save it. These backup data are called backup data that has expired. With this technical solution, the security of all backup data stored in the backup storage system can be guaranteed.
  • FIG. 3 is a flowchart of a data recovery method according to Embodiment 5 of the present invention.
  • the present embodiment provides a technical solution for implementing data recovery in the backup storage system of Embodiment 3 in detail in the foregoing embodiment.
  • the data recovery method in this embodiment may specifically include the following steps:
  • Step 200 The first storage subsystem receives a data request sent by the backup server, and returns the backup data to the backup server when the backup data corresponding to the data request is cached in the first storage subsystem. ;
  • the first storage subsystem 11 is configured to receive a data request sent by the backup server. When the backup data corresponding to the data request is cached in the first storage subsystem, the first storage subsystem 11 returns the backup data to the backup server.
  • the storage scheme of the embodiment of the present invention described in the second embodiment when data is stored, it is sequentially cached from front to back in the N storage subsystems. However, the capacity of the storage subsystem is incremented. When a storage subsystem has no storage space, it uses the backup data stored first to ensure the security of the stored backup data. It can be seen that the storage subsystem with the N storage subsystems has a smaller capacity, and generally stores the latest backup data, because some backup data with a longer storage time has been overwritten. The lower the storage subsystem, the larger the capacity, the smaller the probability of overwriting the backup data, and the more commonly saved backup data.
  • the backup data corresponding to the recovery request when the backup data corresponding to the recovery request is the latest backup data, it may be found in the first storage subsystem 11, and then the backup data corresponding to the recovery request may be returned to the backup server.
  • the backup data corresponding to the recovery request When the backup data corresponding to the recovery request is not in the first storage subsystem 11, the backup data may be stored long ago, that is, the storage time is long. At this time, the following steps 201 and 202 are performed to recover the data.
  • Step 201 When the backup data corresponding to the data request is not cached in the first storage subsystem In the system, assign i to 1;
  • N is the number of storage subsystems, N > 2;
  • step 201 and step 202 when the backup data corresponding to the recovery request is not in the first storage subsystem 11, the value of i is first set to 1, and the execution of the loop is performed to send the data request to the first level. Until the backup data corresponding to the data request is found or until the Nth storage subsystem is found, it is still not found.
  • i is the same as Embodiment 3 above, and N is used to indicate the number of storage subsystems, N > 2; i represents the label of one of the storage subsystems, where 1 N-1.
  • the second storage subsystem 12 After the value of i is set to 1, the second storage subsystem 12 receives the data request sent by the first storage subsystem 11, and when the backup data corresponding to the data request is cached in the second storage subsystem 12, the backup is performed. The data is returned to the backup server.
  • the backup data return process is specifically performed by the second storage subsystem 12 to return the backup data to the first storage subsystem 11, and then returned to the backup server by the first storage subsystem 11.
  • the backup data corresponding to the data request is cached in the i+1th storage subsystem, the backup data is returned to the backup server; and the process of returning backup data is similar to the foregoing embodiment 3, The level returns forward until the backup data is returned to the first storage subsystem 11, and then returned to the backup server by the first storage subsystem 11.
  • the target backup storage subsystem of the backup data receives the data request and finds the backup data to be restored. Then, the target backup storage system returns the backup data to the backup server, and the process of returning the backup data is similar to the foregoing embodiment 3, and the first level returns forward until the backup data is returned to the first storage subsystem 11. It is returned to the backup server by the first storage subsystem 11.
  • the Nth storage subsystem 13 If the data request is not sent to the Nth storage subsystem by the N-1th storage subsystem 13 . However, when the Nth storage subsystem 13 still does not store the backup data corresponding to the data request, an error message is returned to the backup server. The return of the error message is the same as the return of the backup data described above, and is also a return of the first level, and finally returned by the first storage subsystem 11 to the backup server, telling the backup server that the data corresponding to the recovery request cannot be found.
  • the implementation mechanism of the data recovery method in this embodiment is the same as that in the foregoing embodiment 3.
  • the data recovery method of the embodiment of the present invention can support the high-efficiency recovery of the backup data by using the first storage subsystem 11 to communicate with the backup server, receiving the data request sent by the server or returning the requested backup data to the server. There is a need for a business system.
  • Embodiment 4 is a flowchart of a data recovery method according to Embodiment 6 of the present invention.
  • the data recovery method of this embodiment is different from the data recovery method of the foregoing Embodiment 5 in that the return pointer is used in this embodiment.
  • the data recovery method in this embodiment may specifically include the following steps:
  • Step 300 The first storage subsystem receives a data request sent by the backup server, and when the backup data corresponding to the data request is cached in the first storage subsystem, returning the address of the backup data to the Backup server
  • the first storage subsystem After the first storage subsystem receives the data request sent by the backup server, and determines that the backup data corresponding to the data request is cached in the first storage subsystem, the first storage subsystem addresses the backup data. Return to the backup server for the backup server to recover data based on this address.
  • the pointer corresponding to the first storage subsystem may be returned to the backup server.
  • the data is restored by the backup server based on the channel established by the return pointer.
  • Step 301 When the backup data corresponding to the data request is not cached in the first storage subsystem, assign i to 1;
  • N is the number of storage subsystems, N > 2;
  • the steps 301 and 302 are different from the steps 201 and 202 in the foregoing embodiment, in which, if the backup data corresponding to the recovery request is stored in a target storage subsystem, when the recovery is found, When the corresponding backup data is obtained, the backup data address corresponding to the data request is returned to the backup server. Similar to the return of the aforementioned backup data, the address return here is also a level one return.
  • Each storage subsystem receives the address sent by the nearest nearest neighbor storage subsystem and caches the address, and then sends the address of the address sent by the cached nearest neighbor storage subsystem to the previous nearest neighbor storage subsystem; until the first The storage subsystem returns the address used to store the address sent by the second storage subsystem to the backup server.
  • a channel is formed according to the return of the address.
  • the backup server returns the backup data corresponding to the data request according to the received address and returns the channel formed by the process.
  • the Nth storage subsystem 13 still does not store the backup data corresponding to the data request, at which time the Nth storage subsystem returns an error message to the backup server.
  • the return of the error information is the same as that of the third embodiment described above, and details are not described herein again.
  • the implementation mechanism of the data recovery method in this embodiment is the same as that in the foregoing embodiment 3.
  • the data recovery method of the embodiment can quickly and efficiently find the data requested by the backup server, and return the pointer corresponding to the target storage subsystem of the backup data corresponding to the recovery request to the backup server, thereby directly recovering from the storage.
  • the data is restored in the target storage subsystem of the corresponding backup data; the backup data corresponding to the recovery request does not need to be copied and returned step by step, which can well meet the data recovery requirements of the existing service system.
  • FIG. 5 is a structural diagram of a backup system according to Embodiment 7 of the present invention; as shown in FIG. 5, the backup system of this embodiment includes a backup client 20, a backup server 21, and a backup storage storage system 22.
  • the backup server 21 is configured to receive the backup data sent by the client 20, and store the backup data in the backup storage system 22 to complete the storage operation of the backup data; and/or the backup server 21 restores the request according to the data of the user. And sending a data request to the backup storage system 22 to obtain backup data corresponding to the data request from the backup storage system 22, and send the corresponding backup data to the client 20 to complete the recovery operation of the backup data.
  • the backup client 20 when a write operation, that is, backup data is performed, the backup client 20 reads data from the business system, and transmits the read data to the backup server 21. After receiving the data sent by the client, the backup server 21 stores the data in the backup storage system 22 to complete the operation of storing the backup data.
  • the backup server 21 sends a data request to the backup storage system 22 according to the user's recovery request.
  • the backup storage system 22 After receiving the data request sent by the backup server 21, the backup storage system 22 performs a data recovery operation in each storage subsystem to acquire backup data corresponding to the recovery request.
  • the backup storage system 22 then returns the acquired backup data to the backup server 21, which in turn sends the backup data to the client 20.
  • the backup client 20 writes the backup data into the service system to complete the operation of restoring the backup data.
  • the backup storage system 22 of the present embodiment may be configured with the backup storage system of any one of the foregoing implementations to the third embodiment, and the data backup method may be implemented by using the data backup method described in the fourth embodiment.
  • the data recovery method described in Embodiment 6 implements recovery of backup data. For details, refer to the related descriptions of the first embodiment to the sixth embodiment, and details are not described herein again.
  • the backup system of this embodiment can effectively improve the performance of the backup system, shorten the backup window, and meet the requirements of the service system by adopting the above backup storage system, without significantly increasing the system cost and energy consumption.

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
  • Techniques For Improving Reliability Of Storages (AREA)

Abstract

A backup storage system, a backup system, a data backup method and a recovery method are provided. The backup storage system comprises N storage subsystems, wherein an i(th) storage subsystem is connected with an i+1(th) storage subsystem, the performance of the i(th) storage subsystem is superior to that of the i+1(th) storage subsystem, the capacity of the i(th) storage subsystem is less than that of the i+1(th) storage subsystem, the cost of the unit capacity of the i(th) storage subsystem is higher than that of the i+1(th) storage subsystem, wherein the N is not less than 2, and the i is not less than 1 and not more than N-1. The technical solution can effectively improve the performance of the system without increasing the total cost of the system significantly, thereby improving the cost performance of the system.

Description

备份存储系统、 备份系统及数据备份方法和恢复方法 本申请要求于 2010 年 2 月 24 日提交中国专利局、 申请号为 201010114868.6、 发明名称为"备份存储系统、 备份系统及数据备份方法和 恢复方法"的中国专利申请的优先权, 其全部内容通过引用结合在本申请中。 技术领域 本发明涉及数据备份技术领域, 尤其涉及一种备份存储系统、备份系统及 数据备份方法和恢复方法。  Backup storage system, backup system and data backup method and recovery method The application is submitted to the Chinese Patent Office on February 24, 2010, the application number is 201010114868.6, and the invention name is "backup storage system, backup system and data backup method and recovery method" The priority of the Chinese Patent Application, the entire contents of which is incorporated herein by reference. The present invention relates to the field of data backup technologies, and in particular, to a backup storage system, a backup system, and a data backup method and a recovery method.
背景技术 Background technique
随着信息技术的发展, 数据的安全性也越来越重要。 然而, 硬件故障、 软 件损坏、 病毒侵袭、 黑客骚扰、 错误操作以及其他意想不到的原因时时都在威 胁着应用系统中的数据。通过采用备份的方式将应用系统中的数据另外保存一 个副本, 当应用系统中的数据丟失时, 就使用这个数据副本恢复应用系统中的 数据, 从而有效避免数据丟失可能造成的损失。 因此, 备份成为提高数据安全 性的一个非常有效的方法。  With the development of information technology, the security of data is becoming more and more important. However, hardware failures, software corruption, virus attacks, hacking, erroneous operations, and other unexpected causes are constantly threatening the data in the application. By using backup to save another copy of the data in the application system, when the data in the application system is lost, the data copy is used to restore the data in the application system, thereby effectively avoiding the loss of data loss. Therefore, backup is a very effective way to improve data security.
随着现有业务系统中数据量的急剧增长,为了有效地保证业务数据的安全 性, 业务系统每天进行一次备份, 但是由于备份时需要读取系统中的数据, 所 以往往需要消耗大量的应用系统资源,如果在系统业务比较繁忙的时候进行数 据备份, 不仅严重影响系统业务性能, 而且备份性能也不好。 所以现有技术中 通常采用将第 1天下班之后到第二天上班之前作为备份时间即备份窗口 (此备 份窗口为理想备份窗口), 来有效地完成对业务系统中数据的备份。  With the rapid increase of the amount of data in the existing business system, in order to effectively ensure the security of the business data, the business system performs a backup every day, but because the data in the system needs to be read during the backup, it is often required to consume a large number of application systems. Resources, if data backup is performed when the system business is busy, not only seriously affects the system business performance, but also the backup performance is not good. Therefore, in the prior art, the backup window (the backup window is an ideal backup window) is used as the backup time (after the backup window is the ideal backup window) after the first day of work to the next day, to effectively complete the backup of the data in the business system.
传统的备份存储系统采用物理磁带库来存储备份数据。物理磁带库中的物 理磁带为线性访问介质, 物理磁带库各部件需要完成如机械臂移动定位磁带、 磁带机加载磁带以及绕带等大量的机械运动来完成一次完整的写操作以实现 对数据的备份。由于其备份性能非常低,所以不能在理想备份窗口内完成备份, 无法满足客户的备份需求。 为了有效地缩短备份窗口, 提高备份的性能, 现有 技术中采用磁盘阵列代替物理磁带库来存储备份数据。磁盘阵列为随机访问介 质, 数据定位时间短, 且磁盘阵列的读写性能远高于一般的物理带库; 能够提 高备份存储系统的性能, 满足用户对备份窗口的要求。 然而, 采用磁盘阵列作 为备份存储系统,在提高备份性能的同时,造成备份存储系统的成本和能耗都 急剧升高, 使得采用磁盘阵列的备份存储系统的性价比较低。 Traditional backup storage systems use physical tape libraries to store backup data. The physical tapes in the physical tape library are linear access media. The physical tape library components need to complete a large number of mechanical movements such as robotic arm positioning tapes, tape drive loading tapes, and tape windings to complete a complete write operation to achieve data. Backup. Due to its very low backup performance, backups cannot be completed within the ideal backup window and cannot meet the backup needs of customers. In order to effectively shorten the backup window and improve the performance of the backup, existing In the technology, disk arrays are used instead of physical tape libraries to store backup data. The disk array is a random access medium, the data positioning time is short, and the read and write performance of the disk array is much higher than that of the general physical tape library; the performance of the backup storage system can be improved to meet the requirements of the user for the backup window. However, the use of a disk array as a backup storage system increases the backup performance while causing a significant increase in the cost and power consumption of the backup storage system, making the backup storage system using the disk array less cost-effective.
发明内容 Summary of the invention
本发明实施例提供一种备份存储系统、备份系统及数据备份方法和恢复方 法, 实现了在不显著增加系统成本和能耗的同时,有效地提升备份存储系统的 性能, 以满足现有业务系统的需求。  The embodiments of the present invention provide a backup storage system, a backup system, a data backup method, and a recovery method, which can effectively improve the performance of the backup storage system and meet the existing business system without significantly increasing system cost and energy consumption. Demand.
本发明实施例提供一种备份存储系统, 包括:  An embodiment of the present invention provides a backup storage system, including:
N个存储子系统, 其中, 第 i个存储子系统与第 i+1个存储子系统相连接, 第 i个存储子系统的性能高于第 i+ 1个存储子系统,第 i个存储子系统的容量小于 第 i+1个存储子系统, 第 i个存储子系统的单位容量成本高于第 i+1个存储子系 统, N > 2, 1 N-1。  N storage subsystems, wherein the i-th storage subsystem is connected to the i+1th storage subsystem, and the performance of the i-th storage subsystem is higher than the i+1 storage subsystem, the i-th storage subsystem The capacity of the i-th storage subsystem is higher than that of the i+1th storage subsystem, N > 2, 1 N-1.
本发明实施例还提供一种备份系统, 包括备份客户端、备份服务器, 还包 括上述的备份存储系统; 所述备份服务器用于接收所述客户端发送的备份数 据, 并将所述备份数据存储在所述备份存储系统中, 以完成备份数据的存储操 作; 和 /或所述备份服务器根据用户的恢复需求, 向所述备份存储系统发送数 据请求, 以从所述备份存储系统获取与所述数据请求对应的备份数据; 并将所 述对应的备份数据发送至所述客户端, 以完成备份数据的恢复操作。  The embodiment of the present invention further provides a backup system, including a backup client and a backup server, and the backup storage system. The backup server is configured to receive backup data sent by the client, and store the backup data. In the backup storage system, to complete the storage operation of the backup data; and/or the backup server sends a data request to the backup storage system according to the recovery requirement of the user, to obtain and read from the backup storage system The data requests the corresponding backup data; and sends the corresponding backup data to the client to complete the recovery operation of the backup data.
本发明实施例还提供一种数据备份方法, 包括:  The embodiment of the invention further provides a data backup method, including:
第 1个存储子系统接收并緩存备份服务器发送的备份数据; 将复制的备份 数据发送给第 2个存储子系统;  The first storage subsystem receives and caches the backup data sent by the backup server; sends the copied backup data to the second storage subsystem;
从第 2个存储子系统开始, 第 i+1个存储子系统接收并緩存第 i个存储子系 统发送的复制的备份数据, 重复执行该步骤, 直至 i=N-l ; 第 N个存储子系统 接收并存储第 N-1个存储子系统发送的复制的备份数据, N为存储子系统的个 数, N > 2, K i < N-l ;  Starting from the second storage subsystem, the i+1th storage subsystem receives and caches the copied backup data sent by the i th storage subsystem, and repeats the step until i=N1; the Nth storage subsystem receives And storing the copied backup data sent by the N-1th storage subsystem, where N is the number of storage subsystems, N > 2, K i < Nl;
其中, 第 i个存储子系统的性能高于第 i+1个存储子系统, 第 i个存储子系统 的容量小于第 i+1个存储子系统, 第 i个存储子系统的单位容量成本高于第 i+1 个存储子系统。 Among them, the performance of the i-th storage subsystem is higher than the i+1th storage subsystem, the i-th storage subsystem The capacity of the i-th storage subsystem is higher than that of the i+1th storage subsystem.
本发明实施例还提供一种数据恢复方法, 包括:  The embodiment of the invention further provides a data recovery method, including:
第 1个存储子系统接收备份服务器发送的数据请求, 当所述数据请求对应 的备份数据緩存在所述第 1个存储子系统中时, 将所述备份数据返回给所述备 份服务器;  Receiving, by the first storage subsystem, a data request sent by the backup server, and returning the backup data to the backup server when the backup data corresponding to the data request is cached in the first storage subsystem;
当所述数据请求对应的备份数据没有緩存在所述第 1个存储子系统中时, 将 i赋值为 1 , 其中 l i N-l , N为存储子系统的个数, N > 2;  When the backup data corresponding to the data request is not cached in the first storage subsystem, assign i to 1 , where l i N-l , N is the number of storage subsystems, N > 2;
第 i+1个存储子系统接收第 i个存储子系统发送的所述数据请求, 当所述数 据请求对应的备份数据緩存在所述第 i+1个存储子系统中时, 将所述备份数据 的信息返回给所述备份服务器, 以供所述备份服务器恢复数据; 当所述数据请 求对应的备份数据没有緩存在第 i+1个存储子系统时, 将 i赋值为 i+1 ; 重复执行 该步骤, 直至 i=N-l , 返回错误信息; 其中 l i N-l , N为存储子系统的个数, N > 2。  The i+1th storage subsystem receives the data request sent by the i th storage subsystem, and when the backup data corresponding to the data request is cached in the i+1th storage subsystem, the backup is performed The information of the data is returned to the backup server for the backup server to recover data; when the backup data corresponding to the data request is not cached in the i+1th storage subsystem, the value of i is assigned to i+1; Perform this step until i=Nl, returning an error message; where li Nl , N is the number of storage subsystems, N > 2.
本发明实施例的备份存储系统、备份系统及数据备份方法和恢复方法,通 过采用读写性能和单位容量成本依次降低、 容量依次增大的依次相连的 N个存 储子系统作为备份存储系统, 并将具有最高读写性能的第 1个存储子系统作为 与备份服务器的接口, 以实现对备份数据的存储及恢复处理。采用本发明实施 例的技术方案, 能够在不显著提高备份系统成本和能耗的基石出上,有效地提高 了备份系统的性能。 从而有效地提高了备份存储系统的性价比。  The backup storage system, the backup system, the data backup method, and the recovery method of the embodiment of the present invention use the N storage subsystems sequentially connected with the read/write performance and the unit capacity cost, and sequentially increase the capacity, as the backup storage system, and The first storage subsystem with the highest read/write performance is used as an interface with the backup server to implement storage and recovery processing of backup data. By adopting the technical solution of the embodiment of the present invention, the performance of the backup system can be effectively improved without significantly increasing the cost and power consumption of the backup system. Thereby effectively improving the cost performance of the backup storage system.
附图说明 为了更清楚地说明本发明实施例或现有技术中的技术方案,下面将对实施 例或现有技术描述中所需要使用的附图作一简单地介绍,显而易见地, 下面描 述中的附图仅仅是本发明的一些实施例,对于本领域普通技术人员来讲,在不 付出创造性劳动性的前提下, 还可以根据这些附图获得其他的附图。 BRIEF DESCRIPTION OF THE DRAWINGS In order to more clearly illustrate the embodiments of the present invention or the technical solutions in the prior art, a brief description of the drawings used in the embodiments or the prior art description will be briefly described below. The drawings are only some of the embodiments of the present invention, and those skilled in the art can obtain other drawings based on these drawings without any inventive labor.
图 1为本发明实施例一的备份存储系统的结构图; 图 2为本发明实施例四提供的数据备份方法的流程图; 图 3为本发明实施例五提供的数据恢复方法的流程图; 图 4为本发明实施例六提供的数据恢复方法的流程图; 图 5为本发明实施例七提供的备份系统结构图。 1 is a structural diagram of a backup storage system according to Embodiment 1 of the present invention; FIG. 2 is a flowchart of a data backup method according to Embodiment 4 of the present invention; 3 is a flowchart of a data recovery method according to Embodiment 5 of the present invention; FIG. 4 is a flowchart of a data recovery method according to Embodiment 6 of the present invention; and FIG. 5 is a structural diagram of a backup system according to Embodiment 7 of the present invention.
具体实施方式 detailed description
下面将结合本发明实施例中的附图,对本发明实施例中的技术方案进行清 楚、 完整地描述, 显然, 所描述的实施例仅仅是本发明一部分实施例, 而不是 全部的实施例。基于本发明中的实施例, 本领域普通技术人员在没有作出创造 性劳动前提下所获得的所有其他实施例, 都属于本发明保护的范围。 BRIEF DESCRIPTION OF THE DRAWINGS The technical solutions in the embodiments of the present invention will be described in detail below with reference to the accompanying drawings. All other embodiments obtained by a person of ordinary skill in the art based on the embodiments of the present invention without creative work are within the scope of the present invention.
图 1为本发明实施例一的备份存储系统的结构图; 如图 1所示, 本实施例的 备份存储系统, 包括 N个存储子系统。  1 is a structural diagram of a backup storage system according to Embodiment 1 of the present invention; as shown in FIG. 1, the backup storage system of this embodiment includes N storage subsystems.
N个存储子系统, 其中, 第 i个存储子系统与第 i+1个存储子系统相连接, 第 i个存储子系统的性能高于第 i+ 1个存储子系统,第 i个存储子系统的容量小于 第 i+1个存储子系统, 第 i个存储子系统的单位容量成本高于第 i+1个存储子系 统, N > 2, K i < N-l , N和 i均为正整数。  N storage subsystems, wherein the i-th storage subsystem is connected to the i+1th storage subsystem, and the performance of the i-th storage subsystem is higher than the i+1 storage subsystem, the i-th storage subsystem The capacity of the i-th storage subsystem is lower than that of the i+1th storage subsystem, N > 2, K i < Nl , and N and i are positive integers.
具体地, 本实施例介绍一种备份存储系统, 第 1个存储子系统 11、 第 2个存 储子系统 12—直到第 N个存储子系统 13依次相连。 也就是说, 各存储子系统仅 与其前一个最近邻的存储子系统和后一个最近邻的存储子系统相连 ,实现数据 交换。 需要说明的是, 备份存储系统中还包括一控制器 10, 控制器 10用于控制 各相邻的存储子系统之间备份数据的交换。  Specifically, this embodiment introduces a backup storage system, a first storage subsystem 11, and a second storage subsystem 12 - until the Nth storage subsystem 13 is sequentially connected. That is to say, each storage subsystem is only connected to its previous nearest neighbor storage subsystem and the next nearest neighbor storage subsystem to implement data exchange. It should be noted that the backup storage system further includes a controller 10 for controlling the exchange of backup data between adjacent storage subsystems.
本实施例备份存储系统的第 1个存储子系统 11读写性能最高、 单位容量成 本最高, 但是其容量较小。 比如第 1个存储子系统 11可以选取小容量的虚拟磁 带库、 或者磁盘阵列, 以及随着高新技术的发展, 可能还会采用 FLASH芯片 阵列等等具有较高读写性能的存储介质。考虑到高读写性能的存储介质其成本 一般非常高,单位存储容量的成本较高, 本实施例备份存储系统可以选取容量 较小的高性能存储介质作为第 1个存储子系统 11 ,主要用于实现高速读写性能。 对于第 1个存储子系统 11的容量通常要求要至少容纳一个全备份数据集, 也就 是至少能够支持一次全备份过程,其容量足以保证能够存储完成一次备份存储 操作所需要存储的备份数据。实际使用中可以根据实际情况选择合适容量大小 的第 1个存储子系统 11。 The first storage subsystem 11 of the backup storage system of this embodiment has the highest read/write performance and the highest unit capacity cost, but its capacity is small. For example, the first storage subsystem 11 can select a small-capacity virtual tape library or a disk array, and with the development of high-tech, a storage medium with high read/write performance such as a FLASH chip array may be used. Considering the storage medium with high read/write performance, the cost is generally very high, and the cost per unit storage capacity is high. In this embodiment, the backup storage system can select a high-performance storage medium with a small capacity as the first storage subsystem 11, and mainly uses For high-speed read and write performance. For the capacity of the first storage subsystem 11, it is usually required to accommodate at least one full backup data set, that is, At least one full backup process can be supported, and its capacity is sufficient to store the backup data needed to complete a backup storage operation. In actual use, the first storage subsystem 11 of a suitable capacity size can be selected according to actual conditions.
第 1个存储子系统 11以及依次位于其后的第 2个存储子系统 12、 第 3个存储 子系统 第 N个存储子系统 13 , 各最近邻的存储子系统之间的读写性能和 单位容量成本依次降低、 容量依次增大。 从性能的角度来分析, 第 1存储子系 统 11主要用于体现备份存储系统的读写性能; 中间的第 2个存储子系统 12到第 N-1个存储子系统可以起緩存的作用。 而且各最近邻的存储子系统之间的读写 性能要能够相匹配, 以满足读写顺畅。 比如当前存储子系统的读写性能为 1000mb/s, 而其后最近邻的存储子系统的读写性能为 100mb/s。 两者相差太多, 无法实现正常匹配, 亦不能实现高效率的备份; 可以在中间增加一个或者多个 读写性能 1000mb/s至 100mb/s之间的存储子系统, 以实现正常的匹配。 比如中 间的第 2个存储子系统 12到第 N- 1个存储子系统就是用于将第 1个存储子系统 11 和第 N个存储子系统 13的读写性能的匹配, 这里需要说明, 各最近邻的存储子 系统之间的读写性能和单位容量成本依次降低、容量依次增大,但是中间出现 读写性能和 /或单位容量成本相同的存储子系统也是可能的。  The first storage subsystem 11 and the second storage subsystem 12, the third storage subsystem, the Nth storage subsystem 13, and the nearest neighbor storage subsystem read and write performance and units The capacity costs are reduced in turn, and the capacity is increased in turn. From the performance point of view, the first storage subsystem 11 is mainly used to reflect the read and write performance of the backup storage system; the second storage subsystem 12 to the N-1th storage subsystem can function as a cache. Moreover, the read and write performance between the nearest neighbor storage subsystems can be matched to meet the smooth read and write. For example, the current storage subsystem has a read/write performance of 1000 mb/s, and the read/write performance of the nearest neighbor storage subsystem is 100 mb/s. There is too much difference between the two, and normal matching cannot be achieved, and efficient backup cannot be achieved. One or more storage subsystems with read/write performance between 1000 mb/s and 100 mb/s can be added in the middle to achieve normal matching. For example, the second storage subsystem 12 to the N-1 storage subsystem in the middle are used to match the read/write performance of the first storage subsystem 11 and the Nth storage subsystem 13. The read/write performance and unit capacity cost between the nearest neighbor storage subsystems are sequentially reduced, and the capacity is sequentially increased, but a storage subsystem with the same read/write performance and/or unit capacity cost is also possible.
从容量的角度来分析, 第 N个存储子系统 13用于体现备份存储系统的存储 数据。 前 N-1个存储子系统都是用于緩存备份数据, 相当于备份数据经过 N-1 级的緩存最终存储到第 N个存储子系统。 从成本的角度来分析, 现有的备份存 储系统通常仅包含一个本实施例备份存储系统的第 N个存储子系统 13。 优选 地, 和第 N个存储子系统的容量相比, 前 N-1个存储子系统的容量都很小。 比 如, 优选地, 第 N个存储子系统 13的存储容量是前 N-1个存储子系统的容量总 和的成千上万倍, 而其单位容量的成本为前 N-1个存储子系统单位容量的成本 的几分之一, 此种情况下, 本实施例备份存储系统由于增加前 N-1个存储子系 统而增加的成本还不到总成本的百分之一。可以看出, 和现有的备份存储系统 相比, 本实施例的备份存储系统几乎不会增加成本。 另外, 由于前 N-1个存储 子系统的容量都不大, 其功耗总量也不高。  From the perspective of capacity, the Nth storage subsystem 13 is used to represent the storage data of the backup storage system. The first N-1 storage subsystems are used to cache backup data, which is equivalent to the backup data being stored in the N-1 level cache and finally stored in the Nth storage subsystem. From a cost perspective, existing backup storage systems typically contain only one Nth storage subsystem 13 of the backup storage system of the present embodiment. Preferably, the capacity of the first N-1 storage subsystems is small compared to the capacity of the Nth storage subsystem. For example, preferably, the storage capacity of the Nth storage subsystem 13 is tens of thousands of times the total capacity of the first N-1 storage subsystems, and the cost per unit capacity is the first N-1 storage subsystem units. The cost of the capacity is a fraction of the total cost. In this case, the cost of the backup storage system in this embodiment is less than one percent of the total cost due to the addition of the first N-1 storage subsystems. It can be seen that the backup storage system of this embodiment hardly increases the cost compared to the existing backup storage system. In addition, since the capacity of the first N-1 storage subsystems is not large, the total power consumption is not high.
当本实施例的备份存储系统的第 1个存储子系统采用磁盘阵列, 第 N个存 储子系统采用物理磁带库。 将本实施例的备份存储系统和与第 N个存储子系统 具有相等容量的采用磁盘阵列的备份存储系统相比,本实施例的备份存储系统 同样可以达到磁盘阵列的读写性能,但其成本和功耗要比按同等容量的采用磁 盘阵列的备份存储系统要小很多。 When the first storage subsystem of the backup storage system of this embodiment adopts a disk array, the Nth storage subsystem uses a physical tape library. The backup storage system of this embodiment is compared with the backup storage system of the present embodiment and the backup storage system using the disk array having the same capacity as the Nth storage subsystem. The read and write performance of the disk array can also be achieved, but the cost and power consumption are much smaller than the backup storage system using the disk array of the same capacity.
本实施例的备份存储系统, 通过采用读写性能依次降低、 容量依次增大、 单位容量成本依次升高的依次逻辑相连的 N个存储子系统构成。 可以满足系统 性能和容量的需求,在成本和能耗都不显著增高的情况下, 能够有效地提高备 份存储系统的性能, 从而有效地提高备份存储系统的性价比。  The backup storage system of this embodiment is configured by sequentially adopting N storage subsystems in which the read/write performance is sequentially decreased, the capacity is sequentially increased, and the unit capacity cost is sequentially increased. It can meet the system performance and capacity requirements, and can effectively improve the performance of the backup storage system without significantly increasing the cost and energy consumption, thereby effectively improving the cost performance of the backup storage system.
本发明实施例二提供一种备份存储系统。本实施例的备份存储系统在上述 实施例一的备份存储系统的基础上,进一步介绍实施例一的备份存储系统的备 份存储的性能。  Embodiment 2 of the present invention provides a backup storage system. The backup storage system of this embodiment further describes the performance of the backup storage of the backup storage system of the first embodiment on the basis of the backup storage system of the first embodiment.
本实施例的备份存储系统中的第 1个存储子系统 11用于接收并緩存备份服 务器发送的备份数据; 当 i+l<N时, 第 i+1个存储子系统用于接收并緩存第 i个 存储子系统发送的复制的备份数据; 当 i+l=N时, 第 i+1个存储子系统, 用于接 收并存储第 i个存储子系统发送的复制的备份数据。  The first storage subsystem 11 in the backup storage system of the embodiment is configured to receive and cache backup data sent by the backup server; when i+l<N, the i+1th storage subsystem is configured to receive and cache the first The backup backup data sent by the i storage subsystems; when i+l=N, the i+1th storage subsystem is configured to receive and store the copied backup data sent by the i th storage subsystem.
具体地, 第 1个存储系统 11的读写性能最高, 用来体现整个备份存储系统 的读写性能。第 1个存储子系统 11用于接收并緩存备份服务器发送的备份数据, 作为整个备份存储系统与备份服务器的接口。 因为第 1个备份存储系统 11的容 量较小, 接收完备份服务器发送的备份数据之后, 将备份数据緩存下来, 然后 将备份数据复制一份, 并发送给第 2个存储子系统 12, 然后第 2个存储子系统 12 接收并緩存备份数据之后,再将备份数据复制并发送至下一个最近邻的备份存 储子系统, 依次执行上述操作, 直至将备份数据最终发送至第 N个存储子系统 Specifically, the first storage system 11 has the highest read and write performance, and is used to reflect the read and write performance of the entire backup storage system. The first storage subsystem 11 is configured to receive and cache backup data sent by the backup server as an interface between the entire backup storage system and the backup server. Because the capacity of the first backup storage system 11 is small, after receiving the backup data sent by the backup server, the backup data is cached, then the backup data is copied and sent to the second storage subsystem 12, and then After the two storage subsystems 12 receive and cache the backup data, the backup data is copied and sent to the next nearest neighbor backup storage subsystem, and the above operations are performed in sequence until the backup data is finally sent to the Nth storage subsystem.
13 , 由第 N个存储子系统 13存储该备份数据。 各最近邻存储子系统之间备份数 据的发送与接收时在控制器 10的调度下完成的。 13. The backup data is stored by the Nth storage subsystem 13. The transmission and reception of backup data between the nearest neighbor storage subsystems is completed under the scheduling of the controller 10.
从整个备份存储系统来看, 与备份服务器相连的仅仅是第 1个存储子系统 From the perspective of the entire backup storage system, only the first storage subsystem is connected to the backup server.
11 , 第 1个存储子系统 11的读写性能决定了整个备份存储系统的读写性能。 而 第 1个存储子系统 11接收到备份数据之后, 该备份数据经一级一级地复制并緩 存, 属于在备份存储系统内部的操作, 不影响备份存储系统整体的读写性能。 11, the read and write performance of the first storage subsystem 11 determines the read and write performance of the entire backup storage system. After the first storage subsystem 11 receives the backup data, the backup data is copied and cached one level at a time, and belongs to the operation inside the backup storage system, and does not affect the overall read and write performance of the backup storage system.
本实施的备份存储系统, 采用读写性能最高的第 1个存储子系统作为与备 份服务器相通信的接口,然后再将备份数据依次向其后的各个存储子系统中緩 存, 最终将备份数据存储在第 N个存储子系统。 本实施例的备份存储系统备份 性能较高, 完成备份时间较短; 能够满足现有业务系统的需求。 需要说明的是, 在本实施例二的基石出上, 当 l i N-2, JLN > 3时, 所述 第 i+1个存储子系统具体用于接收第 i个存储子系统发送的复制的备份数据, 将 所述复制的备份数据覆盖最先存储在所述第 i+1个存储子系统中的备份数据; 具体地,当第 1个存储子系统 11至第 N- 1个存储子系统中某一个存储子系统 没有空闲存储空间时 ,该存储子系统将接收到的上一个最近邻存储子系统发送 的复制的备份数据直接覆盖最先存储在该存储子系统中的备份数据。通常情况 下, 最先存储的备份数据已经复制到下一个最近邻的存储子系统中, 没有存储 的意义, 因此可以被覆盖。 然而, 存储子系统中的备份数据, 如果还没有复制 到下一个存储子系统, 则必须先完成复制才允许被覆盖。 采用上述方案, 可以 保证各存储子系统中保存的都是最新的备份数据,有效地保证存储的备份数据 的安全性。 The backup storage system of the present implementation adopts the first storage subsystem with the highest read and write performance as the interface for communicating with the backup server, and then sequentially caches the backup data to the subsequent storage subsystems, and finally stores the backup data. At the Nth storage subsystem. The backup storage system in this embodiment has high backup performance and short backup time; it can meet the requirements of existing business systems. It should be noted that, on the basis of the second embodiment, when the Li N-2, JLN > 3, the i+1th storage subsystem is specifically configured to receive the replication sent by the i th storage subsystem. Backing up the data, overwriting the backup data that is first stored in the i+1th storage subsystem; specifically, when the first storage subsystem 11 to the N-1 storage subsystem When one of the storage subsystems has no free storage space, the storage subsystem directly receives the backup data sent by the last nearest neighbor storage subsystem to directly overwrite the backup data first stored in the storage subsystem. Normally, the first stored backup data has been copied to the next nearest neighbor storage subsystem, meaning no storage, so it can be overwritten. However, the backup data in the storage subsystem, if it has not been copied to the next storage subsystem, must be replicated before it is allowed to be overwritten. By adopting the above solution, it is ensured that the latest backup data is stored in each storage subsystem, thereby effectively ensuring the security of the stored backup data.
而当 i=N- 1时,所述第 i+1个存储子系统具体用于接收第 i个存储子系统发送 的复制的备份数据 , 将所述复制的备份数据覆盖所述第 i+1个存储子系统中已 经过期的备份数据。  When i=N-1, the i+1th storage subsystem is specifically configured to receive the copied backup data sent by the i th storage subsystem, and overwrite the duplicate backup data with the i+1th Backup data that has expired in the storage subsystem.
具体地,即当第 N个存储子系统 13没有空闲存储空间时,将接收到的第 N-1 个存储子系统发送的复制的备份数据直接覆盖第 N个存储子系统 13中已经过 期的备份数据。这里所述的已经过期的备份数据指的是超过备份存储时间的数 据。 备份存储系统的备份数据都有一定的存储时间, 比如为三个月、 半年或者 一年, 当超过这个备份存储时间, 这些备份数据就再也没有使用价值了, 没有 再保存的必要性。这些备份数据称之为已经过期的备份数据。采用此技术方案, 可以保证存储在备份存储系统中所有备份数据的安全性。  Specifically, when the Nth storage subsystem 13 has no free storage space, the copied backup data sent by the received N-1th storage subsystem directly overwrites the expired backup in the Nth storage subsystem 13. data. The expired backup data described here refers to data that exceeds the backup storage time. The backup data of the backup storage system has a certain storage time, for example, three months, half a year, or one year. When the backup storage time is exceeded, the backup data is no longer useful, and there is no need to save it. These backup data are called backup data that has expired. With this technical solution, the security of all backup data stored in the backup storage system can be guaranteed.
本发明实施例三提供一种备份存储系统。上述实施例二的备份存储系统主 要用于接收备份服务器的备份数据并进行存储。本实施例在上述实施例一备份 存储系统的基础上从数据恢复的角度来描述备份存储系统。下面详细描述本实 施例的技术方案。  Embodiment 3 of the present invention provides a backup storage system. The backup storage system of the second embodiment is mainly used for receiving backup data of the backup server and storing the data. This embodiment describes the backup storage system from the perspective of data recovery on the basis of the backup storage system of the first embodiment. The technical solution of the present embodiment will be described in detail below.
第 1个存储子系统 11用于接收备份服务器发送的数据请求。 当所述数据请 求对应的备份数据緩存在第 1个存储子系统 11中时,第 1个存储子系统 11将所述 备份数据返回给所述备份服务器。  The first storage subsystem 11 is configured to receive a data request sent by the backup server. When the backup data corresponding to the data request is cached in the first storage subsystem 11, the first storage subsystem 11 returns the backup data to the backup server.
所述第 i+1个存储子系统, 还用于当所述数据请求对应的备份数据没有緩 存在所述第 1个存储子系统中时, 接收第 i个存储子系统发送的所述数据请求, 当所述数据请求对应的备份数据緩存在所述第 i+1个存储子系统中时, 将所述 备份数据返回给所述备份服务器。 The i+1th storage subsystem is further configured to receive the data request sent by the i th storage subsystem when the backup data corresponding to the data request is not cached in the first storage subsystem , And when the backup data corresponding to the data request is cached in the (i+1)th storage subsystem, returning the backup data to the backup server.
具体地, 当所述数据请求对应的备份数据没有緩存在所述第 1个存储子系 统 11中时, 将所述数据请求发给第 2个存储子系统 12以在第 2个存储子系统 12 中获取与所述数据请求对应的备份数据。 若第 2个存储子系统 12中仍没有緩存 所述数据请求对应的备份数据,再将所述数据请求发给下一个最近邻存储子系 统;依此类推, 直至将所述数据请求发送给某一个緩存有所述数据请求对应的 备份数据的目标存储子系统;然后由该目标存储子系统将所述数据请求对应的 备份数据返回给备份服务器。 为便于描述, 本发明实施例中的目标存储子系统 为緩存有数据请求对应的备份数据的存储子系统。  Specifically, when the backup data corresponding to the data request is not cached in the first storage subsystem 11, the data request is sent to the second storage subsystem 12 to be in the second storage subsystem 12. Obtaining backup data corresponding to the data request. If the backup data corresponding to the data request is still not cached in the second storage subsystem 12, the data request is sent to the next nearest neighbor storage subsystem; and so on, until the data request is sent to the A target storage subsystem that caches the backup data corresponding to the data request; and then the target storage subsystem returns the backup data corresponding to the data request to the backup server. For convenience of description, the target storage subsystem in the embodiment of the present invention is a storage subsystem that caches backup data corresponding to the data request.
需要说明的是,本实施例中的目标存储子系统将所述数据请求对应的备份 数据返回给备份服务器是一级一级的返回,即先由目标存储子系统将数据请求 对应的备份数据返回给目标存储子系统的前一个最近邻存储子系统,然后再由 目标存储子系统的前一个最近邻存储子系统依次向前返回。也就是,每一个存 储子系统将接收到的后一个最近邻存储子系统发送的数据请求对应的备份数 据返回给其前一个最近邻存储子系统 ,直到将数据请求对应的备份数据返回至 第 1个存储子系统, 最后由第 1个存储子系统返回给备份服务器。  It should be noted that, in the embodiment, the target storage subsystem returns the backup data corresponding to the data request to the backup server, which is a level-1 return, that is, the target storage subsystem first returns the backup data corresponding to the data request. The previous nearest neighbor storage subsystem of the target storage subsystem is then forwarded back by the previous nearest neighbor storage subsystem of the target storage subsystem. That is, each storage subsystem returns the backup data corresponding to the data request sent by the next nearest neighbor storage subsystem to its previous nearest neighbor storage subsystem until the backup data corresponding to the data request is returned to the first The storage subsystem is finally returned to the backup server by the first storage subsystem.
若直到由第 N-1个存储子系统将所述数据请求发送给第 N个存储子系统 13 , 但是第 N个存储子系统 13仍没有存储所述数据请求对应的备份数据时, 此 时第 N个存储子系统返回错误信息给所述备份服务器。 错误信息的返回同上述 数据请求对应的备份数据的返回一样, 也是一级级的返回, 最后由第 1个存储 子系统 11返回给备份服务器, 告诉备份服务器找不到与恢复请求对应的数据。  If the data request is sent to the Nth storage subsystem 13 by the N-1th storage subsystem, but the Nth storage subsystem 13 still does not store the backup data corresponding to the data request, N storage subsystems return error information to the backup server. The return of the error message is the same as the return of the backup data corresponding to the above data request, and is also a level one return, and finally returned by the first storage subsystem 11 to the backup server, telling the backup server that the data corresponding to the recovery request cannot be found.
数据恢复过程中数据请求的发送以及所述数据请求对应的备份数据返回 都是在控制器 10的调度下完成。 因为在存储过程中进行写数据操作的时候,控 制器 10中就保存有备份数据的存储记录。在进行恢复数据过程中进行读数据操 作的时候, 可以参考控制器 10中保存的备份数据的存储记录, 以更方便的找到 数据请求对应的备份数据。但是优选地,数据请求对应的备份数据还是一级级 的向前返回, 直到给第 1个存储子系统 11 , 再由第 1个存储子系统返回给备份服 务器。 在容灾应用中, 可以根据控制器 10中保存的备份数据的存储记录, 找到 存储数据请求对应的备份数据的存储子系统,可以直接将该存储子系统运输到 备份服务器的本地, 与所述备份服务器对接实现备份数据的恢复。 The transmission of the data request during the data recovery process and the return of the backup data corresponding to the data request are all performed under the scheduling of the controller 10. Since the write data operation is performed in the storage process, the storage record of the backup data is saved in the controller 10. When the data read operation is performed during the process of restoring data, the storage record of the backup data saved in the controller 10 can be referred to, so as to more conveniently find the backup data corresponding to the data request. Preferably, however, the backup data corresponding to the data request is returned to the first level, until the first storage subsystem 11 is returned to the backup server by the first storage subsystem. In the disaster recovery application, the storage subsystem storing the backup data corresponding to the data request may be found according to the storage record of the backup data saved in the controller 10, and the storage subsystem may be directly transported to the storage subsystem. The backup server is localized and docked with the backup server to implement backup data recovery.
本发明实施例的备份存储系统,第 1个存储子系统 11与备份服务器相通信, 接收服务器发送的数据请求或者向服务器返回请求的备份数据,以实现备份数 据的高效率恢复, 满足现有备份存储系统对业务的需求。  In the backup storage system of the embodiment of the present invention, the first storage subsystem 11 communicates with the backup server, receives a data request sent by the server, or returns the requested backup data to the server, so as to achieve efficient recovery of the backup data and satisfy the existing backup. The storage system's demand for business.
需要说明的是, 在本实施例的上述技术方案的基础上, 所述第 i+1个存储 子系统, 还用于当所述数据请求对应的备份数据没有緩存在第 1个存储子系统 11中时, 接收第 i个存储子系统发送的所述数据请求, 当所述数据请求对应的 备份数据緩存在所述第 i+1个存储子系统中时, 将所述第 i+1个存储子系统存储 所述备份数据的地址返回给所述备份服务器。  It should be noted that, based on the foregoing technical solution of the embodiment, the i+1th storage subsystem is further configured to: when the backup data corresponding to the data request is not cached in the first storage subsystem 11 Receiving, in the middle, the data request sent by the i th storage subsystem, when the backup data corresponding to the data request is cached in the i+1th storage subsystem, storing the i+1th storage The subsystem stores the address of the backup data back to the backup server.
具体地,当数据请求对应的备份数据没有緩存在第 1个存储子系统 11中时, 第 1个存储子系统 11将数据请求发给第 2个存储子系统 12 , 若第 2个存储子系统 12中緩存有数据请求对应的备份数据, 则将第 2个存储子系统 12将存储该数据 请求对应的备份数据的地址(即该数据请求对应的备份数据的指针)返回给备 份服务器。若第 2个存储子系统 12中仍没有緩存所述数据请求对应的备份数据, 再将所述数据请求发给下一个最近邻存储子系统;依此类推, 直至将所述数据 请求发送给某一个緩存有所述数据请求对应的备份数据的目标存储子系统;然 后将该目标存储子系统中存储该数据请求对应的备份数据的地址(即该数据请 求对应的备份数据的指针)返回给备份服务器。 此时对应的, 备份服务器根据 该数据请求对应的备份数据的地址 (即该数据请求对应的备份数据的指针 )返 回给备份服务器的过程中形成的通道, 恢复数据。  Specifically, when the backup data corresponding to the data request is not cached in the first storage subsystem 11, the first storage subsystem 11 sends the data request to the second storage subsystem 12, if the second storage subsystem 12, the backup data corresponding to the data request is cached, and the second storage subsystem 12 returns the address of the backup data corresponding to the data request (that is, the pointer of the backup data corresponding to the data request) to the backup server. If the backup data corresponding to the data request is still not cached in the second storage subsystem 12, the data request is sent to the next nearest neighbor storage subsystem; and so on, until the data request is sent to the a target storage subsystem that caches the backup data corresponding to the data request; and then returns an address of the backup data corresponding to the data request in the target storage subsystem (ie, a pointer of the backup data corresponding to the data request) to the backup server. Correspondingly, the backup server returns the channel formed in the process of returning to the backup server according to the address of the corresponding backup data (that is, the pointer of the backup data corresponding to the data request), and restores the data.
其中将该目标存储子系统中存储该数据请求对应的备份数据的地址(即该 数据请求对应的备份数据的指针)返回给备份服务器的过程是由各存储子系统 一级级的返回,因为每一个存储子系统仅与其最近邻的前一个存储子系统和后 一个最近邻存储子系统相连以实现数据交换。 所以,每一个存储子系统接收到 后一个最近邻存储子系统发送的地址之后,该存储子系统会緩存该地址并将该 存储子系统中用于緩存该地址的地址返回给前一个最近邻存储子系统。这样一 级级的返回, 直到第 2个存储子系统 12返回给第 1个存储子系统 11一个地址, 再 由第 1个存储子系统 11将存储第 2个存储子系统 12返回的地址的地址返回给备 份服务器, 地址返回的过程形成一个由目标存储子系统到备份服务器的通道, 备份服务器根据该通道便能够找到恢复请求对应的备份数据。 本实施例的备份存储系统,能够快速有效地找到备份服务器恢复请求对应 的备份数据后,并将存储有恢复请求对应的备份数据的目标存储子系统对应的 指针返回给备份服务器,即可直接从存储有恢复请求对应的备份数据的目标存 储子系统中恢复数据; 而不需要将恢复请求对应的备份数据逐级复制并返回, 能够艮好地满足现有业务系统的数据恢复需求。 The process of returning the address of the backup data corresponding to the data request corresponding to the data request in the target storage subsystem (that is, the pointer of the backup data corresponding to the data request) to the backup server is returned by each storage subsystem level first level, because each A storage subsystem is only connected to the nearest storage subsystem and the nearest nearest neighbor storage subsystem of its nearest neighbor for data exchange. Therefore, after each storage subsystem receives the address sent by the next nearest neighbor storage subsystem, the storage subsystem caches the address and returns the address in the storage subsystem for caching the address to the previous nearest neighbor storage. Subsystem. Such a level of return until the second storage subsystem 12 returns an address to the first storage subsystem 11, and then the first storage subsystem 11 will store the address of the address returned by the second storage subsystem 12. Returning to the backup server, the process of returning the address forms a channel from the target storage subsystem to the backup server, and the backup server can find the backup data corresponding to the recovery request according to the channel. The backup storage system of the embodiment can quickly and efficiently find the backup data corresponding to the backup server recovery request, and return the pointer corresponding to the target storage subsystem of the backup data corresponding to the recovery request to the backup server, thereby directly The data is restored in the target storage subsystem that stores the backup data corresponding to the recovery request. The backup data corresponding to the recovery request does not need to be copied and returned step by step, and the data recovery requirement of the existing service system can be satisfactorily satisfied.
以上所描述的备份存储系统实施例仅仅是示意性的,其中所述作为分离部 件说明的子系统可以是或者也可以不是物理上分开的, 即可以位于一个地方, 或者也可以分布式分布在多个地方上。可以根据实际的需要选择其中的部分或 者全部子系统来实现本实施例方案的目的。本领域普通技术人员在不付出创造 性的劳动的情况下, 即可以理解并实施。  The backup storage system embodiments described above are merely illustrative, wherein the subsystems described as separate components may or may not be physically separate, that is, may be located in one place, or may be distributed in multiple locations. Place. Some or all of the subsystems may be selected according to actual needs to achieve the objectives of the solution of the embodiment. Those of ordinary skill in the art can understand and implement without undue labor.
图 2为本发明实施例四提供的数据备份方法的流程图; 如图 2所示, 本实施 例的数据备份方法, 具体可以包括以下步骤:  2 is a flowchart of a data backup method according to Embodiment 4 of the present invention; as shown in FIG. 2, the data backup method in this embodiment may specifically include the following steps:
步骤 100、第 1个存储子系统接收并緩存备份服务器发送的备份数据; 将复 制的备份数据发送给第 2个存储子系统;  Step 100: The first storage subsystem receives and caches backup data sent by the backup server, and sends the copied backup data to the second storage subsystem.
步骤 101、 从 i=l开始, 第 i+1个存储子系统接收并緩存第 i个存储子系统发 送的复制的备份数据, 重复执行该步骤, 直至 i=N-l , 第 N个存储子系统接收 并存储第 N-1个存储子系统发送的复制的备份数据; N为存储子系统的个数, N > 2, K i N-l , N和 i均为正整数;  Step 101: Starting from i=l, the i+1th storage subsystem receives and caches the copied backup data sent by the i th storage subsystem, and repeats the step until i=N1, and the Nth storage subsystem receives And storing the copied backup data sent by the N-1th storage subsystem; N is the number of storage subsystems, N > 2, K i Nl , N and i are positive integers;
其中, 第 i个存储子系统的性能高于第 i+1个存储子系统, 第 i个存储子系统 的容量小于第 i+1个存储子系统, 第 i个存储子系统的单位容量成本高于第 i+1 个存储子系统。  The performance of the i-th storage subsystem is higher than that of the i+1th storage subsystem, and the capacity of the i-th storage subsystem is smaller than the i+1th storage subsystem, and the unit capacity of the i-th storage subsystem is high. On the i+1th storage subsystem.
具体地,本实施例的数据备份方法是在上述实施例一所提供的备份存储系 统中, 实现上述实施例二的备份存储性能的技术方案。 第 1个存储子系统 11用 于接收备份服务器发送的备份数据, 并緩存备份数据, 然后复制备份数据发送 到第 2个存储子系统 12,从第 2个存储子系统 12开始,每一个存储子系统都接收 并緩存前一个最近邻存储子系统发送的复制的备份数据,然后再复制备份数据 并发送至下一个最近邻的备份存储子系统, 直至最后由第 N-1个存储子系统将 备份数据发送至第 N个存储子系统 13 , 由第 N个存储子系统 13存储备份数据。  Specifically, the data backup method of the embodiment is a technical solution for implementing the backup storage performance of the second embodiment in the backup storage system provided in the first embodiment. The first storage subsystem 11 is configured to receive backup data sent by the backup server, and cache the backup data, and then copy the backup data to the second storage subsystem 12, starting from the second storage subsystem 12, each storage sub- The system receives and caches the replicated backup data sent by the previous nearest neighbor storage subsystem, and then copies the backup data and sends it to the next nearest neighbor backup storage subsystem until it is finally backed up by the N-1th storage subsystem. The data is sent to the Nth storage subsystem 13, and the Nth storage subsystem 13 stores the backup data.
由上述可知,第 1个存储子系统 11至第 N- 1个存储子系统均用于緩存所述备 份数据; 第 N个存储子系统 13用于存储所述备份数据。 因为第 1个存储子系统 11作为整个备份存储系统与所述备份服务器相互通信的接口, 第 1个存储子系 统 11的读写性能即相当于整个备份存储系统地读写性能。 从第 2个存储子系统 12开始, 各存储子系统的性能和单位容量成本依次降低, 但是容量依次增大。 直到第 N个存储子系统 13主要体现备份存储系统的存储器的功能, 性能和单位 容量成本最低,但存储容量最大的。关于备份存储系统可以参照上述实施例一, 在此不再赘述。 It can be seen from the above that the first storage subsystem 11 to the N-1 storage subsystem are used to cache the backup data; the Nth storage subsystem 13 is configured to store the backup data. Because of the first storage subsystem 11 As an interface between the entire backup storage system and the backup server, the read and write performance of the first storage subsystem 11 is equivalent to the read and write performance of the entire backup storage system. Starting from the second storage subsystem 12, the performance and unit capacity costs of each storage subsystem are sequentially reduced, but the capacity is sequentially increased. Until the Nth storage subsystem 13 mainly reflects the function of the memory of the backup storage system, the performance and unit capacity cost are the lowest, but the storage capacity is the largest. For the backup storage system, reference may be made to the foregoing embodiment 1, and details are not described herein again.
采用本实施例的存储方法, 采用第 1个存储子系统 11作为备份存储系统与 备份服务器的接口, 接收并緩存备份服务器发送的备份数据。 由于第 1个存储 子系统 11具有较高读写性能。 所以整个备份存储系统也具有较高的备份性能。  With the storage method of this embodiment, the first storage subsystem 11 is used as an interface between the backup storage system and the backup server, and the backup data sent by the backup server is received and cached. Because the first storage subsystem 11 has high read and write performance. Therefore, the entire backup storage system also has high backup performance.
本实施例的数据备份方法所采用的机制与上述实施一和实施例二相同,具 体可以参照实施例一和实施例二的描述。  The mechanism used in the data backup method of this embodiment is the same as that of the first embodiment and the second embodiment. For details, refer to the descriptions of the first embodiment and the second embodiment.
本实施例的数据备份方法, 用以对读写性能和单位容量成本依次降低、存 储容量依次增大、 依次逻辑相连的 N个存储子系统组成的备份存储系统存储备 份数据; 并将具有最高读写性能的第 1个存储子系统作为与备份服务器相通信 的接口, 提高了备份处理的性能, 能够有效地缩短备份窗口; 满足现有业务系 统的需求。  The data backup method of the embodiment is used to store backup data for a backup storage system composed of N storage subsystems in which the read/write performance and the unit capacity cost are sequentially decreased, the storage capacity is sequentially increased, and the logically connected N storage subsystems are sequentially stored; The first storage subsystem with write performance serves as an interface for communication with the backup server, improving the performance of backup processing, effectively shortening the backup window, and meeting the needs of existing business systems.
需要说明的是, 在本实施例四的基石出上; 还可以包括:  It should be noted that, in the foundation of the fourth embodiment, it may also include:
当 l<i<N-2, N > 3时, 第 i+1个存储子系统接收第 i个存储子系统发送的 复制的备份数据; 当第 i+1个存储子系统没有空闲存储空间时, 第 i+1个存储子 系统将所述复制的所述备份数据直接覆盖最先存储在所述第 i+1个存储子系统 中的备份数据。  When l<i<N-2, N>3, the i+1th storage subsystem receives the copied backup data sent by the i th storage subsystem; when the i+1th storage subsystem has no free storage space And the i+1th storage subsystem directly overwrites the backup data stored in the i+1th storage subsystem.
具体地,当第 1个存储子系统 11至第 N- 1个存储子系统中某一个存储子系统 没有空闲存储空间时 ,该存储子系统将接收到的上一个最近邻存储子系统发送 的复制的备份数据直接覆盖最先存储在该存储子系统中的备份数据。通常情况 下, 最先存储的备份数据已经复制到下一个最近邻的存储子系统中, 没有存储 的意义, 因此可以被覆盖。 然而, 存储子系统中的备份数据, 如果还没有复制 到下一个存储子系统, 则必须先完成复制才允许被覆盖。 采用上述方案, 可以 保证各存储子系统中保存的都是最新的备份数据,有效地保证存储的备份数据 的安全性。  Specifically, when one of the first storage subsystem 11 to the N-1 storage subsystem has no free storage space, the storage subsystem will receive the replication sent by the last nearest neighbor storage subsystem. The backup data directly covers the backup data that was first stored in the storage subsystem. Normally, the first backup data stored has been copied to the next nearest neighbor storage subsystem, meaning no storage, so it can be overwritten. However, the backup data in the storage subsystem, if it has not been copied to the next storage subsystem, must be replicated before it is allowed to be overwritten. With the above solution, it is ensured that the latest backup data is stored in each storage subsystem, and the security of the stored backup data is effectively ensured.
当 i=N-l时,第 i+1个存储子系统接收并緩存第 i个存储子系统发送的复制的 备份数据; 当第 i+1个存储子系统没有空闲存储空间时, 第 i+1个存储子系统将 所述复制的所述备份数据覆盖第 i+1个存储子系统中已经过期的备份数据。 When i=N1, the i+1th storage subsystem receives and caches the copy sent by the i th storage subsystem Backing up data; when the i+1th storage subsystem has no free storage space, the i+1th storage subsystem overwrites the duplicated backup data in the i+1th storage subsystem .
具体地, 当第 N个存储子系统 13没有空闲存储空间时, 将接收到的第 N-1 个存储子系统发送的复制的备份数据直接覆盖第 N个存储子系统 13中已经过 期的备份数据。这里所述的已经过期的备份数据指的是超过备份存储时间的备 份数据, 备份存储系统的备份数据都有一定的备份存储时间, 比如为三个月、 半年或者一年, 当超过这个备份存储时间,对应的这些备份数据就再也没有使 用价值了, 没有再保存的必要性。 这些备份数据称之为已经过期的备份数据。 采用此技术方案, 可以保证存储在备份存储系统中所有备份数据的安全性。  Specifically, when the Nth storage subsystem 13 has no free storage space, the copied backup data sent by the received N-1th storage subsystem directly overwrites the backup data that has expired in the Nth storage subsystem 13. . The expired backup data described here refers to backup data that exceeds the backup storage time. The backup data of the backup storage system has a certain backup storage time, such as three months, six months, or one year, when the backup storage is exceeded. At the time, the corresponding backup data is no longer useful, and there is no need to save it. These backup data are called backup data that has expired. With this technical solution, the security of all backup data stored in the backup storage system can be guaranteed.
图 3为本发明实施例五提供的数据恢复方法的流程图; 本实施在上述实施 例三的基石出上,详细描述上述实施例三的备份存储系统实现数据恢复的技术方 案。 如图 3所示, 本实施例的数据恢复方法, 具体可以包括以下步骤:  FIG. 3 is a flowchart of a data recovery method according to Embodiment 5 of the present invention. The present embodiment provides a technical solution for implementing data recovery in the backup storage system of Embodiment 3 in detail in the foregoing embodiment. As shown in FIG. 3, the data recovery method in this embodiment may specifically include the following steps:
步骤 200、第 1个存储子系统接收备份服务器发送的数据请求, 当所述数据 请求对应的备份数据緩存在所述第 1个存储子系统中时, 将所述备份数据返回 给所述备份服务器;  Step 200: The first storage subsystem receives a data request sent by the backup server, and returns the backup data to the backup server when the backup data corresponding to the data request is cached in the first storage subsystem. ;
具体地, 第 1个存储子系统 11用于接收备份服务器发送的数据请求。 当所 述数据请求对应的备份数据緩存在第 1个存储子系统中时,第 1个存储子系统 11 将所述备份数据返回给备份服务器。  Specifically, the first storage subsystem 11 is configured to receive a data request sent by the backup server. When the backup data corresponding to the data request is cached in the first storage subsystem, the first storage subsystem 11 returns the backup data to the backup server.
根据上述实施例二所述的本发明实施例的存储方案, 存储数据时, 在 N个 存储子系统中是从前向后依次緩存。 但是由于存储子系统的容量是依次递增 的。 某一存储子系统没有存储空间时,是采用覆盖最先存储的备份数据的方式 以保证存储的备份数据的安全性。 由此可知, 在 N个存储子系统越靠前的存储 子系统, 容量较小, 一般都是保存最新的一些备份数据, 因为存储时间稍长的 一些备份数据已经被覆盖。 而越靠后的存储子系统, 容量越大, 覆盖备份数据 的几率较小, 一般保存的备份数据比较多。 所以当进行数据恢复的时候, 当恢 复请求对应的备份数据是最新的备份数据时, 可能在第 1个存储子系统 11中便 能找到, 然后将恢复请求对应的备份数据返回给备份服务器即可, 当恢复请求 对应的备份数据不在第 1个存储子系统 11时, 可能该备份数据是很久以前存储 的, 即存储时间较长, 此时就需要执行下面的步骤 201和步骤 202以恢复数据。  According to the storage scheme of the embodiment of the present invention described in the second embodiment, when data is stored, it is sequentially cached from front to back in the N storage subsystems. However, the capacity of the storage subsystem is incremented. When a storage subsystem has no storage space, it uses the backup data stored first to ensure the security of the stored backup data. It can be seen that the storage subsystem with the N storage subsystems has a smaller capacity, and generally stores the latest backup data, because some backup data with a longer storage time has been overwritten. The lower the storage subsystem, the larger the capacity, the smaller the probability of overwriting the backup data, and the more commonly saved backup data. Therefore, when data recovery is performed, when the backup data corresponding to the recovery request is the latest backup data, it may be found in the first storage subsystem 11, and then the backup data corresponding to the recovery request may be returned to the backup server. When the backup data corresponding to the recovery request is not in the first storage subsystem 11, the backup data may be stored long ago, that is, the storage time is long. At this time, the following steps 201 and 202 are performed to recover the data.
步骤 201、当所述数据请求对应的备份数据没有緩存在所述第 1个存储子系 统中时, 将 i赋值为 1 ; Step 201: When the backup data corresponding to the data request is not cached in the first storage subsystem In the system, assign i to 1;
其中: l i N-l , N为存储子系统的个数, N > 2;  Where: l i N-l , N is the number of storage subsystems, N > 2;
步骤 202、 第 i+1个存储子系统接收第 i个存储子系统发送的所述数据请求, 当所述数据请求对应的备份数据緩存在所述第 i+1个存储子系统中时, 将所述 备份数据返回给所述备份服务器, 结束; 当所述数据请求对应的备份数据没有 緩存在第 i+1个存储子系统时, 将 i赋值为 i+1 ; 重复执行该步骤 202, 直至 i=N, 返回错误信息, 结束。  Step 202: The i+1th storage subsystem receives the data request sent by the i th storage subsystem, and when the backup data corresponding to the data request is cached in the i+1th storage subsystem, Returning the backup data to the backup server, ending; when the backup data corresponding to the data request is not cached in the i+1th storage subsystem, assigning i to i+1; repeating step 202 until i=N, returns an error message, and ends.
具体地, 步骤 201和步骤 202用以实现当第 1个存储子系统 11中没有恢复请 求对应的备份数据时, 先将 i赋值为 1 , 以开始执行循环将数据请求一级级向后 发送, 直到找到数据请求对应的备份数据或者直到找到第 N个存储子系统仍没 有找到。 这里的 i与上述实施例三相同, 用 N表示存储子系统的个数, N > 2; i 表示其中某一个存储子系统的标号, 其中 1 N-1。  Specifically, in step 201 and step 202, when the backup data corresponding to the recovery request is not in the first storage subsystem 11, the value of i is first set to 1, and the execution of the loop is performed to send the data request to the first level. Until the backup data corresponding to the data request is found or until the Nth storage subsystem is found, it is still not found. Here, i is the same as Embodiment 3 above, and N is used to indicate the number of storage subsystems, N > 2; i represents the label of one of the storage subsystems, where 1 N-1.
当 i赋值为 1之后, 第 2个存储子系统 12接收第 1个存储子系统 11发送的数据 请求, 当数据请求对应的备份数据緩存在所述第 2个存储子系统 12中时, 将备 份数据返回给备份服务器。 这里的备份数据返回过程具体是由第 2个存储子系 统 12将所述备份数据返回给第 1个存储子系统 11 ,再由第 1个存储子系统 11返回 给备份服务器。  After the value of i is set to 1, the second storage subsystem 12 receives the data request sent by the first storage subsystem 11, and when the backup data corresponding to the data request is cached in the second storage subsystem 12, the backup is performed. The data is returned to the backup server. Here, the backup data return process is specifically performed by the second storage subsystem 12 to return the backup data to the first storage subsystem 11, and then returned to the backup server by the first storage subsystem 11.
若第 2个存储子系统 12中仍没有緩存数据请求对应的备份数据, 将 i递增 1 , 继续执行 "第 i+1个存储子系统接收第 i个存储子系统发送的所述数据请求, 当 所述数据请求对应的备份数据緩存在所述第 i+1个存储子系统中时, 将所述备 份数据返回给所述备份服务器"; 返回备份数据的过程同前述实施例三情况类 似, 一级级的向前返回, 直到将备份数据返回给第 1个存储子系统 11 , 再由第 1 个存储子系统 11返回给备份服务器。 如果第 i+1个存储子系统仍然没有数据请 求对应的备份数据, 继续执行 i递增 1 , 依次类推, 继续向后一个最近邻的存储 子系统发送数据请求,直到由某个存储有数据请求对应的备份数据的目标备份 存储子系统接收到数据请求,找到要恢复的备份数据。 然后该目标备份存储系 统再将备份数据返回给备份服务器,返回备份数据的过程同前述实施例三情况 类似, 一级级的向前返回, 直到将备份数据返回给第 1个存储子系统 11 , 再由 第 1个存储子系统 11返回给备份服务器。  If there is still no backup data corresponding to the cache data request in the second storage subsystem 12, increment i by 1, and continue to execute the "i+1th storage subsystem receives the data request sent by the i th storage subsystem. When the backup data corresponding to the data request is cached in the i+1th storage subsystem, the backup data is returned to the backup server; and the process of returning backup data is similar to the foregoing embodiment 3, The level returns forward until the backup data is returned to the first storage subsystem 11, and then returned to the backup server by the first storage subsystem 11. If the i+1th storage subsystem still does not have the corresponding backup data of the data request, continue to perform i incrementing by 1, and so on, and continue to send a data request to the next nearest neighbor storage subsystem until a data request is corresponding to the storage. The target backup storage subsystem of the backup data receives the data request and finds the backup data to be restored. Then, the target backup storage system returns the backup data to the backup server, and the process of returning the backup data is similar to the foregoing embodiment 3, and the first level returns forward until the backup data is returned to the first storage subsystem 11. It is returned to the backup server by the first storage subsystem 11.
若直到由第 N-1个存储子系统将所述数据请求发送给第 N个存储子系统 13 , 但是第 N个存储子系统 13仍没有存储所述数据请求对应的备份数据时, 此 时返回错误信息给所述备份服务器。错误信息的返回同上述备份数据的返回一 样, 也是一级级的返回, 最后由第 1个存储子系统 11返回给备份服务器, 告诉 备份服务器找不到与恢复请求对应的数据。 If the data request is not sent to the Nth storage subsystem by the N-1th storage subsystem 13 . However, when the Nth storage subsystem 13 still does not store the backup data corresponding to the data request, an error message is returned to the backup server. The return of the error message is the same as the return of the backup data described above, and is also a return of the first level, and finally returned by the first storage subsystem 11 to the backup server, telling the backup server that the data corresponding to the recovery request cannot be found.
本实施例的数据恢复方法的实现机制与上述实施例三相同,详细亦可参照 上述实施例三的描述。  The implementation mechanism of the data recovery method in this embodiment is the same as that in the foregoing embodiment 3. For details, refer to the description of the third embodiment.
本发明实施例的数据恢复方法, 通过采用第 1个存储子系统 11与备份服务 器相通信,接收服务器发送的数据请求或者向服务器返回请求的备份数据, 能 够支持备份数据的高效率恢复, 满足现有业务系统的需求。  The data recovery method of the embodiment of the present invention can support the high-efficiency recovery of the backup data by using the first storage subsystem 11 to communicate with the backup server, receiving the data request sent by the server or returning the requested backup data to the server. There is a need for a business system.
图 4为本发明实施例六提供的数据恢复方法的流程图; 本实施例的数据恢 复方法与上述实施例五的数据恢复方法不同的是,本实施例中采用返回指针的 形式。 如图 4所示, 本实施例的数据恢复方法, 具体可以包括如下步骤:  4 is a flowchart of a data recovery method according to Embodiment 6 of the present invention. The data recovery method of this embodiment is different from the data recovery method of the foregoing Embodiment 5 in that the return pointer is used in this embodiment. As shown in FIG. 4, the data recovery method in this embodiment may specifically include the following steps:
步骤 300、第 1个存储子系统接收备份服务器发送的数据请求, 当所述数据 请求对应的备份数据緩存在所述第 1个存储子系统中时, 将所述备份数据的地 址返回给所述备份服务器;  Step 300: The first storage subsystem receives a data request sent by the backup server, and when the backup data corresponding to the data request is cached in the first storage subsystem, returning the address of the backup data to the Backup server
具体地, 第 1个存储子系统接收到备份服务器发送的数据请求之后, 判断 该数据请求对应的备份数据緩存在第 1个存储子系统中时,第 1个存储子系统将 该备份数据的地址返回给备份服务器, 以供备份服务器根据该地址恢复数据。  Specifically, after the first storage subsystem receives the data request sent by the backup server, and determines that the backup data corresponding to the data request is cached in the first storage subsystem, the first storage subsystem addresses the backup data. Return to the backup server for the backup server to recover data based on this address.
需要说明的是, 这里也可以是返回第 1个存储子系统对应的指针给备份服 务器。 由备份服务器根据返回指针建立的通道恢复数据。  It should be noted that, here, the pointer corresponding to the first storage subsystem may be returned to the backup server. The data is restored by the backup server based on the channel established by the return pointer.
步骤 301、当所述数据请求对应的备份数据没有緩存在所述第 1个存储子系 统中时, 将 i赋值为 1 ;  Step 301: When the backup data corresponding to the data request is not cached in the first storage subsystem, assign i to 1;
其中 l i N-l , N为存储子系统的个数, N > 2;  Where l i N-l , N is the number of storage subsystems, N > 2;
步骤 302、 第 i+1个存储子系统接收第 i个存储子系统发送的所述数据请求, 当所述数据请求对应的备份数据緩存在所述第 i+1个存储子系统中时, 将所述 数据请求对应的备份数据的地址返回给所述备份服务器,以供所述备份服务器 恢复数据, 结束; 当所述数据请求对应的备份数据没有緩存在第 i+1个存储子 系统时, 将 i赋值为 i+1 , 重复执行本步骤 302; 直至 i=N, 返回错误信息, 结束。  Step 302: The i+1th storage subsystem receives the data request sent by the i th storage subsystem, and when the backup data corresponding to the data request is cached in the i+1th storage subsystem, The address of the backup data corresponding to the data request is returned to the backup server, for the backup server to recover data, and ends; when the backup data corresponding to the data request is not cached in the i+1th storage subsystem, Assign i to i+1 and repeat step 302; until i=N, return an error message and end.
具体地, 步骤 301和步骤 302与上述实施例中步骤 201和步骤 202的不同在 于,假如恢复请求对应的备份数据存储在一目标存储子系统中, 当找到恢复请 求对应的备份数据时,返回该数据请求对应的备份数据地址给备份服务器。 同 前述备份数据的返回类似, 这里的地址返回也是一级级的返回。每一存储子系 统接收后一个最近邻存储子系统发送的地址并緩存该地址,然后将緩存后一个 最近邻存储子系统发送的地址的地址再发送至前一个最近邻存储子系统;直到 第 1个存储子系统将用于存储第 2个存储子系统发送的地址的地址返回给备份 服务器。 Specifically, the steps 301 and 302 are different from the steps 201 and 202 in the foregoing embodiment, in which, if the backup data corresponding to the recovery request is stored in a target storage subsystem, when the recovery is found, When the corresponding backup data is obtained, the backup data address corresponding to the data request is returned to the backup server. Similar to the return of the aforementioned backup data, the address return here is also a level one return. Each storage subsystem receives the address sent by the nearest nearest neighbor storage subsystem and caches the address, and then sends the address of the address sent by the cached nearest neighbor storage subsystem to the previous nearest neighbor storage subsystem; until the first The storage subsystem returns the address used to store the address sent by the second storage subsystem to the backup server.
从存储数据请求对应的备份数据的目标存储子系统到备份服务器之间,根 据地址的返回形成一条通道。备份服务器根据接收到的地址,按照地址返回过 程中形成的通道, 便可以恢复出数据请求对应的备份数据。  From the target storage subsystem that stores the corresponding backup data to the backup server, a channel is formed according to the return of the address. The backup server returns the backup data corresponding to the data request according to the received address and returns the channel formed by the process.
若直到由第 N-1个存储子系统将所述数据请求发送给第 N个存储子系统 If the data request is not sent to the Nth storage subsystem by the N-1th storage subsystem
13 , 但是第 N个存储子系统 13仍没有存储所述数据请求对应的备份数据时, 此 时第 N个存储子系统返回错误信息给所述备份服务器。 错误信息的返回同上述 实施例三相同, 在此不再贅述。 13 , but the Nth storage subsystem 13 still does not store the backup data corresponding to the data request, at which time the Nth storage subsystem returns an error message to the backup server. The return of the error information is the same as that of the third embodiment described above, and details are not described herein again.
本实施例的数据恢复方法的实现机制与上述实施例三相同,详细亦可参照 上述实施例三的描述。  The implementation mechanism of the data recovery method in this embodiment is the same as that in the foregoing embodiment 3. For details, refer to the description of the third embodiment.
本实施例的数据恢复方法,能够快速有效地找到备份服务器请求恢复的数 据,并将存储有恢复请求对应的备份数据的目标存储子系统对应的指针返回给 备份服务器,即可直接从存储有恢复请求对应的备份数据的目标存储子系统中 恢复数据; 而不需要将恢复请求对应的备份数据逐级复制并返回, 能够很好地 满足现有业务系统的数据恢复需求。  The data recovery method of the embodiment can quickly and efficiently find the data requested by the backup server, and return the pointer corresponding to the target storage subsystem of the backup data corresponding to the recovery request to the backup server, thereby directly recovering from the storage. The data is restored in the target storage subsystem of the corresponding backup data; the backup data corresponding to the recovery request does not need to be copied and returned step by step, which can well meet the data recovery requirements of the existing service system.
图 5为本发明实施例七提供的备份系统结构图; 如图 5所示, 本实施例的备 份系统包括备份客户端 20、 备份服务器 21和备份存储存储系统 22。  FIG. 5 is a structural diagram of a backup system according to Embodiment 7 of the present invention; as shown in FIG. 5, the backup system of this embodiment includes a backup client 20, a backup server 21, and a backup storage storage system 22.
其中: 备份服务器 21用于接收客户端 20发送的备份数据, 并将所述备份数 据存储在备份存储系统 22中, 以完成备份数据的存储操作; 和 /或备份服务器 21根据用户的数据恢复需求, 向备份存储系统 22发送数据请求, 以从备份存储 系统 22中获取与所述数据请求对应的备份数据,并将所述对应的备份数据发送 至客户端 20, 以完成备份数据的恢复操作。  The backup server 21 is configured to receive the backup data sent by the client 20, and store the backup data in the backup storage system 22 to complete the storage operation of the backup data; and/or the backup server 21 restores the request according to the data of the user. And sending a data request to the backup storage system 22 to obtain backup data corresponding to the data request from the backup storage system 22, and send the corresponding backup data to the client 20 to complete the recovery operation of the backup data.
具体地, 当进行写操作即备份数据时,备份客户端 20从业务系统中读取数 据, 并将读取到的数据发送给备份服务器 21。备份服务器 21接收到客户端发送 的数据之后, 将数据存储在备份存储系统 22中, 完成存储备份数据操作。 当进 行读操作即恢复数据时,备份服务器 21根据用户的恢复需求, 向备份存储系统 22发送数据请求。 备份存储系统 22接收到备份服务器 21发送的数据请求之后, 在各存储子系统中进行数据恢复操作, 以获取与恢复请求对应的备份数据。 然 后备份存储系统 22将获取的备份数据返回给备份服务器 21 ,备份服务器 21再将 备份数据发送至客户端 20。 最后由备份客户端 20将备份数据写入业务系统中, 完成恢复备份数据的操作。 Specifically, when a write operation, that is, backup data is performed, the backup client 20 reads data from the business system, and transmits the read data to the backup server 21. After receiving the data sent by the client, the backup server 21 stores the data in the backup storage system 22 to complete the operation of storing the backup data. When When the row read operation is to restore the data, the backup server 21 sends a data request to the backup storage system 22 according to the user's recovery request. After receiving the data request sent by the backup server 21, the backup storage system 22 performs a data recovery operation in each storage subsystem to acquire backup data corresponding to the recovery request. The backup storage system 22 then returns the acquired backup data to the backup server 21, which in turn sends the backup data to the client 20. Finally, the backup client 20 writes the backup data into the service system to complete the operation of restoring the backup data.
本实施例的备份存储系统 22可以采用上述实施一至实施例三中任一所述 的备份存储系统,可以采用上述实施例四所述的数据的备份方法实现数据的备 份, 还可采用实施例五和实施例六所述的数据的恢复方法实现备份数据的恢 复。 详细可以参照上述实施例一至实施例六的相关描述, 在此不再贅述。  The backup storage system 22 of the present embodiment may be configured with the backup storage system of any one of the foregoing implementations to the third embodiment, and the data backup method may be implemented by using the data backup method described in the fourth embodiment. The data recovery method described in Embodiment 6 implements recovery of backup data. For details, refer to the related descriptions of the first embodiment to the sixth embodiment, and details are not described herein again.
本实施例的备份系统,通过采用上述备份存储系统, 能够在不显著增加系 统成本及能耗的基石出上, 有效地提高备份系统的性能, 缩短备份窗口; 满足业 务系统的需求。  The backup system of this embodiment can effectively improve the performance of the backup system, shorten the backup window, and meet the requirements of the service system by adopting the above backup storage system, without significantly increasing the system cost and energy consumption.
通过以上的实施方式的描述,本领域的技术人员可以清楚地了解到本发明 可借助软件加必需的硬件平台的方式来实现, 当然也可以全部通过硬件来实 施, 但很多情况下前者是更佳的实施方式。基于这样的理解, 本发明的技术方 案对背景技术做出贡献的全部或者部分可以以软件产品的形式体现出来,该计 算机软件产品可以存储在存储介质中, 如 ROM/RAM、 磁碟、 光盘等, 包括若 干指令用以使得一台计算机设备(可以是个人计算机, 服务器, 或者网络设备 等)执行本发明各个实施例或者实施例的某些部分所述的方法。 最后应说明的是:以上实施例仅用以说明本发明的技术方案而非对其进行 限制,尽管参照较佳实施例对本发明进行了详细的说明, 本领域的普通技术人 员应当理解: 其依然可以对本发明的技术方案进行修改或者等同替换, 而这些 修改或者等同替换亦不能使修改后的技术方案脱离本发明技术方案的精神和 范围。  Through the description of the above embodiments, those skilled in the art can clearly understand that the present invention can be implemented by means of software plus a necessary hardware platform, and of course, all can be implemented by hardware, but in many cases, the former is better. Implementation. Based on such understanding, all or part of the technical solution of the present invention contributing to the background art may be embodied in the form of a software product, which may be stored in a storage medium such as a ROM/RAM, a magnetic disk, an optical disk, or the like. A number of instructions are included to cause a computer device (which may be a personal computer, server, or network device, etc.) to perform the methods described in various embodiments of the present invention or portions of the embodiments. It should be noted that the above embodiments are only used to illustrate the technical solutions of the present invention and are not intended to be limiting, although the present invention will be described in detail with reference to the preferred embodiments. The technical solutions of the present invention may be modified or equivalently substituted, and the modified technical solutions may not deviate from the spirit and scope of the technical solutions of the present invention.

Claims

权 利 要 求 Rights request
1、 一种备份存储系统, 其特征在于, 包括: A backup storage system, comprising:
N个存储子系统, 其中, 第 i个存储子系统与第 i+1个存储子系统相连接, 第 i个存储子系统的性能高于第 i+ 1个存储子系统,第 i个存储子系统的容量小于 第 i+1个存储子系统, 第 i个存储子系统的单位容量成本高于第 i+1个存储子系 统, N > 2, K i < N-l , N和 i均为正整数。  N storage subsystems, wherein the i-th storage subsystem is connected to the i+1th storage subsystem, and the performance of the i-th storage subsystem is higher than the i+1 storage subsystem, the i-th storage subsystem The capacity of the i-th storage subsystem is lower than that of the i+1th storage subsystem, N > 2, K i < Nl , and N and i are positive integers.
2、 根据权利要求 1所述的备份存储系统, 其特征在于:  2. The backup storage system of claim 1 wherein:
第 1个存储子系统, 用于接收并緩存备份服务器发送的备份数据; 当 i+l<N时, 第 i+1个存储子系统, 用于接收并緩存第 i个存储子系统发送 的复制的备份数据;  The first storage subsystem is configured to receive and cache backup data sent by the backup server; when i+l<N, the i+1th storage subsystem is configured to receive and cache the replication sent by the i th storage subsystem Backup data;
当 i+l=N时, 第 i+1个存储子系统, 用于接收并存储第 i个存储子系统发送 的复制的备份数据。  When i+l=N, the i+1th storage subsystem is configured to receive and store the copied backup data sent by the i th storage subsystem.
3、 根据权利要求 2所述的备份存储系统, 其特征在于,  3. The backup storage system of claim 2, wherein
当 l i N-2, N > 3时, 所述第 i+1个存储子系统, 具体用于接收第 i个 存储子系统发送的复制的备份数据,将所述复制的备份数据覆盖最先存储在所 述第 i+1个存储子系统中的备份数据;  When li N-2, N > 3, the i+1th storage subsystem is specifically configured to receive the copied backup data sent by the i th storage subsystem, and overwrite the duplicate backup data first. Backup data in the i+1th storage subsystem;
当 i=N-l时, 所述第 i+1个存储子系统,具体用于接收第 i个存储子系统发送 的复制的备份数据 , 将所述复制的备份数据覆盖所述第 i+1个存储子系统中已 经过期的备份数据。  When i=N1, the i+1th storage subsystem is specifically configured to receive the copied backup data sent by the i th storage subsystem, and overwrite the duplicated backup data with the i+1th storage. Backup data that has expired in the subsystem.
4、 根据权利要求 2或 3所述的备份存储系统, 其特征在于,  4. The backup storage system according to claim 2 or 3, characterized in that
所述第 1个存储子系统, 用于接收所述备份服务器发送的数据请求, 当所 述数据请求对应的备份数据緩存在所述第 1个存储子系统中时, 将所述备份数 据返回给所述备份服务器;  The first storage subsystem is configured to receive a data request sent by the backup server, and when the backup data corresponding to the data request is cached in the first storage subsystem, return the backup data to The backup server;
所述第 i+1个存储子系统, 用于当所述数据请求对应的备份数据没有緩存 在所述第 1个存储子系统中时, 接收第 i个存储子系统发送的所述数据请求, 当 所述数据请求对应的备份数据緩存在所述第 i+1个存储子系统中时, 将所述备 份数据返回给所述备份服务器或者将所述第 i+1个存储子系统存储所述备份数 据的地址返回给所述备份服务器;  The i+1th storage subsystem is configured to receive the data request sent by the i th storage subsystem when the backup data corresponding to the data request is not cached in the first storage subsystem, Returning the backup data to the backup server or storing the (i+1)th storage subsystem when the backup data corresponding to the data request is cached in the (i+1)th storage subsystem The address of the backup data is returned to the backup server;
第 N个存储子系统,用于当所述第 N个存储子系统接收第 N-1个存储子系统 发送的所述数据请求, 当所述数据请求对应的备份数据緩存在所述第 N个存储 子系统中时, 将所述备份数据返回给所述备份服务器或者将所述第 N个存储子 系统存储所述备份数据的地址返回给所述备份服务器。 An Nth storage subsystem, configured to receive the N-1th storage subsystem when the Nth storage subsystem receives Sending the data request, when the backup data corresponding to the data request is cached in the Nth storage subsystem, returning the backup data to the backup server or the Nth storage subsystem The address storing the backup data is returned to the backup server.
5、 根据权利要求 4所述的备份存储系统, 其特征在于,  5. The backup storage system of claim 4, wherein
当所述第 N个存储子系统不存在与所述数据请求对应的备份数据时, 返回 错误信息给所述备份服务器。  When the Nth storage subsystem does not have backup data corresponding to the data request, an error message is returned to the backup server.
6、一种备份系统, 包括备份客户端、备份服务器以及如权利要求 5所述的 备份存储系统;  6. A backup system comprising a backup client, a backup server, and a backup storage system according to claim 5;
所述备份服务器用于接收所述客户端发送的备份数据,并将所述备份数据 存储在所述备份存储系统中, 以完成备份数据的存储操作; 和 /或根据用户的 恢复需求, 向所述备份存储系统发送数据请求, 以从所述备份存储系统获取与 所述数据请求对应的备份数据; 并将所述对应的备份数据发送至所述客户端, 以完成备份数据的恢复操作。  The backup server is configured to receive backup data sent by the client, and store the backup data in the backup storage system to complete a storage operation of backup data; and/or according to a recovery requirement of the user The backup storage system sends a data request to obtain backup data corresponding to the data request from the backup storage system; and sends the corresponding backup data to the client to complete the recovery operation of the backup data.
7、根据权利要求 6所述的备份存储系统, 其特征在于, 所述备份系统还包 括:  The backup storage system according to claim 6, wherein the backup system further comprises:
控制器,用于控制所述备份存储系统中各相邻的存储子系统之间备份数据 的发送与接收。  And a controller, configured to control sending and receiving of backup data between adjacent storage subsystems in the backup storage system.
8、 一种数据备份方法, 其特征在于, 包括:  8. A data backup method, comprising:
第 1个存储子系统接收并緩存备份服务器发送的备份数据; 将复制的备份 数据发送给第 2个存储子系统;  The first storage subsystem receives and caches the backup data sent by the backup server; sends the copied backup data to the second storage subsystem;
从 i=l开始,第 i+1个存储子系统接收并緩存第 i个存储子系统发送的复制的 备份数据, 重复执行该步骤, 直至 i=N-l , 第 N个存储子系统接收并存储第 N-1 个存储子系统发送的复制的备份数据, N为存储子系统的个数, N > 2, K i < N-1 , N和 i均为正整数;  Starting from i=l, the i+1th storage subsystem receives and caches the copied backup data sent by the i th storage subsystem, and repeats the step until i=N1, and the Nth storage subsystem receives and stores the first The replicated backup data sent by N-1 storage subsystems, N is the number of storage subsystems, N > 2, K i < N-1 , N and i are positive integers;
其中, 第 i个存储子系统的性能高于第 i+1个存储子系统, 第 i个存储子系统 的容量小于第 i+1个存储子系统, 第 i个存储子系统的单位容量成本高于第 i+1 个存储子系统。  The performance of the i-th storage subsystem is higher than that of the i+1th storage subsystem, and the capacity of the i-th storage subsystem is smaller than the i+1th storage subsystem, and the unit capacity of the i-th storage subsystem is high. On the i+1th storage subsystem.
9、 根据权利要求 8所述的数据备份方法, 其特征在于, 第 i+1个存储子系 统接收并緩存第 i个存储子系统发送的复制的备份数据包括:  The data backup method according to claim 8, wherein the i+1th storage sub-system receives and caches the copied backup data sent by the i-th storage subsystem, including:
当 l i N-2, N > 3时, 第 i+1个存储子系统接收第 i个存储子系统发送 的复制的备份数据; 当第 i+1个存储子系统没有空闲存储空间时, 第 i+1个存储 子系统将所述复制的所述备份数据覆盖最先存储在所述第 i+1个存储子系统中 的备份数据; When li N-2, N > 3, the i+1th storage subsystem receives the ith storage subsystem to send The copied backup data; when the i+1th storage subsystem has no free storage space, the i+1th storage subsystem stores the copied backup data coverage first in the i+1th Backup data in the storage subsystem;
当 i=N- 1时,第 i+1个存储子系统接收并緩存第 i个存储子系统发送的复制的 备份数据; 当第 i+1个存储子系统没有空闲存储空间时, 第 i+1个存储子系统将 所述复制的所述备份数据覆盖第 i+1个存储子系统中已经过期的备份数据。  When i=N-1, the i+1th storage subsystem receives and caches the copied backup data sent by the i th storage subsystem; when the i+1th storage subsystem has no free storage space, the i+ A storage subsystem overwrites the duplicated data in the i+1th storage subsystem that has expired.
10、 一种数据恢复方法, 其特征在于, 包括:  10. A data recovery method, comprising:
第 1个存储子系统接收备份服务器发送的数据请求, 当所述数据请求对应 的备份数据緩存在所述第 1个存储子系统中时, 将所述备份数据返回给所述备 份服务器;  Receiving, by the first storage subsystem, a data request sent by the backup server, and returning the backup data to the backup server when the backup data corresponding to the data request is cached in the first storage subsystem;
当所述数据请求对应的备份数据没有緩存在所述第 1个存储子系统中时, 将 i赋值为 1;  When the backup data corresponding to the data request is not cached in the first storage subsystem, assign i to 1;
第 i+1个存储子系统接收第 i个存储子系统发送的所述数据请求, 当所述数 据请求对应的备份数据緩存在所述第 i+1个存储子系统中时, 将备份数据的信 息返回给所述备份服务器, 以供所述备份服务器恢复数据; 当所述数据请求对 应的备份数据没有緩存在第 i+1个存储子系统时, 将 i更新为 i+1 , 重复执行该步 骤, 直至 i=N, 返回错误信息; 其中 l i N-l , N为存储子系统的个数, N > 2。  The i+1th storage subsystem receives the data request sent by the i th storage subsystem, and when the backup data corresponding to the data request is cached in the i+1th storage subsystem, the data is backed up. The information is returned to the backup server for the backup server to recover data; when the backup data corresponding to the data request is not cached in the i+1th storage subsystem, i is updated to i+1, and the execution is repeated. Steps, until i=N, return an error message; where li Nl , N is the number of storage subsystems, N > 2.
11、 根据权利要求 10所述的数据恢复方法, 其特征在于, 所述备份数据的 信息包括: 所述备份数据或所述备份数据的地址。  The data recovery method according to claim 10, wherein the information of the backup data comprises: an address of the backup data or the backup data.
PCT/CN2011/071070 2010-02-24 2011-02-18 Backup storage system, backup system, data backup method and recovery method WO2011103791A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201010114868.6 2010-02-24
CN 201010114868 CN101794246B (en) 2010-02-24 2010-02-24 Backup storage system, backup system, data backup method and recovery method

Publications (1)

Publication Number Publication Date
WO2011103791A1 true WO2011103791A1 (en) 2011-09-01

Family

ID=42586956

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2011/071070 WO2011103791A1 (en) 2010-02-24 2011-02-18 Backup storage system, backup system, data backup method and recovery method

Country Status (2)

Country Link
CN (1) CN101794246B (en)
WO (1) WO2011103791A1 (en)

Families Citing this family (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101794246B (en) * 2010-02-24 2013-03-20 成都市华为赛门铁克科技有限公司 Backup storage system, backup system, data backup method and recovery method
CN102622286B (en) * 2012-02-23 2016-02-03 浪潮(北京)电子信息产业有限公司 A kind of raid-array bitmap backup method and system
CN103853837B (en) * 2014-03-17 2017-07-28 上海爱数信息技术股份有限公司 Oracle does not stop the table level back-up restoring method of Production database automatically
CN108196979B (en) * 2017-12-28 2021-07-09 北京星选科技有限公司 Data backup method and device
CN109460438B (en) * 2018-09-26 2024-04-12 中国平安人寿保险股份有限公司 Message data storage method, device, computer equipment and storage medium

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101008918A (en) * 2006-01-25 2007-08-01 株式会社日立制作所 Storage system and data restoration method thereof
WO2009015310A1 (en) * 2007-07-25 2009-01-29 Agiga Tech Inc Variable partitioning in a hybrid memory subsystem
CN101587453A (en) * 2009-06-18 2009-11-25 成都市华为赛门铁克科技有限公司 Data backup processing method, data memory node apparatus and data memory apparatus
CN101794246A (en) * 2010-02-24 2010-08-04 成都市华为赛门铁克科技有限公司 Backup storage system, backup system, data backup method and recovery method

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101008918A (en) * 2006-01-25 2007-08-01 株式会社日立制作所 Storage system and data restoration method thereof
WO2009015310A1 (en) * 2007-07-25 2009-01-29 Agiga Tech Inc Variable partitioning in a hybrid memory subsystem
CN101587453A (en) * 2009-06-18 2009-11-25 成都市华为赛门铁克科技有限公司 Data backup processing method, data memory node apparatus and data memory apparatus
CN101794246A (en) * 2010-02-24 2010-08-04 成都市华为赛门铁克科技有限公司 Backup storage system, backup system, data backup method and recovery method

Also Published As

Publication number Publication date
CN101794246B (en) 2013-03-20
CN101794246A (en) 2010-08-04

Similar Documents

Publication Publication Date Title
US10965753B2 (en) Interconnect delivery process
US7975168B2 (en) Storage system executing parallel correction write
US8719520B1 (en) System and method for data migration between high-performance computing architectures and data storage devices with increased data reliability and integrity
US9135119B1 (en) System and method for data management
US8468313B2 (en) Asynchronous replication with write concurrency grouping
EP2622487B1 (en) Host based write ordering for asynchronous replication
CN111587428B (en) Metadata journaling in distributed storage systems
US8046548B1 (en) Maintaining data consistency in mirrored cluster storage systems using bitmap write-intent logging
US9081691B1 (en) Techniques for caching data using a volatile memory cache and solid state drive
US7228381B2 (en) Storage system using fast storage device for storing redundant data
US7076605B1 (en) Method and apparatus for writing data to a storage device
US7487311B2 (en) System and method for asynchronous backup of virtual disks in a distributed storage array
US20110055471A1 (en) Apparatus, system, and method for improved data deduplication
JP2002323959A (en) System and method for non-volatile write cache based on log of magnetic disk controller
US11301324B2 (en) Method and apparatus for consistent and highly available data storage using local and fabric attached non-volatile memory storage devices
WO2011103791A1 (en) Backup storage system, backup system, data backup method and recovery method
JP2001188710A (en) Method and system for performing access to data set and product
CN107422989B (en) Server SAN system multi-copy reading method and storage system
US20090249003A1 (en) Method and system for multiplexing concatenated storage disk arrays to form a rules-based array of disks
JP2005276196A (en) System and method for performing drive recovery subsequent to drive failure
JP2008016024A (en) Dynamic adaptive flushing of cached data
WO2022033269A1 (en) Data processing method, device and system
US7711913B1 (en) System and method for backing up extended copy commands
US11221928B2 (en) Methods for cache rewarming in a failover domain and devices thereof
JP6976237B2 (en) Storage system, data management method, and data management program

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 11746833

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

32PN Ep: public notification in the ep bulletin as address of the adressee cannot be established

Free format text: COMMUNICATION NOT DELIVERED. NOTING OF LOSS OF RIGHTS PURSUANT TO RULE 112(1) EPC (EPO FORM 1205A DATED 31.10.2012)

122 Ep: pct application non-entry in european phase

Ref document number: 11746833

Country of ref document: EP

Kind code of ref document: A1