TWI509426B - System for achieving non-interruptive data reconstruction - Google Patents

System for achieving non-interruptive data reconstruction Download PDF

Info

Publication number
TWI509426B
TWI509426B TW103132081A TW103132081A TWI509426B TW I509426 B TWI509426 B TW I509426B TW 103132081 A TW103132081 A TW 103132081A TW 103132081 A TW103132081 A TW 103132081A TW I509426 B TWI509426 B TW I509426B
Authority
TW
Taiwan
Prior art keywords
storage device
data
source
snapshot
target storage
Prior art date
Application number
TW103132081A
Other languages
Chinese (zh)
Other versions
TW201612766A (en
Inventor
Wen Shyen Chen
sheng wei Chen
Bernard Jay Wu
Original Assignee
Prophetstor Data Services Inc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Prophetstor Data Services Inc filed Critical Prophetstor Data Services Inc
Priority to TW103132081A priority Critical patent/TWI509426B/en
Application granted granted Critical
Publication of TWI509426B publication Critical patent/TWI509426B/en
Publication of TW201612766A publication Critical patent/TW201612766A/en

Links

Landscapes

  • Time Recorders, Dirve Recorders, Access Control (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Description

用於達成無干擾性資料重建的系統System for achieving interference-free data reconstruction

本發明關於一種用於達成資料重建的系統,特別是關於一種供雲端主機,用於達成無干擾性資料重建的系統。當該系統運行時,該主機能平順的運轉,而存取的資料已由一儲存裝置被複製或移動至另一儲存裝置中。The present invention relates to a system for achieving data reconstruction, and more particularly to a system for a cloud host for achieving interference-free data reconstruction. When the system is running, the host can operate smoothly, and the accessed data has been copied or moved to another storage device by a storage device.

對雲端服務而言,通常需要有幾個重要的元件,例如,一個應用伺服器(主機),及許多的儲存設備來運作。該儲存設備可包含一些硬碟並形成一磁碟陣列,一些邏輯單元號碼設備被指定於磁碟陣列中,或該儲存設備可包含一些硬碟及儲存許多物件儲存資料。為了防止該儲存設備老化及部分毀損、為了更好的性能而升級為更快的儲存設備,或為了成本因素而降級成多個更符合成本效益的儲存設備,在一段時間使用後,一儲存設備(或邏輯單元號碼設備)中的儲存資料應被複製或移動到另一個儲存設備(或邏輯單元號碼設備)中。資料的重建可以是一種資料備份和資料遷移的方法, 該些儲存設備可散置於不同的位置。資料傳輸可經由區域網路或廣域網路來達成。For cloud services, there are usually several important components, such as an application server (host) and many storage devices to operate. The storage device may include some hard disks and form a disk array, some logical unit number devices are assigned to the disk array, or the storage device may include some hard disks and store a lot of object storage materials. In order to prevent the storage device from aging and partial damage, upgrade to a faster storage device for better performance, or downgrade to multiple more cost-effective storage devices for cost reasons, after a period of use, a storage device The stored data in the (or logical unit number device) should be copied or moved to another storage device (or logical unit number device). Data reconstruction can be a method of data backup and data migration. The storage devices can be placed in different locations. Data transmission can be achieved via a regional or wide area network.

傳統上,有三種方法用來實現資料複製或移動(重建),即離線方法、鏡像方法與雙寫入及快照影像複製方法。對於離線方法,請參閱第1圖。三個步驟應用於離線方法中。步驟1,一應用伺服器1存取一來源儲存設備2中的資料,這存取運作由使用者提出。在步驟2中,一資料複製伺服器4負責資料重建,由來源儲存設備2到一目標儲存設備3中。同時,應用伺服器1關閉不提供服務。當目標儲存設備3中的資料重建完成於步驟3時,應用伺服器1開機並連結到目標儲存設備3。從而,應用伺服器1提供的服務能繼續下去。雖然離線方法能簡單地實現資料重建,但它具有服務中斷的問題。對繁忙的雲端服務來說是不能接受的。Traditionally, there are three methods for data copying or moving (rebuilding), namely offline methods, mirroring methods, and double-write and snapshot image copying methods. For the offline method, please refer to Figure 1. The three steps are applied to the offline method. Step 1. An application server 1 accesses data in a source storage device 2, and the access operation is proposed by the user. In step 2, a data replication server 4 is responsible for data reconstruction from the source storage device 2 to a target storage device 3. At the same time, the application server 1 is turned off and no service is provided. When the data reconstruction in the target storage device 3 is completed in step 3, the application server 1 is powered on and connected to the target storage device 3. Thus, the service provided by the application server 1 can continue. Although the offline method can easily implement data reconstruction, it has the problem of service interruption. It is unacceptable for busy cloud services.

鏡像方法說明於第2圖中,也包含三個主要步驟。步驟1描述一應用伺服器1連結並存取一來源儲存設備2,而該來源儲存設備2中的資料即將重建於一目標儲存設備3中。在步驟2,來源儲存設備2開始鏡像服務並複製其包含的資料到目標儲存設備3中。在步驟2完成後,來源儲存設備2與目標儲存設備3交換角色,應用伺服器1連結並存取目標儲存設備3(現在,它變成了來源儲存設備)如第2圖之步驟3所示。The mirroring method is illustrated in Figure 2 and also contains three main steps. Step 1 describes an application server 1 linking and accessing a source storage device 2, and the data in the source storage device 2 is about to be reconstructed in a target storage device 3. In step 2, the source storage device 2 starts mirroring the service and copies the data it contains into the target storage device 3. After the completion of step 2, the source storage device 2 exchanges roles with the target storage device 3, and the application server 1 connects and accesses the target storage device 3 (now it becomes the source storage device) as shown in step 3 of FIG.

不同於離線方法,鏡像方法中使用的儲存設備必須由相同的供應商所提供,或由單一儲存的虛擬機器監視器 (Hypervisor)運行所提供。鏡像功能必須同時為所有儲存設備所具備。如果目標儲存設備3的效率比來源儲存設備2的效率差,應用伺服器1將面臨服務中斷。同時,額外的儲存設備鏡像同步將影響來源儲存設備2的效能。最重要的是,應用伺服器1需要重開機以切換來源儲存設備的識別特性為目標儲存設備的識別特性,這又造成了另一種干擾。Unlike offline methods, the storage devices used in the mirroring method must be provided by the same vendor or by a single stored virtual machine monitor. (Hypervisor) is provided by the run. The mirroring feature must be available for all storage devices at the same time. If the efficiency of the target storage device 3 is inferior to that of the source storage device 2, the application server 1 will face a service interruption. At the same time, additional storage device image synchronization will affect the performance of the source storage device 2. Most importantly, the application server 1 needs to be rebooted to switch the identification characteristics of the source storage device to the identification characteristics of the target storage device, which in turn causes another interference.

雙寫入及快照影像複製方法說明於第3圖中。一外接頻內資料移動服務單元9連結到一應用伺服器8、一來源儲存設備10及一目標儲存設備11。需要一個計劃內停機時間,以用於實現資料移動服務單元9的頻內基礎設施。資料移動服務單元9發出一快照命令給來源儲存設備10,並開始複製快照影像至目標儲存設備11中。”雙寫入”係因在快照時間點後,資料移動服務單元9執行所有寫入命令到來源儲存設備10與目標儲存設備11中。最終,當快照影像複製完成且所有新的寫入都完成時,來源儲存設備10與目標儲存設備11交換角色。The double write and snapshot image copy methods are illustrated in Figure 3. An external in-band data mobile service unit 9 is coupled to an application server 8, a source storage device 10, and a target storage device 11. A planned downtime is required for implementing the intra-frequency infrastructure of the data movement service unit 9. The data movement service unit 9 issues a snapshot command to the source storage device 10 and begins copying the snapshot image to the target storage device 11. The "double write" is because the data movement service unit 9 executes all write commands to the source storage device 10 and the target storage device 11 after the snapshot time point. Finally, when the snapshot image copy is complete and all new writes are completed, the source storage device 10 exchanges roles with the target storage device 11.

除了該頻內基礎設施造成資料重建時服務受到干擾或中斷的瓶頸,因為來源儲存設備10需要同時進行讀寫運作一段時間,直到快照影像複製及新寫入命令運作兩者皆完成,雙寫入及快照影像複製方法也使得應用伺服器8面臨服務中斷。故雙寫入及快照影像複製方法仍不便於資料複製或移動。In addition to the bottleneck of service interruption or interruption when the intra-frequency infrastructure causes data reconstruction, the source storage device 10 needs to perform both reading and writing operations for a period of time until both snapshot image copying and new write command operations are completed, double writing And the snapshot image copying method also causes the application server 8 to face a service interruption. Therefore, the double write and snapshot image copy methods are still inconvenient for data copying or moving.

因此,一種用於達成無干擾性資料重建的系統,特別是應用於雲端服務的伺服器之儲存設備,極為所需。Therefore, a system for achieving interference-free data reconstruction, particularly a storage device for a server serving a cloud service, is highly desirable.

本段文字提取和編譯本發明的某些特點。其他特點將被揭露於後續段落中。其目的在涵蓋附加的申請專利範圍之精神和範圍中,各式的修改和類似的排列。This paragraph of text extracts and compiles certain features of the present invention. Other features will be revealed in subsequent paragraphs. The intention is to cover various modifications and similar arrangements in the spirit and scope of the appended claims.

如上所述,現有用於資料移動或複製的系統是具干擾性的,因此需要一種用於達成無干擾性資料重建的系統。依照本發明的一種態樣,一種用於達成無干擾性資料重建的系統包含:一來源儲存設備,包含一來源資料;一目標儲存設備;一伺服器,用以存取在該來源儲存設備中的來源資料與儲存於該目標儲存設備中的任何資料,安裝一代理模組與一路徑輸入輸出模組,其中該代理模組持續蒐集一性能參數值,該性能參數值表達伺服器對來源資料存取狀態,並在該來源儲存設備快照之後,記錄來源儲存設備中改變的區塊到一位元圖表中,該位元圖表由該路徑輸入輸出模組所產生;該路徑輸入輸出模組將連結到該伺服器之來源儲存設備的儲存設備名稱脫鉤,並耦接至該目標儲存設備的儲存設備名稱,以便達成該伺服器連接路徑,從來源儲存設備到目標儲存設備的切換;一流量模型單元,接收該性能參數值,並能依照該接收的性能參數值,在未來一特定時間點提供一資料存取流量狀態,以依照一定義策略與該資料存取流量狀態,提供 一行動時間表;及至少一資料移動服務單元,用以快照一部分或整個來源儲存設備、複製該部分或整個來源儲存設備的一快照到該目標儲存設備,及依照該行動時間表,將相較於前一次快照,該部分或整個來源儲存設備的一新快照中的改變,修改到該目標儲存設備中的複製的快照,並控制該代理模組的運作。As noted above, existing systems for data movement or replication are disruptive and therefore require a system for achieving interference-free data reconstruction. According to one aspect of the present invention, a system for achieving interference-free data reconstruction includes: a source storage device including a source material; a target storage device; and a server for accessing the source storage device The source data and any data stored in the target storage device are installed with a proxy module and a path input and output module, wherein the proxy module continuously collects a performance parameter value, and the performance parameter value expresses the server source data. Accessing the status, and after the source storage device snapshot, recording the changed block in the source storage device into a meta-chart generated by the path input and output module; the path input and output module will The storage device name of the source storage device connected to the server is decoupled and coupled to the storage device name of the target storage device to achieve the server connection path, switching from the source storage device to the target storage device; a traffic model Unit, receiving the performance parameter value, and according to the received performance parameter value, at a specific time point in the future A data access traffic state to define a strategy in accordance with the state to access traffic information, provided An action schedule; and at least one data movement service unit for snapshotting a portion or the entire source storage device, copying a snapshot of the portion or the entire source storage device to the target storage device, and comparing the action schedule In the previous snapshot, the change in a new snapshot of the part or the entire source storage device is modified to the copied snapshot in the target storage device and controls the operation of the proxy module.

該代理模組與該流量模型單元與至少一資料移動服務單元通訊;該位元圖表對來源儲存設備中的每一區塊包含值為0或1的一位元;0代表對應區塊中沒有改變而1代表二連續快照間對應區塊中內容有改變;當該位元圖表中1的數量小於或等於一門檻值時,執行切換。The proxy module communicates with the traffic model unit and at least one data movement service unit; the bit map includes one bit with a value of 0 or 1 for each block in the source storage device; 0 represents no corresponding block Change and 1 represents a change in the content of the corresponding block between two consecutive snapshots; when the number of 1 in the bit chart is less than or equal to a threshold, switching is performed.

依照本案構想,該來源儲存設備為硬碟、固態硬碟、隨機存取記憶體,或前述裝置的混合,或為邏輯單元號碼設備,或為一物件儲存資料。同樣地,該目標儲存設備為硬碟、固態硬碟、隨機存取記憶體,或前述裝置的混合,或為邏輯單元號碼設備,或為一物件儲存資料。According to the present invention, the source storage device is a hard disk, a solid state hard disk, a random access memory, or a mixture of the foregoing devices, or a logical unit number device, or an object storage material. Similarly, the target storage device is a hard disk, a solid state hard disk, a random access memory, or a mixture of the foregoing devices, or a logical unit number device, or an object storage material.

該代理模組為軟體、硬體或韌體。同樣地,該路徑輸入輸出模組為軟體、硬體或韌體。The proxy module is software, hardware or firmware. Similarly, the path input/output module is software, hardware or firmware.

依照本發明,該性能參數為每秒輸入輸出操作次數、延遲時間,及/或流通量。該行動時間表指定快照該部分或整個來源儲存設備的時間、複製該部分或整個來源儲存設備的快照到該目標儲存設備、將相較於前一次快照,該部分或 整個來源儲存設備的一新快照中的改變,修改到該目標儲存設備中的複製的快照,及當該來源儲存設備不會有或少量資料存取時,執行切換。該少量資料存取的情況定義於一服務層級協議、一服務品質需求,或基於該來源儲存設備或目標儲存設備規範的定義策略中。According to the invention, the performance parameter is the number of input and output operations per second, the delay time, and/or the throughput. The action schedule specifies the time to snapshot the portion or the entire source storage device, copying a snapshot of the portion or the entire source storage device to the target storage device, compared to the previous snapshot, the portion or The change in a new snapshot of the entire source storage device is modified to the copied snapshot in the target storage device, and the switch is performed when the source storage device does not have or has a small amount of data access. The small amount of data access is defined in a service level protocol, a quality of service requirement, or a defined policy based on the source storage device or target storage device specification.

該定義策略定義一遲滯時間,一旦該行動無法滿足一服務層級協議或一服務品質需求中的需求時,停止複製該部分或整個來源儲存設備的快照到該目標儲存設備,及將相較於前一次快照,該部分或整個來源儲存設備的一新快照中的改變,停止修改到該目標儲存設備中的複製的快照。該定義策略也可定義一等待時間,但該行動無法滿足一服務層級協議或一服務品質需求中的需求時,重試複製該部分或整個來源儲存設備的快照到該目標儲存設備,或將相較於前一次快照,該部分或整個來源儲存設備的一新快照中的改變,重試修改到該目標儲存設備中的複製的快照。該儲存設備名稱包含該來源儲存設備或目標儲存設備的型號名稱、型號,及/或序號。The definition policy defines a lag time, and once the action fails to meet the requirements in a service level agreement or a service quality requirement, the copying of the snapshot of the part or the entire source storage device to the target storage device is stopped, and will be compared to the previous A snapshot, a change in a new snapshot of the portion or the entire source storage device, stops modifying the copied snapshot to the target storage device. The definition policy may also define a wait time, but when the action fails to meet the requirements in a service level agreement or a service quality requirement, retry copying the snapshot of the part or the entire source storage device to the target storage device, or A modified snapshot to a copy of the target storage device is compared to the previous snapshot, a change in a new snapshot of the portion or the entire source storage device. The storage device name includes the model name, model number, and/or serial number of the source storage device or the target storage device.

為了運作該系統,一程序包含步驟:A.該流量模型單元提供該行動時間表;B.該資料移動服務單元指示該來源儲存設備快照一部分或整個來源儲存設備,且該代理模組紀錄該來源儲存設備中改變的區塊到該位元圖表中,該位元圖表初始時皆為0;C.該資料移動服務單元依照該行動時間表, 依序複製該快照到目標儲存設備中相同的區塊;D.該資料移動服務單元將該部分或整個來源儲存設備快照的改變,修改到目標儲存設備中的複製的快照;E.重複步驟B至步驟D,直到位元圖表中1的數量小於或等於該門檻值;及F.該資料移動服務單元由該來源儲存設備至該目標儲存設備,執行該伺服器連接路徑的切換,及/或依照使用者需求,重新連接至該來源儲存設備。此處,門檻值為0。In order to operate the system, a program includes the steps of: A. the traffic model unit provides the action schedule; B. the data move service unit indicates a portion of the source storage device snapshot or the entire source storage device, and the proxy module records the source The changed block in the storage device is in the bit chart, and the bit chart is initially 0; C. The data mobile service unit is in accordance with the action schedule. Copying the snapshot to the same block in the target storage device in sequence; D. the data mobile service unit modifies the change of the partial or entire source storage device snapshot to the copied snapshot in the target storage device; E. repeat step B Go to step D until the number of 1 in the bitmap is less than or equal to the threshold; and F. the data movement service unit performs the switching of the server connection path from the source storage device to the target storage device, and/or Reconnect to the source storage device as required by the user. Here, the threshold is 0.

上述的程序進一步包括一步驟B1於步驟B與步驟C間,及一步驟G於步驟F之後:B1.如果步驟B到步驟D已被重複至少一次、位元圖表中1的數量小於該門檻值,且該資料存取流量狀態預期在不久將來的一段時間內是低的,以便該資料移動服務單元計算得知快照的複製能於一服務層級協議需求的時間內完成,該資料移動服務單元發動該代理模組以保留所有到該來源儲存設備的資料寫入命令;及G.該資料移動服務單元指示該代理模組釋放所有資料寫入給該目標儲存設備及/或該來源儲存設備。The above procedure further includes a step B1 between step B and step C, and a step G after step F: B1. If step B to step D have been repeated at least once, the number of 1 in the bit chart is less than the threshold value. And the data access traffic status is expected to be low for a period of time in the near future, so that the data mobile service unit calculates that the copy of the snapshot can be completed within a time required by a service level protocol, and the data mobile service unit starts The proxy module retains all data write commands to the source storage device; and G. the data move service unit instructs the proxy module to release all data to the target storage device and/or the source storage device.

該程序能進一步包含一步驟B2於步驟B與步驟C間,及一步驟E1於步驟E與步驟F間:B2.如果步驟B到步驟D已被重複至少一次,位元圖表中1的數量小於該門檻值,且該資料存取流量狀態預期在不久將來的一段時間內是低的,以便該資料移動服務單元計算得知快照的複製能於一服務層級協議需求的時間內完成,該代理模組通知該資料移動服務單元關 於到該來源儲存設備的新的區塊寫入命令;及E1.該資料移動服務單元執行該新的區塊寫入命令到目標儲存設備中對應的區塊。該門檻值可為任何小於10的整數。The program can further include a step B2 between step B and step C, and a step E1 between step E and step F: B2. If step B to step D have been repeated at least once, the number of 1 in the bit map is less than The threshold value, and the data access traffic status is expected to be low for a period of time in the near future, so that the data movement service unit calculates that the copy of the snapshot can be completed within a time required by a service level protocol, the proxy mode The group informs the data mobile service unit to close And a new block write command to the source storage device; and E1. the data move service unit executes the new block write command to a corresponding block in the target storage device. The threshold can be any integer less than 10.

藉由流量模型單元的協助,能預測一段時間內會有低量的存取需求,資料重建(複製或移動)便能在該段時間內進行。從而,資料重建能在系統所提供的服務沒有受到干擾及中斷下完成,即稱之為達成無干擾性資料重建。同時,無論該來源儲存設備或目標儲存設備是出自同樣或不同的製造商,或儲存設備是實體設備或邏輯單元號碼設備,甚或是物件儲存資料,本系統皆能應用之。With the help of the traffic model unit, it is possible to predict a low amount of access demand for a period of time, and data reconstruction (copying or moving) can be performed during that time. Therefore, data reconstruction can be completed without any interference or interruption of the services provided by the system, that is, to achieve non-interfering data reconstruction. At the same time, the system can be applied regardless of whether the source storage device or the target storage device is from the same or different manufacturer, or the storage device is a physical device or a logical unit number device, or even an object storage material.

1‧‧‧應用伺服器1‧‧‧Application Server

2‧‧‧來源儲存設備2‧‧‧Source storage equipment

3‧‧‧目標儲存設備3‧‧‧Target storage equipment

4‧‧‧資料複製伺服器4‧‧‧Data Copy Server

8‧‧‧應用伺服器8‧‧‧Application Server

9‧‧‧資料移動服務單元9‧‧‧Data Mobile Service Unit

10‧‧‧來源儲存設備10‧‧‧Source storage equipment

11‧‧‧目標儲存設備11‧‧‧Target storage equipment

100‧‧‧系統100‧‧‧ system

110‧‧‧來源儲存設備110‧‧‧Source storage equipment

120‧‧‧目標儲存設備120‧‧‧Target storage equipment

130‧‧‧伺服器130‧‧‧Server

140‧‧‧流量模型單元140‧‧‧Flow model unit

150‧‧‧資料移動服務單元150‧‧‧Data Mobile Service Unit

151‧‧‧輔助資料移動服務單元151‧‧‧Auxiliary Data Mobile Service Unit

第1圖顯示由一離線方法運作的一系統。Figure 1 shows a system that operates by an offline method.

第2圖顯示由一鏡像方法運作的一系統。Figure 2 shows a system that operates by a mirroring method.

第3圖顯示由一雙寫入及快照影像複製方法運作的一系統。Figure 3 shows a system that operates by a pair of write and snapshot image copy methods.

第4圖顯示依照本發明,一種用於達成無干擾性資料重建的系統。Figure 4 shows a system for achieving interference-free data reconstruction in accordance with the present invention.

第5圖說明使用於本發明中的一位元圖表。Figure 5 illustrates a one-dimensional chart used in the present invention.

第6圖為用於運作該系統的程序之流程圖。Figure 6 is a flow chart of the procedure for operating the system.

第7圖顯示依照本發明,另一種用於達成無干擾性資料重建的系統。Figure 7 shows another system for achieving interference-free data reconstruction in accordance with the present invention.

第8圖說明如何於一來源儲存設備及一目標儲存設備間複製資料。Figure 8 illustrates how data is copied between a source storage device and a target storage device.

本發明將藉由參照下列的實施例而更具體地描述。The invention will be more specifically described by reference to the following examples.

請參閱第4圖至第6圖,本發明的一實施例揭露於此。能達成無干擾性資料重建的一系統100顯示於第4圖中。名詞"重建"意味一儲存設備內的資料可被複製或移動到另一儲存設備中。如果資料是移動到另一儲存設備,在重建完成後,原始來源儲存設備中的資料將不會被存取或使用。資料重建可以是無干擾性地完成,以便存取該原始儲存設備的一或多個伺服器將不會中斷運作,其提供的服務可以保持不變。依照本發明,系統100至少包含一來源儲存設備110、一目標儲存設備120、一伺服器130、一流量模型單元140及一資料移動服務單元150。上述設備的功能和操作過程將在下文中詳細地說明。Referring to Figures 4 through 6, an embodiment of the present invention is disclosed herein. A system 100 that achieves interference-free data reconstruction is shown in FIG. The term "reconstruction" means that data in a storage device can be copied or moved to another storage device. If the data is moved to another storage device, the data in the original source storage device will not be accessed or used after the rebuild is completed. Data reconstruction can be done without interference so that one or more servers accessing the original storage device will not be interrupted, and the services provided may remain unchanged. According to the present invention, the system 100 includes at least one source storage device 110, a target storage device 120, a server 130, a traffic model unit 140, and a data movement service unit 150. The function and operation of the above device will be explained in detail below.

使用於本實施例中的來源儲存設備110與目標儲存設備120都是硬碟。來源儲存設備110與目標儲存設備120的特性,諸如容量、轉速、功耗等,可以是相同、部分相同或完全不同。也就是說,該二儲存設備可以來自相同或者是不同的製造商,而不像鏡像方法要求兩個儲存設備必須具備一致的規格。實作上,來源儲存設備110或目標儲存設備120可以是固態硬碟、隨機存取記憶體,或硬碟、固態硬碟與隨機存 取記憶體其中至少二個的混合組成。來源儲存設備110或目標儲存設備120也能是邏輯單元號碼設備,或用於資料儲存目的的物件儲存資料。The source storage device 110 and the target storage device 120 used in this embodiment are both hard disks. The characteristics of the source storage device 110 and the target storage device 120, such as capacity, rotational speed, power consumption, etc., may be the same, partially identical, or completely different. That is to say, the two storage devices can be from the same or different manufacturers, unlike the mirroring method, which requires two storage devices to have consistent specifications. In practice, the source storage device 110 or the target storage device 120 may be a solid state hard disk, a random access memory, or a hard disk, a solid state hard disk, and a random storage. Take a mixture of at least two of the memory. The source storage device 110 or the target storage device 120 can also be a logical unit number device, or an object storage material for data storage purposes.

來源儲存設備110與目標儲存設備120中實體儲存設備的數量並不限定於一個。例如在另一個實施例中,來源儲存設備110可包含數個硬碟,該些硬碟形成了一磁碟陣列,而目標儲存設備120仍是一個硬碟,具有等同於該磁碟陣列的儲存設備容量。來源儲存設備110可包含許多的資料或檔案,等待複製或移動。為了對本發明有完整的理解,僅將一來源資料儲存於來源儲存設備110中,用於說明。The number of physical storage devices in the source storage device 110 and the target storage device 120 is not limited to one. For example, in another embodiment, the source storage device 110 can include a plurality of hard disks that form a disk array, and the target storage device 120 is still a hard disk having storage equivalent to the disk array. Equipment capacity. The source storage device 110 can contain a number of materials or files waiting to be copied or moved. For a complete understanding of the present invention, only one source of material is stored in source storage device 110 for illustration.

伺服器130被用來存取來源儲存設備110中的來源資料。因此,伺服器130能提供一特定服務,例如,音樂串流。伺服器130也能存取儲存於該目標儲存設備中的任何資料120。但是應當強調的是,在這實施例中,伺服器130與來源儲存設備110間的連結,或伺服器130與目標儲存設備120間的連結可以是直接連結、經由區域網絡連接,或經由廣域網路,諸如網際網路,所連接。伺服器130可提供該服務給區域網路或廣域網路上有限的使用群。伺服器130已裝設了一代理模組與一路徑輸入輸出模組(未繪示)。該代理模組或路徑輸入輸出模組可以軟體、硬體或韌體形態存在,而執行其功能。在本實施例中,代理模組與路徑輸入輸出模組都是安裝於伺服器130上的軟體。The server 130 is used to access source material in the source storage device 110. Therefore, the server 130 can provide a specific service, such as a music stream. The server 130 can also access any of the data 120 stored in the target storage device. However, it should be emphasized that in this embodiment, the connection between the server 130 and the source storage device 110, or the connection between the server 130 and the target storage device 120 may be directly connected, connected via a regional network, or via a wide area network. , such as the Internet, connected. Server 130 can provide this service to a limited group of users on a regional or wide area network. The server 130 has a proxy module and a path input and output module (not shown). The proxy module or path input/output module can exist in a soft, hardware or firmware form to perform its functions. In this embodiment, the proxy module and the path input/output module are both software installed on the server 130.

關於功能,代理模組能持續蒐集一性能參數值,該性能參數值表達伺服器130對來源資料的存取狀態。此處,性能參數是每秒輸入輸出操作次數。這些使用者存取的歷史資料將通過伺服器130,被發送到流量模型單元140中做進一步分析。還有其它可獲得的性能參數用於資料存取分析,例如,延遲時間及/或流通量可以被使用。只要該數值可以被流量模型單元140所接受,本發明並不限制蒐集特定的性能參數值,這點將會詳述於後。代理模組能在該來源儲存設備110快照之後,記錄來源儲存設備110中改變的區塊到一位元圖表中,該位元圖表由該路徑輸入輸出模組所產生。因為系統100被用來進行資料複製與移動,快照是主要用於達成該目標的方法,代理模組使用的位元圖表儲存於伺服器130中。如何追蹤與控制來源儲存設備110快照與目標儲存設備120複製快照間,因為快照後寫入命令而產生的差異是很重要的。Regarding the function, the proxy module can continuously collect a performance parameter value that expresses the access status of the server 130 to the source material. Here, the performance parameter is the number of input and output operations per second. The historical data accessed by these users will be sent to the traffic model unit 140 via the server 130 for further analysis. There are other available performance parameters for data access analysis, for example, delay time and/or throughput can be used. As long as the value can be accepted by the traffic model unit 140, the present invention does not limit the collection of specific performance parameter values, as will be described in detail later. The proxy module can record the changed block in the source storage device 110 to a one-bit graph after the snapshot of the source storage device 110. The bitmap is generated by the path input and output module. Because system 100 is used for data copying and movement, snapshots are primarily used to achieve this goal, and the bitmaps used by the proxy modules are stored in server 130. How to track and control the difference between the source storage device 110 snapshot and the target storage device 120 copy snapshot, because the write command after the snapshot is important.

請參閱第5圖。對來源儲存設備110中每一區塊,位元圖表包含值為0或1的一位元。0代表對應區塊中沒有改變而1代表二連續快照間對應區塊中內容有改變。例如,當快照發生於T0 時,因為資料結構沒有變動,位元圖表陣列中所有對應該24區塊的值都是0。實際上,該資料結構僅包含來源資料。該來源資料還未被覆寫,故快照不變。當時間來到T1 ,執行了另一個快照。此時發現來源資料已被覆寫,某些區塊改變了。有4個改變的區塊,改變部分為C0 →C1 、R0 →R1 、S0 →S1 , 及Z0 →Z1 。陣列中對應的位元變為1,而其它位元仍是0。當時間來到T2 ,來源儲存設備進行第三次快照。來源資料又被覆寫一次,且仍有某些區塊被改變。T1 與T2 間的改變為F0 →F2 及R1 →R2 。很明顯區塊R2 改變兩次而區塊F2 僅改變一次。然而,無論因為寫入命令造成多少次的區塊改變,相對應的位元皆將由0變成1。最終,於T3 時,連續快照間沒有任何改變,位元圖表所有的位元都又恢復0。此即位元圖表運作的方式。Please refer to Figure 5. For each block in source storage device 110, the bit map contains a single bit with a value of 0 or 1. 0 means that there is no change in the corresponding block and 1 means that the content in the corresponding block between two consecutive snapshots has changed. For example, when a snapshot occurs at T 0 , the value of all corresponding 24 blocks in the bitmap array is 0 because there is no change in the data structure. In fact, the data structure contains only source material. The source data has not been overwritten, so the snapshot remains unchanged. When the time comes to T 1 , another snapshot is taken. At this point, the source data has been overwritten and some of the blocks have changed. There are 4 changed blocks, and the changed parts are C 0 → C 1 , R 0 → R 1 , S 0 → S 1 , and Z 0 → Z 1 . The corresponding bit in the array becomes 1 while the other bits are still 0. When the time comes to T 2 , the source storage device takes a third snapshot. Source data is overwritten again and some blocks are still being changed. The change between T 1 and T 2 is F 0 → F 2 and R 1 → R 2 . It is obvious that the block R 2 changes twice and the block F 2 changes only once. However, no matter how many blocks change due to a write command, the corresponding bit will change from 0 to 1. Finally, at 3:00 T, there is no change between successive snapshots, all the bits are bit charts resumed 0. This is how the bit chart works.

路徑輸入輸出模組將連結到伺服器130之來源儲存設備110的儲存設備名稱脫鉤,並耦接至該目標儲存設備120的儲存設備名稱。路徑輸入輸出模組這樣運作以便達成伺服器130連接路徑,從來源儲存設備110到目標儲存設備120的切換。路徑輸入輸出模組控制伺服器130存取的路徑。儲存設備名稱包含來源儲存設備110或目標儲存設備120的型號名稱、型號,及/或序號,它能包含關於來源儲存設備110或目標儲存設備120規格的其它資訊。當路徑輸入輸出模組將來源儲存設備110的儲存設備名稱脫鉤時,伺服器130不能與來源儲存設備110連接及對其進行存取。當路徑輸入輸出模組耦接至目標儲存設備120的儲存設備名稱時,伺服器130連接至目標儲存設備120,兩者之間的存取是可行的。在脫鉤與耦接的動作之後,路徑輸入輸出模組切換伺服器130的存取路徑。The path input and output module decouples the storage device name of the source storage device 110 connected to the server 130 and is coupled to the storage device name of the target storage device 120. The path input and output module operates in such a manner as to achieve a switch of the server 130 connection path from the source storage device 110 to the target storage device 120. The path input and output module controls the path accessed by the server 130. The storage device name includes the model name, model number, and/or serial number of the source storage device 110 or the target storage device 120, which can contain other information regarding the specifications of the source storage device 110 or the target storage device 120. When the path input and output module decouples the storage device name of the source storage device 110, the server 130 cannot connect to and access the source storage device 110. When the path input and output module is coupled to the storage device name of the target storage device 120, the server 130 is connected to the target storage device 120, and access between the two is feasible. After the action of decoupling and coupling, the path input/output module switches the access path of the server 130.

流量模型單元140能從代理模組接收每秒輸入輸出操作次數(性能參數)的值,它能依照該接收的每秒輸入輸 出操作次數值,提供在未來一特定時間點一資料存取流量狀態。該流量狀態是一種實體資料,證實伺服器130是如何存取來源儲存設備110。例如,該流量狀態可以是5分鐘後任何時間的每秒輸入輸出操作次數。然而,流量狀態必須由過去歷史中累積的每秒輸入輸出操作次數蒐集得來,並且經過分析。提供這樣服務的任何適合的方法、演算法或模組都可以應用之,最好使用由相同發明人於美國專利申請案第14/290,533號中所提供的一種儲存流量模型,可由該申請案獲得相同技術之共同參考。因此,流量模型單元140能依照一定義策略與資料存取流量狀態,提供一行動時間表。The traffic model unit 140 can receive the value of the number of input/output operations per second (performance parameter) from the proxy module, and can input and output according to the received input per second. The value of the operation times provides a data access flow status at a specific time in the future. The traffic status is an entity material that confirms how the server 130 accesses the source storage device 110. For example, the traffic status can be the number of input/output operations per second at any time after 5 minutes. However, the traffic status must be gathered from the number of input/output operations per second accumulated in past history and analyzed. Any suitable method, algorithm or module for providing such a service can be applied, preferably using a stored flow model provided by the same inventor in U.S. Patent Application Serial No. 14/290,533, which is incorporated herein by reference. A common reference for the same technology. Therefore, the traffic model unit 140 can provide an action schedule according to a defined policy and data access traffic status.

此處,行動時間表是一組的指令,用來在最佳的時間點作業系統100。例如,行動時間表指定快照該部分或整個來源儲存設備110、複製該部分或整個來源儲存設備110的快照到目標儲存設備120、將相較於前一次快照,該部分或整個來源儲存設備110的一新快照中的改變,修改到該目標儲存設備120中的複製的快照,及當該來源儲存設備120不會有或少量資料存取時,執行切換的時間。前述少量資料存取的情況定義於一服務層級協議、一服務品質需求,或基於該來源儲存設備110或目標儲存設備120規範的定義策略中。需要強調的是整個行動時間表是基於未來流量狀態的預估之上,行動時間表會由定義策略修改,並當未預期的狀況發生時,再進行評估。Here, the action schedule is a set of instructions for operating the system 100 at the optimal point in time. For example, the action schedule specifies the snapshot of the portion or the entire source storage device 110, copying the snapshot of the portion or the entire source storage device 110 to the target storage device 120, as compared to the previous snapshot, the portion or the entire source storage device 110 The change in a new snapshot, the copied snapshot to the target storage device 120, and the time at which the switch is performed when the source storage device 120 does not have or accesses a small amount of data. The foregoing small amount of data access is defined in a service level protocol, a quality of service requirement, or in a defined policy based on the source storage device 110 or the target storage device 120 specification. It is important to emphasize that the entire action schedule is based on an estimate of future traffic status, and the action schedule is modified by the defined strategy and evaluated when an unexpected condition occurs.

定義策略不同於服務層級協議或服務品質需求,後二者在系統100上線前由系統100的製造商與擁有該系統100的服務提供者簽署且同意。定義策略設定詳細的系統100的運作,以針對服務層級協議或服務品質需求,實現服務內容。特別是定義策略定義了一遲滯時間,一旦行動無法滿足一服務層級協議或一服務品質需求中的需求時,停止複製該部分或整個來源儲存設備110的快照到該目標儲存設備120,及將相較於前一次快照,該部分或整個來源儲存設備110的一新快照中的改變,停止修改到該目標儲存設備120中的複製的快照。例如,如果該服務層級協議要求伺服器130回應一使用者需求不得超過2秒,當快照複製的時間超過5秒的遲滯時間,而在此情況下可能會導致回應時間比2秒來得長,那整個複製運作就停止。快照的複製將被延後30秒,或直到伺服器130的存取預期低於某一每秒輸入輸出操作次數,比如10,000Mb/s,而這不會延遲快照複製的運作。在另一例子中,如果在二快照間有某些需要被一些接收寫入命令更新的區塊,定義策略將要求最短時間,比如3秒,來處理該行動,以免服務層級協議或服務品質需求中的要求不能被滿足。"5秒"的遲滯時間、"30秒"的等待,及"3秒"的最短時間是定義策略的某些態樣。在資料複製或移動服務完成前,來源儲存設備110的快照可能有很多次,定義策略對每一次而言都適用。The definition policy is different from the service level agreement or service quality requirement, and the latter two are signed and agreed by the manufacturer of the system 100 and the service provider owning the system 100 before the system 100 goes online. The definition policy sets up the operation of the detailed system 100 to implement service content for service level agreements or service quality requirements. In particular, the definition policy defines a lag time, and once the action fails to meet the requirements in a service level agreement or a service quality requirement, the copying of the snapshot of the part or the entire source storage device 110 to the target storage device 120 is stopped, and the phase is The change to a new snapshot of the portion or of the entire source storage device 110, compared to the previous snapshot, stops modifying the copied snapshot into the target storage device 120. For example, if the service level agreement requires the server 130 to respond to a user request for no more than 2 seconds, when the snapshot copy time exceeds 5 seconds of lag time, and in this case, the response time may be longer than 2 seconds, then The entire copy operation stops. The copy of the snapshot will be delayed by 30 seconds, or until the access expected by the server 130 is lower than the number of input/output operations per second, such as 10,000 Mb/s, which does not delay the operation of the snapshot copy. In another example, if there are some blocks between the two snapshots that need to be updated by some receive write commands, the definition policy will require the shortest time, such as 3 seconds, to process the action to avoid service level agreements or quality of service requirements. The requirements in can not be met. The "5 second" lag time, the "30 second" wait, and the "3 second" minimum time are some aspects of defining the strategy. The snapshot of the source storage device 110 may be many times before the data copy or mobile service is completed, and the definition policy is applicable for each time.

由上述說明可知,定義策略也定義一等待時間,一旦該行動無法滿足一服務層級協議或一服務品質需求中的需求時,重試複製該部分或整個來源儲存設備110的快照到該目標儲存設備120,或將相較於前一次快照,該部分或整個來源儲存設備110的一新快照中的改變,重試修改到該目標儲存設備120中的複製的快照。例如,"30秒"的等待。As can be seen from the above description, the definition policy also defines a waiting time. Once the action cannot meet the requirements in a service level agreement or a service quality requirement, retry copying the snapshot of the part or the entire source storage device 110 to the target storage device. 120. Depending on the previous snapshot, the change in a new snapshot of the portion or the entire source storage device 110, the modified snapshot to the target storage device 120 is retried. For example, a "30 second" wait.

資料移動服務單元150能存取來源儲存設備110與目標儲存設備120,及與伺服器130和流量模型單元140通訊,因此它負責快照一部分或整個來源儲存設備110,快照一部分或整個來源儲存設備110、複製該部分或整個來源儲存設備110的快照到該目標儲存設備120,及依照該行動時間表,將相較於前一次快照,該部分或整個來源儲存設備110的一新快照中的改變,修改到該目標儲存設備120中的複製的快照。它也能控制該代理模組的運作。要注意的是控制功能能完成是因為代理模組能與流量模型單元140及資料移動服務單元150通訊之故。從另外一個觀點看來,代理模組是伺服器130的窗口,用來接收來自資料移動服務單元150的控制指令。對負載分佈與平衡而言,系統中可以有超過一個以上的資料移動服務單元150,這將在之後另一實施例中詳細說明。The data movement service unit 150 can access the source storage device 110 and the target storage device 120, and communicate with the server 130 and the traffic model unit 140, so it is responsible for snapshotting a portion or the entire source storage device 110, a snapshot portion or the entire source storage device 110. Copying a snapshot of the portion or the entire source storage device 110 to the target storage device 120, and in accordance with the action schedule, a change in a new snapshot of the portion or the entire source storage device 110 compared to the previous snapshot, The copied snapshot to the target storage device 120 is modified. It also controls the operation of the proxy module. It should be noted that the control function can be completed because the proxy module can communicate with the traffic model unit 140 and the data movement service unit 150. From another point of view, the proxy module is a window of the server 130 for receiving control commands from the data movement service unit 150. For load distribution and balancing, there may be more than one data movement service unit 150 in the system, as will be described in detail in another embodiment that follows.

現在,作業系統100的程序說明如下。該程序包括幾個步驟,請參閱第6圖,第6圖是該程序的流程圖。在資料移動前,流量模型單元140持續蒐集每秒輸入輸出操作次數值, 該值表達伺服器130對來源資料的存取狀態。基於該蒐集的值,流量模型單元140能提供行動時間表(S01)。當資料移動服務開始時,資料移動服務單元150指示來源儲存設備110去快照整個來源儲存設備110,它也指示代理模組去紀錄來源儲存設備110中改變的區塊到伺服器130中的位元圖表(S02)。如上所述,因在二快照間到來源儲存設備110的寫入命令而造成區塊改變,位元圖表包含的所有位元都是0,以對應初始階段所有的區塊。當資料移動服務單元150確認一快照產生了,它就依照行動時間表,將該快照依序複製到目標儲存設備120中相同的區塊上(S03)。在步驟S03中,資料移動服務單元150將遵循來自流量模型單元140的行動時間表,以在來源儲存設備110預期無存取或低存取情況當下,執行快照複製。從而,由伺服器130存取所造成的服務中斷可以避免。因為快照複製持續進行,僅某些因寫入命令導致改變的區塊,在目標儲存設備120中整個複製的快照鮮少全部寫入。移動服務單元150僅修改整個來源儲存設備110快照的改變部分,到目標儲存設備120中複製的快照(S04)。如第5圖所示,改變區塊的數量可能在下一個快照中減少。藉重複步驟S02到步驟S04,可以達成快照複製時間上的收斂。這些步驟的重複將持續,直到位元圖表中1的數量小於或等於該門檻值(S05)。最好,該門檻值是0,這意味在來源儲存設備110連續快照間沒有區塊改變。接著,資料移動服務單元150進行伺服器130連接路徑的切換, 從來源儲存設備110到目標儲存設備120,或重新連接至該來源儲存設備110(S06),何種做法,取決於使用者的需求。The program of the operating system 100 will now be described as follows. The program consists of several steps, please refer to Figure 6, which is a flow chart of the program. Before the data movement, the traffic model unit 140 continuously collects the value of the number of input and output operations per second. This value expresses the access status of the source data by the server 130. Based on the collected values, the traffic model unit 140 can provide an action schedule (S01). When the data movement service starts, the data movement service unit 150 instructs the source storage device 110 to snapshot the entire source storage device 110, which also instructs the agent module to record the changed blocks in the source storage device 110 to the bits in the server 130. Chart (S02). As described above, since the block change is caused by the write command to the source storage device 110 between the two snapshots, all the bits contained in the bit map are 0 to correspond to all the blocks in the initial stage. When the data movement service unit 150 confirms that a snapshot has been generated, it copies the snapshot to the same block in the target storage device 120 in order according to the action schedule (S03). In step S03, the material movement service unit 150 will follow the action schedule from the traffic model unit 140 to perform snapshot copying when the source storage device 110 anticipates no access or low access. Thus, the service interruption caused by the access by the server 130 can be avoided. Because the snapshot copying continues, only some of the blocks that have changed due to the write command, the entire replicated snapshot in the target storage device 120 is rarely written. The mobile service unit 150 modifies only the changed portion of the entire source storage device 110 snapshot to the snapshot copied in the target storage device 120 (S04). As shown in Figure 5, the number of changed blocks may be reduced in the next snapshot. By repeating step S02 to step S04, convergence on the snapshot copy time can be achieved. The repetition of these steps will continue until the number of 1s in the bit map is less than or equal to the threshold value (S05). Preferably, the threshold is zero, which means that there is no block change between successive snapshots of the source storage device 110. Next, the data movement service unit 150 performs switching of the connection path of the server 130. From the source storage device 110 to the target storage device 120, or reconnected to the source storage device 110 (S06), depending on the needs of the user.

詳細地說,步驟S06包含一切換的子步驟,該子步驟停止伺服器130與來源儲存設備110間的輸出輸入。在切換後,資料移動服務單元150將經由代理模組通知路徑輸入輸出模組,改變伺服器130的連接路徑,由來源儲存設備110成為目標儲存設備120。從而,目標儲存設備120變成一新來源儲存設備。或者,資料移動服務單元150將經由代理模組通知路徑輸入輸出模組,重新連接伺服器130的路徑到來源儲存設備110,何種做法,取決於使用者的需求。至於切換的時點,對來源儲存設備110與目標儲存設備120來說都相同。在資料複製過程中,當存取量大時,因為有流量模型單元140,伺服器130不用進行複製快照與切換,伺服器130提供的服務不受影響,可達成無干擾性資料重建(複製或移動)。In detail, step S06 includes a sub-step of switching that stops the output input between the server 130 and the source storage device 110. After the handover, the data movement service unit 150 notifies the path input/output module via the proxy module, changes the connection path of the server 130, and becomes the target storage device 120 by the source storage device 110. Thus, the target storage device 120 becomes a new source storage device. Alternatively, the data movement service unit 150 will notify the path input/output module via the proxy module to reconnect the path of the server 130 to the source storage device 110, depending on the needs of the user. As for the timing of the handover, the same is true for the source storage device 110 and the target storage device 120. In the data copying process, when the access amount is large, because there is the traffic model unit 140, the server 130 does not need to perform copy snapshot and switching, and the service provided by the server 130 is not affected, and the interference-free data reconstruction can be achieved (replication or mobile).

對以上的說明,很明顯地,用於切換的方法是等待二個連續快照間沒有區塊改變。正如人們所知,連續二連續快照沒有寫入命令的情況純屬理想。因此,需要一個更切實際的切換方法或運作程序,本發明提供其它的解決方案。For the above description, it is obvious that the method for switching is to wait for no block change between two consecutive snapshots. As is known, the fact that two consecutive snapshots have no write commands is ideal. Therefore, a more practical switching method or operating procedure is needed, and the present invention provides other solutions.

如果步驟S02到步驟S04已被執行至少一次,位元圖表中1的數量小於門檻值,且資料存取流量狀態在不久的將來裡的一段時間預期是低的,以便資料移動服務單元150能計算得知快照的複製能於一服務層級協議需求的時間內完成,資 料移動服務單元150發動代理模組,以保留所有到該來源儲存設備110的資料寫入命令。取決於使用者的需求,資料移動服務單元150在步驟S06後指示代理模組釋放所有資料寫入命令給目標儲存設備120或來源儲存設備110。最好該門檻值為任何小於10的整數,例如5。在切換之前,所有資料寫入依序完成。If step S02 to step S04 have been executed at least once, the number of 1 in the bit map is less than the threshold, and the data access flow state is expected to be low for a period of time in the near future, so that the data movement service unit 150 can calculate Know that the copy of the snapshot can be completed within the time required by a service level agreement. The mobile service unit 150 activates the proxy module to retain all data write commands to the source storage device 110. Depending on the needs of the user, the data movement service unit 150 instructs the agent module to release all data write commands to the target storage device 120 or the source storage device 110 after step S06. Preferably, the threshold is any integer less than 10, such as 5. All data is written in sequence before switching.

如果在步驟S02到步驟S04已重複至少一次之相同前提下,位元圖表中1的數量小於門檻值,且資料存取流量狀態在不久的將來裡的一段時間預期是低的,以便資料移動服務單元150能計算得知快照的複製能於一服務層級協議需求的時間內完成,代理模組能通知資料移動服務單元150,關於到該來源儲存設備110的新的區塊寫入命令。在二連續快照間改變的區塊修改完成後,資料移動服務單元150進行新的區塊寫入命令到目標儲存設備120中的對應的區塊。接著,執行步驟S06。同樣地,門檻值為任何小於10的整數。從而,在切換之前,所有資料寫入依序完成。If the number of 1 in the bit chart is less than the threshold value, the number of 1 in the bit chart is expected to be low for a period of time in the near future, so that the data movement service is available, in the same premise that step S02 to step S04 have been repeated at least once. The unit 150 can calculate that the copy of the snapshot can be completed within a time required by a service level protocol, and the proxy module can notify the data movement service unit 150 of a new block write command to the source storage device 110. After the block modification changed between the two consecutive snapshots is completed, the material movement service unit 150 performs a new block write command to the corresponding block in the target storage device 120. Next, step S06 is performed. Similarly, the threshold is any integer less than 10. Thus, all data writes are completed in sequence before switching.

此外,如果服務層級協議要求較短的回應時間或服務商想加速快照複製的運作,可使用多於一個的資料移動服務單元。請見第7圖。一輔助資料移動服務單元151連接到系統100中。如上所述,快照能完成用於一部分的來源儲存設備110,而不是整個來源儲存設備110。請同時參閱第8圖。如果將來源儲存設備110分做兩部份,磁軌1到磁軌100及磁軌101 到磁軌200。目標儲存設備120也分做兩部份,磁軌1到磁軌100及磁軌101到磁軌200。資料移動服務單元150負責複製或移動來源儲存設備110磁軌1到磁軌100區塊中的來源資料到目標儲存設備120的磁軌1到磁軌100。輔助資料移動服務單元151負責複製或移動來源儲存設備110磁軌101到磁軌200區塊中的來源資料到目標儲存設備120的磁軌101到磁軌200。傳統上,來源儲存設備110的磁軌1到磁軌100定義為一邏輯單元號碼設備,來源儲存設備110的磁軌101到磁軌200定義為另一邏輯單元號碼設備。從而,本發明的應用不限於二實體儲存設備間的資料複製或資料移動。邏輯儲存設備也能應用之。In addition, if the service level agreement requires a shorter response time or the service provider wants to speed up the operation of the snapshot copy, more than one data move service unit can be used. See Figure 7. An auxiliary material mobile service unit 151 is connected to the system 100. As noted above, the snapshot can be completed for a portion of the source storage device 110, rather than the entire source storage device 110. Please also refer to Figure 8. If the source storage device 110 is divided into two parts, the track 1 to the track 100 and the track 101 Go to track 200. The target storage device 120 is also divided into two parts, a track 1 to a track 100 and a track 101 to a track 200. The data movement service unit 150 is responsible for copying or moving the source material from the track 1 to the track 100 block of the source storage device 110 to the track 1 to the track 100 of the target storage device 120. The auxiliary material mobile service unit 151 is responsible for copying or moving the source material from the source storage device 110 track 101 to the track 200 block to the track 101 of the target storage device 120 to the track 200. Traditionally, track 1 to track 100 of source storage device 110 is defined as a logical unit number device, and track 101 to track 200 of source storage device 110 is defined as another logical unit number device. Thus, the application of the present invention is not limited to data copying or data movement between two physical storage devices. Logical storage devices can also be applied.

依照上述的實施例,應注意的是如果伺服器130在代理模組與路徑輸入輸出模組安裝前,已具有該些模組的某些功能,這些功能應被保留而其它伺服器130沒有的功能則由代理模組與路徑輸入輸出模組來實現。例如,路徑輸入輸出模組能利用伺服器130作業系統中現有的路徑輸入輸出元件,它僅提供供資料移動或複製服務所需的修改及插入功能。同樣地,如果伺服器130或來源與目標儲存設備的作業系統具有相近的複製服務,資料移動服務單元150將利用這本機上的資料複製服務。此外,基於服務商所訂的定義策略,在預期低存取的時間內,可以達成積極的快照複製或最終切換。換句話說,定義策略是一種控制方式,決定資料複製或移動能多有效地完成。資料移動服務單元150在存取量低時,能進一步獲 得來源儲存設備110與目標儲存設備120間的快照複製速度基準。該基準能被送到流量模型單元140以創建行動時間表,並計算每一快照複製的預期完成時間。According to the above embodiment, it should be noted that if the server 130 has some functions of the modules before the proxy module and the path input/output module are installed, these functions should be retained while other servers 130 do not. The function is implemented by the proxy module and the path input and output module. For example, the path input and output module can utilize the existing path input and output elements in the server 130 operating system, which provides only the modification and insertion functions required for data movement or copy services. Similarly, if the server 130 or source has a similar copy service to the operating system of the target storage device, the data movement service unit 150 will utilize the data copy service on the local machine. In addition, based on the defined policy set by the service provider, a positive snapshot copy or a final switch can be achieved during the expected low access time. In other words, defining a strategy is a way of controlling how efficiently data replication or movement can be done. The data movement service unit 150 can further obtain when the access amount is low A snapshot copy speed reference between the source storage device 110 and the target storage device 120. The baseline can be sent to the traffic model unit 140 to create an action schedule and calculate the expected completion time for each snapshot copy.

雖然本發明已以實施方式揭露如上,然其並非用以限定本發明,任何所屬技術領域中具有通常知識者,在不脫離本發明之精神和範圍內,當可作些許之更動與潤飾,因此本發明之保護範圍當視後附之申請專利範圍所界定者為準。Although the present invention has been disclosed in the above embodiments, it is not intended to limit the invention, and those skilled in the art can make some modifications and refinements without departing from the spirit and scope of the invention. The scope of the invention is defined by the scope of the appended claims.

100‧‧‧系統100‧‧‧ system

110‧‧‧來源儲存設備110‧‧‧Source storage equipment

120‧‧‧目標儲存設備120‧‧‧Target storage equipment

130‧‧‧伺服器130‧‧‧Server

140‧‧‧流量模型單元140‧‧‧Flow model unit

150‧‧‧資料移動服務單元150‧‧‧Data Mobile Service Unit

Claims (17)

一種用於達成無干擾性資料重建的系統,包含:一來源儲存設備,包含一來源資料;一目標儲存設備;一伺服器,用以存取在該來源儲存設備中的來源資料與儲存於該目標儲存設備中的任何資料,安裝一代理模組與一路徑輸入輸出模組,其中該代理模組持續蒐集一性能參數值,該性能參數值表達伺服器對來源資料的存取狀態,並在對該來源儲存設備進行快照之後,將來源儲存設備中改變的區塊記錄到一位元圖表中,該位元圖表由該路徑輸入輸出模組所產生;該路徑輸入輸出模組將連結到該伺服器之來源儲存設備的儲存設備名稱脫鉤,並耦接至該目標儲存設備的儲存設備名稱,以便達成該伺服器從來源儲存設備到目標儲存設備連接路徑的切換;一流量模型單元,接收該性能參數值,並能依照該接收的性能參數值,在未來一特定時間點提供一資料存取流量狀態,以依照一定義策略與該資料存取流量狀態,提供一行動時間表;及至少一資料移動服務單元,用以對一部分或整個來源儲存設備中的該來源資料進行快照、將快照範圍內的來源資料複製到該目標儲存設備,及依照該行動時間表, 對該部分或整個來源儲存設備中的該來源資料進行重新的快照,若重新擷取的快照與前一次擷取的快照之間有差異性,將兩者的差異性修改到前一次快照時所複製到該目標儲存設備中的來源資料,並控制該代理模組的運作;其中該代理模組與該流量模型單元與至少一資料移動服務單元通訊;該位元圖表對來源儲存設備中的每一區塊包含值為0或1的一位元;0代表對應區塊中沒有改變而1代表二連續快照間對應區塊中內容有改變;當該位元圖表中1的數量小於或等於一門檻值時,執行切換。 A system for achieving non-interfering data reconstruction, comprising: a source storage device, including a source data; a target storage device; and a server for accessing source data stored in the source storage device and stored in the source Any data in the target storage device, a proxy module and a path input and output module are installed, wherein the proxy module continuously collects a performance parameter value, the performance parameter value expresses an access status of the server to the source data, and After taking a snapshot of the source storage device, the changed block in the source storage device is recorded into a one-dimensional chart, and the bit chart is generated by the path input and output module; the path input and output module is connected to the The storage device name of the source storage device of the server is decoupled and coupled to the storage device name of the target storage device, so as to achieve the switching of the server from the source storage device to the target storage device connection path; a traffic model unit receives the Performance parameter value, and can provide a data access at a specific time point in the future according to the received performance parameter value a quantity status to provide an action schedule in accordance with a defined policy and the data access status; and at least one data movement service unit for taking snapshots of the source material in a portion or the entire source storage device The source data in the copy is copied to the target storage device, and according to the action schedule, Re-snapping the source material in the part or the entire source storage device. If there is a difference between the re-taken snapshot and the previous snapshot, the difference between the two is modified to the previous snapshot. Copying source data to the target storage device and controlling operation of the proxy module; wherein the proxy module communicates with the traffic model unit and at least one data movement service unit; the bit map is for each of the source storage devices A block contains a bit with a value of 0 or 1; 0 means no change in the corresponding block and 1 means change in the corresponding block between two consecutive snapshots; when the number of 1 in the bit chart is less than or equal to one When the threshold is exceeded, a switch is performed. 如申請專利範圍第1項所述之系統,其中該來源儲存設備為硬碟、固態硬碟、隨機存取記憶體,或前述裝置的混合,或為邏輯單元號碼設備,或為一物件儲存資料。 The system of claim 1, wherein the source storage device is a hard disk, a solid state hard disk, a random access memory, or a mixture of the foregoing devices, or a logical unit number device, or an object storage material. . 如申請專利範圍第1項所述之系統,其中該目標儲存設備為硬碟、固態硬碟、隨機存取記憶體,或前述裝置的混合,或為邏輯單元號碼設備,或為一物件儲存資料。 The system of claim 1, wherein the target storage device is a hard disk, a solid state hard disk, a random access memory, or a mixture of the foregoing devices, or a logical unit number device, or an object storage data. . 如申請專利範圍第1項所述之系統,其中該代理模組為軟體、硬體或韌體。 The system of claim 1, wherein the proxy module is a soft body, a hardware or a firmware. 如申請專利範圍第1項所述之系統,其中該路徑輸入輸出模組為軟體、硬體或韌體。 The system of claim 1, wherein the path input/output module is a software, a hardware or a firmware. 如申請專利範圍第1項所述之系統,其中該性能參數為每秒輸入輸出操作次數、延遲時間,及/或流通量。 The system of claim 1, wherein the performance parameter is an input/output operation number per second, a delay time, and/or a throughput. 如申請專利範圍第1項所述之系統,其中該行動時間表指定對該部分或整個來源儲存設備中的該來源資料進行快照的時間、將快照範圍內的來源資料複製到該目標儲存設備的時間、對該部分或整個來源儲存設備中的該來源資料進行重新快照的時間、將重新擷取的快照與前一次擷取的快照進行比對並將兩者的差異性修改到前一次快照時所複製到該目標儲存設備中的來源資料的時間,及當該來源儲存設備預期沒有資料存取或資料存取的量很少時,執行切換的時間。 The system of claim 1, wherein the action schedule specifies a time at which the source material in the portion or the entire source storage device is snapshotted, and the source data in the snapshot range is copied to the target storage device. Time, the time to re-snap the source material in the part or the entire source storage device, compare the retaked snapshot with the previously taken snapshot, and modify the difference between the two to the previous snapshot The time at which the source data is copied to the target storage device and when the source storage device is expected to have no data access or data access is small. 如申請專利範圍第7項所述之系統,其中該少量資料存取的情況定義於一服務層級協議、一服務品質需求,或基於該來源儲存設備或目標儲存設備規範的定義策略中。 The system of claim 7, wherein the small amount of data access is defined in a service level agreement, a quality of service requirement, or a defined policy based on the source storage device or the target storage device specification. 如申請專利範圍第1項所述之系統,其中該定義策略定義一遲滯時間,一旦該行動無法滿足一服務層級協議或一服務品質需求中的需求時,停止將快照範圍內該部分或整個來源儲存設備中的該來源資料複製到該目標儲存設備,及停止將重新擷取的快照與前一次擷取的快照之差異性修改到前一次快照時所複製到該目標儲存設備中的來源資料。 The system of claim 1, wherein the definition policy defines a lag time, and if the action fails to meet a requirement in a service level agreement or a service quality requirement, stopping the part or the entire source within the snapshot range The source data in the storage device is copied to the target storage device, and the difference between the retaken snapshot and the previous snapshot is stopped to be changed to the source data copied to the target storage device in the previous snapshot. 如申請專利範圍第1項所述之系統,其中該定義策略定義一等待時間,一旦該行動無法滿足一服務層級協議或一服務品質需求中的需求時,重試將快照範圍內該部分或整個 來源儲存設備中的該來源資料複製到該目標儲存設備,或重試將重新擷取的快照與前一次擷取的快照之差異性修改到前一次快照時所複製到該目標儲存設備中的來源資料。 The system of claim 1, wherein the definition policy defines a waiting time, and if the action fails to meet a requirement in a service level agreement or a service quality requirement, retrying the part or the whole of the snapshot range Copy the source material from the source storage device to the target storage device, or retry to modify the difference between the retaken snapshot and the previous snapshot to the source copied to the target storage device from the previous snapshot. data. 如申請專利範圍第1項所述之系統,其中該儲存設備名稱包含該來源儲存設備或目標儲存設備的型號名稱、型號,及/或序號。 The system of claim 1, wherein the storage device name includes a model name, a model number, and/or a serial number of the source storage device or the target storage device. 如申請專利範圍第1項所述之系統,其中用以運作該系統的一程序包含步驟:A.該流量模型單元提供該行動時間表;B.該資料移動服務單元指示該來源儲存設備快照一部分或整個來源儲存設備,及該代理模組紀錄該來源儲存設備中改變的區塊到該位元圖表中,該位元圖表初始時皆為0;C.該資料移動服務單元依照該行動時間表,依序複製該快照到目標儲存設備中相同的區塊;D.該資料移動服務單元將該部分或整個來源儲存設備快照的改變,修改到目標儲存設備中的複製的快照;E.重複步驟B至步驟D,直到位元圖表中1的數量小於或等於該門檻值;及F.該資料移動服務單元由該來源儲存設備至該目標儲存設備,執行該伺服器連接路徑的切換,及/或重新 連接至該來源儲存設備。 The system of claim 1, wherein the program for operating the system comprises the steps of: A. the traffic model unit provides the action schedule; B. the data move service unit indicates a portion of the source storage device snapshot Or the entire source storage device, and the agent module records the changed block in the source storage device into the bit chart, the bit chart is initially 0; C. the data mobile service unit is in accordance with the action schedule Copying the snapshot to the same block in the target storage device in sequence; D. the data mobile service unit modifies the change of the partial or entire source storage device snapshot to the copied snapshot in the target storage device; E. repeating the steps B to step D until the number of 1 in the bit map is less than or equal to the threshold value; and F. the data movement service unit from the source storage device to the target storage device performs switching of the server connection path, and/ Or re Connect to the source storage device. 如申請專利範圍第12項所述之系統,其中該門檻值為0。 The system of claim 12, wherein the threshold is zero. 如申請專利範圍第12項所述之系統,其中該程序進一步包含一步驟B1於步驟B與步驟C間,及一步驟G於步驟F之後:B1.如果步驟B到步驟D已被重複至少一次、位元圖表中1的數量小於該門檻值,且該資料存取流量狀態預期在不久將來的一段時間內是低的,以便該資料移動服務單元計算得知快照的複製能於一服務層級協議需求的時間內完成,該資料移動服務單元發動該代理模組以保留所有到該來源儲存設備的資料寫入命令;及G.該資料移動服務單元指示該代理模組釋放所有資料寫入給該目標儲存設備及/或該來源儲存設備。 The system of claim 12, wherein the program further comprises a step B1 between step B and step C, and a step G after step F: B1. if step B to step D have been repeated at least once The number of 1 in the bit chart is less than the threshold, and the data access traffic state is expected to be low for a period of time in the near future, so that the data mobile service unit calculates that the copy of the snapshot can be in a service level protocol. After the required time is completed, the data mobile service unit activates the proxy module to retain all data write commands to the source storage device; and G. the data mobile service unit instructs the proxy module to release all data to be written to the Target storage device and/or the source storage device. 如申請專利範圍第14項所述之系統,其中該門檻值為任何小於10的正整數。 The system of claim 14, wherein the threshold is any positive integer less than 10. 如申請專利範圍第12項所述之系統,其中該程序進一步包含一步驟B2於步驟B與步驟C間,及一步驟E1於步驟E與步驟F間:B2.如果步驟B到步驟D已被重複至少一次,位元圖表中1的數量小於該門檻值,且該資料存取流量狀態預期在不久將來的一段時間內是低的,以便該資料移動服務單元計算得知快照的複製能於一服務層級協議需求的時間內完成,該代理模組通知該資料移動服務單元關於到該來源儲存 設備的新的區塊寫入命令;及E1.該資料移動服務單元執行該新的區塊寫入命令到目標儲存設備中對應的區塊。 The system of claim 12, wherein the program further comprises a step B2 between step B and step C, and a step E1 between step E and step F: B2. if step B to step D have been Repeating at least once, the number of 1 in the bit chart is less than the threshold, and the data access traffic state is expected to be low for a period of time in the near future, so that the data mobile service unit calculates that the copy of the snapshot can be The time required by the service level agreement is completed, the agent module notifies the data mobile service unit about storing to the source a new block write command for the device; and E1. The data move service unit executes the new block write command to the corresponding block in the target storage device. 如申請專利範圍第16項所述之系統,其中該門檻值為任何小於10的正整數。 The system of claim 16, wherein the threshold is any positive integer less than 10.
TW103132081A 2014-09-17 2014-09-17 System for achieving non-interruptive data reconstruction TWI509426B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
TW103132081A TWI509426B (en) 2014-09-17 2014-09-17 System for achieving non-interruptive data reconstruction

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
TW103132081A TWI509426B (en) 2014-09-17 2014-09-17 System for achieving non-interruptive data reconstruction

Publications (2)

Publication Number Publication Date
TWI509426B true TWI509426B (en) 2015-11-21
TW201612766A TW201612766A (en) 2016-04-01

Family

ID=55220161

Family Applications (1)

Application Number Title Priority Date Filing Date
TW103132081A TWI509426B (en) 2014-09-17 2014-09-17 System for achieving non-interruptive data reconstruction

Country Status (1)

Country Link
TW (1) TWI509426B (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
TWI584131B (en) * 2015-12-14 2017-05-21 財團法人工業技術研究院 Server backup method and backup system thereof
CN108958880A (en) * 2017-05-24 2018-12-07 财团法人工业技术研究院 Data processing method, data processing system and computer program product

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20130335594A1 (en) * 2012-06-18 2013-12-19 Microsoft Corporation Enhancing captured data
TW201407375A (en) * 2012-08-15 2014-02-16 Alibaba Group Services Ltd Virtual Machine Snapshot Backup Based on Multilayer De-duplication
TW201416881A (en) * 2013-12-09 2014-05-01 Annethen Technology Co Ltd Operation method of distributed memory disk cluster storage system
US20140189688A1 (en) * 2012-12-28 2014-07-03 Hon Hai Precision Industry Co., Ltd. Computing device and computing method

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20130335594A1 (en) * 2012-06-18 2013-12-19 Microsoft Corporation Enhancing captured data
TW201407375A (en) * 2012-08-15 2014-02-16 Alibaba Group Services Ltd Virtual Machine Snapshot Backup Based on Multilayer De-duplication
US20140189688A1 (en) * 2012-12-28 2014-07-03 Hon Hai Precision Industry Co., Ltd. Computing device and computing method
TW201416881A (en) * 2013-12-09 2014-05-01 Annethen Technology Co Ltd Operation method of distributed memory disk cluster storage system

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
TWI584131B (en) * 2015-12-14 2017-05-21 財團法人工業技術研究院 Server backup method and backup system thereof
US9778997B2 (en) 2015-12-14 2017-10-03 Industrial Technology Research Institute Server backup method and backup system using the method
CN108958880A (en) * 2017-05-24 2018-12-07 财团法人工业技术研究院 Data processing method, data processing system and computer program product

Also Published As

Publication number Publication date
TW201612766A (en) 2016-04-01

Similar Documents

Publication Publication Date Title
US10853139B2 (en) Dynamic workload management based on predictive modeling and recommendation engine for storage systems
US9400611B1 (en) Data migration in cluster environment using host copy and changed block tracking
US8335771B1 (en) Storage array snapshots for logged access replication in a continuous data protection system
US7415488B1 (en) System and method for redundant storage consistency recovery
US7467259B2 (en) System and method to protect data stored in a storage system
US8332687B1 (en) Splitter used in a continuous data protection environment
CA3054040A1 (en) Synchronously replicating datasets and other managed objects to cloud-based storage systems
US20150355862A1 (en) Transparent array migration
US20140115390A1 (en) Power failure management in components of storage area network
US7617259B1 (en) System and method for managing redundant storage consistency at a file system level
JP6064608B2 (en) Storage device, backup program, and backup method
JP2009048497A (en) Storage system having function of changing data storage method using a pair of logical volumes
JP2004259079A (en) Data processing system
US20070294568A1 (en) Storage system and method of managing data using the same
JP4693540B2 (en) Database reconfiguration device and database reconfiguration program
JP2022539981A (en) Storage device mirroring method, device, program
US20080059750A1 (en) Computer system and performance tuning method
US20090037655A1 (en) System and Method for Data Storage and Backup
US7734591B1 (en) Coherent device to device data replication
TWI509426B (en) System for achieving non-interruptive data reconstruction
US20120054739A1 (en) Method and apparatus for deployment of storage functions on computers having virtual machines
US20090177916A1 (en) Storage system, controller of storage system, control method of storage system
US20230353635A1 (en) Replication Utilizing Cloud-Based Storage Systems
US10853203B2 (en) Storage aggregate restoration
US9619493B2 (en) System for achieving non-interruptive data reconstruction