WO2011138886A1 - Système informatique insensible aux défaillances et procédé d'accès aux données - Google Patents

Système informatique insensible aux défaillances et procédé d'accès aux données Download PDF

Info

Publication number
WO2011138886A1
WO2011138886A1 PCT/JP2011/058576 JP2011058576W WO2011138886A1 WO 2011138886 A1 WO2011138886 A1 WO 2011138886A1 JP 2011058576 W JP2011058576 W JP 2011058576W WO 2011138886 A1 WO2011138886 A1 WO 2011138886A1
Authority
WO
WIPO (PCT)
Prior art keywords
data
processing
node
user
processing nodes
Prior art date
Application number
PCT/JP2011/058576
Other languages
English (en)
Japanese (ja)
Inventor
山本 秀典
加藤 博光
足達 芳昭
Original Assignee
株式会社日立製作所
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 株式会社日立製作所 filed Critical 株式会社日立製作所
Publication of WO2011138886A1 publication Critical patent/WO2011138886A1/fr

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/07Responding to the occurrence of a fault, e.g. fault tolerance
    • G06F11/16Error detection or correction of the data by redundancy in hardware
    • G06F11/20Error detection or correction of the data by redundancy in hardware using active fault-masking, e.g. by switching out faulty elements or by switching in spare elements
    • G06F11/2053Error detection or correction of the data by redundancy in hardware using active fault-masking, e.g. by switching out faulty elements or by switching in spare elements where persistent mass storage functionality or persistent mass storage control functionality is redundant
    • G06F11/2094Redundant storage or storage space
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/07Responding to the occurrence of a fault, e.g. fault tolerance
    • G06F11/16Error detection or correction of the data by redundancy in hardware
    • G06F11/1658Data re-synchronization of a redundant component, or initial sync of replacement, additional or spare unit
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/07Responding to the occurrence of a fault, e.g. fault tolerance
    • G06F11/16Error detection or correction of the data by redundancy in hardware
    • G06F11/18Error detection or correction of the data by redundancy in hardware using passive fault-masking of the redundant circuits

Definitions

  • the present invention relates to a fault tolerant computer (Fault Tolerant Computer) system, which is composed of a plurality of independent nodes interconnected via a network, and allows the same processing to be executed in parallel in each of the nodes constituting the system.
  • the present invention relates to a tool used when a user performs work such as debugging and testing of an application program running on a fault tolerant computer system.
  • Patent Documents 1 to 3 disclose various techniques related to data access processing for a plurality of devices.
  • Patent Document 1 in order to edit a file on a remote workstation, the file is transferred / stored to the local workstation and then edited locally, and the edited file is transferred to the original remote workstation and stored.
  • a remote editor support apparatus and method is disclosed.
  • Patent Document 2 an area for storing own computer data and an area for storing partner computer data are prepared in a multiplexed computer, and when copying, data and mode information (control / standby / A method of using data between multiplexed computers is disclosed, in which the mode of the own computer and the partner computer is compared and the data in the mode designated in advance is adopted.
  • Patent Document 3 includes access authority information between terminals in a distributed file system including a plurality of terminal devices that manage files by storing files that collectively handle data under a directory that indicates the position of a hierarchical structure.
  • a distributed file sharing system and its file access control method are disclosed in which index information is shared and referred to during search and access to minimize processing time.
  • JP-A-9-231155 Japanese Patent Laid-Open No. 2003-162441 JP 2002-132530 A
  • a fault-tolerant computer which is composed of a plurality of independent nodes interconnected via a network, and the same processing is executed in parallel in each of the constituent nodes, a plurality of parallel processing is performed to guarantee the operation of the system.
  • Data used by applications for processing between nodes must always have the same contents. For this reason, when the user edits these data, it is necessary to be careful not to cause a difference in contents between nodes.
  • Patent Document 1 when referring to and editing the same data in a plurality of processing nodes constituting the fault tolerant computer system, it is necessary to operate each node and access the data. Work load increases. It can also cause human error.
  • Patent Document 2 when referring to and editing the same data in a plurality of processing nodes constituting a fault-tolerant computer system, it is necessary to individually access and operate a directory mapped to each node. , User workload increases. It can also cause human error.
  • data is identified between computers by duplication regardless of data contents, but when executed between processing nodes constituting a fault-tolerant computer system, it differs from data that should have the same contents.
  • An object of the present invention is to provide a data access method in a fault tolerant computer system for editing and the like.
  • the present invention has the following configuration. That is, a method in a fault tolerant computer system including a plurality of processing nodes connected via a network and a maintenance node that acquires, edits, and updates data in the plurality of processing nodes, the plurality of processing nodes being the same When the user refers to or edits the same data that the plurality of processing nodes have, the maintenance node performs the same data in the plurality of processing nodes, respectively.
  • the step of acquiring and saving from the processing node via the network and the maintenance node comparing the contents of the same data acquired from the plurality of processing nodes, and displaying the contents to the user via the user terminal based on the comparison result And pre-defined on the maintenance node via the user terminal A step in which a user edits data acquired from one processing node that matches a condition specified in advance in a processing node that is a table or an operating processing node, and the maintenance node is after the editing Transferring the data to the plurality of processing nodes, and overwriting and updating.
  • the load of operation by the user regardless of the complexity of the configuration of the system, software, etc. Is reduced and work efficiency is improved.
  • FIG. 1 shows the module structure of the processing node which is contained in a fault tolerant computer system and performs the process with respect to the request
  • the maintenance node that is included in the fault-tolerant computer system and performs processing such as data acquisition from the processing node, presentation to the user, editing, overwrite update of the edited data to the processing node, etc.
  • It is a flowchart which shows the flow of the process displayed on a user.
  • It is a flowchart which shows the flow of the process which overwrites and updates.
  • FIG. 1 is a diagram showing an outline of a data access method in a fault tolerant computer system.
  • Main components are a maintenance node 0211 that constitutes the fault-tolerant computer system 0201, and performs processing such as data acquisition from the processing node 0212, presentation to the user, editing, overwriting update of edited data to the processing node 0212, A plurality of processing nodes 0212 and a user terminal 0202 that execute processing in response to an external request in parallel.
  • the plurality of processing nodes 0212 execute the same processing at the same timing using the same data 0101 held in each node.
  • the contents of data used by the application for processing must be always the same among a plurality of nodes executed in parallel.
  • the data is collected from each processing node 0212 to the maintenance node 0211 (0121).
  • a copy of the corresponding data 0101 is generated at each processing node 0212, transmitted to the maintenance node 0211, and the data is stored on the maintenance node 0211 (0102).
  • the maintenance node 0211 compares the contents of the collected data 0102 (0122), and displays the data contents on the screen 0111 of the user terminal 0202 based on the contents comparison result (0112).
  • the contents of the data from all the processing nodes 0212 match as a result of the content comparison, it is designated in advance in the processing node 0212 which is a representative defined in advance or the processing node in operation. Only the content of data acquired from one processing node 0212 that matches the condition (matches all processing nodes) is displayed. If there is a difference in data content from at least one processing node 0212 as a result of the content comparison, the data content of a portion having a difference between the processing nodes 0212 is displayed.
  • the user edits the data 0101 on the processing node 0212 via the user terminal 0202
  • one data is selected from the data 0102 collected from each processing node 0212 in 0121 (0103), and the data 0103 is stored in the data 0103.
  • the user is allowed to edit using an editing tool.
  • the edited data 0103 is distributed from the maintenance node 0211 to each processing node 0212, and the contents of the corresponding data 0101 in each processing node 0212 are overwritten and updated (0123).
  • FIG. 2 is a diagram showing an overview of a fault-tolerant computer system that is configured by a plurality of independent nodes interconnected via a network (communication medium), and that the same processing is executed in parallel in each of the constituent nodes. It is.
  • the main components of the fault tolerant computer system 0201 are processing such as data acquisition from the processing node 0212, presentation to the user, editing, overwriting update of the edited data to the processing node 0212, which are interconnected via the LAN 0214.
  • a maintenance node 0211 that performs the above, two or more processing nodes 0212, a gateway server 0213 that connects to the wide area network 0203 and relays communication with an external system, and the like.
  • a user terminal 0202 connected to the maintenance node 0211 via the external network 0205 is used for information presentation for users, various operations, and the like.
  • the fault tolerant computer system 0201 receives a request from the external system 0204 that can communicate via the wide area network 0203, performs processing on the request, and returns a service result as a response to the external system 0204. provide.
  • the input message 0231 is received as a request from the external system 0204, and an output message 0241 storing the processing result for the request is transmitted to the external system 0204.
  • the gateway server 0213 that has received the input message 0231 as a request from the external system 0204 via the wide area network 0203 sends an input message to all the processing nodes 0212 in the system via the LAN 0214. Transfer as 0232 again. Here, almost all the processing nodes 0212 receive the message and broadcast the input message 0232 so that the processing for the message can be started. Each processing node 0212 that has received the input message 0232 executes processing for the input message 0232 and transmits an output message 0242 storing the processing result to the gateway server 0213 via the LAN 0214.
  • the gateway server 0213 that has received the output message 0242 from each processing node 0212 creates an output message 0241 as a response to the requesting external system 0204, and transmits the output message 0241 to the external system 0204.
  • the gateway server 0213 performs comparison and collation of data contents of one or more output messages 0242 received from each processing node 0212, correct / incorrect determination, etc., and sends correct message data to the requesting external system 0204 as an output message 0241.
  • Send In the comparison / collation and correctness / incorrectness determination of the data content of the output message 0242, the output message 0242 having the largest number of matching data contents is regarded as the correct message data, and one of the correct output messages 0242 is sent to the external system 0204.
  • Output message 0241 In the comparison / collation and correctness / incorrectness determination of the data content of the output message 0242, the output message 0242 having the largest number of matching data contents is regarded as the correct message data, and one of the correct output messages 0242 is sent to the external
  • the maintenance node 0211 does not execute any online processing executed in response to the request from the external system 0204.
  • a user who logs in to the maintenance node 0211 using the user terminal 0202 refers to and edits data in the processing node 0212, the data is acquired and stored from each processing node 0212, and each processing node 0212 of the data after editing is stored. Transfer to, overwrite update, etc.
  • the main hardware configuration of the maintenance node 0211 includes a processing device (CPU) 0221, a storage device (memory, hard disk) 0222, and a communication device 0223.
  • a software program for acquiring data from the processing node 0212 and storing the data in a designated area in the storage device 0222, for comparing the contents of the data acquired from the processing node 0212 and presenting it to the user A software program for performing screen display, a software program for editing data acquired and stored from the processing node 0212, and data edited by the user for transferring to the processing node 0212 for overwriting update
  • a software program, a software program for performing communication between the maintenance node 0211 and the processing node 0212 via the LAN 0214, and the like are stored, and these software programs are processed by the processing device 0221.
  • the communication device 0223 performs communication processing when the processing node 0212 acquires data, communication processing for inputting / outputting a screen for logging in from the user terminal 0202, and the like.
  • the processing device (CPU) 0221 reads a program necessary for processing from the storage device 0222 such as a hard disk and executes each processing. It also controls operations of the storage device 0222 and the communication device 0223. In addition, the processing of the maintenance node 0211 in the flowcharts and sequence diagrams described in the drawings to be described later is assumed to be executed by the processing device (CPU) 0221 unless otherwise specified.
  • the main hardware configuration of the processing node 0212 includes a processing device (CPU) 0224, a storage device (memory, hard disk) 0225, and a communication device 0226.
  • the storage device 0225 stores data necessary for executing a service provided to the external system 0204 by the fault tolerant computer system 0201, and a user program for executing processing for a request from the external system 0204 regarding the service.
  • Software program for performing communication between the communication or processing node 0212 and maintenance node 0211 between Eisaba 0213 is stored, these software programs are processed by the processing device 0224.
  • the communication device 0226 receives the input message 0232 from the gateway server 0213 and performs communication processing for transmitting the output message 0242 to the gateway server 0213.
  • processing device (CPU) 0224 reads a program necessary for processing from the storage device 0225 such as a hard disk, executes each processing, and controls operations of the storage device 0225 and the communication device 0226.
  • processing of the processing node 0212 in the flowcharts and sequence diagrams described in the drawings to be described later is assumed to be executed by the processing device (CPU) 0224 unless otherwise specified.
  • the main hardware configuration of the user terminal 0202 includes a processing device (CPU), a storage device (hard disk), and a communication device.
  • the storage device stores a software program for logging in to the maintenance node 0211, performing command operations, screen display, and the like, a software program for performing communication between the user terminal 0202 and the maintenance node 0211, and the like. Processed by the processing device.
  • the communication device performs communication processing for inputting / outputting a screen for login to the maintenance node 0211.
  • the processing device reads out a program necessary for processing from the storage device such as a hard disk and executes each processing. It also controls the operation of the storage device and the previous communication device. In addition, the processing of the user terminal 0202 in the flowcharts and sequence diagrams described in the drawings to be described later is executed by the processing device (CPU) unless otherwise specified.
  • FIG. 3 is a diagram showing an overall processing sequence between a plurality of nodes constituting the fault tolerant computer system when the data access method in the fault tolerant computer system is executed.
  • Main components are a user terminal 0202, a maintenance node 0211, a processing node 1 (0212a), a processing node 2 (0212b), and a processing node 3 (0212c) that constitute the fault-tolerant computer system 0201.
  • the user terminal 0202 logs in to the maintenance node 0211 by a user operation.
  • data to be accessed by the user is designated by the user's operation.
  • the processing node 1 (0212a) extracts the corresponding data in 0321, The corresponding data is transmitted to the maintenance node 0211.
  • the processing node 2 (0212b) extracts the corresponding data in 0331 and transmits the corresponding data to the maintenance node 0211.
  • the processing node 3 (0212c) when requesting the corresponding data designated by the user to the processing node 3 (0212c), the processing node 3 (0212c) extracts the corresponding data in 0341 and transmits the corresponding data to the maintenance node 0211. Thereafter, the same processing as 0311 to 0313 is performed on all the processing nodes 0212 constituting the fault tolerant computer system 0201, and the corresponding data designated by the user is acquired from all the processing nodes 0212.
  • the data acquired in 0311 to 0313 is stored in a storage area (hard disk or the like) in the maintenance node 0211.
  • the contents of the data acquired from the processing node 0211 are compared in 0311 to 0313.
  • the data content is displayed on the user terminal 0202. Details of the processing and display method in 0315 and 0316 will be described with reference to FIGS.
  • the user terminal 0202 starts data editing for the maintenance node 0211 by a user operation.
  • the editing tool is activated at 0317 and the corresponding data to be edited is opened. Thereafter, data editing work by the user is performed.
  • data editing for the maintenance node 0211 is terminated by the user's operation.
  • the processing node 0212 all processing nodes or individual processing nodes to be updated with respect to the corresponding data edited by the user is determined.
  • the data edited by the user is distributed to the processing node 0212 to be updated determined in 0318.
  • FIG. 4 is a diagram showing a module configuration of a maintenance node that is included in the fault tolerant computer system and performs processing such as data acquisition from the processing node, presentation to the user, editing, overwriting update of the edited data to the processing node, and the like. is there.
  • the maintenance node 0211 includes a data access management unit 0401 and a processing node 0212 that perform processing such as data acquisition from the processing node 0212, presentation to the user, management of editing by the user, and overwrite update of the edited data to the processing node 0212.
  • a hard disk 0402 for storing the acquired data and an editing tool 0403 for the user to edit the data are introduced.
  • the data access management unit 0401 is a software program executed by the processing device (CPU) 0221.
  • the main component of the data access management 0401 is data acquisition for acquiring data from the processing node 0212 via the data communication unit 0417 based on the designation from the user via the screen display unit 0415 and storing the data in the hard disk 0402.
  • a storage unit 0411 a data content comparison unit 0412 that compares data contents acquired by the data acquisition / storage unit 0411 from a plurality of processing nodes 0212, and an editing tool based on designation from the user via the screen display unit 0415
  • a data editing management unit 0413 that monitors activation and termination of 0403, a data distribution / update unit 0414 that distributes data edited by the user using the editing tool 0403 to the processing node 0212 and overwrites and updates the corresponding data, and communication Remote connection from user terminal 0202 via medium 0205
  • a screen display unit 0415 that provides a command input screen, an output result output screen of the data content comparison unit 0412, a screen of the editing tool 0403, and the like, and a screen for accepting a remote connection from the user terminal 0202 via
  • the profile information 0421 defines, for each data held by the processing node 0212, data attribute information, access authority, presence / absence of possession at other processing nodes, necessity / unnecessity of data content matching between processing nodes, and the like.
  • the data content comparison unit 0412, the data distribution / update unit 0414, and the like refer to these in each process. Details of the profile information 0421 will be described in FIG.
  • FIG. 5 is a diagram showing a module configuration of a processing node included in the fault-tolerant computer system and executing processing for an external request.
  • the processing node 0212 includes a data access management unit 0501 that performs processing such as acquisition of designated data and transmission to the maintenance node 0211, a user program 0502 that performs various processing in response to requests from the external system 0204, and middleware 0503.
  • a hard disk 0504 and a memory 0505 which are storage destinations of data to be referenced and updated by the user program 0502, middleware 0503, are introduced.
  • the data access management unit 0501 is a software program executed by the processing device (CPU) 0224.
  • the main component of the data access management unit 0501 receives a data acquisition request from the maintenance node 0211 via the data communication unit 0513, extracts the relevant data from the hard disk 0504 or the memory 0505, and sends the relevant data to the maintenance node 0211.
  • Data management unit 0501 that receives a data overwrite update request from the maintenance node 0211 and performs overwrite update on the corresponding data, and processing to the original data transmitted by the data management unit 0501 to the maintenance node 0211
  • a data update monitoring unit 0512 that monitors updates by a program or the like in the node 0211 and a data communication unit 0513 that performs communication with the maintenance node 0211 or the like in the system via the communication medium 0214.
  • FIG. 6 is a maintenance node that is included in the fault tolerant computer system and performs processing such as data acquisition from the processing node, presentation to the user, editing, overwriting update of the edited data to the processing node, and the like from the processing node. It is a flowchart which shows the flow of the process which displays the acquired data to a user.
  • a data path of data to be accessed is acquired from an operation by the user via the user terminal 0202.
  • data corresponding to the data path acquired in 0601 is requested to the processing node 0212 and acquired.
  • the data acquired from the processing node 0212 in 0602 is stored in a storage area (such as a hard disk) in the maintenance node 0211.
  • the processing of 0602 and 0603 is repeated. If the data acquisition from all the processing nodes 0212 has been completed in 0604, the contents of the same data acquired from the processing nodes 0212 in 0602 to 0604 are compared in 0605.
  • a general tool such as “diff” may be used to compare the data contents.
  • the profile information of the corresponding data is referred to in 0607.
  • the “content match” section of the profile information of the corresponding data indicates that the contents should not match between the processing nodes 0212 (NO)
  • a warning to the user Although it is not allowed to be displayed, it is displayed that it matches.
  • the contents of data acquired from one processing node 0212 that matches the condition specified in advance in the representative processing node 0212 defined in advance or the processing node in operation (matches with all processing nodes). indicate.
  • the profile information of the corresponding data is referred to in 0611.
  • the item “content match” in the profile information of the corresponding data indicates that the content needs to match between the processing nodes 0212 (YES)
  • a warning to the user in 0613. Displayed to indicate that they do not match even though they are necessary.
  • the data contents of a portion having a difference between the processing nodes 0212 are displayed. Details of the display method here are shown in FIG. 10B.
  • FIG. 7 is a fault-tolerant computer system that is edited by a user at a maintenance node that performs processing such as data acquisition from a processing node, presentation to a user, editing, and overwrite update of edited data to a processing node.
  • 6 is a flowchart showing a flow of processing for overwriting and updating updated data in a processing node.
  • an instruction to start editing of data is received from an operation by the user via the user terminal 0202.
  • the representative defined in advance in 0703 Data acquired from one processing node 0212 that matches a condition specified in advance in the processing node 0212 or an active processing node is determined as data to be edited.
  • the data contents from at least one processing node 0212 as a result of comparing the data contents acquired from the processing node 0212 as shown in FIG.
  • the processing node 0212 selected by the user in 0704 Is determined as data to be edited (here, a screen display for allowing the user to select a processing node as a data acquisition source is performed).
  • the editing tool is activated and the data determined in 0703 or 0704 is opened.
  • the editing tool is defined in advance for each type of data to be edited. For example, if it is a text file etc., it is a text editor etc.
  • 0706 the end of the editing tool started in 0705 is monitored. If the end is not detected, update of the corresponding data is detected on the processing node 0212 side in 0707, and the updated data is transmitted to the maintenance node 0211.
  • the processing node 0212 is presented with a selection as to whether or not to replace it with the updated data, and the data replacement is executed or not executed according to the user's selection. I do. Thereafter, the processing returns to 0706. In 0707, if the processing node 0212 has not detected the update of the corresponding data, the processing returns to 0706. In 0706, the end of the editing tool activated in 0705 is monitored. If the end is detected, in 0709, the user is made to select the processing node 0212 that is the overwrite update destination of the edited data.
  • the profile information of the corresponding data is referred to in 0710.
  • the “content match” section of the profile information of the corresponding data indicates that the content needs to be matched between the processing nodes 0212 (YES)
  • the user is sent to all the processing nodes 0212. Update the edited data by overwriting.
  • FIG. 8 is a diagram of a fault-tolerant computer system that manages data in a maintenance node that performs processing such as data acquisition from a processing node, presentation to a user, editing, overwriting and updating of edited data in a processing node. It is a figure which shows the format of the profile information which defined attribute information, access authority, the presence or absence of possession in another processing node, the necessity of data content matching between processing nodes, etc.
  • Main components of profile information are a type 0801, a data path 0802, an access authority 0803, an access content 0804, another node possession 0805, and a content match 0806. Setting values corresponding to these items are specified for each data.
  • the type 0801 describes the type of data.
  • the data path 0802 describes information for specifying the location of data. For example, if the type 0801 is a file, the file path is described, and if the type 0801 is memory data, the address and size are described.
  • the access authority 0803 describes the authority of the user who is permitted to access the corresponding data.
  • the access content 0804 describes the access content that can be executed by the user permitted by the access authority 0803. In the example of the figure, “R” (reading is possible), “W” (writing is possible), and the like are described.
  • In other node possession 0805 whether or not the relevant data is possessed by other processing node 0212 is described (YES / NO).
  • the content match 0806 describes whether the data content needs to be matched between the processing nodes 0212 or not (YES / NO) when the corresponding data is held in other processing nodes 0212. To do.
  • the profile information is created for each processing node 0212 in the form of a file, etc., and is stored and managed by the maintenance node 0211.
  • the content data of the same data acquired from a plurality of processing nodes 0212 and the data edited by the user are displayed. It is referred to when executing processing such as overwriting update for a plurality of processing nodes 0212.
  • the profile information is created in advance by the administrator or user of the fault tolerant computer system and is updated as appropriate.
  • FIG. 9 is a flowchart showing the flow of processing that is included in the fault-tolerant computer system and performs data acquisition, update monitoring of data to be edited, data overwrite update, and the like in a processing node that executes processing in response to an external request. is there.
  • a data acquisition request is received from the maintenance node 0211.
  • data corresponding to the request in 0901 is acquired in the processing node 0212 and transmitted to the maintenance node 0211.
  • the corresponding data acquired in 0902 is duplicated, and the duplicated data is stored in a temporary storage area (a temporary use directory in the hard disk, etc.).
  • monitoring of update by the program in the processing node 0212 or monitoring of update of the corresponding data by another user is started for the data requested by the maintenance node 0211.
  • the corresponding data after the update is transmitted to the maintenance node 0211 together with an update notification in 0906. Thereafter, the processing returns to 0905.
  • FIG. 10A is a diagram showing a screen display example when the data contents match between the processing nodes for presenting the data acquired from the processing nodes to the user.
  • the screen display 1001 is displayed when the contents of the data from all the processing nodes 0212 match as a result of comparing the contents of the same data acquired from the plurality of processing nodes 0212 shown in FIG.
  • the data on the processing node 0212 is displayed as an output result for the command operation of the designated user.
  • attribute information (type and name) of the acquired data and a content comparison result (data from all processing nodes 0212 match in the example in the figure) are displayed.
  • the contents of the acquired data are displayed as they are.
  • FIG. 10B is a diagram illustrating a screen display example when there is a difference in data contents between processing nodes for presenting data acquired from the processing nodes to a user.
  • the screen display 1002 shows a case where there is a difference in data contents from at least one processing node 0212 as a result of comparing the contents for the same data acquired from a plurality of processing nodes 0212 shown in FIG.
  • the data on the processing node 0212 is displayed as an output result for the command operation of the designated user.
  • attribute information (type and name) of the acquired data and a content comparison result (in the example of the figure, there is a difference in data from one processing node 0212) are displayed.
  • the data content of a portion having a difference between the one processing node 0212 having the difference and the other processing node 0212 is displayed in a comparison format.
  • the data access method in the fault-tolerant computer system according to the present invention can be used particularly when the user performs work such as debugging and testing of an application program running on the fault-tolerant computer system 0201.
  • the method according to the prior art accesses one processing node 0212 at a time and accesses the same file with the same contents. It was necessary to carry out updates. In addition, it is necessary for the user to check whether the contents of the corresponding files in all the processing nodes 0212 are the same. If there is a difference in the contents of the corresponding file between the processing nodes 0212, the application program may operate differently between the processing nodes 0212, and the fault-tolerant computer system 0201 becomes in an abnormal state, and the test and debugging may not be performed correctly. is there.
  • the user only has to open the file once and update the contents. Are guaranteed to match. Furthermore, updating of node-specific information such as an IP address is restricted so that all processing nodes 0212 do not have the same content. Therefore, even if the user updates the file, the application program does not operate differently between the processing nodes 0212 as described above.

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Quality & Reliability (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Debugging And Monitoring (AREA)
  • Hardware Redundancy (AREA)

Abstract

Dans un système informatique insensible aux défaillances, un procédé est prévu pour qu'un utilisateur effectue une exécution de commande ou une édition de données en relation avec un nœud qui constitue le système, sans être conscient de la configuration du système ou de l'agencement des données. Lorsque l'utilisateur doit se référer aux mêmes données qu'une pluralité de nœuds de traitement constituant le système informatique insensible aux défaillances possèdent ou doit les éditer, les données applicables sont collectées auprès de chacun des nœuds de traitement dans un nœud de maintenance, les contenus de celles-ci sont comparés, et le contenu est affiché sur la base du résultat de la comparaison. Lorsque l'utilisateur édite les données, le nœud de maintenance transmet les données éditées à la pluralité de nœuds de traitement, et réécrit et met à jour toutes les données en même temps.
PCT/JP2011/058576 2010-05-06 2011-04-05 Système informatique insensible aux défaillances et procédé d'accès aux données WO2011138886A1 (fr)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
JP2010-106089 2010-05-06
JP2010106089A JP5331749B2 (ja) 2010-05-06 2010-05-06 フォールトトレラントコンピュータシステム、およびデータアクセス方法

Publications (1)

Publication Number Publication Date
WO2011138886A1 true WO2011138886A1 (fr) 2011-11-10

Family

ID=44903738

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/JP2011/058576 WO2011138886A1 (fr) 2010-05-06 2011-04-05 Système informatique insensible aux défaillances et procédé d'accès aux données

Country Status (2)

Country Link
JP (1) JP5331749B2 (fr)
WO (1) WO2011138886A1 (fr)

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPH08314744A (ja) * 1995-05-18 1996-11-29 Hitachi Ltd フォールトトレラントシステム
JPH09231155A (ja) * 1996-02-27 1997-09-05 Nippon Denki Joho Service Kk リモートエディタ支援装置および方法

Family Cites Families (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPH02240730A (ja) * 1989-03-14 1990-09-25 Nec Corp 情報処理システム
JP4070398B2 (ja) * 2000-10-25 2008-04-02 中部電力株式会社 多重化計算機間のデータ利用方法
JP4186456B2 (ja) * 2001-11-28 2008-11-26 沖電気工業株式会社 分散ファイル共有システムおよびその制御方法

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPH08314744A (ja) * 1995-05-18 1996-11-29 Hitachi Ltd フォールトトレラントシステム
JPH09231155A (ja) * 1996-02-27 1997-09-05 Nippon Denki Joho Service Kk リモートエディタ支援装置および方法

Also Published As

Publication number Publication date
JP2011237846A (ja) 2011-11-24
JP5331749B2 (ja) 2013-10-30

Similar Documents

Publication Publication Date Title
US9971823B2 (en) Dynamic replica failure detection and healing
US8069144B2 (en) System and methods for asynchronous synchronization
US7155462B1 (en) Method and apparatus enabling migration of clients to a specific version of a server-hosted application, where multiple software versions of the server-hosted application are installed on a network
JP5360978B2 (ja) ファイルサーバ、及びファイルサーバにおけるファイル操作通知方法
JP4473153B2 (ja) ネットワーク構成のチェックおよび修理のための方法、システムおよびプログラム
US7716319B2 (en) Computer system and method for managing log information
US20150263909A1 (en) System and method for monitoring a large number of information processing devices in a communication network
JP5698429B2 (ja) 構成要素を管理するためのコンピュータ・システム、並びにその方法及びコンピュータ・プログラム
US20140379656A1 (en) System and Method for Maintaining a Cluster Setup
US20150081773A1 (en) High availability architecture for a cloud-based concurrent-access collaboration platform
US20080263082A1 (en) Recovery segment identification in a computing infrastructure
US20070088763A1 (en) Methods and systems for validating accessibility and currency of replicated data
JP2004227359A (ja) ポリシーに基づいたストレージシステムの運用管理方法
US6820136B1 (en) System and method for replicating monitored registry keys
JP2001188765A (ja) 分散コンピューティング環境で複数の関係する障害を表す障害情報を参照する技法
CN112491606A (zh) 一种基于ansible的自动化部署业务系统高可用集群的方法
JP2015506112A (ja) 一時的障害中における第2場所へのリクエストのリダイレクト
CN112035062B (zh) 云计算的本地存储的迁移方法、计算机设备及存储介质
JP5239072B2 (ja) 構成要素を管理するためのコンピュータ・システム、並びにその方法及びコンピュータ・プログラム
CN109445988A (zh) 异构容灾方法、装置、系统、服务器和容灾平台
JP5331749B2 (ja) フォールトトレラントコンピュータシステム、およびデータアクセス方法
JP2020119207A (ja) データベース管理サービス提供システム
CN114760314A (zh) 服务器管理方法、装置、计算机设备和存储介质
US7644306B2 (en) Method and system for synchronous operation of an application by a purality of processing units
EP2126701A1 (fr) Gestion de données dans un système de stockage de données, réalisée en utilisant des ensembles de données

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 11777405

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 11777405

Country of ref document: EP

Kind code of ref document: A1