CN107480004A - Fault recovery method, device and computer equipment - Google Patents

Fault recovery method, device and computer equipment Download PDF

Info

Publication number
CN107480004A
CN107480004A CN201710626392.6A CN201710626392A CN107480004A CN 107480004 A CN107480004 A CN 107480004A CN 201710626392 A CN201710626392 A CN 201710626392A CN 107480004 A CN107480004 A CN 107480004A
Authority
CN
China
Prior art keywords
data base
local data
mentioned
network
main frame
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201710626392.6A
Other languages
Chinese (zh)
Other versions
CN107480004B (en
Inventor
马云存
纪勇
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Neusoft Corp
Original Assignee
Neusoft Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Neusoft Corp filed Critical Neusoft Corp
Priority to CN201710626392.6A priority Critical patent/CN107480004B/en
Publication of CN107480004A publication Critical patent/CN107480004A/en
Application granted granted Critical
Publication of CN107480004B publication Critical patent/CN107480004B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/30Monitoring
    • G06F11/3003Monitoring arrangements specially adapted to the computing system or computing system component being monitored
    • G06F11/3034Monitoring arrangements specially adapted to the computing system or computing system component being monitored where the computing system component is a storage system, e.g. DASD based or network based
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/07Responding to the occurrence of a fault, e.g. fault tolerance
    • G06F11/14Error detection or correction of the data by redundancy in operation
    • G06F11/1402Saving, restoring, recovering or retrying
    • G06F11/1446Point-in-time backing up or restoration of persistent data
    • G06F11/1458Management of the backup or restore process

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Computing Systems (AREA)
  • Quality & Reliability (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Mathematical Physics (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
  • Debugging And Monitoring (AREA)

Abstract

The application proposes a kind of fault recovery method, device and computer equipment, wherein, above-mentioned fault recovery method includes:The network-in-dialing situation of main frame where monitoring local data base;If the network-in-dialing of main frame where the local data base, obtain the running situation of the local data base;According to the running situation of the local data base, when the running status of the local data base occurs abnormal, the abnormality run by database High-Available Middleware to the local data base is recovered.The application can realize performs database recovery action automatically under various abnormalities, avoids manual operation, improves the promptness of fault recovery.

Description

Fault recovery method, device and computer equipment
Technical field
The application is related to Computer Applied Technology field, more particularly to a kind of fault recovery method, device and computer are set It is standby.
Background technology
Database High-Available Middleware (Pgpool II) has been realized in ORD (PostgreSQL) base The high-availability cluster of this active-standby switch pattern, but when event occurs for the main frame where above-mentioned database or above-mentioned database itself Barrier delays the network that machine or above-mentioned main frame are connected when interrupting, and Pgpool II can not recover database automatically.
In existing correlation technique, when running into said circumstances, it is necessary to manually participate in carrying out fault recovery, cause fault recovery Promptness it is poor.
The content of the invention
The application is intended to one of technical problem at least solving in correlation technique to a certain extent.
Therefore, first purpose of the application is to propose a kind of fault recovery method, to realize in various abnormalities The lower automatic database recovery that performs acts, and avoids manual operation, improves the promptness of fault recovery.
Second purpose of the application is to propose a kind of local fault recovery device.
The 3rd purpose of the application is to propose a kind of computer equipment.
The 4th purpose of the application is to propose a kind of non-transitorycomputer readable storage medium.
The 5th purpose of the application is to propose a kind of computer program product.
For the above-mentioned purpose, the application first aspect embodiment proposes a kind of fault recovery method, including:Monitoring is local The network-in-dialing situation of main frame where database;If the network-in-dialing of main frame where the local data base, described in acquisition The running situation of local data base;According to the running situation of the local data base, when the running status of the local data base When occurring abnormal, the abnormality run by database High-Available Middleware to the local data base is recovered.
In the fault recovery method of the embodiment of the present application, monitoring local data base where main frame network-in-dialing situation it Afterwards, if the network-in-dialing of main frame where above-mentioned local data base, the running situation of above-mentioned local data base is obtained, according to upper The running situation of local data base is stated, when the running status of above-mentioned local data base occurs abnormal, passes through database High Availabitity The abnormality that middleware is run to above-mentioned local data base is recovered, automatic under various abnormalities so as to realize Database recovery action is performed, manual operation is avoided, improves the promptness of fault recovery, and then data-base cluster can be improved Availability.
For the above-mentioned purpose, the application second aspect embodiment proposes a kind of local fault recovery device, including:Monitor mould Block, the network-in-dialing situation for main frame where monitoring local data base;Acquisition module, for determining institute when the monitoring modular Where stating local data base during the network-in-dialing of main frame, the running situation of the local data base is obtained;Recovery module, for root The running situation of the local data base obtained according to the acquisition module, when the running status appearance of the local data base is different Chang Shi, the abnormality run by database High-Available Middleware to the local data base are recovered.
In the local fault recovery device of the embodiment of the present application, the network-in-dialing of main frame where monitoring module monitors local data base After situation, if the network-in-dialing of main frame where above-mentioned local data base, acquisition module obtain above-mentioned local data base Running situation, then recovery module is according to the running situation of above-mentioned local data base, when the running status of above-mentioned local data base When occurring abnormal, the abnormality run by database High-Available Middleware to above-mentioned local data base is recovered, so as to Can realize under various abnormalities and to perform database recovery action automatically, avoid manual operation, improve fault recovery and Shi Xing, and then the availability of data-base cluster can be improved.
For the above-mentioned purpose, the application third aspect embodiment proposes a kind of computer equipment, including memory, processing Device and the computer program that can be run on the memory and on the processor is stored in, meter described in the computing device During calculation machine program, method as described above is realized.
To achieve these goals, the application fourth aspect embodiment proposes a kind of computer-readable storage of non-transitory Medium, is stored thereon with computer program, and the computer program realizes method as described above when being executed by processor.
To achieve these goals, the aspect embodiment of the application the 5th proposes a kind of computer program product, when described When instruction in computer program product is by computing device, method as described above is realized.
The aspect and advantage that the application adds will be set forth in part in the description, and will partly become from the following description Obtain substantially, or recognized by the practice of the application.
Brief description of the drawings
The above-mentioned and/or additional aspect of the application and advantage will become from the following description of the accompanying drawings of embodiments Substantially and it is readily appreciated that, wherein:
Fig. 1 is the flow chart of the application fault recovery method one embodiment;
Fig. 2 is the flow chart of another embodiment of the application fault recovery method;
Fig. 3 is the flow chart of the application fault recovery method further embodiment;
Fig. 4 is the flow chart of the application fault recovery method further embodiment;
Fig. 5 is the flow chart of the application fault recovery method further embodiment;
Fig. 6 is the flow chart of the application fault recovery method further embodiment;
Fig. 7 is the structural representation of the application local fault recovery device one embodiment;
Fig. 8 is the structural representation of another embodiment of the application local fault recovery device;
Fig. 9 is the structural representation of the application computer equipment one embodiment.
Embodiment
Embodiments herein is described below in detail, the example of the embodiment is shown in the drawings, wherein from beginning to end Same or similar label represents same or similar element or the element with same or like function.Below with reference to attached The embodiment of figure description is exemplary, it is intended to for explaining the application, and it is not intended that limitation to the application.
Fig. 1 is the flow chart of the application fault recovery method one embodiment, as shown in figure 1, above-mentioned fault recovery method It can include:
Step 101, the network-in-dialing situation of main frame where monitoring local data base.
Wherein, above-mentioned local data base can be ORD, such as PostgreSQL, but the present embodiment is simultaneously This is not limited only to, the present embodiment is not construed as limiting to the particular type of above-mentioned local data base.
Step 102, if the network-in-dialing of main frame where above-mentioned local data base, the fortune of above-mentioned local data base is obtained Market condition.
Step 103, according to the running situation of above-mentioned local data base, when the running status appearance of above-mentioned local data base is different Chang Shi, the abnormality run by database High-Available Middleware to above-mentioned local data base are recovered.
Wherein, above-mentioned database High-Available Middleware can be Pgpool II, but the present embodiment is not limited to that, this Embodiment is not construed as limiting to the particular type of above-mentioned database High-Available Middleware.
In above-mentioned fault recovery method, after the network-in-dialing situation of main frame where monitoring local data base, if above-mentioned The network-in-dialing of main frame where local data base, then the running situation of above-mentioned local data base is obtained, according to above-mentioned local data The running situation in storehouse, when the running status of above-mentioned local data base occurs abnormal, by database High-Available Middleware to upper The abnormality for stating local data base operation is recovered, and database is performed automatically under various abnormalities so as to realize Recovery action, manual operation is avoided, improve the promptness of fault recovery, and then the availability of data-base cluster can be improved.
Fig. 2 is the flow chart of another embodiment of the application fault recovery method, as shown in Fig. 2 real shown in the application Fig. 1 Apply in example, after step 101, can also include:
Step 201, if the network of main frame does not connect where above-mentioned local data base, to where above-mentioned local data base The network continuous monitoring pre-determined number of main frame.
Wherein, above-mentioned pre-determined number according to systematic function and/or can realize the sets itselfs such as demand in specific implementation, The present embodiment is not construed as limiting to the size of above-mentioned pre-determined number, for example, above-mentioned pre-determined number can be 10 times.
Step 202, after continuous monitoring pre-determined number, if the network-in-dialing of main frame where above-mentioned local data base, Then obtain the running situation of above-mentioned local data base.
Fig. 3 is the flow chart of the application fault recovery method further embodiment, as shown in figure 3, real shown in the application Fig. 1 Apply in example, step 103 can include:
Step 301, if above-mentioned local data base is in running status, judge above-mentioned local data base whether above-mentioned In the affiliated cluster of local data base.
If it is, terminate this flow;If not, i.e. above-mentioned local data base is not in collection belonging to above-mentioned local data base In group, then step 302 is performed.
Step 302, above-mentioned local data base is added into the above-mentioned affiliated cluster of local data base.
That is, in the present embodiment, if above-mentioned local data base causes due to breaking down or other reasonses State local data base and do not add the affiliated cluster of above-mentioned local data base, then can above-mentioned local data base be in running status it Afterwards, above-mentioned local data base is added into the above-mentioned affiliated cluster of local data base.
Fig. 4 is the flow chart of the application fault recovery method further embodiment, as shown in figure 4, real shown in the application Fig. 1 Applying a step 103 can include:
Step 401, if above-mentioned local data base is in off-duty state, at above-mentioned database High-Available Middleware When running status, the teledata in the affiliated cluster of above-mentioned local data base is judged by above-mentioned database High-Available Middleware Storehouse whether normal operation.
If it is, i.e. above-mentioned remote data base normal operation, then perform step 402;If not, i.e. above-mentioned remote data base Non- normal operation, then perform step 403.
Step 402, above-mentioned local data base is started by above-mentioned database High-Available Middleware.
Step 403, determine above-mentioned local data base in above-mentioned local data base by above-mentioned database High-Available Middleware Node type in affiliated cluster.
Step 404, if above-mentioned local data base is host node in the affiliated cluster of above-mentioned local data base, in startup State local data base.
Further, if above-mentioned local data base is not host node in the affiliated cluster of above-mentioned local data base, etc. Treat remote data base normal operation in the affiliated cluster of above-mentioned local data base and then by above-mentioned database High Availabitity among Part starts above-mentioned local data base.
That is, in the present embodiment, when above-mentioned local data base is in off-duty state, it is necessary to recover to start above-mentioned During ground database, can first determine remote data base in the affiliated cluster of above-mentioned local data base whether normal operation, if far Journey database normal operation, then can directly initiate local data base;And the abnormal fortune if remote data base also breaks down OK, this just illustrates that the database in the affiliated cluster of above-mentioned local data base breaks down, in order to ensure the synchronization of data, it is necessary to First start the host node in the affiliated cluster of above-mentioned local data base, it is therefore desirable to first determine above-mentioned local data base in above-mentioned local Whether it is host node in the affiliated cluster of database, if it is, local data base is directly initiated, and if above-mentioned local data base It is not host node in the affiliated cluster of above-mentioned local data base, then waits the teledata in the affiliated cluster of above-mentioned local data base Storehouse normal operation and then pass through above-mentioned database High-Available Middleware and start above-mentioned local data base.
Above-described embodiment can be realized when local data base is in off-duty state, start above-mentioned local data base, carry The high promptness of fault recovery, and then the availability of data-base cluster can be improved.
Fig. 5 is the flow chart of the application fault recovery method further embodiment, as shown in figure 5, real shown in the application Fig. 2 Apply in example, after step 201, can also include:
Step 501, after continuous monitoring pre-determined number, if the network-in-dialing of main frame where above-mentioned local data base, Then when above-mentioned local data base is in running status and above-mentioned database High-Available Middleware is in off-duty state, start Above-mentioned database High-Available Middleware.
Fig. 6 is the flow chart of the application fault recovery method further embodiment, as shown in fig. 6, real shown in the application Fig. 2 Apply in example, after step 201, can also include:
Step 601, after continuous monitoring pre-determined number, if the network of main frame does not connect where above-mentioned local data base Lead to, then above-mentioned local data base out of service and above-mentioned database High-Available Middleware.
The fault recovery method that the embodiment of the present application provides can be realized performs database automatically under various abnormalities Recovery action, manual operation is avoided, improve the promptness of fault recovery, and then the availability of data-base cluster can be improved.
Fig. 7 is the structural representation of the application local fault recovery device one embodiment, and the failure in the embodiment of the present application is extensive Apparatus for coating can realize the fault recovery method that the embodiment of the present application provides.As shown in fig. 7, above-mentioned local fault recovery device can wrap Include:Monitoring modular 71, acquisition module 72 and recovery module 73;
Wherein, monitoring modular 71, the network-in-dialing situation for main frame where monitoring local data base;Wherein, above-mentioned Ground database can be ORD, such as PostgreSQL, but the present embodiment is not limited to that, the present embodiment The particular type of above-mentioned local data base is not construed as limiting.
Acquisition module 72, during network-in-dialing for main frame where determining above-mentioned local data base when monitoring modular 71, obtain Take the running situation of above-mentioned local data base.
Recovery module 73, for the running situation of the above-mentioned local data base obtained according to acquisition module 72, when above-mentioned When the running status of ground database occurs abnormal, the exception run by database High-Available Middleware to above-mentioned local data base State is recovered.
Wherein, above-mentioned database High-Available Middleware can be Pgpool II, but the present embodiment is not limited to that, this Embodiment is not construed as limiting to the particular type of above-mentioned database High-Available Middleware.
In above-mentioned local fault recovery device, monitoring modular 71 monitor local data base where main frame network-in-dialing situation it Afterwards, if the network-in-dialing of main frame where above-mentioned local data base, acquisition module 72 obtain the operation of above-mentioned local data base Situation, recovery module 73 according to the running situation of above-mentioned local data base, when the running status of above-mentioned local data base occur it is different Chang Shi, the abnormality run by database High-Available Middleware to above-mentioned local data base is recovered, so as to reality It is automatic under present various abnormalities to perform database recovery action, manual operation is avoided, the promptness of fault recovery is improved, enters And the availability of data-base cluster can be improved.
Fig. 8 is the structural representation of another embodiment of the application local fault recovery device.
In the present embodiment, above-mentioned monitoring modular 71, the network of the main frame where above-mentioned local data base is determined is additionally operable to not During connection, to the network continuous monitoring pre-determined number of main frame where above-mentioned local data base;Wherein, above-mentioned pre-determined number can be According to systematic function and/or realize the sets itselfs such as demand during specific implementation, the present embodiment to the size of above-mentioned pre-determined number not It is construed as limiting, for example, above-mentioned pre-determined number can be 10 times.
Acquisition module 72, it is additionally operable to after the continuous monitoring pre-determined number of monitoring modular 71, when above-mentioned local data place In the network-in-dialing of main frame, the running situation of above-mentioned local data base is obtained.
Further, in the present embodiment, recovery module 73 can include:Judging submodule 731 and addition submodule 732;
Wherein, judging submodule 731, for when above-mentioned local data base is in running status, judging above-mentioned local number According to storehouse whether in the affiliated cluster of above-mentioned local data base;
Submodule 732 is added, will be upper for when above-mentioned local data base is not in the affiliated cluster of above-mentioned local data base State local data base and add the above-mentioned affiliated cluster of local data base.
That is, in the present embodiment, if above-mentioned local data base causes due to breaking down or other reasonses State local data base and do not add the affiliated cluster of above-mentioned local data base, then adding submodule 732 can be in above-mentioned local data base After running status, above-mentioned local data base is added into the above-mentioned affiliated cluster of local data base.
In the present embodiment, recovery module 73 can include:Judging submodule 731, start submodule 733 and determination sub-module 734;
Wherein, judging submodule 731, for being in off-duty state, and above-mentioned database when above-mentioned local data base When High-Available Middleware is in running status, collection belonging to above-mentioned local data base is judged by above-mentioned database High-Available Middleware Group in remote data base whether normal operation;
Start submodule 733, for when judging submodule 731 determines above-mentioned remote data base normal operation, by upper State database High-Available Middleware and start above-mentioned local data base;
Determination sub-module 734, for when judging submodule 731 determines the non-normal operation of above-mentioned remote data base, passing through Above-mentioned database High-Available Middleware determines node type of the above-mentioned local data base in the affiliated cluster of above-mentioned local data base;
Start submodule 733, be additionally operable to when determination sub-module 734 determines above-mentioned local data base in above-mentioned local data base When being host node in affiliated cluster, start above-mentioned local data base.
Further, if it is determined that submodule 734 determines above-mentioned local data base in the affiliated cluster of above-mentioned local data base In be not host node, then after waiting the remote data base normal operation in the affiliated cluster of above-mentioned local data base, start submodule Block 733 starts above-mentioned local data base by above-mentioned database High-Available Middleware again.
That is, in the present embodiment, when above-mentioned local data base is in off-duty state, it is necessary to recover to start above-mentioned During ground database, just whether judging submodule 731 can first determine remote data base in the affiliated cluster of above-mentioned local data base Often operation, if remote data base normal operation, local data base can be directly initiated by starting submodule 733;And if remote Journey database also breaks down non-normal operation, this database just illustrated in the affiliated cluster of above-mentioned local data base occur therefore Barrier, in order to ensure the synchronization of data, starting submodule 733 needs first to start the main section in the affiliated cluster of above-mentioned local data base Point, it is therefore desirable to determination sub-module 734 first determine above-mentioned local data base in the affiliated cluster of above-mentioned local data base whether be Host node, if it is, starting submodule 733 directly initiates local data base, and if above-mentioned local data base is at above-mentioned It is not host node in the affiliated cluster of ground database, then waits the remote data base in the affiliated cluster of above-mentioned local data base normally to transport After row, start submodule 733 and above-mentioned local data base is started by above-mentioned database High-Available Middleware again, so as to reality Now when local data base is in off-duty state, start above-mentioned local data base, improve the promptness of fault recovery, and then The availability of data-base cluster can be improved.
Further, recovery module 73, it is additionally operable to after the continuous monitoring pre-determined number of monitoring modular 71, when above-mentioned local The network-in-dialing of main frame, above-mentioned local data base are in running status and above-mentioned database High-Available Middleware where database During in off-duty state, start above-mentioned database High-Available Middleware.
Further, recovery module 73, it is additionally operable to after the continuous monitoring pre-determined number of monitoring modular 71, when above-mentioned local When the network of main frame does not connect where database, above-mentioned local data base out of service and above-mentioned database High-Available Middleware.
The local fault recovery device that the embodiment of the present application provides can be realized performs database automatically under various abnormalities Recovery action, manual operation is avoided, improve the promptness of fault recovery, and then the availability of data-base cluster can be improved.
Fig. 9 is the structural representation of the application computer equipment one embodiment, and above computer equipment can include depositing Reservoir, processor and it is stored in the computer program that can be run on above-mentioned memory and on above-mentioned processor, above-mentioned processor When performing above computer program, it is possible to achieve the fault recovery method that the embodiment of the present application provides.
Above computer equipment can be server, personal computer (Personal Computer;Hereinafter referred to as:PC)、 The intelligent terminals such as tablet personal computer, notebook computer or smart mobile phone, specific shape of the present embodiment to above computer equipment State is not construed as limiting.
Fig. 9 shows the block diagram suitable for being used for the exemplary computer device 12 for realizing the application embodiment.Fig. 9 is shown Computer equipment 12 be only an example, any restrictions should not be brought to the function and use range of the embodiment of the present application.
As shown in figure 9, computer equipment 12 is showed in the form of universal computing device.The component of computer equipment 12 can be with Including but not limited to:One or more processor or processing unit 16, system storage 28, connect different system component The bus 18 of (including system storage 28 and processing unit 16).
Bus 18 represents the one or more in a few class bus structures, including memory bus or Memory Controller, Peripheral bus, graphics acceleration port, processor or the local bus using any bus structures in a variety of bus structures.Lift For example, these architectures include but is not limited to industry standard architecture (Industry Standard Architecture;Hereinafter referred to as:ISA) bus, MCA (Micro Channel Architecture;Below Referred to as:MAC) bus, enhanced isa bus, VESA (Video Electronics Standards Association;Hereinafter referred to as:VESA) local bus and periphery component interconnection (Peripheral Component Interconnection;Hereinafter referred to as:PCI) bus.
Computer equipment 12 typically comprises various computing systems computer-readable recording medium.These media can be it is any can be by The usable medium that computer equipment 12 accesses, including volatibility and non-volatile media, moveable and immovable medium.
System storage 28 can include the computer system readable media of form of volatile memory, such as arbitrary access Memory (Random Access Memory;Hereinafter referred to as:RAM) 30 and/or cache memory 32.Computer equipment 12 It may further include other removable/nonremovable, volatile/non-volatile computer system storage mediums.Only conduct Citing, storage system 34 can be used for reading and writing immovable, non-volatile magnetic media, and (Fig. 9 do not show, commonly referred to as " hard disk Driver ").Although not shown in Fig. 9, it can provide for the magnetic to may move non-volatile magnetic disk (such as " floppy disk ") read-write Disk drive, and to removable anonvolatile optical disk (such as:Compact disc read-only memory (Compact Disc Read Only Memory;Hereinafter referred to as:CD-ROM), digital multi read-only optical disc (Digital Video Disc Read Only Memory;Hereinafter referred to as:DVD-ROM) or other optical mediums) read-write CD drive.In these cases, each driving Device can be connected by one or more data media interfaces with bus 18.Memory 28 can include at least one program and produce Product, the program product have one group of (for example, at least one) program module, and it is each that these program modules are configured to perform the application The function of embodiment.
Program/utility 40 with one group of (at least one) program module 42, such as memory 28 can be stored in In, such program module 42 includes --- but being not limited to --- operating system, one or more application program, other programs Module and routine data, the realization of network environment may be included in each or certain combination in these examples.Program mould Block 42 generally performs function and/or method in embodiments described herein.
Computer equipment 12 can also be with one or more external equipments 14 (such as keyboard, sensing equipment, display 24 Deng) communication, the equipment communication interacted with the computer equipment 12 can be also enabled a user to one or more, and/or with making Obtain any equipment that the computer equipment 12 can be communicated with one or more of the other computing device (such as network interface card, modulatedemodulate Adjust device etc.) communication.This communication can be carried out by input/output (I/O) interface 22.Also, computer equipment 12 may be used also To pass through network adapter 20 and one or more network (such as LAN (Local Area Network;Hereinafter referred to as: LAN), wide area network (Wide Area Network;Hereinafter referred to as:WAN) and/or public network, for example, internet) communication.Such as figure Shown in 9, network adapter 20 is communicated by bus 18 with other modules of computer equipment 12.It should be understood that although in Fig. 9 not Show, computer equipment 12 can be combined and use other hardware and/or software module, included but is not limited to:Microcode, equipment are driven Dynamic device, redundant processing unit, external disk drive array, RAID system, tape drive and data backup storage system etc..
Processing unit 16 is stored in program in system storage 28 by operation, so as to perform various function application and Data processing, such as realize the fault recovery method that the embodiment of the present application provides.
The embodiment of the present application also provides a kind of non-transitorycomputer readable storage medium, is stored thereon with computer journey Sequence, the fault recovery method that the embodiment of the present application provides can be realized when above computer program is executed by processor.
Above-mentioned non-transitorycomputer readable storage medium can use appointing for one or more computer-readable media Meaning combination.Computer-readable medium can be computer-readable signal media or computer-readable recording medium.Computer can Read storage medium and for example may be-but not limited to-the system of electricity, magnetic, optical, electromagnetic, infrared ray or semiconductor, device Or device, or any combination above.The more specifically example (non exhaustive list) of computer-readable recording medium includes: Electrical connection, portable computer diskette, hard disk, random access memory (RAM), read-only storage with one or more wires Device (Read Only Memory;Hereinafter referred to as:ROM), erasable programmable read only memory (Erasable Programmable Read Only Memory;Hereinafter referred to as:EPROM) or flash memory, optical fiber, portable compact disc are read-only deposits Reservoir (CD-ROM), light storage device, magnetic memory device or above-mentioned any appropriate combination.In this document, computer Readable storage medium storing program for executing can be any includes or the tangible medium of storage program, the program can be commanded execution system, device Either device use or in connection.
Computer-readable signal media can include in a base band or as carrier wave a part propagation data-signal, Wherein carry computer-readable program code.The data-signal of this propagation can take various forms, including --- but It is not limited to --- electromagnetic signal, optical signal or above-mentioned any appropriate combination.Computer-readable signal media can also be Any computer-readable medium beyond computer-readable recording medium, the computer-readable medium can send, propagate or Transmit for by instruction execution system, device either device use or program in connection.
The program code included on computer-readable medium can be transmitted with any appropriate medium, including --- but it is unlimited In --- wireless, electric wire, optical cable, RF etc., or above-mentioned any appropriate combination.
Can with one or more programming languages or its combination come write for perform the application operation computer Program code, described program design language include object oriented program language-such as Java, Smalltalk, C++, Also include conventional procedural programming language-such as " C " language or similar programming language.Program code can be with Fully perform, partly perform on the user computer on the user computer, the software kit independent as one performs, portion Divide and partly perform or performed completely on remote computer or server on the remote computer on the user computer. It is related in the situation of remote computer, remote computer can pass through the network of any kind --- including LAN (Local Area Network;Hereinafter referred to as:) or wide area network (Wide Area Network LAN;Hereinafter referred to as:WAN) it is connected to user Computer, or, it may be connected to outer computer (such as passing through Internet connection using ISP).
The embodiment of the present application also provides a kind of computer program product, when the instruction in the computer program product by When managing device execution, it is possible to achieve the fault recovery method that the embodiment of the present application provides.
In the description of this specification, reference term " one embodiment ", " some embodiments ", " example ", " specifically show The description of example " or " some examples " etc. means specific features, structure, material or the spy for combining the embodiment or example description Point is contained at least one embodiment or example of the application.In this manual, to the schematic representation of above-mentioned term not Identical embodiment or example must be directed to.Moreover, specific features, structure, material or the feature of description can be with office Combined in an appropriate manner in one or more embodiments or example.In addition, in the case of not conflicting, the skill of this area Art personnel can be tied the different embodiments or example and the feature of different embodiments or example described in this specification Close and combine.
In addition, term " first ", " second " are only used for describing purpose, and it is not intended that instruction or hint relative importance Or the implicit quantity for indicating indicated technical characteristic.Thus, define " first ", the feature of " second " can be expressed or Implicitly include at least one this feature.In the description of the present application, " multiple " are meant that at least two, such as two, three It is individual etc., unless otherwise specifically defined.
Any process or method described otherwise above description in flow chart or herein is construed as, and represents to include Module, fragment or the portion of the code of the executable instruction of one or more the step of being used to realize custom logic function or process Point, and the scope of the preferred embodiment of the application includes other realization, wherein can not press shown or discuss suitable Sequence, including according to involved function by it is basic simultaneously in the way of or in the opposite order, carry out perform function, this should be by the application Embodiment person of ordinary skill in the field understood.
Expression or logic and/or step described otherwise above herein in flow charts, for example, being considered use In the order list for the executable instruction for realizing logic function, may be embodied in any computer-readable medium, for Instruction execution system, device or equipment (such as computer based system including the system of processor or other can be held from instruction The system of row system, device or equipment instruction fetch and execute instruction) use, or combine these instruction execution systems, device or set It is standby and use.For the purpose of this specification, " computer-readable medium " can any can be included, store, communicate, propagate or pass Defeated program is for instruction execution system, device or equipment or the dress used with reference to these instruction execution systems, device or equipment Put.The more specifically example (non-exhaustive list) of computer-readable medium includes following:Electricity with one or more wiring Connecting portion (electronic installation), portable computer diskette box (magnetic device), random access memory (Random Access Memory;Hereinafter referred to as:RAM), read-only storage (Read Only Memory;Hereinafter referred to as:ROM), erasable editable Read memory (Erasable Programmable Read Only Memory;Hereinafter referred to as:EPROM) or flash memory, Fiber device, and portable optic disk read-only storage (Compact Disc Read Only Memory;Hereinafter referred to as:CD- ROM).In addition, computer-readable medium, which can even is that, to print the paper or other suitable media of described program thereon, because Can then to enter edlin, interpretation or suitable with other if necessary for example by carrying out optical scanner to paper or other media Mode is handled electronically to obtain described program, is then stored in computer storage.
It should be appreciated that each several part of the application can be realized with hardware, software, firmware or combinations thereof.Above-mentioned In embodiment, software that multiple steps or method can be performed in memory and by suitable instruction execution system with storage Or firmware is realized.Such as, if realized with hardware with another embodiment, following skill well known in the art can be used Any one of art or their combination are realized:With the logic gates for realizing logic function to data-signal from Logic circuit is dissipated, the application specific integrated circuit with suitable combinational logic gate circuit, programmable gate array (Programmable Gate Array;Hereinafter referred to as:PGA), field programmable gate array (Field Programmable Gate Array;Below Referred to as:FPGA) etc..
Those skilled in the art are appreciated that to realize all or part of step that above-described embodiment method carries Suddenly it is that by program the hardware of correlation can be instructed to complete, described program can be stored in a kind of computer-readable storage medium In matter, the program upon execution, including one or a combination set of the step of embodiment of the method.
In addition, each functional unit in each embodiment of the application can be integrated in a processing module, can also That unit is individually physically present, can also two or more units be integrated in a module.Above-mentioned integrated mould Block can both be realized in the form of hardware, can also be realized in the form of software function module.The integrated module is such as Fruit is realized in the form of software function module and as independent production marketing or in use, can also be stored in a computer In read/write memory medium.
Storage medium mentioned above can be read-only storage, disk or CD etc..Although have been shown and retouch above Embodiments herein is stated, it is to be understood that above-described embodiment is exemplary, it is impossible to be interpreted as the limit to the application System, one of ordinary skill in the art can be changed to above-described embodiment, change, replace and become within the scope of application Type.

Claims (10)

  1. A kind of 1. fault recovery method, it is characterised in that including:
    The network-in-dialing situation of main frame where monitoring local data base;
    If the network-in-dialing of main frame where the local data base, obtain the running situation of the local data base;
    According to the running situation of the local data base, when the running status of the local data base occurs abnormal, pass through number The abnormality run according to storehouse High-Available Middleware to the local data base is recovered.
  2. 2. according to the method for claim 1, it is characterised in that the network-in-dialing of main frame where the monitoring local data base After situation, in addition to:
    If the network of main frame does not connect where the local data base, the network of main frame where the local data base is connected Continuous monitoring pre-determined number;
    After continuous monitoring pre-determined number, if the network-in-dialing of main frame where the local data base, obtains described The running situation of ground database.
  3. 3. method according to claim 1 or 2, it is characterised in that the running situation according to the local data base, When the running status of the local data base occurs abnormal, the local data base is transported by database High-Available Middleware Capable abnormality, which carries out recovery, to be included:
    If the local data base is in running status, judge the local data base whether in the local data place Belong in cluster;
    If it is not, then the local data base is added into the affiliated cluster of local data base.
  4. 4. method according to claim 1 or 2, it is characterised in that the running situation according to the local data base, When the running status of the local data base occurs abnormal, the local data base is transported by database High-Available Middleware Capable abnormality, which carries out recovery, to be included:
    If the local data base is in off-duty state, when the database High-Available Middleware is in running status When, judge whether the remote data base in the affiliated cluster of the local data base is normal by the database High-Available Middleware Operation;
    If the remote data base normal operation, the local data is started by the database High-Available Middleware Storehouse;
    If the non-normal operation of remote data base, the local data is determined by the database High-Available Middleware Node type of the storehouse in the affiliated cluster of the local data base;
    If the local data base is host node in the affiliated cluster of the local data base, start the local data Storehouse.
  5. 5. according to the method for claim 2, it is characterised in that also include:
    After continuous monitoring pre-determined number, if the network-in-dialing of main frame where the local data base, when the local Database is in running status and when the database High-Available Middleware is in off-duty state, and it is high to start the database Middleware can be used.
  6. 6. according to the method for claim 2, it is characterised in that also include:
    It is out of service if the network of main frame does not connect where the local data base after continuous monitoring pre-determined number The local data base and the database High-Available Middleware.
  7. A kind of 7. local fault recovery device, it is characterised in that including:
    Monitoring modular, the network-in-dialing situation for main frame where monitoring local data base;
    Acquisition module, during network-in-dialing for main frame where determining the local data base when the monitoring modular, obtain institute State the running situation of local data base;
    Recovery module, for the running situation of the local data base obtained according to the acquisition module, when the local number When occurring abnormal according to the running status in storehouse, the abnormality run by database High-Available Middleware to the local data base Recovered.
  8. 8. a kind of computer equipment, it is characterised in that including memory, processor and be stored on the memory and can be in institute The computer program run on processor is stated, described in the computing device during computer program, is realized as in claim 1-6 Any described method.
  9. 9. a kind of non-transitorycomputer readable storage medium, is stored thereon with computer program, it is characterised in that the calculating The method as described in any in claim 1-6 is realized when machine program is executed by processor.
  10. 10. a kind of computer program product, when the instruction in the computer program product is by computing device, realize as weighed Profit requires any described method in 1-6.
CN201710626392.6A 2017-07-27 2017-07-27 Fault recovery method and device and computer equipment Active CN107480004B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201710626392.6A CN107480004B (en) 2017-07-27 2017-07-27 Fault recovery method and device and computer equipment

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201710626392.6A CN107480004B (en) 2017-07-27 2017-07-27 Fault recovery method and device and computer equipment

Publications (2)

Publication Number Publication Date
CN107480004A true CN107480004A (en) 2017-12-15
CN107480004B CN107480004B (en) 2020-06-23

Family

ID=60597916

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201710626392.6A Active CN107480004B (en) 2017-07-27 2017-07-27 Fault recovery method and device and computer equipment

Country Status (1)

Country Link
CN (1) CN107480004B (en)

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6389431B1 (en) * 1999-08-25 2002-05-14 Hewlett-Packard Company Message-efficient client transparency system and method therefor
CN104917827A (en) * 2015-05-26 2015-09-16 浪潮电子信息产业股份有限公司 Method for realizing oracle load balancing cluster
CN105224637A (en) * 2015-09-24 2016-01-06 珠海许继芝电网自动化有限公司 A kind of based on PostgreSQL database active and standby/the comprehensive method of cluster application

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6389431B1 (en) * 1999-08-25 2002-05-14 Hewlett-Packard Company Message-efficient client transparency system and method therefor
CN104917827A (en) * 2015-05-26 2015-09-16 浪潮电子信息产业股份有限公司 Method for realizing oracle load balancing cluster
CN105224637A (en) * 2015-09-24 2016-01-06 珠海许继芝电网自动化有限公司 A kind of based on PostgreSQL database active and standby/the comprehensive method of cluster application

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
苏乐: ""事务中间件在高可用性数据库系统中的应用"", 《中国优秀硕士学位论文全文数据库 信息科技辑》 *

Also Published As

Publication number Publication date
CN107480004B (en) 2020-06-23

Similar Documents

Publication Publication Date Title
US9092022B2 (en) Systems and methods for load balancing of modular information handling resources in a chassis
US8694693B2 (en) Methods and systems for providing user selection of associations between information handling resources and information handling systems in an integrated chassis
US20160275037A1 (en) System and Method for Providing Keyboard, Video, and Mouse Functionality
US9690745B2 (en) Methods and systems for removal of information handling resources in a shared input/output infrastructure
CN110417575A (en) Alarm method, device and the computer equipment of O&M monitor supervision platform
US20190012005A1 (en) Method and device for asynchronous touch and asynchronous display on dual-screen and computer readable storage medium
US8819779B2 (en) Methods and systems for managing multiple information handling systems with a virtual keyboard-video-mouse interface
CN110333875A (en) A kind of service routine update method, device, server and storage medium
US9928206B2 (en) Dedicated LAN interface per IPMI instance on a multiple baseboard management controller (BMC) system with single physical network interface
US20140304532A1 (en) Server systems having segregated power circuits for high availability applications
CN110049118A (en) Information push method, device, equipment and storage medium
CN107391295A (en) The processing method and processing device of application exception
US10630399B2 (en) Testing distributed applications that have an established exchange in an advanced message queuing protocol (AMQP) message broker
CN107423894A (en) The task measures and procedures for the examination and approval, device and computer equipment
CN110046726A (en) A kind of O&M is listed method, system, equipment and storage medium automatically
CN110297658A (en) Functional unit sharing method, device and computer equipment
US20130265328A1 (en) Methods and systems for providing video overlay for display coupled to integrated chassis housing a plurality of modular information handling systems
US20140359194A1 (en) Methods and systems for virtualization of storage services in an integrated chassis
CN108551481A (en) A kind of file uploading method, device, server and storage medium
CN109117399A (en) Reduce device, the system and method for chip selection
CN109347899A (en) The method of daily record data is written in distributed memory system
CN107480004A (en) Fault recovery method, device and computer equipment
CN109866786A (en) Power-control method, device and the traction control unit of rail traffic
CN107731154B (en) LED display screen data backup device and method and terminal equipment
CN109741430A (en) Animation instance creating method, device, equipment and storage medium

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant