CN110532130A - Software failure recovery method, equipment and computer readable storage medium - Google Patents

Software failure recovery method, equipment and computer readable storage medium Download PDF

Info

Publication number
CN110532130A
CN110532130A CN201810500927.XA CN201810500927A CN110532130A CN 110532130 A CN110532130 A CN 110532130A CN 201810500927 A CN201810500927 A CN 201810500927A CN 110532130 A CN110532130 A CN 110532130A
Authority
CN
China
Prior art keywords
software
normal device
response
failure recovery
software package
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Withdrawn
Application number
CN201810500927.XA
Other languages
Chinese (zh)
Inventor
林茂
易兴旺
李仙
颜学伦
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
ZTE Corp
Original Assignee
ZTE Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by ZTE Corp filed Critical ZTE Corp
Priority to CN201810500927.XA priority Critical patent/CN110532130A/en
Publication of CN110532130A publication Critical patent/CN110532130A/en
Withdrawn legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/07Responding to the occurrence of a fault, e.g. fault tolerance
    • G06F11/14Error detection or correction of the data by redundancy in operation
    • G06F11/1402Saving, restoring, recovering or retrying
    • G06F11/1415Saving, restoring, recovering or retrying at system level
    • G06F11/142Reconfiguring to eliminate the error
    • G06F11/143Reconfiguring to eliminate the error with loss of software functionality
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L12/00Data switching networks
    • H04L12/02Details
    • H04L12/16Arrangements for providing special services to substations
    • H04L12/18Arrangements for providing special services to substations for broadcast or conference, e.g. multicast
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L41/00Arrangements for maintenance, administration or management of data switching networks, e.g. of packet switching networks
    • H04L41/06Management of faults, events, alarms or notifications
    • H04L41/0677Localisation of faults

Abstract

The invention discloses a kind of software failure recovery methods, this method comprises: the normal device broadcast packages acquisition request when detecting fail soft, into network;It receives the software package that the normal device is sent and obtains response;It is matched with the one of normal device for sending software package acquisition response, wherein the normal device of pairing is known as paired device;Fault recovery is carried out from paired device downloading software.In addition, the present invention also provides a kind of equipment and computer readable storage mediums.Using the present invention can automatic recognition software failure, improve faulty equipment positioning efficiency and fault recovery reliability.

Description

Software failure recovery method, equipment and computer readable storage medium
Technical field
The present invention relates to current technical field more particularly to a kind of software failure recovery methods, equipment and computer-readable Storage medium.
Background technique
In many industries, such as in water, electricity, gas, hot meter reading industry, the equipment tool of deployment in a network is on the one hand required There is high fault-tolerant ability, it is desirable that device software is run steadily in the long term.On the other hand, when software fault occurs for equipment, it is desirable that Device software can recovery as quickly as possible operate normally, but due to deployed with devices dispersion etc., failover procedure is doomed One process taken time and effort.
Equipment fault recovery is broadly divided into: faulty equipment positioning, device software transmission, and device software replaces three steps. Software transmission efficiency is improved from quick positioning failure equipment respectively in current existing fault recovery technology, guarantees device software Power etc. is substituted for solve the problems, such as in failover procedure.But these fault recovery methods all have the following problems: It needs artificially to participate in, fault recovery takes time and effort, fault recovery low efficiency.And artificially make a mistake and may cause fault recovery failure, Even equipment is caused thoroughly to damage.Due to cannot achieve automation, needs artificial positioning failure and trigger fault recovery process.
Summary of the invention
It is a primary object of the present invention to propose a kind of software failure recovery method, equipment and computer-readable storage medium Matter, it is intended to solve the problems, such as that cannot achieve software fault automation restores.
To achieve the above object, a kind of software failure recovery method provided by the invention, which comprises
Normal device broadcast packages acquisition request when detecting fail soft, into network;
It receives the software package that the normal device is sent and obtains response;
It is matched with the one of normal device for sending software package acquisition response, wherein the normal device of pairing claims For paired device;
Fault recovery is carried out from paired device downloading software.
In addition, to achieve the above object, the present invention also proposes that a kind of equipment, the equipment include processor and storage Device;
The processor is for executing the software failure recovery program stored in memory, to realize the upper method.
In addition, to achieve the above object, the present invention also proposes a kind of computer readable storage medium, described computer-readable Storage medium is stored with one or more program, and one or more of programs can be held by one or more processor Row, to realize above-mentioned method.
Software failure recovery method, equipment and computer readable storage medium proposed by the present invention, when detecting that failure is soft When part, normal device broadcast packages acquisition request into network obtains response receiving the software package that normal device is sent Later, it is matched with the one of normal device for sending software package acquisition response, wherein the normal device of pairing is known as matching Fault recovery is carried out to equipment, and from paired device downloading software.Using the present invention can automatic recognition software failure, improve The efficiency of faulty equipment positioning, and pass through the network equipment of networking, software is obtained from normal device any in network automatically, As long as there are the normal devices of version match in network, failure can be restored, to ensure that the reliability of fault recovery.
Detailed description of the invention
Fig. 1 is the flow diagram for the software failure recovery method that first embodiment of the invention provides;
Fig. 2 is the sub-process schematic diagram one for the software failure recovery method that first embodiment of the invention provides;
Fig. 3 is the sub-process schematic diagram two for the software failure recovery method that first embodiment of the invention provides;
Fig. 4 is the sub-process schematic diagram three for the software failure recovery method that first embodiment of the invention provides;
Fig. 5 is another flow diagram for the software failure recovery method that first embodiment of the invention provides;
Fig. 6 is the schematic diagram for the device hardware framework that second embodiment of the invention provides;
Fig. 7 is the module diagram of software failure recovery program in Fig. 6.
The embodiments will be further described with reference to the accompanying drawings for the realization, the function and the advantages of the object of the present invention.
Specific embodiment
It should be appreciated that the specific embodiments described herein are merely illustrative of the present invention, it is not intended to limit the present invention.
In subsequent description, it is only using the suffix for indicating such as " module ", " component " or " unit " of element Be conducive to explanation of the invention, itself there is no a specific meaning.Therefore, " module ", " component " or " unit " can mix Ground uses.
First embodiment
As shown in Figure 1, the flow diagram of the software failure recovery method provided for first embodiment of the invention.In Fig. 1 In, the software failure recovery method includes:
Step 110, the normal device broadcast packages acquisition request when detecting fail soft, into network.
Specifically, if the current operating conditions of periodic detection and logging software will detect that software breaks down The software of failure is known as fail soft, and the other equipment broadcast packages acquisition request into network, wherein in network Other equipment be known as normal device.
Optionally, the priority orders of record operating status can be set, for example, whether preferential logging software is currently normal Operation, current version information, compatible version rank etc..
Optionally, software package acquisition request includes version information required for current device.
Optionally, the quantity of normal device can be one, be also possible to multiple, and the present invention is not particularly limited herein.
Optionally, it if when inspection software normal operation, waits next cycle to detect, is repeated with this.
Step 120, it receives the software package that the normal device is sent and obtains response.
Specifically, the normal device in network gets the software package acquisition request, can first compare itself version whether Mutually compatible with the version information in software package acquisition request, if edition compatibility, normal device confirmation obtains the software package and obtains Request is taken, and returns to software package and obtains response, if incompatible version, not returning response.
In normal device returning response, then the software package for receiving normal device transmission obtains response.Those skilled in the art For member it is understood that different according to the response speed of normal device, then the time for receiving response is not also identical.The software package Obtaining response is the normal device response mutually compatible with the version information.
Step 130, it is matched with the one of normal device for sending software package acquisition response.
Specifically, then selecting one of normal device to be matched after receiving software package and obtaining response, and will The normal device of pairing is known as paired device.
Step 140, fault recovery is carried out from paired device downloading software.
Specifically, initiating software transmission request, paired device transmitting software journey after being matched with normal device Sequence.After software program is transmitted, then integrality and correctness verification are carried out to software program, if verified successfully, again Start software, if verification failure, re-initiates device pairing and file transmission.
Optionally, as shown in Fig. 2, step 130 specifically includes:
Step 210, statistics receives quantity and the response time that the software package obtains response;
Step 220, when the number of responses reaches preset amount threshold, then the Response to selection time is shortest just standing It is standby to be matched.
Specifically, statistics receives quantity and the response time that the software package that each software package obtains obtains response.To keep away Exempt from frequently to initiate network request, cause network congestion, could be completed when the response number received is greater than certain amount threshold value Pairing, continues to execute fault recovery process, otherwise terminates to match.That is, respond number reach amount threshold when, then from All software packages received obtain response time shortest normal device among response and are matched.
Optionally, it when the number of responses that software obtains response does not reach preset amount threshold, then waits again wide Broadcast the software package acquisition request.
Optionally, as shown in figure 3, step 130 specifically includes:
Step 310, each signal strength for sending the software package and obtaining the corresponding normal device of response is obtained;
Step 320, the highest normal device of selection signal intensity is matched.
Specifically, the signal for detecting the normal device that each transmission software package obtains response in failover procedure is strong Degree, and matched with the highest equipment of signal strength.
Optionally, as described in Figure 4, step 140 specifically includes:
Step 410, Xiang Suoshu paired device sends software transmission request;
Step 420, it is requested according to the software transmission, receives the software program of the paired device transmission;
Step 430, the software program is verified;
Step 440, when verifying software program success, the software program is replaced into the fail soft.
Specifically, pairing complete after, then to paired device issue software transmission request so that paired device according to be somebody's turn to do Request, sends corresponding software program, and verify to the integrality and correctness of software program, if verified successfully, The software program that verification is passed through replaces fail soft, if be replaced successfully, fault recovery success, otherwise, again and normally Equipment is matched and downloads software program.
Optionally, as described in Figure 5, after step 140, the software failure recovery method of the present embodiment further include:
Step 510, the software program is backed up.
Step 520, the software program is restarted.
Specifically, being backed up in software start-up course to received software program, reattempts starting and get Software program.If started successfully, fault recovery success;If starting failure, fault recovery failure, wait under initiating Primary fault restores.
Software failure recovery method provided in this embodiment, the normal device when detecting fail soft, into network Broadcast packages acquisition request is rung after receiving the software package that normal device is sent and obtaining response with sending software package and obtaining The one of normal device answered is matched, wherein the normal device of pairing is known as paired device, and downloads from paired device Software carries out fault recovery.Using the present invention can automatic recognition software failure, improve the efficiency of faulty equipment positioning, and lead to After the network equipment of networking, software is obtained from normal device any in network automatically, as long as there are version match in network Normal device, failure can be restored, to ensure that the reliability of fault recovery.
Second embodiment
As shown in fig. 6, providing a kind of schematic diagram of device hardware framework for second embodiment of the invention.In Fig. 6, equipment Include: memory 610, processor 620 and is stored on the memory 610 and can be run on the processor 620 soft Part damage assessment routine (DAR) 630.In the present embodiment, the software failure recovery program 630 includes a series of is stored in The present invention may be implemented when the computer program instructions are executed by processor 620 in computer program instructions on reservoir 610 The software failure recovery of each embodiment operates.In some embodiments, it is realized based on the computer program instructions each section Specific operation, software failure recovery program 630 can be divided into one or more modules.As shown in fig. 7, software fault is extensive Multiple program 630 includes: broadcast module 710, receiving module 720, matching module 730, Failure Recovery Module 740, backup module 750 And software starting module 760.Wherein,
Broadcast module 710, for when detecting fail soft, the normal device broadcast packages acquisition into network to be asked It asks.
Specifically, if the current operating conditions of periodic detection and logging software will detect that software breaks down The software of failure is known as fail soft, other equipment broadcast packages acquisition request of the broadcast module 710 into network, In, the other equipment in network are known as normal device.
Optionally, the priority orders of record operating status can be set, for example, whether preferential logging software is currently normal Operation, current version information, compatible version rank etc..
Optionally, software package acquisition request includes version information required for current device.
Optionally, the quantity of normal device can be one, be also possible to multiple, and the present invention is not particularly limited herein.
Optionally, it if when inspection software normal operation, waits next cycle to detect, is repeated with this.
Receiving module 720 obtains response for receiving the software package that the normal device is sent.
Specifically, the normal device in network gets the software package acquisition request, can first compare itself version whether Mutually compatible with the version information in software package acquisition request, if edition compatibility, normal device confirmation obtains the software package and obtains Request is taken, and returns to software package and obtains response, if incompatible version, not returning response.
In normal device returning response, then the software package that receiving module 720 receives that normal device is sent obtains response.This For field technical staff it is understood that different according to the response speed of normal device, then the time for receiving response is not also identical. It is the normal device response mutually compatible with the version information that the software package, which obtains response,.
Matching module 730, for being matched with the one of normal device for sending software package acquisition response.
Specifically, receive software package obtain response after, then matching module 730 select one of normal device into Row pairing, and the normal device of pairing is known as paired device.
Failure Recovery Module 740, for carrying out fault recovery from paired device downloading software.
Specifically, Failure Recovery Module 740 initiates software transmission request, pairing after being matched with normal device Device transfer software program.After software program is transmitted, then integrality and correctness verification are carried out to software program, if school Success is tested, then restarts software, if verification failure, re-initiates device pairing and file transmission.
More specifically, Failure Recovery Module 740 is specifically used for:
Software transmission request is sent to the paired device;
It is requested according to the software transmission, receives the software program of the paired device transmission;
Verify the software program;
When verifying software program success, the software program is replaced into the fail soft.
Specifically, pairing complete after, then to paired device issue software transmission request so that paired device according to be somebody's turn to do Request, sends corresponding software program, and verify to the integrality and correctness of software program, if verified successfully, The software program that verification is passed through replaces fail soft, if be replaced successfully, fault recovery success, otherwise, again and normally Equipment is matched and downloads software program.
In an alternative embodiment, matching module 730 can be specifically used for:
Statistics receives quantity and the response time that the software package obtains response;
When the number of responses reaches preset amount threshold, then Response to selection time shortest normal device is matched It is right.
Specifically, matching module 730, which counts, receives quantity and sound that the software package that each software package obtains obtains response Between seasonable.To avoid frequently initiating network request, network congestion is caused, when the response number received is greater than certain amount threshold Value could complete pairing, continue to execute fault recovery process, otherwise terminate to match.That is, reaching quantity in response number When threshold value, then response time shortest normal device among response is obtained from all software packages received and matched.
Optionally, it when the number of responses that software obtains response does not reach preset amount threshold, then waits again wide Broadcast the software package acquisition request.
In another alternative embodiment, matching module 730 can also be specifically used for:
Obtain each signal strength for sending the software package and obtaining the corresponding normal device of response;
The highest normal device of selection signal intensity is matched.
Specifically, the signal for detecting the normal device that each transmission software package obtains response in failover procedure is strong Degree, and matched with the highest equipment of signal strength.
Backup module 750, for backing up the software program.
Software starting module 760, for newly starting the software program.
Specifically, backup module 750 backs up received software program, then tastes in software start-up course The software program that experimental starting is got.If software starting module 760 starts successfully, fault recovery success;If starting is lost It loses, then fault recovery fails, and waits and initiates fault recovery next time.
Equipment provided in this embodiment, when detecting fail soft, normal device of the broadcast module 710 into network is wide Software package acquisition request is broadcast, after receiving module 720 receives the software package acquisition response that normal device is sent, matching module 730 It is matched with the one of normal device for sending software package acquisition response, Failure Recovery Module 740 is downloaded from paired device Software carries out fault recovery.Using the present invention can automatic recognition software failure, improve the efficiency of faulty equipment positioning, and lead to After the network equipment of networking, software is obtained from normal device any in network automatically, as long as there are version match in network Normal device, failure can be restored, to ensure that the reliability of fault recovery.
3rd embodiment
The embodiment of the invention also provides a kind of computer readable storage mediums.Here computer readable storage medium is deposited Contain one or more program.Wherein, computer readable storage medium may include volatile memory, such as arbitrary access Memory;Memory also may include nonvolatile memory, such as read-only memory, flash memory, hard disk or solid-state are hard Disk;Memory can also include the combination of the memory of mentioned kind.When one or more in computer readable storage medium Program can be executed by one or more processor, to realize software failure recovery method provided by above-mentioned first embodiment.
It should be noted that, in this document, the terms "include", "comprise" or its any other variant are intended to non-row His property includes, so that the process, method, article or the device that include a series of elements not only include those elements, and And further include other elements that are not explicitly listed, or further include for this process, method, article or device institute it is intrinsic Element.In the absence of more restrictions, the element limited by sentence "including a ...", it is not excluded that including being somebody's turn to do There is also other identical elements in the process, method of element, article or device.
The serial number of the above embodiments of the invention is only for description, does not represent the advantages or disadvantages of the embodiments.
Through the above description of the embodiments, those skilled in the art can be understood that above-described embodiment side Method can be realized by means of software and necessary general hardware platform, naturally it is also possible to by hardware, but in many cases The former is more preferably embodiment.Based on this understanding, technical solution of the present invention substantially in other words does the prior art The part contributed out can be embodied in the form of software products, which is stored in a storage medium In (such as ROM/RAM, magnetic disk, CD), including some instructions are used so that a terminal (can be mobile phone, computer, service Device, air conditioner or network equipment etc.) execute method described in each embodiment of the present invention.
The embodiment of the present invention is described with above attached drawing, but the invention is not limited to above-mentioned specific Embodiment, the above mentioned embodiment is only schematical, rather than restrictive, those skilled in the art Under the inspiration of the present invention, without breaking away from the scope protected by the purposes and claims of the present invention, it can also make very much Form, all of these belong to the protection of the present invention.

Claims (10)

1. a kind of software failure recovery method, which is characterized in that the described method includes:
Normal device broadcast packages acquisition request when detecting fail soft, into network;
It receives the software package that the normal device is sent and obtains response;
It is matched with the one of normal device for sending software package acquisition response, wherein the normal device of pairing is known as matching To equipment;
Fault recovery is carried out from paired device downloading software.
2. software failure recovery method according to claim 1, which is characterized in that obtain its responded with software package is sent In a normal device matched, specifically include:
Statistics receives quantity and the response time that the software package obtains response;
When the number of responses reaches preset amount threshold, then Response to selection time shortest normal device is matched.
3. software failure recovery method according to claim 2, which is characterized in that when number of responses does not reach preset When amount threshold, then waits and re-broadcast the software package acquisition request.
4. software failure recovery method according to claim 1, which is characterized in that obtain its responded with software package is sent In a normal device matched, specifically include:
Obtain each signal strength for sending the software package and obtaining the corresponding normal device of response;
The highest normal device of selection signal intensity is matched.
5. software failure recovery method according to claim 1, which is characterized in that from the paired device download software into Row fault recovery, specifically includes:
Software transmission request is sent to the paired device;
It is requested according to the software transmission, receives the software program of the paired device transmission;
Verify the software program.
6. software failure recovery method according to claim 5, which is characterized in that successful when verifying the software program When, the method also includes:
The software program is replaced into the fail soft.
7. software failure recovery method according to claim 6, which is characterized in that the method also includes:
Back up the software program
Restart the software program.
8. software failure recovery method according to claim 1, which is characterized in that the software package acquisition request includes version This information, so that the normal device is determining compatible with the version information, correspondingly, it is institute that the software package, which obtains response, State the normal device response mutually compatible with the version information.
9. a kind of equipment, which is characterized in that the equipment includes processor and memory;
The processor is for executing the software failure recovery program stored in memory, to realize any one of claim 1-8 The method.
10. a kind of computer readable storage medium, which is characterized in that the computer-readable recording medium storage have one or Multiple programs, one or more of programs can be executed by one or more processor, to realize that claim 1-8 is any Method described in.
CN201810500927.XA 2018-05-23 2018-05-23 Software failure recovery method, equipment and computer readable storage medium Withdrawn CN110532130A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201810500927.XA CN110532130A (en) 2018-05-23 2018-05-23 Software failure recovery method, equipment and computer readable storage medium

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201810500927.XA CN110532130A (en) 2018-05-23 2018-05-23 Software failure recovery method, equipment and computer readable storage medium

Publications (1)

Publication Number Publication Date
CN110532130A true CN110532130A (en) 2019-12-03

Family

ID=68657349

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201810500927.XA Withdrawn CN110532130A (en) 2018-05-23 2018-05-23 Software failure recovery method, equipment and computer readable storage medium

Country Status (1)

Country Link
CN (1) CN110532130A (en)

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1809816A (en) * 2003-06-30 2006-07-26 汤姆森许可贸易公司 Network equipment and a method for monitoring the start up of such equipment
CN101060427A (en) * 2006-04-19 2007-10-24 华为技术有限公司 A system and method for realizing the remote software updating

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1809816A (en) * 2003-06-30 2006-07-26 汤姆森许可贸易公司 Network equipment and a method for monitoring the start up of such equipment
CN101060427A (en) * 2006-04-19 2007-10-24 华为技术有限公司 A system and method for realizing the remote software updating

Similar Documents

Publication Publication Date Title
CN112291621B (en) System upgrading automation test method, device and test equipment
CN108429629A (en) Equipment fault restoration methods and device
CN104850475B (en) A kind of test method and device of equipment
CN104731701B (en) Test system and the method and apparatus for performing test assignment
CN110290016A (en) Equipment obstacle management method, apparatus, internet of things equipment and storage medium
CN111026581A (en) Application program repairing method, device, system, storage medium and electronic device
CN111585841B (en) Automatic test method and related device
CN102541739B (en) The method of testing of (SuSE) Linux OS and device
CN108121612A (en) A kind of collapse processing method and system based on linux kernel router
CN110659106B (en) Container state inspection method and device
CN110049313A (en) A kind of video measurement method and system
CN110618853B (en) Detection method, device and equipment for zombie container
CN101800672B (en) Equipment detection method and equipment
CN111314133A (en) Ammeter communication module upgrading method, ammeter control module and ammeter communication module
CN110532130A (en) Software failure recovery method, equipment and computer readable storage medium
CN105373474A (en) Business system-oriented test method and device
CN104424098B (en) A kind of application testing method and device
CN101237355B (en) Service monitoring method, and network monitoring device
CN104881341A (en) Server as well as testing method and system for computer hard disk
CN113721957A (en) Automatic testing method, device and system for firmware deployment and upgrade of embedded equipment
CN113687992A (en) Onboard equipment power-on and power-off test method
CN113157481A (en) Cluster-based server jump time fault processing method, device and system
US11031972B2 (en) Method for automatically selecting a frequency band
CN110581786A (en) Method, device, system and medium for testing communication stability of NCSI (network communication service) network
CN108988050A (en) A kind of Wi-Fi intelligent socket and Wi-Fi network availability safeguards system

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
WW01 Invention patent application withdrawn after publication

Application publication date: 20191203

WW01 Invention patent application withdrawn after publication