CN110532130A - Software failure recovery method, equipment and computer readable storage medium - Google Patents
Software failure recovery method, equipment and computer readable storage medium Download PDFInfo
- Publication number
- CN110532130A CN110532130A CN201810500927.XA CN201810500927A CN110532130A CN 110532130 A CN110532130 A CN 110532130A CN 201810500927 A CN201810500927 A CN 201810500927A CN 110532130 A CN110532130 A CN 110532130A
- Authority
- CN
- China
- Prior art keywords
- software
- normal device
- response
- failure recovery
- software package
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Withdrawn
Links
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F11/00—Error detection; Error correction; Monitoring
- G06F11/07—Responding to the occurrence of a fault, e.g. fault tolerance
- G06F11/14—Error detection or correction of the data by redundancy in operation
- G06F11/1402—Saving, restoring, recovering or retrying
- G06F11/1415—Saving, restoring, recovering or retrying at system level
- G06F11/142—Reconfiguring to eliminate the error
- G06F11/143—Reconfiguring to eliminate the error with loss of software functionality
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04L—TRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
- H04L12/00—Data switching networks
- H04L12/02—Details
- H04L12/16—Arrangements for providing special services to substations
- H04L12/18—Arrangements for providing special services to substations for broadcast or conference, e.g. multicast
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04L—TRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
- H04L41/00—Arrangements for maintenance, administration or management of data switching networks, e.g. of packet switching networks
- H04L41/06—Management of faults, events, alarms or notifications
- H04L41/0677—Localisation of faults
Abstract
The invention discloses a kind of software failure recovery methods, this method comprises: the normal device broadcast packages acquisition request when detecting fail soft, into network;It receives the software package that the normal device is sent and obtains response;It is matched with the one of normal device for sending software package acquisition response, wherein the normal device of pairing is known as paired device;Fault recovery is carried out from paired device downloading software.In addition, the present invention also provides a kind of equipment and computer readable storage mediums.Using the present invention can automatic recognition software failure, improve faulty equipment positioning efficiency and fault recovery reliability.
Description
Technical field
The present invention relates to current technical field more particularly to a kind of software failure recovery methods, equipment and computer-readable
Storage medium.
Background technique
In many industries, such as in water, electricity, gas, hot meter reading industry, the equipment tool of deployment in a network is on the one hand required
There is high fault-tolerant ability, it is desirable that device software is run steadily in the long term.On the other hand, when software fault occurs for equipment, it is desirable that
Device software can recovery as quickly as possible operate normally, but due to deployed with devices dispersion etc., failover procedure is doomed
One process taken time and effort.
Equipment fault recovery is broadly divided into: faulty equipment positioning, device software transmission, and device software replaces three steps.
Software transmission efficiency is improved from quick positioning failure equipment respectively in current existing fault recovery technology, guarantees device software
Power etc. is substituted for solve the problems, such as in failover procedure.But these fault recovery methods all have the following problems:
It needs artificially to participate in, fault recovery takes time and effort, fault recovery low efficiency.And artificially make a mistake and may cause fault recovery failure,
Even equipment is caused thoroughly to damage.Due to cannot achieve automation, needs artificial positioning failure and trigger fault recovery process.
Summary of the invention
It is a primary object of the present invention to propose a kind of software failure recovery method, equipment and computer-readable storage medium
Matter, it is intended to solve the problems, such as that cannot achieve software fault automation restores.
To achieve the above object, a kind of software failure recovery method provided by the invention, which comprises
Normal device broadcast packages acquisition request when detecting fail soft, into network;
It receives the software package that the normal device is sent and obtains response;
It is matched with the one of normal device for sending software package acquisition response, wherein the normal device of pairing claims
For paired device;
Fault recovery is carried out from paired device downloading software.
In addition, to achieve the above object, the present invention also proposes that a kind of equipment, the equipment include processor and storage
Device;
The processor is for executing the software failure recovery program stored in memory, to realize the upper method.
In addition, to achieve the above object, the present invention also proposes a kind of computer readable storage medium, described computer-readable
Storage medium is stored with one or more program, and one or more of programs can be held by one or more processor
Row, to realize above-mentioned method.
Software failure recovery method, equipment and computer readable storage medium proposed by the present invention, when detecting that failure is soft
When part, normal device broadcast packages acquisition request into network obtains response receiving the software package that normal device is sent
Later, it is matched with the one of normal device for sending software package acquisition response, wherein the normal device of pairing is known as matching
Fault recovery is carried out to equipment, and from paired device downloading software.Using the present invention can automatic recognition software failure, improve
The efficiency of faulty equipment positioning, and pass through the network equipment of networking, software is obtained from normal device any in network automatically,
As long as there are the normal devices of version match in network, failure can be restored, to ensure that the reliability of fault recovery.
Detailed description of the invention
Fig. 1 is the flow diagram for the software failure recovery method that first embodiment of the invention provides;
Fig. 2 is the sub-process schematic diagram one for the software failure recovery method that first embodiment of the invention provides;
Fig. 3 is the sub-process schematic diagram two for the software failure recovery method that first embodiment of the invention provides;
Fig. 4 is the sub-process schematic diagram three for the software failure recovery method that first embodiment of the invention provides;
Fig. 5 is another flow diagram for the software failure recovery method that first embodiment of the invention provides;
Fig. 6 is the schematic diagram for the device hardware framework that second embodiment of the invention provides;
Fig. 7 is the module diagram of software failure recovery program in Fig. 6.
The embodiments will be further described with reference to the accompanying drawings for the realization, the function and the advantages of the object of the present invention.
Specific embodiment
It should be appreciated that the specific embodiments described herein are merely illustrative of the present invention, it is not intended to limit the present invention.
In subsequent description, it is only using the suffix for indicating such as " module ", " component " or " unit " of element
Be conducive to explanation of the invention, itself there is no a specific meaning.Therefore, " module ", " component " or " unit " can mix
Ground uses.
First embodiment
As shown in Figure 1, the flow diagram of the software failure recovery method provided for first embodiment of the invention.In Fig. 1
In, the software failure recovery method includes:
Step 110, the normal device broadcast packages acquisition request when detecting fail soft, into network.
Specifically, if the current operating conditions of periodic detection and logging software will detect that software breaks down
The software of failure is known as fail soft, and the other equipment broadcast packages acquisition request into network, wherein in network
Other equipment be known as normal device.
Optionally, the priority orders of record operating status can be set, for example, whether preferential logging software is currently normal
Operation, current version information, compatible version rank etc..
Optionally, software package acquisition request includes version information required for current device.
Optionally, the quantity of normal device can be one, be also possible to multiple, and the present invention is not particularly limited herein.
Optionally, it if when inspection software normal operation, waits next cycle to detect, is repeated with this.
Step 120, it receives the software package that the normal device is sent and obtains response.
Specifically, the normal device in network gets the software package acquisition request, can first compare itself version whether
Mutually compatible with the version information in software package acquisition request, if edition compatibility, normal device confirmation obtains the software package and obtains
Request is taken, and returns to software package and obtains response, if incompatible version, not returning response.
In normal device returning response, then the software package for receiving normal device transmission obtains response.Those skilled in the art
For member it is understood that different according to the response speed of normal device, then the time for receiving response is not also identical.The software package
Obtaining response is the normal device response mutually compatible with the version information.
Step 130, it is matched with the one of normal device for sending software package acquisition response.
Specifically, then selecting one of normal device to be matched after receiving software package and obtaining response, and will
The normal device of pairing is known as paired device.
Step 140, fault recovery is carried out from paired device downloading software.
Specifically, initiating software transmission request, paired device transmitting software journey after being matched with normal device
Sequence.After software program is transmitted, then integrality and correctness verification are carried out to software program, if verified successfully, again
Start software, if verification failure, re-initiates device pairing and file transmission.
Optionally, as shown in Fig. 2, step 130 specifically includes:
Step 210, statistics receives quantity and the response time that the software package obtains response;
Step 220, when the number of responses reaches preset amount threshold, then the Response to selection time is shortest just standing
It is standby to be matched.
Specifically, statistics receives quantity and the response time that the software package that each software package obtains obtains response.To keep away
Exempt from frequently to initiate network request, cause network congestion, could be completed when the response number received is greater than certain amount threshold value
Pairing, continues to execute fault recovery process, otherwise terminates to match.That is, respond number reach amount threshold when, then from
All software packages received obtain response time shortest normal device among response and are matched.
Optionally, it when the number of responses that software obtains response does not reach preset amount threshold, then waits again wide
Broadcast the software package acquisition request.
Optionally, as shown in figure 3, step 130 specifically includes:
Step 310, each signal strength for sending the software package and obtaining the corresponding normal device of response is obtained;
Step 320, the highest normal device of selection signal intensity is matched.
Specifically, the signal for detecting the normal device that each transmission software package obtains response in failover procedure is strong
Degree, and matched with the highest equipment of signal strength.
Optionally, as described in Figure 4, step 140 specifically includes:
Step 410, Xiang Suoshu paired device sends software transmission request;
Step 420, it is requested according to the software transmission, receives the software program of the paired device transmission;
Step 430, the software program is verified;
Step 440, when verifying software program success, the software program is replaced into the fail soft.
Specifically, pairing complete after, then to paired device issue software transmission request so that paired device according to be somebody's turn to do
Request, sends corresponding software program, and verify to the integrality and correctness of software program, if verified successfully,
The software program that verification is passed through replaces fail soft, if be replaced successfully, fault recovery success, otherwise, again and normally
Equipment is matched and downloads software program.
Optionally, as described in Figure 5, after step 140, the software failure recovery method of the present embodiment further include:
Step 510, the software program is backed up.
Step 520, the software program is restarted.
Specifically, being backed up in software start-up course to received software program, reattempts starting and get
Software program.If started successfully, fault recovery success;If starting failure, fault recovery failure, wait under initiating
Primary fault restores.
Software failure recovery method provided in this embodiment, the normal device when detecting fail soft, into network
Broadcast packages acquisition request is rung after receiving the software package that normal device is sent and obtaining response with sending software package and obtaining
The one of normal device answered is matched, wherein the normal device of pairing is known as paired device, and downloads from paired device
Software carries out fault recovery.Using the present invention can automatic recognition software failure, improve the efficiency of faulty equipment positioning, and lead to
After the network equipment of networking, software is obtained from normal device any in network automatically, as long as there are version match in network
Normal device, failure can be restored, to ensure that the reliability of fault recovery.
Second embodiment
As shown in fig. 6, providing a kind of schematic diagram of device hardware framework for second embodiment of the invention.In Fig. 6, equipment
Include: memory 610, processor 620 and is stored on the memory 610 and can be run on the processor 620 soft
Part damage assessment routine (DAR) 630.In the present embodiment, the software failure recovery program 630 includes a series of is stored in
The present invention may be implemented when the computer program instructions are executed by processor 620 in computer program instructions on reservoir 610
The software failure recovery of each embodiment operates.In some embodiments, it is realized based on the computer program instructions each section
Specific operation, software failure recovery program 630 can be divided into one or more modules.As shown in fig. 7, software fault is extensive
Multiple program 630 includes: broadcast module 710, receiving module 720, matching module 730, Failure Recovery Module 740, backup module 750
And software starting module 760.Wherein,
Broadcast module 710, for when detecting fail soft, the normal device broadcast packages acquisition into network to be asked
It asks.
Specifically, if the current operating conditions of periodic detection and logging software will detect that software breaks down
The software of failure is known as fail soft, other equipment broadcast packages acquisition request of the broadcast module 710 into network,
In, the other equipment in network are known as normal device.
Optionally, the priority orders of record operating status can be set, for example, whether preferential logging software is currently normal
Operation, current version information, compatible version rank etc..
Optionally, software package acquisition request includes version information required for current device.
Optionally, the quantity of normal device can be one, be also possible to multiple, and the present invention is not particularly limited herein.
Optionally, it if when inspection software normal operation, waits next cycle to detect, is repeated with this.
Receiving module 720 obtains response for receiving the software package that the normal device is sent.
Specifically, the normal device in network gets the software package acquisition request, can first compare itself version whether
Mutually compatible with the version information in software package acquisition request, if edition compatibility, normal device confirmation obtains the software package and obtains
Request is taken, and returns to software package and obtains response, if incompatible version, not returning response.
In normal device returning response, then the software package that receiving module 720 receives that normal device is sent obtains response.This
For field technical staff it is understood that different according to the response speed of normal device, then the time for receiving response is not also identical.
It is the normal device response mutually compatible with the version information that the software package, which obtains response,.
Matching module 730, for being matched with the one of normal device for sending software package acquisition response.
Specifically, receive software package obtain response after, then matching module 730 select one of normal device into
Row pairing, and the normal device of pairing is known as paired device.
Failure Recovery Module 740, for carrying out fault recovery from paired device downloading software.
Specifically, Failure Recovery Module 740 initiates software transmission request, pairing after being matched with normal device
Device transfer software program.After software program is transmitted, then integrality and correctness verification are carried out to software program, if school
Success is tested, then restarts software, if verification failure, re-initiates device pairing and file transmission.
More specifically, Failure Recovery Module 740 is specifically used for:
Software transmission request is sent to the paired device;
It is requested according to the software transmission, receives the software program of the paired device transmission;
Verify the software program;
When verifying software program success, the software program is replaced into the fail soft.
Specifically, pairing complete after, then to paired device issue software transmission request so that paired device according to be somebody's turn to do
Request, sends corresponding software program, and verify to the integrality and correctness of software program, if verified successfully,
The software program that verification is passed through replaces fail soft, if be replaced successfully, fault recovery success, otherwise, again and normally
Equipment is matched and downloads software program.
In an alternative embodiment, matching module 730 can be specifically used for:
Statistics receives quantity and the response time that the software package obtains response;
When the number of responses reaches preset amount threshold, then Response to selection time shortest normal device is matched
It is right.
Specifically, matching module 730, which counts, receives quantity and sound that the software package that each software package obtains obtains response
Between seasonable.To avoid frequently initiating network request, network congestion is caused, when the response number received is greater than certain amount threshold
Value could complete pairing, continue to execute fault recovery process, otherwise terminate to match.That is, reaching quantity in response number
When threshold value, then response time shortest normal device among response is obtained from all software packages received and matched.
Optionally, it when the number of responses that software obtains response does not reach preset amount threshold, then waits again wide
Broadcast the software package acquisition request.
In another alternative embodiment, matching module 730 can also be specifically used for:
Obtain each signal strength for sending the software package and obtaining the corresponding normal device of response;
The highest normal device of selection signal intensity is matched.
Specifically, the signal for detecting the normal device that each transmission software package obtains response in failover procedure is strong
Degree, and matched with the highest equipment of signal strength.
Backup module 750, for backing up the software program.
Software starting module 760, for newly starting the software program.
Specifically, backup module 750 backs up received software program, then tastes in software start-up course
The software program that experimental starting is got.If software starting module 760 starts successfully, fault recovery success;If starting is lost
It loses, then fault recovery fails, and waits and initiates fault recovery next time.
Equipment provided in this embodiment, when detecting fail soft, normal device of the broadcast module 710 into network is wide
Software package acquisition request is broadcast, after receiving module 720 receives the software package acquisition response that normal device is sent, matching module 730
It is matched with the one of normal device for sending software package acquisition response, Failure Recovery Module 740 is downloaded from paired device
Software carries out fault recovery.Using the present invention can automatic recognition software failure, improve the efficiency of faulty equipment positioning, and lead to
After the network equipment of networking, software is obtained from normal device any in network automatically, as long as there are version match in network
Normal device, failure can be restored, to ensure that the reliability of fault recovery.
3rd embodiment
The embodiment of the invention also provides a kind of computer readable storage mediums.Here computer readable storage medium is deposited
Contain one or more program.Wherein, computer readable storage medium may include volatile memory, such as arbitrary access
Memory;Memory also may include nonvolatile memory, such as read-only memory, flash memory, hard disk or solid-state are hard
Disk;Memory can also include the combination of the memory of mentioned kind.When one or more in computer readable storage medium
Program can be executed by one or more processor, to realize software failure recovery method provided by above-mentioned first embodiment.
It should be noted that, in this document, the terms "include", "comprise" or its any other variant are intended to non-row
His property includes, so that the process, method, article or the device that include a series of elements not only include those elements, and
And further include other elements that are not explicitly listed, or further include for this process, method, article or device institute it is intrinsic
Element.In the absence of more restrictions, the element limited by sentence "including a ...", it is not excluded that including being somebody's turn to do
There is also other identical elements in the process, method of element, article or device.
The serial number of the above embodiments of the invention is only for description, does not represent the advantages or disadvantages of the embodiments.
Through the above description of the embodiments, those skilled in the art can be understood that above-described embodiment side
Method can be realized by means of software and necessary general hardware platform, naturally it is also possible to by hardware, but in many cases
The former is more preferably embodiment.Based on this understanding, technical solution of the present invention substantially in other words does the prior art
The part contributed out can be embodied in the form of software products, which is stored in a storage medium
In (such as ROM/RAM, magnetic disk, CD), including some instructions are used so that a terminal (can be mobile phone, computer, service
Device, air conditioner or network equipment etc.) execute method described in each embodiment of the present invention.
The embodiment of the present invention is described with above attached drawing, but the invention is not limited to above-mentioned specific
Embodiment, the above mentioned embodiment is only schematical, rather than restrictive, those skilled in the art
Under the inspiration of the present invention, without breaking away from the scope protected by the purposes and claims of the present invention, it can also make very much
Form, all of these belong to the protection of the present invention.
Claims (10)
1. a kind of software failure recovery method, which is characterized in that the described method includes:
Normal device broadcast packages acquisition request when detecting fail soft, into network;
It receives the software package that the normal device is sent and obtains response;
It is matched with the one of normal device for sending software package acquisition response, wherein the normal device of pairing is known as matching
To equipment;
Fault recovery is carried out from paired device downloading software.
2. software failure recovery method according to claim 1, which is characterized in that obtain its responded with software package is sent
In a normal device matched, specifically include:
Statistics receives quantity and the response time that the software package obtains response;
When the number of responses reaches preset amount threshold, then Response to selection time shortest normal device is matched.
3. software failure recovery method according to claim 2, which is characterized in that when number of responses does not reach preset
When amount threshold, then waits and re-broadcast the software package acquisition request.
4. software failure recovery method according to claim 1, which is characterized in that obtain its responded with software package is sent
In a normal device matched, specifically include:
Obtain each signal strength for sending the software package and obtaining the corresponding normal device of response;
The highest normal device of selection signal intensity is matched.
5. software failure recovery method according to claim 1, which is characterized in that from the paired device download software into
Row fault recovery, specifically includes:
Software transmission request is sent to the paired device;
It is requested according to the software transmission, receives the software program of the paired device transmission;
Verify the software program.
6. software failure recovery method according to claim 5, which is characterized in that successful when verifying the software program
When, the method also includes:
The software program is replaced into the fail soft.
7. software failure recovery method according to claim 6, which is characterized in that the method also includes:
Back up the software program
Restart the software program.
8. software failure recovery method according to claim 1, which is characterized in that the software package acquisition request includes version
This information, so that the normal device is determining compatible with the version information, correspondingly, it is institute that the software package, which obtains response,
State the normal device response mutually compatible with the version information.
9. a kind of equipment, which is characterized in that the equipment includes processor and memory;
The processor is for executing the software failure recovery program stored in memory, to realize any one of claim 1-8
The method.
10. a kind of computer readable storage medium, which is characterized in that the computer-readable recording medium storage have one or
Multiple programs, one or more of programs can be executed by one or more processor, to realize that claim 1-8 is any
Method described in.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201810500927.XA CN110532130A (en) | 2018-05-23 | 2018-05-23 | Software failure recovery method, equipment and computer readable storage medium |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201810500927.XA CN110532130A (en) | 2018-05-23 | 2018-05-23 | Software failure recovery method, equipment and computer readable storage medium |
Publications (1)
Publication Number | Publication Date |
---|---|
CN110532130A true CN110532130A (en) | 2019-12-03 |
Family
ID=68657349
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201810500927.XA Withdrawn CN110532130A (en) | 2018-05-23 | 2018-05-23 | Software failure recovery method, equipment and computer readable storage medium |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN110532130A (en) |
Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN1809816A (en) * | 2003-06-30 | 2006-07-26 | 汤姆森许可贸易公司 | Network equipment and a method for monitoring the start up of such equipment |
CN101060427A (en) * | 2006-04-19 | 2007-10-24 | 华为技术有限公司 | A system and method for realizing the remote software updating |
-
2018
- 2018-05-23 CN CN201810500927.XA patent/CN110532130A/en not_active Withdrawn
Patent Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN1809816A (en) * | 2003-06-30 | 2006-07-26 | 汤姆森许可贸易公司 | Network equipment and a method for monitoring the start up of such equipment |
CN101060427A (en) * | 2006-04-19 | 2007-10-24 | 华为技术有限公司 | A system and method for realizing the remote software updating |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN112291621B (en) | System upgrading automation test method, device and test equipment | |
CN108429629A (en) | Equipment fault restoration methods and device | |
CN104850475B (en) | A kind of test method and device of equipment | |
CN104731701B (en) | Test system and the method and apparatus for performing test assignment | |
CN110290016A (en) | Equipment obstacle management method, apparatus, internet of things equipment and storage medium | |
CN111026581A (en) | Application program repairing method, device, system, storage medium and electronic device | |
CN111585841B (en) | Automatic test method and related device | |
CN102541739B (en) | The method of testing of (SuSE) Linux OS and device | |
CN108121612A (en) | A kind of collapse processing method and system based on linux kernel router | |
CN110659106B (en) | Container state inspection method and device | |
CN110049313A (en) | A kind of video measurement method and system | |
CN110618853B (en) | Detection method, device and equipment for zombie container | |
CN101800672B (en) | Equipment detection method and equipment | |
CN111314133A (en) | Ammeter communication module upgrading method, ammeter control module and ammeter communication module | |
CN110532130A (en) | Software failure recovery method, equipment and computer readable storage medium | |
CN105373474A (en) | Business system-oriented test method and device | |
CN104424098B (en) | A kind of application testing method and device | |
CN101237355B (en) | Service monitoring method, and network monitoring device | |
CN104881341A (en) | Server as well as testing method and system for computer hard disk | |
CN113721957A (en) | Automatic testing method, device and system for firmware deployment and upgrade of embedded equipment | |
CN113687992A (en) | Onboard equipment power-on and power-off test method | |
CN113157481A (en) | Cluster-based server jump time fault processing method, device and system | |
US11031972B2 (en) | Method for automatically selecting a frequency band | |
CN110581786A (en) | Method, device, system and medium for testing communication stability of NCSI (network communication service) network | |
CN108988050A (en) | A kind of Wi-Fi intelligent socket and Wi-Fi network availability safeguards system |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
WW01 | Invention patent application withdrawn after publication |
Application publication date: 20191203 |
|
WW01 | Invention patent application withdrawn after publication |