CN101488889B - Detecting and recovering method for network management systematic engineering failure - Google Patents

Detecting and recovering method for network management systematic engineering failure Download PDF

Info

Publication number
CN101488889B
CN101488889B CN2008100041044A CN200810004104A CN101488889B CN 101488889 B CN101488889 B CN 101488889B CN 2008100041044 A CN2008100041044 A CN 2008100041044A CN 200810004104 A CN200810004104 A CN 200810004104A CN 101488889 B CN101488889 B CN 101488889B
Authority
CN
China
Prior art keywords
network management
management system
external module
fault
tabulation
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Fee Related
Application number
CN2008100041044A
Other languages
Chinese (zh)
Other versions
CN101488889A (en
Inventor
袁虎
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
ZTE Corp
Original Assignee
ZTE Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by ZTE Corp filed Critical ZTE Corp
Priority to CN2008100041044A priority Critical patent/CN101488889B/en
Publication of CN101488889A publication Critical patent/CN101488889A/en
Application granted granted Critical
Publication of CN101488889B publication Critical patent/CN101488889B/en
Expired - Fee Related legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Abstract

The invention discloses a method for detecting and restoring network management system engineering fault, comprising following steps: (1) a plurality of external modules are provided in the external of the network management system, each external module can complete the functions of one or a plurality of working units in original network management system and has the function of detecting and restoring the function; (2) external modules with corresponding functions can be selected according to the fault of the network management system; (3) the external module are loaded when starting the network management system, the external modules can automatically replace the corresponding working units to complete the corresponding function when being loading; (4) for the fault to be detected, the positioning of the fault can be completed by monitoring the working abnormity of the loaded corresponding external modules; for the fault to be restored, the fault can be directly restored by loading the corresponding external modules. The invention can not be limited by whether the version comprises the debugging information or not, need not modify the original program and interrupt the operation of the network management system, meanwhile, the fault improvement can be implemented without upgrading the source program under the condition that the code changing is small.

Description

A kind of detection restorative procedure of network management system engineering failure
Technical field
The present invention relates to a kind of detection restorative procedure of network management system engineering failure.
Background technology
Because the existence of test for leaks, the version of the engineering of network management system certainly exists fault.In case network management system breaks down in engineering, the detection of fault and solve relatively difficulty, this be because:
1) the engineering released version generally is the Release version, does not comprise Debugging message, the bad track and localization of fault.
2) the network management system user is very tight to edition upgrading control, and the number of times of generally upgrading is limited.Require network management system to run without interruption simultaneously.
Need a kind of method, can not interrupt the operation of network management system on the other hand for engineering released version record Debugging message or daily record on the one hand, the source program realization fault of can not upgrading under the very little situation of code change is simultaneously improved.
Summary of the invention
The technical problem to be solved in the present invention provides a kind of detection restorative procedure of network management system engineering failure, can be engineering released version record Debugging message or daily record, can not interrupt the operation of network management system on the other hand, the source program of can not upgrading under the very little situation of code change is simultaneously realized the fault improvement.
In order to address the above problem, the invention provides a kind of detection restorative procedure of network management system engineering failure, may further comprise the steps:
(1) at the some external modules of network management system outer setting, each external module correspondence is finished the function of finishing the one or more working cell in the former network management system and is had fault detect or the fault restoration function;
(2), select one or more external modules with corresponding function according to network management system fault to be detected or to be repaired;
(3) network management system selected external module in the load step (2) when starting is replaced working cell corresponding in the original network management system automatically and is finished corresponding function when external module is loaded;
(4) for fault to be detected, the relevant work unit of the abnormality detection network management system by the respective external module work that loaded of monitoring is finished failure location; For fault to be repaired, pass through the respective external module that loaded, directly fault is repaired.
Further, method of the present invention, wherein, in the described step (2), after the selection respective external module, customize and be kept in the external module tabulation, in the described step (3), network management system reads described external module tabulation when starting, load the external module in the described external module tabulation successively.
Further, method of the present invention, wherein, in the described step (4), for fault to be detected, the relevant work unit of the abnormality detection network management system by the respective external module work that loaded of monitoring, finish after the failure location, if confirm as the fault of network management system, and be to deal with problems with this method, then externally unload current external module in the module list, load external module, this fault is repaired with correction network management system function by revising the external module tabulation.
Further, method of the present invention, wherein, in the webmaster running, add or the unloading external module, tabulate and finish by revising the external module loaded, load deleted external module in external module newly-increased in the tabulation or the unloading tabulation, revise and this tabulation is preserved after finishing.
Further, method of the present invention, wherein, after external module was unloaded, it stopped to continue to move finishing its corresponding function, and network management system is recovered the operation of relevant work unit, finishes this working cell original function.
Further, method of the present invention, wherein, the tabulation of described external module is kept in the database table or in the configuration file.
Further, method of the present invention, wherein, the external module that is loaded is dynamic link library or feature card.
Further, method of the present invention, wherein, the realization of described external module tabulation, adopt one of following mode: directly represent by the file name of external module, or represent all external modules of matching, or represent all external modules under this catalogue or the path by catalogue or pathname by include file name asterisk wildcard.
Compared with prior art, the method for the invention can load the external module with record Debugging message and journal function, thus auxiliary detection network management system fault; After confirming the abort situation of network management system simultaneously, can load external module with correction network management system function, thus the auxiliary network management system fault that solves; Use this method to carry out engineering failure and detect and solve, be not subjected to version whether to comprise the restriction of Debugging message, also do not need to revise original program, and do not need to interrupt the network management system operation success of just can upgrading, the issue of minimizing temporary version.
Description of drawings
Fig. 1 is the schematic diagram that loads external module in the embodiment of the invention on former network management system;
Fig. 2 is the method flow diagram of the embodiment of the invention;
Fig. 3 adopts dynamic link library to realize the method flow diagram that detects in the embodiment of the invention;
Fig. 4 is the interface schematic diagram of network management system customization dynamic link library in the embodiment of the invention.
Embodiment
The present invention is in order to solve the drawback that conventional solution exists, further set forth the detection restorative procedure of a kind of network management system engineering failure of the present invention by following specific embodiment, below embodiment is described in detail, but not as a limitation of the invention.
The fault location of wavelength-division network management system in engineering and solve to have some problems, for example the phenomenon that the historical performance of the one-tenth section of network element is lost has appearred in network management system.In order to confirm that problem has appearred in the message whether network management system receives, and uses method of the present invention to carry out fault detect and reparation.
Detection of the present invention and restorative procedure are to detect or repair for what finish the work, can change as required: if fault the unknown can be finished failure location by detection; If, can directly revise with this method by the position of other approach known fault; But no matter be to detect or repair, its core thinking all realizes by interim loading of external module.This also is an inventive point of the present invention.
As shown in Figure 1, the present invention has increased one or more external modules (21 in former network management system 11 outsides, 22, ..., 2n), assist the fault detect and the solution of network management system by interim loading external module, wherein, external module can be a dynamic link library, also can be the plug-in unit (as some feature card or the like) of other modes.
As shown in Figure 2, the method flow diagram for the embodiment of the invention may further comprise the steps:
Step 200, at the some external modules of former network management system outer setting, each external module correspondence is finished the function of finishing the one or more working cell in the former network management system and is had fault detect and repair function;
For some fault of failing to confirm, need to detect fault earlier, have the external module of fault detection capability, the step after finishing by loading; For some fault of having confirmed, then directly repair, by the external module that loading has the fault restoration function, the step after finishing;
Step 201, according to fault to be detected or to be repaired, select corresponding external module (in fault detect external module and fault restoration external module, the function of just finishing corresponding function is inequality), customize and be kept in the external module tabulation with fault detect or fault restoration function;
Described tabulation is stored in the configuration file and (for other different situations, also can be kept in the database table);
Realization for described tabulation, can adopt multiple mode, such as: directly represent in the similar present embodiment by the file name of a series of external modules, or represent all external modules of matching, or represent all external modules under this catalogue or the path or the like by catalogue or pathname by include file name asterisk wildcard;
Step 202, network management system read the external module tabulation that has customized and preserved when restarting, load the external module in the tabulation successively, replace the working cell of the correspondence in the former network management system when external module is loaded automatically, finish corresponding function;
Network management system reads the title of each external module in the configuration file when starting, and loads all external modules in the tabulation;
Step 203, for fault to be detected, the relevant work unit of the abnormality detection network management system by monitoring external module work is finished failure location; For fault to be repaired, the external module with correction network management system function that passes through to be loaded is directly repaired fault.
For in the step 203, for fault to be detected, the relevant work unit of the abnormality detection network management system by monitoring described external module work is finished after the failure location, if be the fault of network management system, then externally unload the external module that current detection is used in the module list, and, this fault is repaired by revising the external module that external module tabulation loading has correction network management system function.
As shown in Figure 3, realize the method flow diagram that detects for adopting dynamic link library in the embodiment of the invention, in the present embodiment, adopt dynamic link library, specific implementation as external module, because the message that the conjecture network management system receives has problem and causes performance to be lost, whether the message that needs the location webmaster to receive exists fault, so can write a message receiver function PP with journal function, and put into dynamic link libraries AA, by loading this dynamic link libraries AA, be used for replacing the reception message function OO of former webmaster, can determine by checking daily record whether the message that receives really has problem, finishes fault detect, may further comprise the steps:
Step 300, at dynamic link library AA of former network management system outer setting, dynamic link library AA correspondence is finished the message input in the former network management system;
Dynamic link library AA has record Debugging message and journal function and fault detection capability, thereby can auxiliary detection is that fault has appearred in reception message function OO in the network management system, or the input of webmaster has problem to cause the message mistake;
Step 301 customizes dynamic link library AA and be kept in the tabulation of dynamic link library, and this tabulation is kept in the configuration file;
Increase the window of the tabulation of customization dynamic link library in the network management system interface, as shown in Figure 4, wherein 33 is tabulations of the file name of dynamic link library, and can be by record of button 44 increases with by record of button 55 deletions; The tabulation of the file name of dynamic link library is stored in the configuration file behind the close;
Step 302, network management system reads the title of the dynamic link library in the dynamic link library tabulation that has customized and preserved when restarting, and the dynamic link library AA in the loading tabulation, when being loaded, replaces dynamic link library AA the working cell of the correspondence in the former network management system automatically, be the reception message function OO of former webmaster, finish corresponding function;
Loading dynamic link library mainly is to realize that by the loading function of dynamic link library such as under windows platform, function name is LoadLibrary;
In this step, if in running, add or the unloading dynamic link library, tabulate and finish by revising the dynamic link library loaded, the user can open window as shown in Figure 4, increase a record AA by button 44, network management system need load the dynamic link library among the record AA simultaneously; Or by a record of button 55 deletions BB, network management system need unload the dynamic link library among the record BB simultaneously, closes window after modification finishes, and amended tabulation is saved in the configuration literary composition;
Step 303, by checking the message of dynamic link library AA record, whether correct, finish failure location if detecting the received message of the reception message function OO of former network management system;
Write a message receiver function PP with journal function, and put among the dynamic link library AA, the address of function PP is CC; Network management system is searched original message receiver function OO after loading dynamic link library AA, and the address D D of record function OO, the function address of rewriting the message reception of network management system then is CC, like this, when later network management system was called the message receiver function at every turn, the actual function that calls was function PP.PP has log recording function because of function, so can supply that all message accountings check the raw address that finds fault to send in file;
It is incorrect to be checked through message itself herein, can confirm that then fault is that the input of webmaster is wrong, but the input that can only determine webmaster here is wrong, and emphasis solves the problem of webmaster input, sees whether go back loss performance again after whether network management system itself has problem need wait input correct.
If check and find, the webmaster input is correct, but exists performance to lose, that can determine that then webmaster itself has fault, and can't determine to receive message function OO has fault, needs further to analyze, if be the problem of OO really, just can revise OO, then can carry out next step:
Step 304, the dynamic link library AA that the unloading current detection is used in the dynamic link library tabulation, and be added with the dynamic link library of revising the network management system function, and load this dynamic link library fault is repaired, thus the auxiliary network management system fault that solves;
Behind the network management system unloading dynamic link library AA, it stops to continue to move finishing its corresponding function, simultaneously DD is changeed back in the address of the message receiver function OO of network management system, to recover former message receiver function OO operation, finishes its original function;
The address CC of top receiver function PP can directly use the pointer of function PP to obtain, and the address D D of former receiver function OO can find by function name in the function address table of network management system;
The unloading dynamic link library mainly is to realize that by the unloading function of dynamic link library such as under windows platform, function name is FreeLibrary.
Certainly; the present invention also can have other various embodiments; under the situation that does not deviate from spirit of the present invention and essence thereof; those of ordinary skill in the art can make various corresponding changes and distortion according to the present invention, but these corresponding changes and distortion all should belong to the protection range of the appended claim of the present invention.

Claims (8)

1. the detection restorative procedure of a network management system engineering failure is characterized in that, may further comprise the steps:
(1) at the some external modules of network management system outer setting, each external module correspondence is finished the function of finishing the one or more working cell in the former network management system and is had fault detect or the fault restoration function;
(2), select one or more external modules with corresponding function according to network management system fault to be detected or to be repaired;
(3) network management system selected external module in the load step (2) when starting is replaced working cell corresponding in the original network management system automatically and is finished corresponding function when external module is loaded;
(4) for fault to be detected, the relevant work unit of the abnormality detection network management system by the respective external module work that loaded of monitoring is finished failure location; For fault to be repaired, pass through the respective external module that loaded, directly fault is repaired.
2. the method for claim 1, it is characterized in that, in the described step (2), after selecting the respective external module, customization also is kept in the external module tabulation, and in the described step (3), network management system is when starting, read described external module tabulation, load the external module in the described external module tabulation successively.
3. method as claimed in claim 2, it is characterized in that, in the described step (4), for fault to be detected, the relevant work unit of the abnormality detection network management system by the respective external module work that loaded of monitoring is finished after the failure location, if confirm as the fault of network management system, then externally unload current external module in the module list, load external module, this fault is repaired with correction network management system function by revising the external module tabulation.
4. method as claimed in claim 3, it is characterized in that, in the webmaster running, add or the unloading external module, tabulate and finish by revising the external module loaded, load deleted external module in external module newly-increased in the tabulation or the unloading tabulation, revise and this tabulation is preserved after finishing.
5. as claim 2 or 4 described methods, it is characterized in that after external module was unloaded, it stopped to continue to move finishing its corresponding function, network management system is recovered the operation of relevant work unit, finishes this working cell original function.
6. as claim 3 or 4 described methods, it is characterized in that the tabulation of described external module is kept in the database table or in the configuration file.
7. the method for claim 1 is characterized in that, the external module that is loaded is dynamic link library or feature card.
8. as claim 3 or 4 described methods, it is characterized in that, the realization of described external module tabulation, adopt one of following mode: directly represent by the file name of external module, or represent all external modules of matching, or represent all external modules under this catalogue or the path by catalogue or pathname by include file name asterisk wildcard.
CN2008100041044A 2008-01-17 2008-01-17 Detecting and recovering method for network management systematic engineering failure Expired - Fee Related CN101488889B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN2008100041044A CN101488889B (en) 2008-01-17 2008-01-17 Detecting and recovering method for network management systematic engineering failure

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN2008100041044A CN101488889B (en) 2008-01-17 2008-01-17 Detecting and recovering method for network management systematic engineering failure

Publications (2)

Publication Number Publication Date
CN101488889A CN101488889A (en) 2009-07-22
CN101488889B true CN101488889B (en) 2011-04-20

Family

ID=40891575

Family Applications (1)

Application Number Title Priority Date Filing Date
CN2008100041044A Expired - Fee Related CN101488889B (en) 2008-01-17 2008-01-17 Detecting and recovering method for network management systematic engineering failure

Country Status (1)

Country Link
CN (1) CN101488889B (en)

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1917454A (en) * 2005-08-17 2007-02-21 华为技术有限公司 System and method for detecting faults in network
CN1968148A (en) * 2006-10-13 2007-05-23 华为技术有限公司 Network management system for integrative supervision and management of application software system and host resource
CN101047573A (en) * 2006-06-27 2007-10-03 华为技术有限公司 Fault testing system of communication equipment and fault testing method of communication equipment

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1917454A (en) * 2005-08-17 2007-02-21 华为技术有限公司 System and method for detecting faults in network
CN101047573A (en) * 2006-06-27 2007-10-03 华为技术有限公司 Fault testing system of communication equipment and fault testing method of communication equipment
CN1968148A (en) * 2006-10-13 2007-05-23 华为技术有限公司 Network management system for integrative supervision and management of application software system and host resource

Also Published As

Publication number Publication date
CN101488889A (en) 2009-07-22

Similar Documents

Publication Publication Date Title
CN109240886B (en) Exception handling method, exception handling device, computer equipment and storage medium
CN100549946C (en) Be used to carry out the apparatus and method of data processing operation
US20110283274A1 (en) Firmware image update and management
CN104834575A (en) Firmware recovery method and device
CN101647013B (en) Bookmark and configuration file for installation sequence
US7882388B2 (en) Dual independent non volatile memory systems
CN105550056B (en) A kind of fault self-recovery system and its implementation based on system reconfiguration
US7730029B2 (en) System and method of fault tolerant reconciliation for control card redundancy
CN103458086A (en) Smart phone and fault detection method thereof
CN100395713C (en) Method of automatic repairing basic input output system element and module
JP5167936B2 (en) Information processing device
CN109324918A (en) Processing method and processing device, terminal and the storage medium of application exception
US20170046249A1 (en) Testing of enterprise resource planning systems
CN104598256A (en) Computer BIOS (Basic Input /Output System) recovery system and method
CN108170456B (en) Firmware upgrading method and device for electronic equipment
CN105224416B (en) Restorative procedure and related electronic device
CN101488889B (en) Detecting and recovering method for network management systematic engineering failure
CN106909382B (en) Method and device for outputting different types of system starting information
CN100476745C (en) Method for implementing automatic fault-tolerance of image file in Linux operating system booting process
CN111694724B (en) Test method and device of distributed form system, electronic equipment and storage medium
CN110968456B (en) Method and device for processing fault disk in distributed storage system
CN113568834A (en) SDK code compatibility detection method, device, computer equipment and medium
CN100458719C (en) Method and system obtaining and compliling basic input and output system information
JP2001005693A (en) System and method for automatically restoring fault and recording medium recording automatic fault restoration program
CN111680000A (en) Configuration system and method of field programmable gate array

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C14 Grant of patent or utility model
GR01 Patent grant
CF01 Termination of patent right due to non-payment of annual fee
CF01 Termination of patent right due to non-payment of annual fee

Granted publication date: 20110420

Termination date: 20180117