CN114116282B - Method and device for reporting and repairing network additional storage faults - Google Patents

Method and device for reporting and repairing network additional storage faults Download PDF

Info

Publication number
CN114116282B
CN114116282B CN202111342238.9A CN202111342238A CN114116282B CN 114116282 B CN114116282 B CN 114116282B CN 202111342238 A CN202111342238 A CN 202111342238A CN 114116282 B CN114116282 B CN 114116282B
Authority
CN
China
Prior art keywords
alarm
alarm event
reporting
event
error code
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN202111342238.9A
Other languages
Chinese (zh)
Other versions
CN114116282A (en
Inventor
郑强
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Suzhou Inspur Intelligent Technology Co Ltd
Original Assignee
Suzhou Inspur Intelligent Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Suzhou Inspur Intelligent Technology Co Ltd filed Critical Suzhou Inspur Intelligent Technology Co Ltd
Priority to CN202111342238.9A priority Critical patent/CN114116282B/en
Publication of CN114116282A publication Critical patent/CN114116282A/en
Application granted granted Critical
Publication of CN114116282B publication Critical patent/CN114116282B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/07Responding to the occurrence of a fault, e.g. fault tolerance
    • G06F11/0703Error or fault processing not based on redundancy, i.e. by taking additional measures to deal with the error or fault not making use of redundancy in operation, in hardware, or in data representation
    • G06F11/0706Error or fault processing not based on redundancy, i.e. by taking additional measures to deal with the error or fault not making use of redundancy in operation, in hardware, or in data representation the processing taking place on a specific hardware platform or in a specific software environment
    • G06F11/0727Error or fault processing not based on redundancy, i.e. by taking additional measures to deal with the error or fault not making use of redundancy in operation, in hardware, or in data representation the processing taking place on a specific hardware platform or in a specific software environment in a storage system, e.g. in a DASD or network based storage system
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/07Responding to the occurrence of a fault, e.g. fault tolerance
    • G06F11/0703Error or fault processing not based on redundancy, i.e. by taking additional measures to deal with the error or fault not making use of redundancy in operation, in hardware, or in data representation
    • G06F11/079Root cause analysis, i.e. error or fault diagnosis
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/07Responding to the occurrence of a fault, e.g. fault tolerance
    • G06F11/0703Error or fault processing not based on redundancy, i.e. by taking additional measures to deal with the error or fault not making use of redundancy in operation, in hardware, or in data representation
    • G06F11/0793Remedial or corrective actions
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L41/00Arrangements for maintenance, administration or management of data switching networks, e.g. of packet switching networks
    • H04L41/06Management of faults, events, alarms or notifications
    • H04L41/0631Management of faults, events, alarms or notifications using root cause analysis; using analysis of correlation between notifications, alarms or events based on decision criteria, e.g. hierarchy, tree or time analysis

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Quality & Reliability (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Computer Networks & Wireless Communication (AREA)
  • Signal Processing (AREA)
  • Health & Medical Sciences (AREA)
  • Biomedical Technology (AREA)
  • Debugging And Monitoring (AREA)

Abstract

The application provides a method, a system, equipment and a storage medium for reporting and repairing network additional storage faults, wherein the method comprises the following steps: acquiring an alarm information file additionally stored in a network, and filling alarm data information in the alarm information file; judging whether each alarm event triggers an alarm or not in sequence according to the filled alarm data information; responding to an alarm event to trigger an alarm, and calling a reporting function to report the alarm event; and calling a repair function in a fault mode library to repair the alarm event according to the identification of the occurrence of the alarm event. The application can display the network additional storage alarm and be visible to the user, thus the fault can be effectively handled, the stability of the system is ensured, and meanwhile, part of the alarm can be automatically repaired without manual intervention, thus the application has no perception to the user and increases the acceptance of the user.

Description

Method and device for reporting and repairing network additional storage faults
Technical Field
The present application relates to the field of storage, and in particular, to a method, system, device, and storage medium for reporting and repairing a network attached storage failure.
Background
In the big data age, the requirements on the reliability of storage and the accurate positioning of problems are higher and higher. However, when the service of the current MCS (reduced linux based on the linux kernel) NAS (Network Attached Storage, network additional storage) fails in the use process, the GUI (Graphical User Interface ) has no warning event prompt information related to the network additional storage service, so that the user cannot acquire the failure information in time, the processing cannot be measured in time, and hidden danger is buried for the stable operation of the system.
Disclosure of Invention
In view of the above, an object of the embodiments of the present application is to provide a method, a system, a computer device and a computer readable storage medium for reporting and repairing a network additional storage failure.
Based on the above objects, an aspect of the embodiments of the present application provides a method for reporting and repairing a network additional storage failure, including the following steps: acquiring an alarm information file additionally stored in a network, and filling alarm data information in the alarm information file; judging whether each alarm event triggers an alarm or not in sequence according to the filled alarm data information; responding to an alarm event to trigger an alarm, and calling a reporting function to report the alarm event; and calling a repair function in a fault mode library to repair the alarm event according to the identification of the occurrence of the alarm event.
In some embodiments, the calling a reporting function to report the alarm event includes: activating errors corresponding to the alarm event in a manager corresponding to the alarm event, and checking whether other managers are activated with the errors; and mapping the error code into a node true error code and setting an error flag in response to the other manager not activating the error.
In some embodiments, the method further comprises: and in response to the alarm event failing to trigger an alarm, invoking a clearing function to clear the alarm event.
In some embodiments, the invoking the purge function to purge the alarm event comprises: clearing the error code information in the cache and judging whether the error code is a preset value or not; and in response to the error code being a preset value, clearing the current mode of the platform main process, and setting the platform main process to be a common mode.
In another aspect of the embodiment of the present application, a system for reporting and repairing a network attached storage failure is provided, including: the acquisition module is configured to acquire an alarm information file additionally stored in the network and fill alarm data information in the alarm information file; the judging module is configured to judge whether each alarm event triggers an alarm or not in sequence according to the filled alarm data information; the reporting module is configured to respond to an alarm event and trigger an alarm, and call a reporting function to report the alarm event; and the repair module is configured to call a repair function in the fault mode library to repair the alarm event according to the identification of the occurrence of the alarm event.
In some embodiments, the reporting module is configured to: activating errors corresponding to the alarm event in a manager corresponding to the alarm event, and checking whether other managers are activated with the errors; and mapping the error code into a node true error code and setting an error flag in response to the other manager not activating the error.
In some embodiments, the system further comprises a purge module configured to: and in response to the alarm event failing to trigger an alarm, invoking a clearing function to clear the alarm event.
In some embodiments, the purge module is further configured to: clearing the error code information in the cache and judging whether the error code is a preset value or not; and in response to the error code being a preset value, clearing the current mode of the platform main process, and setting the platform main process to be a common mode.
In yet another aspect of the embodiment of the present application, there is also provided a computer apparatus, including: at least one processor; and a memory storing computer instructions executable on the processor, which when executed by the processor, perform the steps of the method as above.
In yet another aspect of the embodiments of the present application, there is also provided a computer-readable storage medium storing a computer program which, when executed by a processor, implements the method steps as described above.
The application has the following beneficial technical effects: the network additional storage alarm is intuitively displayed on the page of the user, and when the network additional storage alarm appears, the automatic repair reduces manual intervention, increases the acceptance of the user, and improves the stability of the system.
Drawings
In order to more clearly illustrate the embodiments of the application or the technical solutions in the prior art, the drawings that are necessary for the description of the embodiments or the prior art will be briefly described, it being obvious that the drawings in the following description are only some embodiments of the application and that other embodiments may be obtained according to these drawings without inventive effort for a person skilled in the art.
FIG. 1 is a schematic diagram of an embodiment of a method for reporting and repairing a network attached storage failure provided by the present application;
FIG. 2 is a schematic diagram of an embodiment of a system for reporting and repairing network attached storage failures provided by the present application;
FIG. 3 is a schematic hardware architecture diagram of an embodiment of a computer device for reporting and repairing a network attached storage failure provided by the present application;
FIG. 4 is a schematic diagram of an embodiment of a computer storage medium for reporting and repairing network attached storage failures provided by the present application.
Detailed Description
In order to make the objects, technical solutions and advantages of the present application more apparent, the following embodiments of the present application will be described in further detail with reference to the accompanying drawings.
It should be noted that, in the embodiments of the present application, all the expressions "first" and "second" are used to distinguish two entities with the same name but different entities or different parameters, and it is noted that the "first" and "second" are only used for convenience of expression, and should not be construed as limiting the embodiments of the present application, and the following embodiments are not described one by one.
In a first aspect of the embodiment of the present application, an embodiment of a method for reporting and repairing a network attached storage failure is provided. Fig. 1 is a schematic diagram of an embodiment of a method for reporting and repairing a network attached storage failure provided by the present application. As shown in fig. 1, the embodiment of the present application includes the following steps:
s1, acquiring an alarm information file additionally stored by a network, and filling alarm data information in the alarm information file;
s2, judging whether each alarm event triggers an alarm or not in sequence according to the filled alarm data information;
s3, responding to an alarm event and triggering an alarm, and calling a reporting function to report the alarm event; and
s4, according to the identification of the occurrence of the alarm event, a repair function in a fault mode library is called to repair the alarm event.
Through embedding a plurality of fault perceptrons in the network additional storage virtual machine, if faults occur, the perceptrons can rapidly capture and report alarms to an MCS system, such as acquisition network additional storage node failover (failover), NFS (Network File System ) service, CIFS (Common Internet File Systems, universal Internet file system) service, FTP (File Transfer Protocol ) service, minios service, network additional storage restart faults, network additional storage Ethernet port faults, file system capacity and the like, for the MCS system to call, and the implementation flow is as follows:
implemented on mcs by daemon vm_daemon. Py, invoking nas_alarmd once every 5 seconds, and the nas_alarmd performs a query by connecting a virtual machine through ssh (Secure Shell), the query being based on nodes. The nas_alarmd obtains the states of network attached storage nodes in the virtual machine, such as failover, network file system service, universal Internet file system service, file transfer protocol service, minios service, restarting, network card and file system, and writes fifo files for the mcs alarm code to inquire if the inquiry is successful.
And acquiring an alarm information file additionally stored in the network, and filling alarm data information in the alarm information file. And judging whether each alarm event triggers an alarm or not in sequence according to the filled alarm data information.
And responding to the alarm event to trigger an alarm, and calling a reporting function to report the alarm event. The alarm detection processing of the MCS system is completed through two modules, namely an EC module and a PL module, in the system, each module is particularly responsible for the following functions, the EC module sequentially judges alarm events by reading a network additional storage alarm information file, fills information such as error records, state data, activation marks and the like, sequentially processes the alarm events according to the filling information, calls an alarm reporting function if an alarm exists, and otherwise calls an alarm clearing function; and the PL module performs error code sequencing according to the received alarm event information and reports the alarm event. The specific flow is as follows: the MCS checks whether the event state is starting, if so, the method exits; the MCS system reads the NAS alarm information state file, judges whether the acquired information is effective or not, and exits if the acquired information is ineffective; the MCS system starts to judge NAS alarm information in sequence and fills in error records, state data, activation marks and other information; and (3) sequentially processing alarm events according to the alarm data information filled in the previous step, calling an ecmgr_sensor_report_node_error function to report an alarm if an alarm exists in a certain alarm event, and calling the ecmgr_sensor_clear_node_error function to clear the alarm if the alarm does not exist.
In some embodiments, the calling a reporting function to report the alarm event includes: activating errors corresponding to the alarm event in a manager corresponding to the alarm event, and checking whether other managers are activated with the errors; and mapping the error code into a node true error code and setting an error flag in response to the other manager not activating the error. Checking whether the error code is 0x522, if so, forcibly setting the platform main process to 522 mode, and if not, calling the function to report an alarm. The error code is cached to prevent the error information from being lost due to the exit of the io process.
In some embodiments, the method further comprises: and in response to the alarm event failing to trigger an alarm, invoking a clearing function to clear the alarm event.
In some embodiments, the invoking the purge function to purge the alarm event comprises: clearing the error code information in the cache and judging whether the error code is a preset value or not; and in response to the error code being a preset value, clearing the current mode of the platform main process, and setting the platform main process to be a common mode. It is checked whether the error code is 0x522, if so, the platform main process 522 mode is cleared, and if not, the platform main process is set to the normal mode.
Invoking the clear function to clear the alert event also includes: activating errors corresponding to the alarm event in a manager corresponding to the alarm event, and checking whether other managers are activated with the errors; and mapping the error code to a node true error code in response to the other manager not activating the error.
And calling a repairing function in a fault mode library to repair the alarm event according to the identification of the occurrence of the alarm event.
NAS related alarm event information can be displayed in an alarm interface at the front end of a graphic user interface, wherein the interface lists error codes, time stamps, states, descriptions, object types, object identifications and object name information of the current alarm event, and operations such as checking attributes, clearing logs, running repair and the like can be performed on the alarm event by clicking a certain alarm event right. And (3) partial alarm, registering through a big data background script, and then calling an automatic repair module to automatically position and repair. And according to the principle of the automatic modification module and the identification of the occurrence of the alarm, calling an automatic repair module in the fault mode library to perform automatic repair.
It should be noted that, in the foregoing embodiments of the method for reporting and repairing a network additional storage fault, the steps may be intersected, replaced, added and deleted, so that the method for reporting and repairing a network additional storage fault by using these reasonable permutation and combination transforms should also belong to the protection scope of the present application, and should not limit the protection scope of the present application to the embodiments.
Based on the above objective, a second aspect of the embodiments of the present application proposes a system for reporting and repairing a network attached storage failure. As shown in fig. 2, the system 200 includes the following modules: the acquisition module is configured to acquire an alarm information file additionally stored in the network and fill alarm data information in the alarm information file; the judging module is configured to judge whether each alarm event triggers an alarm or not in sequence according to the filled alarm data information; the reporting module is configured to respond to an alarm event and trigger an alarm, and call a reporting function to report the alarm event; and the repair module is configured to call a repair function in the fault mode library to repair the alarm event according to the identification of the occurrence of the alarm event.
In some embodiments, the reporting module is configured to: activating errors corresponding to the alarm event in a manager corresponding to the alarm event, and checking whether other managers are activated with the errors; and mapping the error code into a node true error code and setting an error flag in response to the other manager not activating the error.
In some embodiments, the system further comprises a purge module configured to: and in response to the alarm event failing to trigger an alarm, invoking a clearing function to clear the alarm event.
In some embodiments, the purge module is further configured to: clearing the error code information in the cache and judging whether the error code is a preset value or not; and in response to the error code being a preset value, clearing the current mode of the platform main process, and setting the platform main process to be a common mode.
In view of the above object, a third aspect of the embodiments of the present application provides a computer device, including: at least one processor; and a memory storing computer instructions executable on the processor, the instructions being executable by the processor to perform the steps of: s1, acquiring an alarm information file additionally stored by a network, and filling alarm data information in the alarm information file; s2, judging whether each alarm event triggers an alarm or not in sequence according to the filled alarm data information; s3, responding to an alarm event and triggering an alarm, and calling a reporting function to report the alarm event; s4, according to the identification of the occurrence of the alarm event, a repair function in a fault mode library is called to repair the alarm event.
In some embodiments, the calling a reporting function to report the alarm event includes: activating errors corresponding to the alarm event in a manager corresponding to the alarm event, and checking whether other managers are activated with the errors; and mapping the error code into a node true error code and setting an error flag in response to the other manager not activating the error.
In some embodiments, the steps further comprise: and in response to the alarm event failing to trigger an alarm, invoking a clearing function to clear the alarm event.
In some embodiments, the invoking the purge function to purge the alarm event comprises: clearing the error code information in the cache and judging whether the error code is a preset value or not; and in response to the error code being a preset value, clearing the current mode of the platform main process, and setting the platform main process to be a common mode.
As shown in fig. 3, a hardware structure diagram of an embodiment of the computer device for reporting and repairing the network attached storage fault provided by the present application is shown.
Taking the example of the device shown in fig. 3, a processor 301 and a memory 302 are included in the device.
The processor 301 and the memory 302 may be connected by a bus or otherwise, for example in fig. 3.
The memory 302 is used as a non-volatile computer readable storage medium, and may be used to store non-volatile software programs, non-volatile computer executable programs, and modules, such as program instructions/modules corresponding to a method for reporting and repairing network attached storage failures in an embodiment of the present application. The processor 301 executes various functional applications and data processing of the server, that is, a method of reporting and repairing network attached storage failures, by running nonvolatile software programs, instructions, and modules stored in the memory 302.
Memory 302 may include a storage program area that may store an operating system, at least one application program required for functionality, and a storage data area; the storage data area may store data created according to the use of the method of network attached storage failure reporting and repair, etc. In addition, memory 302 may include high-speed random access memory, and may also include non-volatile memory, such as at least one magnetic disk storage device, flash memory device, or other non-volatile solid-state storage device. In some embodiments, memory 302 may optionally include memory located remotely from processor 301, which may be connected to the local module via a network. Examples of such networks include, but are not limited to, the internet, intranets, local area networks, mobile communication networks, and combinations thereof.
Computer instructions 303 corresponding to the method for reporting and repairing one or more network attached storage failures are stored in the memory 302, and when executed by the processor 301, perform the method for reporting and repairing a network attached storage failure in any of the method embodiments described above.
Any embodiment of a computer device that performs the method for reporting and repairing a network attached storage failure described above may achieve the same or similar effects as any of the method embodiments described above that correspond to the embodiment.
The application also provides a computer readable storage medium storing a computer program which when executed by a processor performs a method of reporting and repairing network attached storage failures.
As shown in fig. 4, a schematic diagram of an embodiment of a computer storage medium for reporting and repairing the network-attached storage failure according to the present application is provided. Taking a computer storage medium as shown in fig. 4 as an example, the computer readable storage medium 401 stores a computer program 402 that when executed by a processor performs the above method.
Finally, it should be noted that, as will be appreciated by those skilled in the art, implementing all or part of the above-described embodiments of the method may be implemented by a computer program to instruct related hardware, and the program of the method for reporting and repairing a network additional storage failure may be stored in a computer readable storage medium, where the program may include the steps of the embodiments of the above-described methods when executed. The storage medium of the program may be a magnetic disk, an optical disk, a read-only memory (ROM), a random-access memory (RAM), or the like. The computer program embodiments described above may achieve the same or similar effects as any of the method embodiments described above.
The foregoing is an exemplary embodiment of the present disclosure, but it should be noted that various changes and modifications could be made herein without departing from the scope of the disclosure as defined by the appended claims. The functions, steps and/or actions of the method claims in accordance with the disclosed embodiments described herein need not be performed in any particular order. Furthermore, although elements of the disclosed embodiments may be described or claimed in the singular, the plural is contemplated unless limitation to the singular is explicitly stated.
It should be understood that as used herein, the singular forms "a", "an", and "the" are intended to include the plural forms as well, unless the context clearly supports the exception. It should also be understood that "and/or" as used herein is meant to include any and all possible combinations of one or more of the associated listed items.
The foregoing embodiment of the present application has been disclosed with reference to the number of embodiments for the purpose of description only, and does not represent the advantages or disadvantages of the embodiments.
It will be understood by those skilled in the art that all or part of the steps for implementing the above embodiments may be implemented by hardware, or may be implemented by a program for instructing relevant hardware, and the program may be stored in a computer readable storage medium, where the storage medium may be a read-only memory, a magnetic disk or an optical disk, etc.
Those of ordinary skill in the art will appreciate that: the above discussion of any embodiment is merely exemplary and is not intended to imply that the scope of the disclosure of embodiments of the application, including the claims, is limited to such examples; combinations of features of the above embodiments or in different embodiments are also possible within the idea of an embodiment of the application, and many other variations of the different aspects of the embodiments of the application as described above exist, which are not provided in detail for the sake of brevity. Therefore, any omission, modification, equivalent replacement, improvement, etc. of the embodiments should be included in the protection scope of the embodiments of the present application.

Claims (10)

1. The method for reporting and repairing the network additional storage fault is characterized by comprising the following steps:
acquiring an alarm information file additionally stored in a network, and filling alarm data information in the alarm information file;
judging whether each alarm event triggers an alarm or not in sequence according to the filled alarm data information;
responding to an alarm event to trigger an alarm, and calling a reporting function to report the alarm event; and
and calling a repairing function in a fault mode library to repair the alarm event according to the identification of the occurrence of the alarm event.
2. The method of claim 1, wherein the invoking a reporting function to report the alarm event comprises:
activating errors corresponding to the alarm event in a manager corresponding to the alarm event, and checking whether other managers are activated with the errors; and
and in response to the other manager not activating the error, mapping the error code into a node true error code and setting an error flag.
3. The method according to claim 1, wherein the method further comprises:
and in response to the alarm event failing to trigger an alarm, invoking a clearing function to clear the alarm event.
4. The method of claim 3, wherein the invoking a clear function to clear the alert event comprises:
clearing the error code information in the cache and judging whether the error code is a preset value or not; and
and in response to the error code being a preset value, clearing the current mode of the platform main process, and setting the platform main process to be a common mode.
5. A system for reporting and repairing a network attached storage failure, comprising:
the acquisition module is configured to acquire an alarm information file additionally stored in the network and fill alarm data information in the alarm information file;
the judging module is configured to judge whether each alarm event triggers an alarm or not in sequence according to the filled alarm data information;
the reporting module is configured to respond to an alarm event and trigger an alarm, and call a reporting function to report the alarm event; and
and the repair module is configured to call a repair function in the fault mode library to repair the alarm event according to the identifier of the alarm event.
6. The system of claim 5, wherein the reporting module is configured to:
activating errors corresponding to the alarm event in a manager corresponding to the alarm event, and checking whether other managers are activated with the errors; and
and in response to the other manager not activating the error, mapping the error code into a node true error code and setting an error flag.
7. The system of claim 5, further comprising a purge module configured to:
and in response to the alarm event failing to trigger an alarm, invoking a clearing function to clear the alarm event.
8. The system of claim 7, wherein the purge module is further configured to:
clearing the error code information in the cache and judging whether the error code is a preset value or not; and
and in response to the error code being a preset value, clearing the current mode of the platform main process, and setting the platform main process to be a common mode.
9. A computer device, comprising:
at least one processor; and
a memory storing computer instructions executable on the processor, which when executed by the processor, perform the steps of the method of any one of claims 1-4.
10. A computer readable storage medium storing a computer program, characterized in that the computer program when executed by a processor implements the steps of the method of any of claims 1-4.
CN202111342238.9A 2021-11-12 2021-11-12 Method and device for reporting and repairing network additional storage faults Active CN114116282B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202111342238.9A CN114116282B (en) 2021-11-12 2021-11-12 Method and device for reporting and repairing network additional storage faults

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202111342238.9A CN114116282B (en) 2021-11-12 2021-11-12 Method and device for reporting and repairing network additional storage faults

Publications (2)

Publication Number Publication Date
CN114116282A CN114116282A (en) 2022-03-01
CN114116282B true CN114116282B (en) 2023-08-18

Family

ID=80379036

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202111342238.9A Active CN114116282B (en) 2021-11-12 2021-11-12 Method and device for reporting and repairing network additional storage faults

Country Status (1)

Country Link
CN (1) CN114116282B (en)

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN115473788B (en) * 2022-08-29 2023-08-11 苏州浪潮智能科技有限公司 Storage alarm test method, device, equipment and storage medium

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106339297A (en) * 2016-09-14 2017-01-18 郑州云海信息技术有限公司 Method and system for warning failures of storage system in real time
CN108763038A (en) * 2018-08-08 2018-11-06 平安科技(深圳)有限公司 Management method, device, computer equipment and the storage medium of alarm data
CN110688280A (en) * 2019-09-25 2020-01-14 中国建设银行股份有限公司 Management system, method, equipment and storage medium of alarm event
CN112035319A (en) * 2020-08-31 2020-12-04 浪潮云信息技术股份公司 Monitoring alarm system for multi-path state
CN112131201A (en) * 2020-09-18 2020-12-25 苏州浪潮智能科技有限公司 Method, system, equipment and medium for high availability of network additional storage
WO2021136247A1 (en) * 2019-12-31 2021-07-08 华为技术有限公司 Alarm processing method and apparatus, and storage medium

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106339297A (en) * 2016-09-14 2017-01-18 郑州云海信息技术有限公司 Method and system for warning failures of storage system in real time
CN108763038A (en) * 2018-08-08 2018-11-06 平安科技(深圳)有限公司 Management method, device, computer equipment and the storage medium of alarm data
CN110688280A (en) * 2019-09-25 2020-01-14 中国建设银行股份有限公司 Management system, method, equipment and storage medium of alarm event
WO2021136247A1 (en) * 2019-12-31 2021-07-08 华为技术有限公司 Alarm processing method and apparatus, and storage medium
CN112035319A (en) * 2020-08-31 2020-12-04 浪潮云信息技术股份公司 Monitoring alarm system for multi-path state
CN112131201A (en) * 2020-09-18 2020-12-25 苏州浪潮智能科技有限公司 Method, system, equipment and medium for high availability of network additional storage

Also Published As

Publication number Publication date
CN114116282A (en) 2022-03-01

Similar Documents

Publication Publication Date Title
CN111290918B (en) Server running state monitoring method and device and computer readable storage medium
CN112631913A (en) Method, device, equipment and storage medium for monitoring operation fault of application program
CN110427303A (en) A kind of fault alarming method and device
KR20150033711A (en) Run-time error repairing method, device and system
CN112202631A (en) Resource access method, device and system, electronic equipment and storage medium
CN113407383B (en) Main and standby system switching method and device, server and main and standby system
WO2019140739A1 (en) Method for determining return visits to client, and electronic apparatus and computer-readable storage medium
CN114116282B (en) Method and device for reporting and repairing network additional storage faults
CN111711542A (en) Disaster recovery switching method and device based on DNS (Domain name System), electronic equipment and storage medium
CN115333923B (en) Fault point tracing analysis method, device, equipment and medium
CN110889143A (en) File verification method and device
CN112749050B (en) Micro-service-framework-based safety circuit breaking method and device and computing equipment
CN114461445A (en) Data recovery method and device, electronic equipment and computer readable storage medium
CN103731315A (en) Server failure detecting method
CN113886122A (en) System operation exception handling method, device, equipment and storage medium
CN113179180A (en) Basalt client disaster fault repairing method, basalt client disaster fault repairing device and basalt client disaster storage medium
CN110231921B (en) Log printing method, device, equipment and computer readable storage medium
CN114866398B (en) Network fault diagnosis method and device
CN115314361B (en) Server cluster management method and related components thereof
CN116719657A (en) Firmware fault log generation method, device, server and readable medium
CN116560960A (en) Abnormality detection method, abnormality detection device, electronic device, and storage medium
CN115269252A (en) Application program fault processing method, device, equipment and storage medium
CN113961395A (en) Plug and play method of chip protection device and control device
US20150127976A1 (en) Method and Apparatus for Failure Recovery in a Machine-to-Machine Network
CN115473793B (en) Automatic recovery method, device, terminal and medium for cluster EI host environment

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant