CN109933448B - Method and device for predicting fault of nonvolatile storage medium - Google Patents

Method and device for predicting fault of nonvolatile storage medium Download PDF

Info

Publication number
CN109933448B
CN109933448B CN201910108992.2A CN201910108992A CN109933448B CN 109933448 B CN109933448 B CN 109933448B CN 201910108992 A CN201910108992 A CN 201910108992A CN 109933448 B CN109933448 B CN 109933448B
Authority
CN
China
Prior art keywords
nonvolatile storage
storage media
storage medium
threshold value
fault threshold
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201910108992.2A
Other languages
Chinese (zh)
Other versions
CN109933448A (en
Inventor
孔伟康
李定
李强
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Huawei Technologies Co Ltd
Original Assignee
Huawei Technologies Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Huawei Technologies Co Ltd filed Critical Huawei Technologies Co Ltd
Priority to CN201910108992.2A priority Critical patent/CN109933448B/en
Publication of CN109933448A publication Critical patent/CN109933448A/en
Application granted granted Critical
Publication of CN109933448B publication Critical patent/CN109933448B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16ZINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS, NOT OTHERWISE PROVIDED FOR
    • G16Z99/00Subject matter not provided for in other main groups of this subclass

Landscapes

  • Techniques For Improving Reliability Of Storages (AREA)
  • Debugging And Monitoring (AREA)

Abstract

The invention relates to the technical field of computers, and discloses a method and a device for predicting the fault of a nonvolatile storage medium, wherein the method comprises the following steps: aiming at any one of at least two nonvolatile storage media of the data center, respectively executing: calculating a condition value of any nonvolatile storage medium, wherein the condition value is used for representing the running condition of any nonvolatile storage medium; predicting that any nonvolatile storage medium will fail when the condition value is determined to be smaller than an initial preset failure threshold value corresponding to any nonvolatile storage medium; in the scheme, the nonvolatile storage media with different condition values respectively correspond to different initial preset fault threshold values, namely the nonvolatile storage media with different condition values respectively correspond to different alarm thresholds, so that the accuracy of the predicted fault nonvolatile storage media is improved.

Description

Method and device for predicting fault of nonvolatile storage medium
Technical Field
The present invention relates to the field of computer technologies, and in particular, to a method and an apparatus for predicting a failure of a nonvolatile storage medium.
Background
Data storage is more and more important, and the data reliability is also very important to guarantee. Hard disks are still the most common storage media for storing data and are widely used in various data centers, so that the prediction of hard disk failure becomes an important means for ensuring data reliability nowadays, and the prediction becomes an important component in data center management software slowly. The data center starts the hard disk alarm and isolation when the hard disk fails or is about to fail by detecting the operating condition of each hard disk, and then starts data reconstruction.
The current DFP (Disk Failure Prediction) technology is to determine whether some indexes of the hard Disk reach a preset threshold value, and if not, send an alarm to consider that the hard Disk will fail. In order to reduce the repair rate, the alarm threshold generally set by the hard disk manufacturer is very low, which results in a very low failure prediction rate of the whole hard disk, but if the alarm threshold of the hard disk manufacturer is referred, the accuracy of predicting the failure of the hard disk is low. In order to improve the accuracy of predicting the hard disk fault, the data center using the hard disk resets the alarm threshold, so that the accuracy of predicting the hard disk fault is improved.
The alarm thresholds of all the hard disks of the data center are the same, but the conditions of the hard disks of the data center are different, the service time of some hard disks is longer, and the service time of some hard disks is shorter, so the method still has the defect of lower accuracy.
Disclosure of Invention
The embodiment of the invention provides a method and a device for predicting the fault of a nonvolatile storage medium, which are used for solving the defect of low accuracy in predicting the fault of a hard disk in the prior art.
The embodiment of the invention provides the following specific technical scheme:
in a first aspect, a method for predicting failure of a non-volatile storage medium is provided, including:
aiming at any one of at least two nonvolatile storage media of the data center, respectively executing:
calculating a condition value of the any one nonvolatile storage medium, wherein the condition value is used for representing the running condition of the any one nonvolatile storage medium;
predicting that any nonvolatile storage medium will fail when the condition value is determined to be smaller than an initial preset failure threshold value corresponding to any nonvolatile storage medium;
the initial preset fault threshold values corresponding to any two nonvolatile storage media with different condition values are different.
With reference to the first aspect, in a first possible implementation manner, after predicting that any nonvolatile storage medium will malfunction, the method further includes:
determining a total number of all non-volatile storage media predicted to fail;
when the determined total number of all the nonvolatile storage media is less than or equal to the number of the hot standby nonvolatile storage media of the data center, replacing all the predicted nonvolatile storage media which will be in failure with the hot standby nonvolatile storage media in the hot standby nonvolatile storage media;
the number of the hot standby nonvolatile storage media which take over the work of all the predicted nonvolatile storage media which will be in failure is the same as the total number of all the nonvolatile storage media.
With reference to the first possible implementation manner of the first aspect, in a second possible implementation manner, after determining a total number of all the nonvolatile storage media predicted to malfunction, the method further includes:
when the determined total number of all the nonvolatile storage media is judged to be larger than the number of the hot standby nonvolatile storage media of the data center, respectively executing the following steps for any one of the nonvolatile storage media:
reducing an initial preset fault threshold value corresponding to any nonvolatile storage medium to obtain a first preset fault threshold value, and
when the condition value of any nonvolatile storage medium is determined to be smaller than a first preset fault threshold value corresponding to any nonvolatile storage medium, further predicting that any nonvolatile storage medium will be in fault;
and when the total number of all the nonvolatile storage media which are predicted to have faults according to the corresponding first preset fault threshold values is judged to be equal to or less than the number of the hot standby nonvolatile storage media of the data center, the hot standby nonvolatile storage media are used for replacing all the nonvolatile storage media which are predicted to have faults according to the corresponding first preset fault threshold values.
With reference to the second possible implementation manner of the first aspect, in a third possible implementation manner, for any two nonvolatile storage media of all the nonvolatile storage media of the data center, the amplitude values of the initial preset fault threshold value reductions corresponding to the any two nonvolatile storage media are the same.
With reference to the second or third possible implementation manner of the first aspect, in a fourth possible implementation manner, after replacing, by using the hot standby nonvolatile storage medium, all nonvolatile storage media that are predicted to have a failure according to the corresponding first preset failure threshold value, the method further includes:
supplementing a preset number of hot standby nonvolatile storage media;
increasing a first preset fault threshold value corresponding to each nonvolatile storage medium of the data center after the initial preset fault threshold value is reduced to obtain a second preset fault threshold value;
for any nonvolatile storage medium with the first preset fault threshold value increased, when the condition value of any nonvolatile storage medium is smaller than a second preset fault threshold value corresponding to any nonvolatile storage medium, predicting that any nonvolatile storage medium will be in fault;
when the total number of all the nonvolatile storage media which are predicted to have faults according to the corresponding second preset fault threshold value is judged to be smaller than or equal to the number of the supplemented hot standby nonvolatile storage media with the preset number, the supplemented hot standby nonvolatile storage media with the preset number are used for replacing all the nonvolatile storage media which are predicted to have faults according to the corresponding second preset fault threshold value;
and aiming at each nonvolatile storage medium corresponding to the second preset fault threshold value, the second preset fault threshold value is smaller than or equal to the initial preset fault threshold value of the corresponding nonvolatile storage medium.
In a second aspect, an apparatus for predicting failure of a non-volatile storage medium is provided, including:
a computing unit, configured to execute, for any one of at least two nonvolatile storage media of a data center, respectively: calculating a condition value of the any one nonvolatile storage medium, wherein the condition value is used for representing the running condition of the any one nonvolatile storage medium;
the predicting unit is used for predicting that any nonvolatile storage medium will fail when the condition value is determined to be smaller than an initial preset failure threshold value corresponding to any nonvolatile storage medium;
the initial preset fault threshold values corresponding to any two nonvolatile storage media with different condition values are different.
With reference to the second aspect, in a first possible implementation manner, the method further includes a determining unit, and a succeeding unit:
the determination unit is used for determining the total number of all the predicted nonvolatile storage media which will fail;
the judging unit is used for judging that the determined total number of all the nonvolatile storage media is less than or equal to the number of the hot standby nonvolatile storage media of the data center;
the replacing unit is used for replacing all the predicted nonvolatile storage media which will fail with the hot standby nonvolatile storage media in the hot standby nonvolatile storage media when the judging unit judges that the determined total number of all the nonvolatile storage media is less than or equal to the number of the hot standby nonvolatile storage media in the data center;
the number of the hot standby nonvolatile storage media which take over the work of all the predicted nonvolatile storage media which will be in failure is the same as the total number of all the nonvolatile storage media.
With reference to the first possible implementation manner of the second aspect, in a second possible implementation manner, the determining unit is further configured to: when the determined total number of all the nonvolatile storage media is judged to be larger than the number of the hot standby nonvolatile storage media of the data center, respectively executing the following steps for any one of the nonvolatile storage media:
reducing an initial preset fault threshold value corresponding to any nonvolatile storage medium to obtain a first preset fault threshold value;
the prediction unit is used for further predicting that the any nonvolatile storage medium will fail when determining that the condition value of the any nonvolatile storage medium is smaller than a first preset failure threshold value corresponding to the any nonvolatile storage medium;
and when judging that the total number of all the nonvolatile storage media which are predicted to have faults according to the corresponding first preset fault threshold values is equal to or less than the number of the hot standby nonvolatile storage media of the data center, the replacing unit replaces all the nonvolatile storage media which are predicted to have faults according to the corresponding first preset fault threshold values by using the hot standby nonvolatile storage media.
With reference to the second possible implementation manner of the second aspect, in a third possible implementation manner, for any two nonvolatile storage media of all the nonvolatile storage media of the data center, the amplitude values of the initial preset fault threshold value reductions corresponding to the any two nonvolatile storage media are the same.
With reference to the second or third possible implementation manner of the second aspect, in a fourth possible implementation manner, the apparatus further includes a supplementing unit, configured to supplement a preset number of the hot standby nonvolatile storage media; increasing a first preset fault threshold value corresponding to each nonvolatile storage medium of the data center after the initial preset fault threshold value is reduced to obtain a second preset fault threshold value;
the prediction unit is further configured to predict that any nonvolatile storage medium will fail when a condition value of any nonvolatile storage medium is smaller than a second preset failure threshold value corresponding to any nonvolatile storage medium, for any nonvolatile storage medium for which the first preset failure threshold value is increased;
the judging unit is further configured to judge that the total number of all nonvolatile storage media which are predicted to have a fault according to the corresponding second preset fault threshold value is less than or equal to the number of the supplemented hot standby nonvolatile storage media in the preset number;
the replacing unit is further configured to replace, when the determining unit determines that the total number of all the nonvolatile storage media to be failed, which are predicted according to the corresponding second preset failure threshold value, is less than or equal to the number of the supplemented hot standby nonvolatile storage media of the preset number, with all the nonvolatile storage media to be failed, which are predicted according to the corresponding second preset failure threshold value, by the supplemented hot standby nonvolatile storage media of the preset number;
and aiming at each nonvolatile storage medium corresponding to the second preset fault threshold value, the second preset fault threshold value is smaller than or equal to the initial preset fault threshold value of the corresponding nonvolatile storage medium.
The invention has the following beneficial effects:
in the prior art, the alarm thresholds corresponding to all the nonvolatile storage media of the data center are the same, but the operating conditions of different nonvolatile storage media may be different, and if all the nonvolatile storage media correspond to the same alarm threshold, the accuracy of the predicted failed nonvolatile storage media is lower.
Drawings
FIG. 1 is a flow chart of predicting a failure of a non-volatile storage medium in an embodiment of the present invention;
FIG. 2 is a diagram illustrating an embodiment of predicting a failure of a hard disk according to the present invention;
FIG. 3A is a schematic diagram of an apparatus for predicting failure of a non-volatile storage medium according to an embodiment of the present invention;
fig. 3B is a schematic structural diagram of an apparatus for predicting failure of a non-volatile storage medium according to an embodiment of the present invention.
Detailed Description
In order to make the objects, technical solutions and advantages of the embodiments of the present invention clearer, the technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the drawings in the embodiments of the present invention, and it is obvious that the described embodiments are some, but not all, embodiments of the present invention. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present invention.
Additionally, the terms "system" and "network" are often used interchangeably herein. The term "and/or" herein is merely an association describing an associated object, meaning that three relationships may exist, e.g., a and/or B, may mean: a exists alone, A and B exist simultaneously, and B exists alone. In addition, the letter "/" herein generally indicates that the former and latter associated objects are in an "or" relationship.
The preferred embodiments of the present invention will be described in detail below with reference to the accompanying drawings, it being understood that the preferred embodiments described herein are merely for illustrating and explaining the present invention, and are not intended to limit the present invention, and that the embodiments and features of the embodiments in the present application may be combined with each other without conflict.
Preferred embodiments of the present invention will be described in detail below with reference to the accompanying drawings.
Referring to fig. 1, in an embodiment of the present invention, a process for predicting the failure of the nonvolatile storage medium is as follows:
step 100: respectively executing any nonvolatile storage medium of at least two nonvolatile storage media of the data center;
step 110: calculating a condition value of any nonvolatile storage medium, wherein the condition value is used for representing the running condition of any nonvolatile storage medium;
step 120: predicting that any nonvolatile storage medium will fail when the condition value is determined to be smaller than an initial preset failure threshold value corresponding to any nonvolatile storage medium; the initial preset fault threshold values corresponding to any two nonvolatile storage media with different condition values are different.
The data center is pre-stored with a certain number of hot standby nonvolatile storage media, so in the embodiment of the present invention, after predicting that any nonvolatile storage medium will fail, the following operations are further included:
determining a total number of all non-volatile storage media predicted to fail;
when the determined total number of all the nonvolatile storage media is less than or equal to the number of the hot standby nonvolatile storage media of the data center, replacing all the predicted nonvolatile storage media which will fail with the hot standby nonvolatile storage media in the hot standby nonvolatile storage media;
the number of hot standby nonvolatile storage media which take over the predicted operation of all the nonvolatile storage media which will be in failure is the same as the total number of all the nonvolatile storage media.
For example: the data center has 10 hard disks, and a total of 2 hard disks are predicted: the hard disk 1 and the hard disk 2 will fail, and if there are 3 hot standby hard disks in the data center, any two hard disks in the 3 hot standby hard disks are used to take over the work of the hard disks 1 and 2.
Of course, in practical applications, the total number of all the determined nonvolatile storage media may be greater than the number of hot spare nonvolatile storage media of the data center, and the operation performed at this time is different from the operation performed when the total number of all the determined nonvolatile storage media is less than or equal to the number of hot spare nonvolatile storage media of the data center, and the specific implementation process is as follows:
after determining the total number of all non-volatile storage media predicted to fail, further comprising the operations of:
when the determined total number of all the nonvolatile storage media is judged to be larger than the number of the hot standby nonvolatile storage media of the data center, respectively executing the following steps for any nonvolatile storage media:
reducing an initial preset fault threshold value corresponding to any nonvolatile storage medium to obtain a first preset fault threshold value, and
when the condition value of any nonvolatile storage medium is determined to be smaller than a first preset fault threshold value corresponding to any nonvolatile storage medium after reduction, further predicting that any nonvolatile storage medium will be in fault;
and when the total number of all the nonvolatile storage media which are predicted to have faults according to the corresponding first preset fault threshold values is judged to be less than or equal to the number of the hot standby nonvolatile storage media of the data center, the hot standby nonvolatile storage media are used for replacing all the nonvolatile storage media which are predicted to have faults according to the corresponding first preset fault threshold values.
For example: the data center has 10 hard disks, and 5 hard disks are predicted in total: hard disk 1, hard disk 2, hard disk 3, hard disk 4 and hard disk 5 will break down, if data center's hot spare hard disk has 3, then reduce the initial fault threshold value of predetermineeing that each hard disk corresponds respectively in 10 hard disks, if before reducing, 10 hard disks correspond respectively the initial fault threshold value of predetermineeing is: x1, X2, X3, X4, X5, X6, X7, X8, X9, and X10, wherein the first preset fault threshold value after the first reduction is: y1, Y2, Y3, Y4, Y5, Y6, Y7, Y8, Y9 and Y10, wherein Y1 is smaller than X1, Y2 is smaller than X2, Y3 is smaller than X3, Y4 is smaller than X4, Y5 is smaller than X5, Y6 is smaller than X6, Y7 is smaller than X7, Y8 is smaller than X8, Y9 is smaller than X9, Y10 is smaller than X10, the total number of hard disks with faults predicted after the first preset fault threshold value is still larger than the number of hard disks with hot spare, the first preset fault threshold value is reduced, and if the total number of hard disks with faults predicted at the moment is still larger than the number of hard disks with hot spare, the first preset fault threshold value is reduced until the total number of hard disks with faults predicted at the moment is smaller than or equal to the number of hard disks with faults predicted finally.
In the embodiment of the present invention, optionally, in order to reduce implementation complexity, for any two nonvolatile storage media of all the nonvolatile storage media of the data center, the amplitude values of the initial preset fault threshold value reductions respectively corresponding to the any two nonvolatile storage media are the same.
For example: the data center has 5 hard disks: the hard disk 1, the hard disk 2, the hard disk 3, the hard disk 4, and the hard disk 5 have corresponding initial preset fault thresholds X1, X2, X3, X4, and X5, respectively, and the first preset fault thresholds obtained by reducing the initial preset fault thresholds are 70% X1, 70% X2, 70% X3, 70% X4, and 70% X5, respectively.
In the embodiment of the invention, when the determined total number of all the nonvolatile storage media is greater than the number of the hot standby nonvolatile storage media of the data center, the initial preset fault threshold corresponding to the non-volatile storage medium is lowered to obtain a first preset fault threshold, and thus, part of the nonvolatile storage media which are to be failed can be screened out, the existing hot standby nonvolatile storage media are used for replacing the nonvolatile storage media which are found out in advance and are to be failed, then, the hot standby nonvolatile storage medium is supplemented, the initial preset fault threshold value after the reduction is increased, namely the first preset fault threshold value is increased, in this way, the non-volatile storage media which are not screened for the first time and will have faults are screened out, and circulating the steps until the nonvolatile storage media which are judged according to the initial preset fault threshold value and are predicted to have faults are screened out. Specifically, the following method can be adopted:
for example, after replacing all the nonvolatile storage media predicted to have a failure according to the corresponding first preset failure threshold value with the hot standby nonvolatile storage media, the following operations are further included:
supplementing a preset number of hot standby nonvolatile storage media;
increasing a first preset fault threshold value corresponding to each nonvolatile storage medium of the data center after the initial preset fault threshold value is reduced to obtain a second preset fault threshold value;
for any nonvolatile storage medium with the first preset fault threshold value increased, when the condition value of any nonvolatile storage medium is smaller than a second preset fault threshold value corresponding to any nonvolatile storage medium, predicting that any nonvolatile storage medium will be in fault;
when the total number of all the nonvolatile storage media which are predicted to have faults according to the corresponding second preset fault threshold values is judged to be smaller than or equal to the number of the hot standby nonvolatile storage media with the supplemented preset number, the supplemented preset number of the hot standby nonvolatile storage media is used for replacing all the nonvolatile storage media which are predicted to have faults according to the corresponding second preset fault threshold values;
and aiming at each nonvolatile storage medium corresponding to the second preset fault threshold value, the second preset fault threshold value is smaller than or equal to the initial preset fault threshold value of the corresponding nonvolatile storage medium. For example: the data center has 10 hard disks: hard disk 1, hard disk 2, hard disk 3, hard disk 4, hard disk 5, hard disk 6, hard disk 7, hard disk 8, hard disk 9, hard disk 10, the initial fault threshold value of presetting that corresponds is X1, X2, X3, X4, X5, X6, X7, X8, X9, X10 respectively, hot spare disk has 3, the hard disk that will break down that screens out according to the initial fault threshold value of presetting has 8: the hard disks 1 to 8 are to reduce an initial preset fault threshold value, the first preset fault threshold values obtained by reducing the initial preset fault threshold value are respectively 50% X1, 50% X2, 50% X3, 50% X4, 50% X5, 50% X6, 50% X7, 50% X8, 50% X9 and 50% X10, and there are 3 hard disks with faults predicted according to the first preset fault threshold value: and replacing the hard disk 1, the hard disk 2 and the hard disk 3 with the hot standby hard disk, and supplementing 3 hot standby hard disks after replacement. Increasing the first preset fault threshold value to obtain a second preset fault threshold value: 60% X1, 60% X2, 60% X3, 60% X4, 60% X5, 60% X6, 60% X7, 60% X8, 60% X9, and 60% X10, where the hard disk with a failure predicted according to the second preset failure threshold value has a value of 3: replacing the hard disk 4, the hard disk 5 and the hard disk 6 with a supplemented hot spare hard disk, then supplementing 3 the hot spare hard disk, and increasing a second preset fault threshold value to obtain a third preset fault threshold value, wherein 80% X1, 80% X2, 80% X3, 80% X4, 80% X5, 80% X6, 80% X7, 80% X8, 80% X9 and 80% X10 are obtained, and the hard disk with faults predicted according to the third preset fault threshold value is 2: and the hard disk 7 and the hard disk 8 replace the hard disk 7 and the hard disk 8 with the supplemented hot standby hard disk, and replace the hard disk 7 and the hard disk 8 with the supplemented hot standby hard disk.
In the embodiment of the invention, the initial preset fault threshold value corresponding to the nonvolatile storage medium is related to the power-on time of the nonvolatile storage medium, the judgment condition is relaxed along with the increase of the power-on time, if the initial preset fault threshold value is increased and the judgment condition is relaxed, the initial preset fault threshold value is increased along with the increase of the power-on time, and if the initial preset fault threshold value is reduced and the judgment condition is relaxed, the initial preset fault threshold value is reduced along with the increase of the power-on time.
In summary, in the embodiments of the present invention, for any one of at least two nonvolatile storage media of a data center, the following are respectively performed: calculating a condition value of any nonvolatile storage medium, wherein the condition value is used for representing the running condition of any nonvolatile storage medium; predicting that any nonvolatile storage medium will fail when the condition value is determined to be smaller than an initial preset failure threshold value corresponding to any nonvolatile storage medium; in the scheme, the nonvolatile storage media with different condition values respectively correspond to different initial preset fault threshold values, that is, the nonvolatile storage media with different condition values respectively correspond to different alarm thresholds, so that the accuracy of the predicted nonvolatile storage media with faults is improved.
For better understanding of the embodiment of the present invention, the following specific application scenarios are given, and further detailed description is made with respect to a process for predicting failure of a non-volatile storage medium, and an architecture diagram is shown in fig. 2:
step 200: the data center has 10 hard disks: hard disk 1, hard disk 2, … … and hard disk 10, and calculating the status value of each hard disk in the 10 hard disks;
step 210: regarding any hard disk in 10 hard disks, taking the hard disk with the condition value smaller than the corresponding initial preset fault threshold value as the predicted hard disk to be in fault, wherein the initial preset fault threshold values corresponding to any two nonvolatile storage media with different condition values are different;
step 220: determining the total number of all predicted hard disks to be in fault, and judging whether the determined total number of all hard disks is less than or equal to the number of hot spare hard disks of the data center; if so, go to step 230, otherwise, go to step 240;
step 230: replacing all the predicted hard disks with hot spare hard disks;
in this step, the number of hot spare hard disks that take over the predicted operation of all the hard disks that will fail is the same as the total number of all the hard disks.
Step 240: reducing initial preset fault threshold values corresponding to the 10 hard disks respectively to obtain a first preset fault threshold value;
in this step, for any two hard disks of all the hard disks of the data center, the amplitude values of the initial preset fault threshold value reductions corresponding to the any two hard disks are the same.
Step 250: judging whether the number of all failed hard disks predicted according to the first preset failure threshold value is smaller than or equal to that of hot standby hard disks of the data center, if so, executing a step 260, otherwise, returning to the step 240;
step 260: replacing the hot standby hard disks for the work of the hard disks with faults predicted according to the first preset fault threshold value, and supplementing the hot standby hard disks with preset quantity;
step 270: increasing the first preset fault threshold value to obtain a second preset fault threshold value, and taking the hard disk with the condition value smaller than the corresponding second preset fault threshold value as the predicted hard disk to be in fault;
step 280: and judging whether the number of the predicted hard disks with faults is 0 and/or the second preset fault threshold value is the initial preset fault threshold value, if so, ending the process, and otherwise, returning to the step 220.
Based on the technical solution of the above corresponding method, referring to fig. 3A, an embodiment of the present invention provides an apparatus for predicting a failure of a non-volatile storage medium, where the apparatus includes a computing unit 30 and a predicting unit 31, where:
a computing unit 30, configured to execute, for any one of at least two nonvolatile storage media of the data center: calculating a condition value of any nonvolatile storage medium, wherein the condition value is used for representing the running condition of any nonvolatile storage medium;
the predicting unit 31 is configured to predict that any one of the nonvolatile storage media will fail when the condition value is determined to be smaller than an initial preset failure threshold value corresponding to the any one of the nonvolatile storage media;
the initial preset fault threshold values corresponding to any two nonvolatile storage media with different condition values are different.
In the embodiment of the present invention, further, the apparatus further includes a determining unit, a judging unit, and a succeeding unit:
a determination unit for determining a total number of all the nonvolatile storage media predicted to malfunction;
the judging unit is used for judging that the total number of all the determined nonvolatile storage media is less than or equal to the number of the hot standby nonvolatile storage media of the data center;
the replacing unit is used for replacing all the predicted nonvolatile storage media which will fail with the hot standby nonvolatile storage media in the hot standby nonvolatile storage media when the judging unit judges that the total number of all the determined nonvolatile storage media is less than or equal to the number of the hot standby nonvolatile storage media in the data center;
the number of hot standby nonvolatile storage media which take over the predicted operation of all the nonvolatile storage media which will be in failure is the same as the total number of all the nonvolatile storage media.
In this embodiment of the present invention, the determining unit is further configured to: when the determined total number of all the nonvolatile storage media is judged to be larger than the number of the hot standby nonvolatile storage media of the data center, respectively executing the following steps for any nonvolatile storage media:
reducing an initial preset fault threshold value corresponding to any nonvolatile storage medium to obtain a first preset fault threshold value;
the prediction unit 31 is configured to further predict that any nonvolatile storage medium will fail when determining that the condition value of any nonvolatile storage medium is smaller than a first preset failure threshold value corresponding to any nonvolatile storage medium;
and when judging that the total number of all the nonvolatile storage media which are predicted to have faults according to the corresponding first preset fault threshold values is equal to or less than the number of the hot standby nonvolatile storage media of the data center, the replacing unit replaces all the nonvolatile storage media which are predicted to have faults according to the corresponding first preset fault threshold values by using the hot standby nonvolatile storage media.
In the embodiment of the present invention, optionally, for any two nonvolatile storage media of all the nonvolatile storage media of the data center, the amplitude values of the initial preset fault threshold value reductions corresponding to the any two nonvolatile storage media are the same.
In the embodiment of the present invention, further, the apparatus further includes a supplement unit, configured to supplement a preset number of hot standby nonvolatile storage media; increasing a first preset fault threshold value corresponding to each nonvolatile storage medium of the data center after the initial preset fault threshold value is reduced to obtain a second preset fault threshold value;
the predicting unit 31 is further configured to predict, for any nonvolatile storage medium in which the first preset failure threshold value is increased, that any nonvolatile storage medium will fail when the condition value of any nonvolatile storage medium is smaller than a second preset failure threshold value corresponding to any nonvolatile storage medium;
the judging unit is further used for judging that the total number of all the nonvolatile storage media which are predicted to have faults according to the corresponding second preset fault threshold values is smaller than or equal to the number of the supplementary hot standby nonvolatile storage media with the preset number;
the replacing unit is further used for replacing all the nonvolatile storage media which are predicted to have faults according to the corresponding second preset fault threshold value by using the supplementary preset number of hot standby nonvolatile storage media when the judging unit judges that the total number of all the nonvolatile storage media which are predicted to have faults according to the corresponding second preset fault threshold value is smaller than or equal to the number of the supplementary preset number of hot standby nonvolatile storage media;
and aiming at each nonvolatile storage medium corresponding to the second preset fault threshold value, the second preset fault threshold value is smaller than or equal to the initial preset fault threshold value of the corresponding nonvolatile storage medium.
Fig. 3B is a schematic structural diagram of an apparatus for predicting failure of a non-volatile storage medium according to an embodiment of the present invention, which includes at least one processor 301, a communication bus 302, a memory 303, and at least one communication interface 304.
The communication bus 302 is used for realizing connection and communication among the above components, and the communication interface 304 is used for connecting and communicating with an external device.
The memory 303 is used for storing executable program codes, and the processor 301 executes the program codes to:
aiming at any one of at least two nonvolatile storage media of the data center, respectively executing:
calculating a condition value of any nonvolatile storage medium, wherein the condition value is used for representing the running condition of any nonvolatile storage medium;
predicting that any nonvolatile storage medium will fail when the condition value is determined to be smaller than an initial preset failure threshold value corresponding to any nonvolatile storage medium;
the initial preset fault threshold values corresponding to any two nonvolatile storage media with different condition values are different.
Further, in this embodiment of the present invention, after predicting that any nonvolatile storage medium will malfunction, the method further includes:
determining a total number of all non-volatile storage media predicted to fail;
when the determined total number of all the nonvolatile storage media is less than or equal to the number of the hot standby nonvolatile storage media of the data center, replacing all the predicted nonvolatile storage media which will fail with the hot standby nonvolatile storage media in the hot standby nonvolatile storage media;
the number of hot standby nonvolatile storage media which take over the predicted operation of all the nonvolatile storage media which will be in failure is the same as the total number of all the nonvolatile storage media.
Further, in this embodiment of the present invention, after determining the total number of all the nonvolatile storage media predicted to malfunction, the processor 301 is further configured to:
when the determined total number of all the nonvolatile storage media is judged to be larger than the number of the hot standby nonvolatile storage media of the data center, respectively executing the following steps for any nonvolatile storage media:
reducing an initial preset fault threshold value corresponding to any nonvolatile storage medium to obtain a first preset fault threshold value, and
when the condition value of any nonvolatile storage medium is determined to be smaller than a first preset fault threshold value corresponding to any nonvolatile storage medium, further predicting that any nonvolatile storage medium will be in fault;
and when the total number of all the nonvolatile storage media which are predicted to have faults according to the corresponding first preset fault threshold values is judged to be equal to or less than the number of the hot standby nonvolatile storage media of the data center, the hot standby nonvolatile storage media are used for replacing all the nonvolatile storage media which are predicted to have faults according to the corresponding first preset fault threshold values.
Optionally, in this embodiment of the present invention, the processor 301 is further configured to, for any two nonvolatile storage media of all the nonvolatile storage media of the data center, respectively have the same amplitude value for reducing the initial preset fault threshold value corresponding to any two nonvolatile storage media.
Further, in this embodiment of the present invention, after the processor 301 is further configured to use the hot standby nonvolatile storage medium to replace all the nonvolatile storage media that are predicted to have faults according to the corresponding first preset fault threshold value, the method further includes:
supplementing a preset number of hot standby nonvolatile storage media;
increasing a first preset fault threshold value corresponding to each nonvolatile storage medium of the data center after the initial preset fault threshold value is reduced to obtain a second preset fault threshold value;
for any nonvolatile storage medium with the first preset fault threshold value increased, when the condition value of any nonvolatile storage medium is smaller than a second preset fault threshold value corresponding to any nonvolatile storage medium, predicting that any nonvolatile storage medium will be in fault;
when the total number of all the nonvolatile storage media which are predicted to have faults according to the corresponding second preset fault threshold values is judged to be smaller than or equal to the number of the hot standby nonvolatile storage media with the supplemented preset number, the supplemented preset number of the hot standby nonvolatile storage media is used for replacing all the nonvolatile storage media which are predicted to have faults according to the corresponding second preset fault threshold values;
and aiming at each nonvolatile storage medium corresponding to the second preset fault threshold value, the second preset fault threshold value is smaller than or equal to the initial preset fault threshold value of the corresponding nonvolatile storage medium.
The present invention is described with reference to flowchart illustrations and/or block diagrams of methods, apparatus (systems), and computer program products according to embodiments of the invention. It will be understood that each flow and/or block of the flow diagrams and/or block diagrams, and combinations of flows and/or blocks in the flow diagrams and/or block diagrams, can be implemented by computer program instructions. These computer program instructions may be provided to a processor of a general purpose computer, special purpose computer, embedded processor, or other programmable data processing apparatus to produce a machine, such that the instructions, which execute via the processor of the computer or other programmable data processing apparatus, create means for implementing the functions specified in the flowchart flow or flows and/or block diagram block or blocks.
These computer program instructions may also be stored in a computer-readable memory that can direct a computer or other programmable data processing apparatus to function in a particular manner, such that the instructions stored in the computer-readable memory produce an article of manufacture including instruction means which implement the function in the flowchart flow or flows and/or block diagram block or blocks.
These computer program instructions may also be loaded onto a computer or other programmable data processing apparatus to cause a series of operational steps to be performed on the computer or other programmable apparatus to produce a computer implemented process such that the instructions which execute on the computer or other programmable apparatus provide steps for implementing the functions specified in the flowchart flow or flows and/or block diagram block or blocks.
While preferred embodiments of the present invention have been described, additional variations and modifications in those embodiments may occur to those skilled in the art once they learn of the basic inventive concepts. Therefore, it is intended that the appended claims be interpreted as including preferred embodiments and all such alterations and modifications as fall within the scope of the invention.
It will be apparent to those skilled in the art that various modifications and variations can be made in the embodiments of the present invention without departing from the spirit or scope of the embodiments of the invention. Thus, if such modifications and variations of the embodiments of the present invention fall within the scope of the claims of the present invention and their equivalents, the present invention is also intended to encompass such modifications and variations.

Claims (10)

1. A method for predicting faults of nonvolatile storage media is characterized in that a data center comprises at least two nonvolatile storage media, each nonvolatile storage medium corresponds to an initial preset fault threshold value, the initial preset fault threshold value of each nonvolatile storage medium is related to the power-on time of the nonvolatile storage medium, and the initial preset fault threshold values of any two nonvolatile storage media with different power-on times are different;
the method comprises the following steps:
calculating a condition value of a first non-volatile storage medium, wherein the condition value is used for characterizing the operating condition of the first non-volatile storage medium, and the first non-volatile storage medium is any one of the at least two non-volatile storage media;
predicting that the first non-volatile storage medium will fail when it is determined that the condition value is less than an initial preset failure threshold value corresponding to the first non-volatile storage medium.
2. The method of claim 1, wherein the method further comprises:
calculating a condition value of each non-volatile storage medium;
determining a total number of all non-volatile storage media predicted to fail;
when the total number of all the predicted nonvolatile storage media to be failed is judged to be less than or equal to the number of the hot standby nonvolatile storage media of the data center, replacing the work of all the predicted nonvolatile storage media to be failed with the hot standby nonvolatile storage media in the hot standby nonvolatile storage media;
the number of hot standby nonvolatile storage media which work for replacing all the predicted nonvolatile storage media which will fail is the same as the total number of all the predicted nonvolatile storage media which will fail.
3. The method of claim 2, wherein determining the total number of all non-volatile storage media predicted to fail further comprises:
when the total number of all the determined nonvolatile storage media with predicted faults is judged to be larger than the number of the hot standby nonvolatile storage media of the data center, respectively executing the following steps for any one nonvolatile storage medium:
reducing an initial preset fault threshold value corresponding to any nonvolatile storage medium to obtain a first preset fault threshold value, and
when the condition value of any nonvolatile storage medium is determined to be smaller than a first preset fault threshold value corresponding to any nonvolatile storage medium, further predicting that any nonvolatile storage medium will be in fault;
and when the total number of all the nonvolatile storage media which are predicted to have faults according to the corresponding first preset fault threshold values is judged to be equal to or less than the number of the hot standby nonvolatile storage media of the data center, the hot standby nonvolatile storage media are used for replacing all the nonvolatile storage media which are predicted to have faults according to the corresponding first preset fault threshold values.
4. The method of claim 3, wherein the magnitude by which the initial predetermined fault threshold values corresponding to any two of the at least two non-volatile storage media are reduced is the same.
5. The method of claim 3 or 4, wherein after using the hot-standby non-volatile storage media to take over all non-volatile storage media predicted to fail according to the corresponding first preset failure threshold value, further comprising:
supplementing a preset number of hot standby nonvolatile storage media;
increasing a first preset fault threshold value corresponding to each nonvolatile storage medium of the data center after the initial preset fault threshold value is reduced to obtain a second preset fault threshold value;
for any nonvolatile storage medium with the first preset fault threshold value increased, when the condition value of any nonvolatile storage medium is smaller than a second preset fault threshold value corresponding to any nonvolatile storage medium, predicting that any nonvolatile storage medium will be in fault;
when the total number of all the nonvolatile storage media which are predicted to have faults according to the corresponding second preset fault threshold value is judged to be smaller than or equal to the number of the supplemented hot standby nonvolatile storage media with the preset number, the supplemented hot standby nonvolatile storage media with the preset number are used for replacing all the nonvolatile storage media which are predicted to have faults according to the corresponding second preset fault threshold value;
and aiming at each nonvolatile storage medium corresponding to the second preset fault threshold value, the second preset fault threshold value is smaller than or equal to the initial preset fault threshold value of the corresponding nonvolatile storage medium.
6. A device for predicting failure of a nonvolatile storage medium is characterized in that a data center comprises at least two nonvolatile storage media, each nonvolatile storage medium corresponds to an initial preset failure threshold value, the initial preset failure threshold value of each nonvolatile storage medium is related to the power-on time of the nonvolatile storage medium, and the initial preset failure threshold values of any two nonvolatile storage media with different power-on times are different; the device comprises:
a computing unit to: calculating a condition value of a first non-volatile storage medium, wherein the condition value is used for characterizing the operating condition of the first non-volatile storage medium, and the first non-volatile storage medium is any one of the at least two non-volatile storage media;
and the predicting unit is used for predicting that the first nonvolatile storage medium will fail when the condition value is determined to be smaller than an initial preset failure threshold value corresponding to the first nonvolatile storage medium.
7. The apparatus of claim 6, further comprising a determining unit, a judging unit, and a successor unit:
the computing unit is further to: calculating a condition value of each non-volatile storage medium;
the determination unit is used for determining the total number of all the predicted nonvolatile storage media which will fail;
the judging unit is used for judging that the total number of all the determined predicted nonvolatile storage media with faults is less than or equal to the number of hot standby nonvolatile storage media of the data center;
the replacing unit is used for replacing all the predicted nonvolatile storage media which are to be failed with the hot spare nonvolatile storage media in the hot spare nonvolatile storage media when the judging unit judges that the determined total number of all the predicted nonvolatile storage media which are to be failed is less than or equal to the number of the hot spare nonvolatile storage media of the data center;
the number of hot standby nonvolatile storage media which work for replacing all the predicted nonvolatile storage media which will fail is the same as the total number of all the predicted nonvolatile storage media which will fail.
8. The apparatus of claim 7, wherein the determination unit is further to: when the total number of all the determined nonvolatile storage media with predicted faults is judged to be larger than the number of the hot standby nonvolatile storage media of the data center, respectively executing the following steps for any one nonvolatile storage medium:
reducing an initial preset fault threshold value corresponding to any nonvolatile storage medium to obtain a first preset fault threshold value;
the prediction unit is used for further predicting that the any nonvolatile storage medium will fail when determining that the condition value of the any nonvolatile storage medium is smaller than a first preset failure threshold value corresponding to the any nonvolatile storage medium;
and when judging that the total number of all the nonvolatile storage media which are predicted to have faults according to the corresponding first preset fault threshold values is equal to or less than the number of the hot standby nonvolatile storage media of the data center, the replacing unit replaces all the nonvolatile storage media which are predicted to have faults according to the corresponding first preset fault threshold values by using the hot standby nonvolatile storage media.
9. The apparatus of claim 8, wherein the magnitude by which the initial predetermined fault threshold values corresponding to any two of the at least two non-volatile storage media are reduced is the same.
10. The apparatus according to claim 8 or 9, further comprising a supplementing unit for supplementing a preset number of hot-standby nonvolatile storage media; increasing a first preset fault threshold value corresponding to each nonvolatile storage medium of the data center after the initial preset fault threshold value is reduced to obtain a second preset fault threshold value;
the prediction unit is further configured to predict that any nonvolatile storage medium will fail when a condition value of any nonvolatile storage medium is smaller than a second preset failure threshold value corresponding to any nonvolatile storage medium, for any nonvolatile storage medium for which the first preset failure threshold value is increased;
the judging unit is further configured to judge that the total number of all nonvolatile storage media which are predicted to have a fault according to the corresponding second preset fault threshold value is less than or equal to the number of the supplemented hot standby nonvolatile storage media in the preset number;
the replacing unit is further configured to replace, when the determining unit determines that the total number of all the nonvolatile storage media to be failed, which are predicted according to the corresponding second preset failure threshold value, is less than or equal to the number of the supplemented hot standby nonvolatile storage media of the preset number, with all the nonvolatile storage media to be failed, which are predicted according to the corresponding second preset failure threshold value, by the supplemented hot standby nonvolatile storage media of the preset number;
and aiming at each nonvolatile storage medium corresponding to the second preset fault threshold value, the second preset fault threshold value is smaller than or equal to the initial preset fault threshold value of the corresponding nonvolatile storage medium.
CN201910108992.2A 2014-12-25 2014-12-25 Method and device for predicting fault of nonvolatile storage medium Active CN109933448B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201910108992.2A CN109933448B (en) 2014-12-25 2014-12-25 Method and device for predicting fault of nonvolatile storage medium

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201910108992.2A CN109933448B (en) 2014-12-25 2014-12-25 Method and device for predicting fault of nonvolatile storage medium
CN201410822384.5A CN105787242B (en) 2014-12-25 2014-12-25 A kind of method and device that prediction non-volatile memory medium breaks down

Related Parent Applications (1)

Application Number Title Priority Date Filing Date
CN201410822384.5A Division CN105787242B (en) 2014-12-25 2014-12-25 A kind of method and device that prediction non-volatile memory medium breaks down

Publications (2)

Publication Number Publication Date
CN109933448A CN109933448A (en) 2019-06-25
CN109933448B true CN109933448B (en) 2021-04-20

Family

ID=56149223

Family Applications (2)

Application Number Title Priority Date Filing Date
CN201410822384.5A Active CN105787242B (en) 2014-12-25 2014-12-25 A kind of method and device that prediction non-volatile memory medium breaks down
CN201910108992.2A Active CN109933448B (en) 2014-12-25 2014-12-25 Method and device for predicting fault of nonvolatile storage medium

Family Applications Before (1)

Application Number Title Priority Date Filing Date
CN201410822384.5A Active CN105787242B (en) 2014-12-25 2014-12-25 A kind of method and device that prediction non-volatile memory medium breaks down

Country Status (2)

Country Link
CN (2) CN105787242B (en)
WO (1) WO2016101786A1 (en)

Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1716428A (en) * 2004-07-01 2006-01-04 华为技术有限公司 Hard disc detecting device and method
CN101604548A (en) * 2009-03-26 2009-12-16 成都市华为赛门铁克科技有限公司 A kind of solid state hard disc and date storage method
CN102033717A (en) * 2010-12-07 2011-04-27 清华大学 Data storage method and system based on RAID (Redundant Array of Inexpensive Disk)
CN102129397A (en) * 2010-12-29 2011-07-20 深圳市永达电子股份有限公司 Method and system for predicating self-adaptive disk array failure
CN103455397A (en) * 2013-09-06 2013-12-18 杭州华为数字技术有限公司 System self-test method, equipment and system
CN103580934A (en) * 2012-07-18 2014-02-12 深圳市腾讯计算机系统有限公司 Cloud business monitoring method and device
CN104020963A (en) * 2014-06-04 2014-09-03 浙江宇视科技有限公司 Method and device for preventing misjudgment of hard disk read-write errors

Family Cites Families (13)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6574754B1 (en) * 2000-02-14 2003-06-03 International Business Machines Corporation Self-monitoring storage device using neural networks
US7480828B2 (en) * 2004-06-10 2009-01-20 International Business Machines Corporation Method, apparatus and program storage device for extending dispersion frame technique behavior using dynamic rule sets
JP2006048789A (en) * 2004-08-02 2006-02-16 Hitachi Global Storage Technologies Netherlands Bv Failure predicting method of magnetic disk device and magnetic disk device using the same
US7523359B2 (en) * 2005-03-31 2009-04-21 International Business Machines Corporation Apparatus, system, and method for facilitating monitoring and responding to error events
US7376499B2 (en) * 2005-09-16 2008-05-20 Gm Global Technology Operations, Inc. State-of-health monitoring and fault diagnosis with adaptive thresholds for integrated vehicle stability system
US7627405B2 (en) * 2006-11-17 2009-12-01 Gm Global Technology Operations, Inc. Prognostic for loss of high-voltage isolation
CN101201786B (en) * 2006-12-13 2010-05-19 中兴通讯股份有限公司 Method and device for monitoring fault log
CN101764846B (en) * 2009-12-18 2012-07-11 西南交通大学 Implement method of remote centralized disk array operation monitoring system
CN101872641A (en) * 2009-12-28 2010-10-27 杭州海康威视数字技术股份有限公司 Method and device for early warning failure of hard disk in digital video recorder
US20120102367A1 (en) * 2010-10-26 2012-04-26 International Business Machines Corporation Scalable Prediction Failure Analysis For Memory Used In Modern Computers
CN103197995B (en) * 2012-01-04 2016-06-15 百度在线网络技术(北京)有限公司 Hard disk fault detection method and device
US9146855B2 (en) * 2012-01-09 2015-09-29 Dell Products Lp Systems and methods for tracking and managing non-volatile memory wear
CN104092440B (en) * 2014-07-21 2017-07-28 阳光电源股份有限公司 Photovoltaic system direct current arc fault detection method, device, processor and its system

Patent Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1716428A (en) * 2004-07-01 2006-01-04 华为技术有限公司 Hard disc detecting device and method
CN101604548A (en) * 2009-03-26 2009-12-16 成都市华为赛门铁克科技有限公司 A kind of solid state hard disc and date storage method
CN102033717A (en) * 2010-12-07 2011-04-27 清华大学 Data storage method and system based on RAID (Redundant Array of Inexpensive Disk)
CN102129397A (en) * 2010-12-29 2011-07-20 深圳市永达电子股份有限公司 Method and system for predicating self-adaptive disk array failure
CN103580934A (en) * 2012-07-18 2014-02-12 深圳市腾讯计算机系统有限公司 Cloud business monitoring method and device
CN103455397A (en) * 2013-09-06 2013-12-18 杭州华为数字技术有限公司 System self-test method, equipment and system
CN104020963A (en) * 2014-06-04 2014-09-03 浙江宇视科技有限公司 Method and device for preventing misjudgment of hard disk read-write errors

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
A pragmatic approach to predict hardware failures in storage systems using MPP database and big data technologies;Rohit Kumar;《 2014 IEEE International Advance Computing Conference (IACC)》;20140327;全文 *
基于智能预警和自修复的高可靠磁盘阵列关键技术研究;胡维;《中国优秀硕士学位论文全文数据库信息科技辑》;20120229;全文 *

Also Published As

Publication number Publication date
WO2016101786A1 (en) 2016-06-30
CN105787242A (en) 2016-07-20
CN109933448A (en) 2019-06-25
CN105787242B (en) 2019-02-26

Similar Documents

Publication Publication Date Title
CN107957692B (en) Controller redundancy method, device and system
CN102890647B (en) Method and device for storing and updating data
CN104376875A (en) Methods and devices for predicting and determining life of storage device
EP3522449B1 (en) Service state transition method and device
CN105512457B (en) Computing system and method for providing information about maintenance activities
WO2010037814A4 (en) A computerised storage system comprising replaceable units for managing testing of replacement units
US9183139B2 (en) Mainboard and method of backing up of baseboard management controller
CN108762118B (en) Fault processing method and device between communication devices
CN105209982A (en) Method and apparatus for controlling a physical unit in an automation system
US9632871B2 (en) Reuse of problematic disks in a redundant disk system
CN103870244A (en) Task stack overflow detection algorithm for SPARC (scalable processor architecture) platform
CN109271270A (en) The troubleshooting methodology, system and relevant apparatus of bottom hardware in storage system
EP3316518A1 (en) Method and device for upgrading virtual network element, and computer storage medium
CN114416284A (en) Distributed operating system control method, apparatus, device, medium, and program product
CN109933448B (en) Method and device for predicting fault of nonvolatile storage medium
CN103890713A (en) Apparatus and method for managing register information in a processing system
JP2016066139A (en) Vehicle control unit
CN104407806A (en) Method and device for revising hard disk information of redundant array group of independent disk (RAID)
CN111130856A (en) Server configuration method, system, equipment and computer readable storage medium
CN115923844A (en) Vehicle driving mode control method, device, electronic equipment and storage medium
CN113986618B (en) Cluster brain fracture automatic repair method, system, device and storage medium
CN112269693B (en) Node self-coordination method, device and computer readable storage medium
CN111404715B (en) Network service instantiation method and device, electronic equipment and storage medium
US12072697B2 (en) Automated device maintenance
CN107729022B (en) Method and device for unloading application program and mobile terminal

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant