WO2017129032A1 - Disk failure prediction method and apparatus - Google Patents

Disk failure prediction method and apparatus Download PDF

Info

Publication number
WO2017129032A1
WO2017129032A1 PCT/CN2017/071699 CN2017071699W WO2017129032A1 WO 2017129032 A1 WO2017129032 A1 WO 2017129032A1 CN 2017071699 W CN2017071699 W CN 2017071699W WO 2017129032 A1 WO2017129032 A1 WO 2017129032A1
Authority
WO
WIPO (PCT)
Prior art keywords
disk
data
sample
tested
dimension
Prior art date
Application number
PCT/CN2017/071699
Other languages
French (fr)
Chinese (zh)
Inventor
丁永明
周俊
崔卿
瞿神全
Original Assignee
阿里巴巴集团控股有限公司
丁永明
周俊
崔卿
瞿神全
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 阿里巴巴集团控股有限公司, 丁永明, 周俊, 崔卿, 瞿神全 filed Critical 阿里巴巴集团控股有限公司
Publication of WO2017129032A1 publication Critical patent/WO2017129032A1/en

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/22Detection or location of defective computer hardware by testing during standby operation or during idle time, e.g. start-up testing
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/22Detection or location of defective computer hardware by testing during standby operation or during idle time, e.g. start-up testing
    • G06F11/2273Test methods
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/30Monitoring
    • G06F11/3003Monitoring arrangements specially adapted to the computing system or computing system component being monitored
    • G06F11/3037Monitoring arrangements specially adapted to the computing system or computing system component being monitored where the computing system component is a memory, e.g. virtual memory, cache
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/30Monitoring
    • G06F11/3058Monitoring arrangements for monitoring environmental properties or parameters of the computing system or of the computing system component, e.g. monitoring of power, currents, temperature, humidity, position, vibrations

Definitions

  • the present invention relates to the field of magnetic disks, and in particular to a method and apparatus for predicting failure of a magnetic disk.
  • the hard disk is the main medium for storing data, and once the hard disk fails, it will cause huge data loss. Therefore, how to ensure the stability of the hard disk can be very important.
  • the probability of a hard disk error in 24 hours is about one in ten thousand.
  • the probability of a server hard disk error will rise to one thousandth, and with the current website.
  • the number of hard disks that the server needs to use will increase, and the probability of multiple hard disks failing at the same time will increase.
  • data storage usually has multiple backups, such as mysql main and standby libraries, and GFS files default to 3 backups.
  • backups such as mysql main and standby libraries
  • GFS files default to 3 backups.
  • the embodiments of the present invention provide a method and a device for predicting a fault of a disk, so as to at least solve the technical problem that some factors in the prior art hard disk fault prediction system that are likely to cause the fault of the hard disk cannot be collected or quantized are inaccurate.
  • a method for predicting a fault of a magnetic disk includes: acquiring sample disk data of a disk by using a disk monitoring technology, where the sample disk data includes sample data in multiple dimensions; using a Bucking technology The sample disk data is subjected to binning processing to classify the sample disk data; the Owlqn model is used to perform sample training on the classified sample disk data to obtain a disk prediction model; after receiving the disk data of the disk to be tested, the disk prediction model is used to treat The disk data of the disk is measured for processing to determine whether the disk to be tested is a failed disk.
  • a fault prediction apparatus for a magnetic disk including: an obtaining module, configured to acquire sample disk data of a disk by using a disk monitoring technology, where the sample disk data includes multiple dimensions. Sample data; a classification module for performing binning processing on sample disk data by using the Buckinging technique, classifying sample disk data; and training module for performing sample training on the classified sample disk data by using the Owlqn model to obtain a disk prediction model
  • the determining module is configured to process the disk data of the disk to be tested after the disk data of the disk to be tested is received, and determine whether the disk to be tested is a faulty disk.
  • the sample disk data of the disk is obtained by using a disk monitoring technology, wherein the sample disk data includes sample data in multiple dimensions; the sample disk data is binned by the Bucking technology, and the sample disk data is processed. Classification; using the Owlqn model to perform sample training on the classified sample disk data to obtain a disk prediction model.
  • the disk prediction data is processed using the disk prediction model, and the disk data is processed. The purpose of determining whether the disk to be tested is a failed disk is achieved.
  • the technical effect of predicting disk failure further solves the technical problem that some of the prior art hard disk failure prediction systems are incapable of causing inaccurate prediction results due to factors that cannot be collected or quantized.
  • FIG. 1 is a block diagram showing the hardware structure of a computer terminal for predicting a failure of a magnetic disk according to an embodiment of the present invention
  • FIG. 2 is a flowchart of a method for predicting a failure of a magnetic disk according to Embodiment 1 of the present invention
  • FIG. 3 is a flowchart of an optional disk fault prediction method according to an embodiment of the present invention.
  • FIG. 4 is a schematic structural diagram of a fault prediction apparatus for a magnetic disk according to an embodiment of the present invention.
  • FIG. 5 is a schematic structural diagram of an optional disk fault prediction apparatus according to an embodiment of the present invention.
  • FIG. 6 is a schematic structural diagram of an optional disk fault prediction apparatus according to an embodiment of the present invention.
  • FIG. 7 is a schematic structural diagram of an optional disk fault prediction apparatus according to an embodiment of the present invention.
  • FIG. 8 is a schematic structural diagram of an optional disk fault prediction apparatus according to an embodiment of the present invention.
  • FIG. 9 is a structural block diagram of a computer terminal according to an embodiment of the present invention.
  • an embodiment of a method for predicting a failure of a magnetic disk there is provided an embodiment of a method for predicting a failure of a magnetic disk. It is to be noted that the steps illustrated in the flowchart of the accompanying drawings may be performed in a computer system such as a set of computer executable instructions, and Although a logical order is shown in the flowcharts, in some cases the steps shown or described may be performed in a different order than the ones described herein.
  • FIG. 1 is a hardware block diagram of a computer terminal of a method for predicting a failure of a magnetic disk according to an embodiment of the present invention.
  • computer terminal 10 may include one or more (only one shown) processor 102 (processor 102 may include, but is not limited to, a processing device such as a microprocessor MCU or a programmable logic device FPGA)
  • processor 102 may include, but is not limited to, a processing device such as a microprocessor MCU or a programmable logic device FPGA)
  • a memory 104 for storing data
  • a transmission module 106 for communication functions.
  • computer terminal 10 may also include more or fewer components than those shown in FIG. 1, or have a different configuration than that shown in FIG.
  • the memory 104 can be used to store software programs and modules of the application software, such as program instructions/modules corresponding to the fault prediction method of the disk in the embodiment of the present invention, and the processor 102 executes by executing the software program and the module stored in the memory 104.
  • Various functional applications and data processing that is, the above-described method for predicting the failure of the disk.
  • Memory 104 may include high speed random access memory, and may also include non-volatile memory such as one or more magnetic storage devices, flash memory, or other non-volatile solid state memory.
  • memory 104 may further include memory remotely located relative to processor 102, which may be coupled to computer terminal 10 via a network. Examples of such networks include, but are not limited to, the Internet, intranets, local area networks, mobile communication networks, and combinations thereof.
  • Transmission device 106 is for receiving or transmitting data via a network.
  • the network specific examples described above may include a wireless network provided by a communication provider of the computer terminal 10.
  • the transmission device 106 includes a Network Interface Controller (NIC) that can be connected to other network devices through a base station to communicate with the Internet.
  • the transmission device 106 can be a Radio Frequency (RF) module for communicating with the Internet wirelessly.
  • NIC Network Interface Controller
  • RF Radio Frequency
  • FIG. 2 is a flow chart of a method for predicting a failure of a magnetic disk according to a first embodiment of the present invention.
  • Step S21 Obtain sample disk data of the disk by using a disk monitoring technology, where the sample disk data includes sample data in multiple dimensions.
  • disk monitoring technology is used to monitor and record the disk status.
  • the sample disk data may be data throughput performance of the sample disk, motor startup time, seek error rate, and the like.
  • the solution of the foregoing embodiment establishes a model for predicting faults by training the samples of the disk sample data, so that after inputting the sample data of the disk to be tested to the disk monitoring system, the fault state analysis of the disk to be tested can be performed according to the model of the predicted fault, thereby avoiding When analyzing disk failures, the analysis of single or fixed multiple sample data results in the impact of non-statistical or non-quantitable disk data on disk failure prediction results.
  • step S23 the sample disk data is subjected to binning processing by using the Buckinging technology, and the sample disk data is classified.
  • the method for binning the sample disk data includes smoothing the data according to the average value of the data in the box.
  • the data is smoothed according to the intermediate value of the data in the box and the data is smoothed according to the boundary value of the data in the box.
  • multiple sample data in the sample disk data set may be divided into multiple bins.
  • the sample disk data is divided into 5 bins, and the sample disk is When the data is divided into different bins, the sample disk data can be sorted in ascending order, and then the amount of data in each bin can be calculated.
  • the sample disk data is divided into 5 points according to the amount of data that should be in each bin.
  • the box then processes the data in each bin.
  • the method is used to smooth the data according to the average value of the data in the box, that is, the average value of the data in each bin is calculated, and then the score is obtained. All data in the box becomes the average.
  • sample disk data is binned for smoothing the data in each bin. Since the data in each bin is similar, the binning process achieves stable and smooth data. Based on this, it does not affect the results of training the sample disk data in the next step.
  • the method for performing binning processing on the sample disk data includes any one of the foregoing embodiments, and is not limited thereto, and any method capable of achieving smooth or stable data can be used for the sample. Box processing of disk data.
  • step S25 the Owlqn model is used to perform sample training on the classified sample disk data to obtain a disk prediction model.
  • the sample disk data is trained to input the processed sample disk data to the Owlqn model, wherein the sample disk data is a sample that knows the true value in advance, and the true value of the sample may be 1 or 0. Indicates that the sample is a positive or negative sample, a positive sample indicates that the sample is a failed disk, and a negative sample indicates that the sample is a normal disk.
  • each input sample disk data can obtain a corresponding output value from the Owlqn model, and after obtaining the corresponding output value of each sample in the sample disk data set, all positive samples are obtained.
  • the output value constitutes a positive sample output value interval, and the output values of all negative samples are also obtained to form an output interval of the negative sample, thereby obtaining a disk prediction model.
  • Step S27 After receiving the disk data of the disk to be tested, use the disk prediction model to process the disk data of the disk to be tested, and determine whether the disk to be tested is a faulty disk.
  • sample data of the sample disk data is trained by using the Owlqn model
  • the sample data after classifying the sample disk data is used, and the classified sample data is subjected to binning processing, so that the classified samples are processed.
  • the sample data in each category is discretized so that the sample data of the sample disk can be trained.
  • the sample disk data may include: an underlying data read error rate, a start/stop count, a number of remapping sectors, a power-on time accumulation, a spindle spin retries, and a disk calibration retry.
  • the number of times, the number of disk power-ons, the temperature, and the write error rate can be used to obtain sample disk data based on historical disk failure conditions. For example, sample acquisition can be performed at a ratio of 1:5 to positive and negative samples, where the positive sample is the faulty disk and the negative sample is the disk with no fault.
  • the disks used by the various organizations that predict the disk failure are not necessarily the same, and the environmental factors such as temperature and humidity of the various mechanisms affect the disk.
  • the ratio of the disks of different organizations is different.
  • the sample disk data can also be obtained according to the actual disk damage of the mechanism.
  • the sample disk data is SMART disk data, wherein the sample disk data includes at least sample data in the following four dimensions: original value, standard value, worst value, and accumulation. value.
  • the above-mentioned original value is the current parameter of the disk running time; the above-mentioned standard value is the value of each parameter of the normal disk running; the above-mentioned worst value is that when the disk is running, the detection parameters of the disk have the largest deviation from the normal value.
  • Normal value is the cumulative result of each disk's detection parameters from disk usage to the current time.
  • the parameters of the disk may be information describing various attributes of the disk, and may include an error read rate, a power-on frequency, a number of re-allocated sectors, a number of rotation retries, One or more of the number of disk calibration retries and the parity error rate may also include other attribute information of the disk.
  • the sample disk data can be obtained by using software such as HDTune or CrystalDiskInfo.
  • the method further includes:
  • Step S211 performing any one or more of the following operations on the sample data in each dimension: a difference operation, a square operation, and a distribution sum operation, so that the sample data in any one dimension is expanded to the sample data in the new dimension.
  • the original value in the sample data is subjected to a difference operation, a square operation, and a distribution sum operation, thereby obtaining a difference value of the original value, a pool value of the original value, and a distribution sum of the original values.
  • Value so based on the original value of the sample disk data, the sample disk data of the other four dimensions is obtained; the standard value, the worst value, and the accumulated value in the sample data can also be used as the above operations, and more Sample disk data for dimensions.
  • performing multiple operations on the sample disk data to obtain more dimensional sample disk data can improve the utilization of the sample disk data and train the sample disk data.
  • the accuracy of the fault prediction model is improved.
  • step S23 uses the Bucking technology to perform binning processing on the sample disk data, and classifies the sample disk data, including:
  • step S231 the value range of each bin divided in advance and the ID value corresponding to each bin are determined.
  • the purpose of the range of values of each bin is to determine the bin corresponding to the data in the sample disk data set, that is, the bin corresponding to the range to which the sample disk data belongs is the data of the sample disk.
  • Binning Determine the ID value of each bin to distinguish between different bins.
  • Step S233 classifying the sample disk data by discretizing the sample data in each dimension to a corresponding bin, and obtaining an ID value corresponding to the sample data in each dimension.
  • the data assigned to the bin is replaced by the ID number of the bin, that is, the sample disk data in each dimension is It is replaced with the bin ID value corresponding to the sample disk data, so that the data in each dimension of the original sample disk data is replaced with the above integer value.
  • each bin has an ID value of 1, 2, 3, 4, 5, each segment
  • the ID value of the sample disk data A may be 10100. According to the scheme in the above embodiment, the sample data in each dimension can be obtained with the ID value corresponding thereto.
  • the Owlqn model is used to perform sample training on the classified sample disk data to obtain a disk prediction model, including:
  • Step S251 the Owlqn model trains the ID values corresponding to the sample data in each dimension to obtain the weight values of the sample data in each dimension.
  • the weight value of the sample data in each dimension is the probability that the sample is "1", that is, the probability that the sample is a positive sample.
  • the disk data to be tested is represented as among them, y i is 0 or 1.
  • the Owlqn model outputs the weight value of each disk characteristic data, that is, the probability that each disk characteristic data is faulty disk data.
  • the weight value can be calculated by the following formula: weight value i is used to represent the i-th sample, n is used to represent n dimensions, k is to represent any dimension between 1 and n, and w k is used to represent the weight value in the k-dimension, where w 0 is the intercept, requiring attention
  • the weight value of the output needs to meet the conditions: The minimum value can be obtained, and J is the optimization objective function.
  • Step S253 determining a disk prediction model according to the sample data and the corresponding weight value in each dimension, wherein the disk prediction model includes the prediction result of the sample data in each dimension.
  • calculating a predicted value of the disk to be tested may be calculated according to the following formula:
  • the above predicted value is the predicted result obtained from the training sample disk data. Since the sample disk is a faulty disk, the known value is obtained. Therefore, after the prediction result is obtained, the prediction result of the positive sample disk and the prediction result of the negative sample disk are distinguished. , get the range of the predicted value of the failed disk and the range of the predicted value of the normal disk.
  • the ID value corresponding to the sample data is input to the Owlqn model, and the fault state of the sample disk corresponding to the ID value is input to the Owlqn model, so that the Owlqn model stores the ID value and the disk fault status corresponding to the ID value. Then, input the ID value repeatedly to the Owlqn model to verify whether the Owlqn model can output the fault status corresponding to the ID value.
  • the prediction result of the sample data in each dimension is a predicted value obtained by classifying the sample disk data.
  • the step S27 uses the disk prediction model to process the disk data of the disk to be tested, and determines whether the disk to be tested is a failed disk.
  • Step S271 After receiving the disk data of the disk to be tested, the disk data of the disk to be tested is discretized to a corresponding bin, and the ID value corresponding to the disk data of the disk to be tested is obtained.
  • the disc data of the disk to be tested is discretized to the corresponding sub-box, and the ID value corresponding to the disk data of the disk to be tested is obtained, which can be implemented by using the solution in step S231 to step S233 in the above embodiment. .
  • Step S273 Determine a weight value of the disk data of the disk to be tested according to the ID value corresponding to the disk data of the disk to be tested.
  • the disk data to be tested is represented as among them, y i is 0 or 1.
  • the weight value of each disk characteristic data is output, that is, the probability that each disk characteristic data is faulty disk data.
  • Step S275 Determine, according to the weight value of the disk data of the disk to be tested, whether the disk to be tested is a failed disk from the disk prediction model.
  • calculating a predicted value of the disk to be tested may be calculated according to the following formula: After obtaining the predicted value of the disk to be tested, compare the predicted value of the disk to be tested with the value range of the positive sample obtained by the training sample disk data and the value range of the negative sample, if the predicted value of the disk to be tested falls into the positive value If the value of the sample is a faulty disk, the disk to be tested is considered to be a faulty disk. If the predicted value of the disk to be tested falls within the range of the negative sample, the disk to be tested can be considered as a normal disk.
  • the method may include the following steps S31 to S37:
  • the sample data of the sample disk may be SMART disk data.
  • the sample disk data can be obtained by using software such as HDTune or CrystalDiskInfo.
  • the difference operation refers to a value obtained by performing difference calculation between the feature data of the disk at a certain time and the feature data of the disk before 24 hours.
  • the above steps perform any one or more of the following operations on the sample data in each dimension: a difference operation, a square operation, and a distribution sum operation, so that the sample data in any one dimension is expanded to the sample data in the new dimension.
  • the purpose of the value range of each bin divided by the above steps is to determine the bin corresponding to the data in the sample disk data set, that is, the bin corresponding to the range to which the sample disk data belongs is the bin to which the sample disk data belongs. . Determine the ID value of each bin to distinguish different bins and discretize the data in each bin.
  • the sample data of the sample disk is trained by the Owlqn model. Disk prediction model.
  • the disk prediction model constructed by the above steps is used to predict the disk to be tested, and after obtaining the predicted value, the predicted value range in the model is compared to obtain the prediction result of the disk to be tested.
  • the apparatus includes: an acquisition module 40, a classification module 42, a training module 44, and a determination module 46.
  • the obtaining module 40 is configured to obtain sample disk data of the disk by using a disk monitoring technology, where the sample disk data includes sample data in multiple dimensions, and the classifying module 42 is configured to perform binning processing on the sample disk data by using a bucketing technology.
  • the training module 44 is configured to perform sample training on the classified sample disk data by using the Owlqn model to obtain a disk prediction model, and the determining module 46 is configured to: after receiving the disk data of the disk to be tested, Use the disk prediction model to process the disk data of the disk to be tested to determine whether the disk to be tested is a failed disk.
  • the example and the application scenario implemented by the step S21 to the step S27 of the first embodiment are the same as the application scenario, but are not limited to the above embodiment.
  • a public content It should be noted that the above module can be operated as part of the device in the computer terminal 10 provided in the first embodiment.
  • the sample disk data is SMART disk data, wherein the sample disk data includes at least sample data in the following four dimensions: original value, standard value, worst value, and accumulation. value.
  • the device further includes:
  • the operation module 50 is configured to perform any one or more of the following operations on the sample data in each dimension: a difference operation, a square operation, and a distribution sum operation, so that the sample data in any one dimension is expanded into a new dimension. Sample data.
  • step S211 of the embodiment the example and the application scenario implemented by the above-mentioned obtaining module 50 corresponding to step S211 of the embodiment are the same, but are not limited to the content disclosed in the first embodiment. It should be noted that the above module can be operated as part of the device in the computer terminal 10 provided in the first embodiment.
  • the classification module 42 includes:
  • a first determining sub-module 60 configured to determine a range of values of each binning pre-divided and an ID value corresponding to each bin; a sub-module 62 for discretizing sample data in each dimension to The corresponding bins are used to classify the sample disk data to obtain the ID value corresponding to the sample data in each dimension.
  • the first determining sub-module 60 and the categorizing sub-module 62 are the same as the example and the application scenario implemented by the step S231 and the step S233 of the embodiment, but are not limited to the one disclosed in the first embodiment. content. It should be noted that the above module can be operated as part of the device in the computer terminal 10 provided in the first embodiment.
  • the training module 44 includes:
  • the training sub-module 70 is configured to train the Owlqn model to match the ID value corresponding to the sample data in each dimension to obtain the weight value of the sample data in each dimension; and the second determining sub-module 72 is configured to use each dimension according to each dimension
  • the disk prediction model is determined by the sample data and the corresponding weight value, wherein the disk prediction model includes prediction results of the sample data in each dimension.
  • the training sub-module 70 and the second determining sub-module 72 are the same as the example and the application scenario implemented by the step S251 and the step S253 of the embodiment, but are not limited to the one disclosed in the first embodiment. content. It should be noted that the above module can be operated as part of the device in the computer terminal 10 provided in the first embodiment.
  • the prediction result of the sample data in each dimension is a predicted value obtained by classifying the sample disk data.
  • the determining module 46 further includes:
  • the discretization module 80 is configured to discretize the disk data of the disk to be tested to the corresponding sub-box, and obtain the ID value corresponding to the disk data of the disk to be tested; the third determining sub-module 82 Determining, according to the ID value corresponding to the disk data of the disk to be tested, the weight value of the disk data of the disk to be tested; the fourth determining sub-module 84, configured to predict the model from the disk according to the weight value of the disk data of the disk to be tested. Determine if the disk to be tested is a failed disk.
  • the above-mentioned discrete module 80, the third determining sub-module 82, and the fourth determining sub-module 84 are the same as the examples and application scenarios implemented in step S271 and step S275 of the embodiment, but are not limited to the above.
  • Embodiments of the present invention may provide a computer terminal, which may be any one of computer terminal groups.
  • the foregoing computer terminal may also be replaced with a terminal device such as a mobile terminal.
  • the computer terminal may be located in at least one network device of the plurality of network devices of the computer network.
  • the computer terminal may execute the program code of the following steps in the method for predicting the fault of the disk: acquiring the sample disk data of the disk by using the disk monitoring technology, wherein the sample disk data includes sample data in multiple dimensions; using Bucking The technology performs binning processing on the sample disk data, classifies the sample disk data, and uses the Owlqn model to perform sample training on the classified sample disk data to obtain a disk prediction model; after receiving the disk data of the disk to be tested, using the disk prediction The model processes the disk data of the disk to be tested to determine whether the disk to be tested is a failed disk.
  • FIG. 9 is a structural block diagram of a computer terminal according to an embodiment of the present invention.
  • the computer terminal A may include one or more (only one shown in the figure) processor 91, memory 93, and transmission device 95.
  • the memory can be used to store the software program and the module, such as the fault prediction method of the disk and the program instruction/module corresponding to the device in the embodiment of the present invention, and the processor executes various programs by running the software program and the module stored in the memory. Functional application and data processing, that is, the above-described method for predicting the failure of the disk.
  • the memory may include a high speed random access memory, and may also include non-volatile memory such as one or more magnetic storage devices, flash memory, or other non-volatile solid state memory.
  • the memory can further include memory remotely located relative to the processor, which can be connected to terminal A via a network. Examples of such networks include, but are not limited to, the Internet, intranets, local area networks, mobile communication networks, and combinations thereof.
  • the processor may call the memory stored information and the application program by the transmission device to perform the following steps: acquiring the sample disk data of the disk by using a disk monitoring technology, where the sample disk data includes sample data in multiple dimensions; using the Bucking technology The sample disk data is subjected to binning processing to classify the sample disk data; the Owlqn model is used to perform sample training on the classified sample disk data to obtain a disk prediction model; after receiving the disk data of the disk to be tested, the disk prediction model is used to treat The disk data of the disk is measured for processing to determine whether the disk to be tested is a failed disk.
  • the foregoing processor may further execute the following program code: the sample disk data is SMART disk data, wherein the sample disk data includes at least sample data in the following four dimensions: original value, standard value, worst value, and Cumulative value.
  • the foregoing processor may further execute the following program code: perform any one or more of the following operations on the sample data in each dimension: a difference operation, a square operation, and a distributed sum operation, so that any one dimension The sample data is expanded out of the sample data on the new dimension.
  • the foregoing processor may further execute the following program code: determining a range of values of each of the pre-divided bins and an ID value corresponding to each bin; and discretizing the sample data in each dimension to The corresponding bins are used to classify the sample disk data to obtain the ID value corresponding to the sample data in each dimension.
  • the foregoing processor may further execute the following program code: the Owlqn model trains the ID value corresponding to the sample data in each dimension, and obtains the weight value of the sample data in each dimension; according to each dimension
  • the disk prediction model is determined by the sample data and the corresponding weight value, wherein the disk prediction model includes prediction results of the sample data in each dimension.
  • the foregoing processor may further execute program code of the following steps: the prediction result of the sample data in each dimension is a predicted value obtained by classifying the sample disk data.
  • the foregoing processor may further execute the following program code: after receiving the disk data of the disk to be tested, discretizing the disk data of the disk to be tested to a corresponding bin, and obtaining the disk data of the disk to be tested. ID value; determining the weight value of the disk data of the disk to be tested according to the ID value corresponding to the disk data of the disk to be tested; determining whether the disk to be tested is a fault disk from the disk prediction model according to the weight value of the disk data of the disk to be tested .
  • the sample disk data of the disk is obtained by using a disk monitoring technology, wherein the sample disk data includes sample data in multiple dimensions; the sample disk data is binned by the Bucking technology, and the sample disk data is processed. Classification; using the Owlqn model to perform sample training on the classified sample disk data to obtain a disk prediction model. After receiving the disk data of the disk to be tested, the disk prediction data is processed using the disk prediction model, and the disk data is processed.
  • the purpose of determining whether the disk to be tested is a faulty disk is to achieve the technical effect of predicting the disk failure, thereby solving the problem that some factors in the prior art hard disk fault prediction system that are likely to cause the hard disk failure cannot be collected or quantized. Accurate technical issues.
  • FIG. 9 is only an illustration, and the computer terminal can also be a smart phone (such as an Android mobile phone, an iOS mobile phone, etc.), a tablet computer, an applause computer, and a mobile Internet device (Mobile Internet Devices, MID). ), PAD and other terminal devices.
  • FIG. 9 does not limit the structure of the above electronic device.
  • computer terminal A may also include more or fewer components (such as a network interface, display device, etc.) than shown in FIG. 9, or have a different configuration than that shown in FIG.
  • the storage medium may include a flash disk, a read-only memory (ROM), a random access memory (RAM), a magnetic disk or an optical disk, and the like.
  • Embodiments of the present invention also provide a storage medium.
  • the storage medium may be used to save the program code executed by the fault prediction method of the disk provided in the first embodiment.
  • the foregoing storage medium may be located in any one of the computer terminal groups in the computer network, or in any one of the mobile terminal groups.
  • the storage medium is configured to store program code for performing the following steps: acquiring sample disk data of the disk by using a disk monitoring technology, wherein the sample disk data includes sample data in multiple dimensions;
  • the sample disk data is binned by the Bucking technology to classify the sample disk data.
  • the Owlqn model is used to perform sample training on the classified sample disk data to obtain a disk prediction model.
  • After receiving the disk data of the disk to be tested, the disk data is used.
  • the disk prediction model processes the disk data of the disk to be tested to determine whether the disk to be tested is a failed disk.
  • the storage medium is further configured to store program code for performing the following steps: the sample disk data is SMART disk data, wherein the sample disk data includes at least sample data in the following four dimensions: original value, standard value , the worst value and the cumulative value.
  • the storage medium is further configured to store program code for performing the following steps: performing one or more of the following operations on the sample data in each dimension: a difference operation, a square operation, and a distribution sum operation,
  • the sample data in any one dimension is expanded to the sample data on the new dimension.
  • the foregoing storage medium is further configured to store program code for performing the following steps: determining a range of values of each of the pre-divided bins and an ID value corresponding to each bin; The sample data in each dimension is discretized to the corresponding bins to classify the sample disk data to obtain the ID value corresponding to the sample data in each dimension.
  • the storage medium is further configured to store program code for performing the following steps: the Owlqn model trains the ID value corresponding to the sample data in each dimension to obtain the weight value of the sample data in each dimension.
  • the disk prediction model is determined based on the sample data and the corresponding weight values in each dimension, wherein the disk prediction model includes prediction results of the sample data in each dimension.
  • the storage medium is further configured to store program code for performing the following steps: the prediction result of the sample data on each dimension is a predicted value obtained by classifying the sample disk data.
  • the foregoing storage medium is further configured to store program code for performing the following steps: after receiving the disk data of the disk to be tested, discretizing the disk data of the disk to be tested to a corresponding bin, and obtaining the disk to be tested.
  • the ID value corresponding to the disk data determining the weight value of the disk data of the disk to be tested according to the ID value corresponding to the disk data of the disk to be tested; determining the test value from the disk prediction model according to the weight value of the disk data of the disk to be tested Whether the disk is a failed disk.
  • the disclosed technical contents may be implemented in other manners.
  • the device embodiments described above are merely illustrative.
  • the division of the unit is only a logical function division.
  • multiple units or components may be combined or may be Integrate into another system, or some features can be ignored or not executed.
  • the mutual coupling or direct coupling or communication connection shown or discussed may be an indirect coupling or communication connection through some interface, unit or module, and may be electrical or otherwise.
  • the units described as separate components may or may not be physically separated, and the components displayed as units may or may not be physical units, that is, may be located in one place. Square, or it can be distributed to multiple network elements. Some or all of the units may be selected according to actual needs to achieve the purpose of the solution of the embodiment.
  • each functional unit in each embodiment of the present invention may be integrated into one processing unit, or each unit may exist physically separately, or two or more units may be integrated into one unit.
  • the above integrated unit can be implemented in the form of hardware or in the form of a software functional unit.
  • the integrated unit if implemented in the form of a software functional unit and sold or used as a standalone product, may be stored in a computer readable storage medium.
  • the technical solution of the present invention which is essential or contributes to the prior art, or all or part of the technical solution, may be embodied in the form of a software product stored in a storage medium.
  • a number of instructions are included to cause a computer device (which may be a personal computer, server or network device, etc.) to perform all or part of the steps of the methods described in various embodiments of the present invention.
  • the foregoing storage medium includes: a U disk, a Read-Only Memory (ROM), a Random Access Memory (RAM), a removable hard disk, a magnetic disk, or an optical disk, and the like. .

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • General Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Quality & Reliability (AREA)
  • General Physics & Mathematics (AREA)
  • Computing Systems (AREA)
  • Computer Hardware Design (AREA)
  • Mathematical Physics (AREA)
  • Debugging And Monitoring (AREA)
  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)
  • Recording Or Reproducing By Magnetic Means (AREA)

Abstract

Disclosed are a disk failure prediction method and apparatus. The method comprises: acquiring sample disk data of a disk through disk monitoring technology, the sample disk data comprising sample data on a plurality of dimensions; binning the sample disk data by using Bucketing technology, and classifying the sample disk data; performing sample training on the classified sample disk data by using an Owlqn model, to obtain a disk prediction model; and after disk data of a disk to be predicted is received, processing the disk data of the disk to be predicted by using the disk prediction model, and determining whether the disk to be predicted is a faulty disk. The method solves the technical problem in the prior art of an inaccurate prediction result caused by the fact that some factors easily causing hard disk failures cannot be collected or quantized in a hard disk failure prediction system.

Description

磁盘的故障预测方法和装置Disk failure prediction method and device 技术领域Technical field
本发明涉及磁盘领域,具体而言,涉及一种磁盘的故障预测方法和装置。The present invention relates to the field of magnetic disks, and in particular to a method and apparatus for predicting failure of a magnetic disk.
背景技术Background technique
目前,硬盘是存储数据的主要介质,硬盘一旦出故障,便会造成巨大的数据损失。因此如何保证硬盘的稳定性能非常重要。在通常状态下,硬盘在24小时中出错的概率在是万分之一左右,当一台服务器具有十块硬盘时,服务器硬盘出错的概率就会上升到千分之一,而随着当前网站等业务的发展,服务器需要使用的硬盘会越来越多,多块硬盘同时出错的概率也会提升。At present, the hard disk is the main medium for storing data, and once the hard disk fails, it will cause huge data loss. Therefore, how to ensure the stability of the hard disk can be very important. Under normal conditions, the probability of a hard disk error in 24 hours is about one in ten thousand. When a server has ten hard disks, the probability of a server hard disk error will rise to one thousandth, and with the current website. As the business develops, the number of hard disks that the server needs to use will increase, and the probability of multiple hard disks failing at the same time will increase.
通常情况下,数据存储通常会有多个备份,如mysql主备库,GFS文件默认3个备份。在大量数据存储平台上,如果多个硬盘同时出故障,那么这些硬盘上存储着同一个文件的备份的概率就会很高,即如果多块硬盘同时出现故障,就会导致一些文件的丢失,对于一些线上的服务,大都依赖于服务器中存储的海量数据,如果硬盘出故障,就会导致上述在线服务异常,甚至暂停使用。Usually, data storage usually has multiple backups, such as mysql main and standby libraries, and GFS files default to 3 backups. On a large number of data storage platforms, if multiple hard disks fail at the same time, the probability of storing the same file on these hard disks will be high. That is, if multiple hard disks fail at the same time, some files will be lost. For some online services, most of them depend on the huge amount of data stored in the server. If the hard disk fails, the above online service will be abnormal or even suspended.
由于上述原因,需要具有预测硬盘是否会出错的系统需要有一套系统能提前告诉我们哪些硬盘会出错,数据可能丢失导致硬盘故障的原因有很多,最常见的有以下几种:外部振动、温度和湿度、电器元件损坏、声音和灰尘,在上述因素中,有些因素能够被采集到,比如温度和湿度、一些元器件数据,但是更多的数据无法被采集和量化,因此便会导致预测结果不准确。For the above reasons, systems that need to predict whether the hard disk will go wrong need a system that can tell us in advance which hard disks will go wrong. There are many reasons why the data may be lost. The most common ones are: external vibration, temperature and Humidity, electrical component damage, sound and dust, some of the above factors can be collected, such as temperature and humidity, some component data, but more data can not be collected and quantified, so it will lead to prediction results accurate.
针对现有技术的硬盘故障预测系统中一些容易致使硬盘故障的因素 不能被采集胡或量化导致的预测结果不准确的问题,目前尚未提出有效的解决方案。Some factors in the prior art hard disk failure prediction system that easily cause hard disk failure There is no effective solution to the problem of inaccurate prediction results that cannot be collected or quantified.
发明内容Summary of the invention
本发明实施例提供了一种磁盘的故障预测方法和装置,以至少解决现有技术的硬盘故障预测系统中一些容易致使硬盘故障的因素不能被采集或量化导致的预测结果不准确的技术问题。The embodiments of the present invention provide a method and a device for predicting a fault of a disk, so as to at least solve the technical problem that some factors in the prior art hard disk fault prediction system that are likely to cause the fault of the hard disk cannot be collected or quantized are inaccurate.
根据本发明实施例的一个方面,提供了一种磁盘的故障预测方法,包括:通过磁盘监控技术获取磁盘的样本磁盘数据,其中,样本磁盘数据包括多个维度上的样本数据;采用Bucketing技术对样本磁盘数据进行分箱处理,对样本磁盘数据进行分类;采用Owlqn模型对分类后的样本磁盘数据进行样本训练,得到磁盘预测模型;在接收到待测磁盘的磁盘数据之后,使用磁盘预测模型对待测磁盘的磁盘数据进行处理,确定待测磁盘是否为故障磁盘。According to an aspect of the embodiments of the present invention, a method for predicting a fault of a magnetic disk includes: acquiring sample disk data of a disk by using a disk monitoring technology, where the sample disk data includes sample data in multiple dimensions; using a Bucking technology The sample disk data is subjected to binning processing to classify the sample disk data; the Owlqn model is used to perform sample training on the classified sample disk data to obtain a disk prediction model; after receiving the disk data of the disk to be tested, the disk prediction model is used to treat The disk data of the disk is measured for processing to determine whether the disk to be tested is a failed disk.
根据本发明实施例的另一方面,还提供了一种磁盘的故障预测装置,包括:获取模块,用于通过磁盘监控技术获取磁盘的样本磁盘数据,其中,样本磁盘数据包括多个维度上的样本数据;分类模块,用于采用Bucketing技术对样本磁盘数据进行分箱处理,对样本磁盘数据进行分类;训练模块,用于采用Owlqn模型对分类后的样本磁盘数据进行样本训练,得到磁盘预测模型;确定模块,用于在接收到待测磁盘的磁盘数据之后,使用磁盘预测模型对待测磁盘的磁盘数据进行处理,确定待测磁盘是否为故障磁盘。According to another aspect of the present invention, a fault prediction apparatus for a magnetic disk is provided, including: an obtaining module, configured to acquire sample disk data of a disk by using a disk monitoring technology, where the sample disk data includes multiple dimensions. Sample data; a classification module for performing binning processing on sample disk data by using the Buckinging technique, classifying sample disk data; and training module for performing sample training on the classified sample disk data by using the Owlqn model to obtain a disk prediction model The determining module is configured to process the disk data of the disk to be tested after the disk data of the disk to be tested is received, and determine whether the disk to be tested is a faulty disk.
在本发明实施例中,采用通过磁盘监控技术获取磁盘的样本磁盘数据,其中,样本磁盘数据包括多个维度上的样本数据;采用Bucketing技术对样本磁盘数据进行分箱处理,对样本磁盘数据进行分类;采用Owlqn模型对分类后的样本磁盘数据进行样本训练,得到磁盘预测模型的方式,通过在接收到待测磁盘的磁盘数据之后,使用磁盘预测模型对待测磁盘的磁盘数据进行处理,达到了确定待测磁盘是否为故障磁盘的目的,从而实现了 预测磁盘故障的技术效果,进而解决了现有技术的硬盘故障预测系统中一些容易致使硬盘故障的因素不能被采集或量化导致的预测结果不准确的技术问题。In the embodiment of the present invention, the sample disk data of the disk is obtained by using a disk monitoring technology, wherein the sample disk data includes sample data in multiple dimensions; the sample disk data is binned by the Bucking technology, and the sample disk data is processed. Classification; using the Owlqn model to perform sample training on the classified sample disk data to obtain a disk prediction model. After receiving the disk data of the disk to be tested, the disk prediction data is processed using the disk prediction model, and the disk data is processed. The purpose of determining whether the disk to be tested is a failed disk is achieved. The technical effect of predicting disk failure further solves the technical problem that some of the prior art hard disk failure prediction systems are incapable of causing inaccurate prediction results due to factors that cannot be collected or quantized.
附图说明DRAWINGS
此处所说明的附图用来提供对本发明的进一步理解,构成本申请的一部分,本发明的示意性实施例及其说明用于解释本发明,并不构成对本发明的不当限定。在附图中:The drawings described herein are intended to provide a further understanding of the invention, and are intended to be a part of the invention. In the drawing:
图1是根据本发明实施例的一种磁盘的故障预测方法的计算机终端的硬件结构框图;1 is a block diagram showing the hardware structure of a computer terminal for predicting a failure of a magnetic disk according to an embodiment of the present invention;
图2是根据本发明实施例一的磁盘的故障预测方法的流程图;2 is a flowchart of a method for predicting a failure of a magnetic disk according to Embodiment 1 of the present invention;
图3是根据本发明实施例的一种可选的磁盘的故障预测方法的流程图;3 is a flowchart of an optional disk fault prediction method according to an embodiment of the present invention;
图4是根据本发明实施例的一种磁盘的故障预测装置的结构示意图;4 is a schematic structural diagram of a fault prediction apparatus for a magnetic disk according to an embodiment of the present invention;
图5是根据本发明实施例的一种可选的磁盘的故障预测装置的结构示意图;FIG. 5 is a schematic structural diagram of an optional disk fault prediction apparatus according to an embodiment of the present invention; FIG.
图6是根据本发明实施例的一种可选的磁盘的故障预测装置的结构示意图;6 is a schematic structural diagram of an optional disk fault prediction apparatus according to an embodiment of the present invention;
图7是根据本发明实施例的一种可选的磁盘的故障预测装置的结构示意图;7 is a schematic structural diagram of an optional disk fault prediction apparatus according to an embodiment of the present invention;
图8是根据本发明实施例的一种可选的磁盘的故障预测装置的结构示意图;以及8 is a schematic structural diagram of an optional disk fault prediction apparatus according to an embodiment of the present invention;
图9是根据本发明实施例的一种计算机终端的结构框图。FIG. 9 is a structural block diagram of a computer terminal according to an embodiment of the present invention.
具体实施方式detailed description
为了使本技术领域的人员更好地理解本发明方案,下面将结合本发明 实施例中的附图,对本发明实施例中的技术方案进行清楚、完整地描述,显然,所描述的实施例仅仅是本发明一部分的实施例,而不是全部的实施例。基于本发明中的实施例,本领域普通技术人员在没有做出创造性劳动前提下所获得的所有其他实施例,都应当属于本发明保护的范围。In order to provide a better understanding of the present invention by those skilled in the art, the present invention will be described below. The embodiments of the present invention are clearly and completely described in the embodiments of the present invention. It is obvious that the described embodiments are only a part of the embodiments of the present invention, and not all of the embodiments. All other embodiments obtained by those skilled in the art based on the embodiments of the present invention without creative efforts shall fall within the scope of the present invention.
需要说明的是,本发明的说明书和权利要求书及上述附图中的术语“第一”、“第二”等是用于区别类似的对象,而不必用于描述特定的顺序或先后次序。应该理解这样使用的数据在适当情况下可以互换,以便这里描述的本发明的实施例能够以除了在这里图示或描述的那些以外的顺序实施。此外,术语“包括”和“具有”以及他们的任何变形,意图在于覆盖不排他的包含,例如,包含了一系列步骤或单元的过程、方法、系统、产品或设备不必限于清楚地列出的那些步骤或单元,而是可包括没有清楚地列出的或对于这些过程、方法、产品或设备固有的其它步骤或单元。It is to be understood that the terms "first", "second" and the like in the specification and claims of the present invention are used to distinguish similar objects, and are not necessarily used to describe a particular order or order. It is to be understood that the data so used may be interchanged where appropriate, so that the embodiments of the invention described herein can be implemented in a sequence other than those illustrated or described herein. In addition, the terms "comprises" and "comprises" and "the" and "the" are intended to cover a non-exclusive inclusion, for example, a process, method, system, product, or device that comprises a series of steps or units is not necessarily limited to Those steps or units may include other steps or units not explicitly listed or inherent to such processes, methods, products or devices.
实施例1Example 1
根据本发明实施例,提供了一种磁盘的故障预测方法实施例,需要说明的是,在附图的流程图示出的步骤可以在诸如一组计算机可执行指令的计算机系统中执行,并且,虽然在流程图中示出了逻辑顺序,但是在某些情况下,可以以不同于此处的顺序执行所示出或描述的步骤。According to an embodiment of the present invention, there is provided an embodiment of a method for predicting a failure of a magnetic disk. It is to be noted that the steps illustrated in the flowchart of the accompanying drawings may be performed in a computer system such as a set of computer executable instructions, and Although a logical order is shown in the flowcharts, in some cases the steps shown or described may be performed in a different order than the ones described herein.
本申请实施例一所提供的方法实施例可以在移动终端、计算机终端或者类似的运算装置中执行。以运行在计算机终端上为例,图1是根据本发明实施例的一种磁盘的故障预测方法的计算机终端的硬件结构框图。如图1所示,计算机终端10可以包括一个或多个(图中仅示出一个)处理器102(处理器102可以包括但不限于微处理器MCU或可编程逻辑器件FPGA等的处理装置)、用于存储数据的存储器104、以及用于通信功能的传输模块106。本领域普通技术人员可以理解,图1所示的结构仅为示意,其并不对上述电子装置的结构造成限定。例如,计算机终端10还可包括比图1中所示更多或者更少的组件,或者具有与图1所示不同的配置。 The method embodiment provided in Embodiment 1 of the present application can be executed in a mobile terminal, a computer terminal or the like. Taking a computer terminal as an example, FIG. 1 is a hardware block diagram of a computer terminal of a method for predicting a failure of a magnetic disk according to an embodiment of the present invention. As shown in FIG. 1, computer terminal 10 may include one or more (only one shown) processor 102 (processor 102 may include, but is not limited to, a processing device such as a microprocessor MCU or a programmable logic device FPGA) A memory 104 for storing data, and a transmission module 106 for communication functions. It will be understood by those skilled in the art that the structure shown in FIG. 1 is merely illustrative and does not limit the structure of the above electronic device. For example, computer terminal 10 may also include more or fewer components than those shown in FIG. 1, or have a different configuration than that shown in FIG.
存储器104可用于存储应用软件的软件程序以及模块,如本发明实施例中的磁盘的故障预测方法对应的程序指令/模块,处理器102通过运行存储在存储器104内的软件程序以及模块,从而执行各种功能应用以及数据处理,即实现上述的磁盘的故障预测方法。存储器104可包括高速随机存储器,还可包括非易失性存储器,如一个或者多个磁性存储装置、闪存、或者其他非易失性固态存储器。在一些实例中,存储器104可进一步包括相对于处理器102远程设置的存储器,这些远程存储器可以通过网络连接至计算机终端10。上述网络的实例包括但不限于互联网、企业内部网、局域网、移动通信网及其组合。The memory 104 can be used to store software programs and modules of the application software, such as program instructions/modules corresponding to the fault prediction method of the disk in the embodiment of the present invention, and the processor 102 executes by executing the software program and the module stored in the memory 104. Various functional applications and data processing, that is, the above-described method for predicting the failure of the disk. Memory 104 may include high speed random access memory, and may also include non-volatile memory such as one or more magnetic storage devices, flash memory, or other non-volatile solid state memory. In some examples, memory 104 may further include memory remotely located relative to processor 102, which may be coupled to computer terminal 10 via a network. Examples of such networks include, but are not limited to, the Internet, intranets, local area networks, mobile communication networks, and combinations thereof.
传输装置106用于经由一个网络接收或者发送数据。上述的网络具体实例可包括计算机终端10的通信供应商提供的无线网络。在一个实例中,传输装置106包括一个网络适配器(Network Interface Controller,NIC),其可通过基站与其他网络设备相连从而可与互联网进行通讯。在一个实例中,传输装置106可以为射频(Radio Frequency,RF)模块,其用于通过无线方式与互联网进行通讯。 Transmission device 106 is for receiving or transmitting data via a network. The network specific examples described above may include a wireless network provided by a communication provider of the computer terminal 10. In one example, the transmission device 106 includes a Network Interface Controller (NIC) that can be connected to other network devices through a base station to communicate with the Internet. In one example, the transmission device 106 can be a Radio Frequency (RF) module for communicating with the Internet wirelessly.
在上述运行环境下,本申请提供了如图2所示的磁盘的故障预测方法。图2是根据本发明实施例一的磁盘的故障预测方法的流程图。In the above operating environment, the present application provides a method for predicting a failure of a disk as shown in FIG. 2. 2 is a flow chart of a method for predicting a failure of a magnetic disk according to a first embodiment of the present invention.
步骤S21,通过磁盘监控技术获取磁盘的样本磁盘数据,其中,样本磁盘数据包括多个维度上的样本数据。Step S21: Obtain sample disk data of the disk by using a disk monitoring technology, where the sample disk data includes sample data in multiple dimensions.
在上述步骤中,磁盘监控技术用于监视并记录磁盘状态,In the above steps, disk monitoring technology is used to monitor and record the disk status.
在一种可选的实施例中,样本磁盘数据可以是样本磁盘的数据吞吐性能、马达启动时间、寻道错误率等。In an alternative embodiment, the sample disk data may be data throughput performance of the sample disk, motor startup time, seek error rate, and the like.
此处需要说明的是,在使用现有技术(例如S.M.A.R.T,自我监测、分析及报告技术)对磁盘进行监测时,能够得到多维度的体现磁盘状态的数据,以及根据监测得到的数据对磁盘是否故障,或是否会在未来的较短时间内发生故障做出分析,这样的分析在磁盘监测技术监测的数据的基础上进行的,然而磁盘的状态还能通过其他数据量体现,这些数据量可能是 不能被检测或不能被量化的数据量,因此本申请建立了磁盘预测模型,使用磁盘预测模型对磁盘的故障状态进行分析,其中,磁盘预测模型由Owlqn模型对样本磁盘数据进行样本训练得到。上述实施例的方案通过对磁盘样本数据的样本训练建立了预测故障的模型,使得向磁盘监控系统输入待测磁盘样本数据后,能够根据预测故障的模型,对待测磁盘进行故障状态分析,避免了在分析磁盘故障时,采用对单一或固定的多个样本数据进行分析,导致的不可统计或不可量化的磁盘数据对磁盘故障预测结果的影响。It should be noted here that when using the existing technology (such as SMART, self-monitoring, analysis and reporting technology) to monitor the disk, it is possible to obtain multi-dimensional data reflecting the state of the disk, and whether the disk is based on the monitored data. Failure, or whether it will be analyzed in the short-term future, such analysis is based on the data monitored by the disk monitoring technology, but the state of the disk can also be reflected by other data volumes, which may be Yes The amount of data that cannot be detected or can not be quantified, so the present application establishes a disk prediction model, which uses a disk prediction model to analyze the fault state of the disk, wherein the disk prediction model is sampled by the Owlqn model for sample disk data. The solution of the foregoing embodiment establishes a model for predicting faults by training the samples of the disk sample data, so that after inputting the sample data of the disk to be tested to the disk monitoring system, the fault state analysis of the disk to be tested can be performed according to the model of the predicted fault, thereby avoiding When analyzing disk failures, the analysis of single or fixed multiple sample data results in the impact of non-statistical or non-quantitable disk data on disk failure prediction results.
步骤S23,采用Bucketing技术对样本磁盘数据进行分箱处理,对样本磁盘数据进行分类。In step S23, the sample disk data is subjected to binning processing by using the Buckinging technology, and the sample disk data is classified.
在上述步骤中,在对样本磁盘数据进行分箱处理时能够采用多种分箱方法达到平滑数据的目的,其中,对样本磁盘数据进行分箱的方法包括按照箱内数据的平均值平滑数据、按照箱内数据的中间值平滑数据以及按照箱内数据的边界值平滑数据。In the above steps, when the sample disk data is binned, a plurality of binning methods can be used to achieve smooth data. The method for binning the sample disk data includes smoothing the data according to the average value of the data in the box. The data is smoothed according to the intermediate value of the data in the box and the data is smoothed according to the boundary value of the data in the box.
在一种可选的实施例中,可以先将样本磁盘数据集合中的多个样本数据分至多个分箱中,在此示例中,将样本磁盘数据分至5个分箱中,将样本磁盘数据分至不同的分箱中时可将样本磁盘数据按照升序排列,然后计算每个分箱中的数据量,在将样本磁盘数据按照每个分箱中应该有的数据量分至5个分箱,然后对每个分箱中的数据进行处理,在此实施例中采用按照箱内数据的平均值平滑数据的方法进行处理,即计算得到每个分箱中数据的平均值,然后该分箱内所有数据均变为该平均值。In an optional embodiment, multiple sample data in the sample disk data set may be divided into multiple bins. In this example, the sample disk data is divided into 5 bins, and the sample disk is When the data is divided into different bins, the sample disk data can be sorted in ascending order, and then the amount of data in each bin can be calculated. The sample disk data is divided into 5 points according to the amount of data that should be in each bin. The box then processes the data in each bin. In this embodiment, the method is used to smooth the data according to the average value of the data in the box, that is, the average value of the data in each bin is calculated, and then the score is obtained. All data in the box becomes the average.
此处需要说明当时,对样本磁盘数据进行分箱处理用于将每个分箱中的数据进行平滑处理,由于每个分箱中的数据都较为相近,因此分箱处理在达到稳定平滑数据的基础上,并不会影响下一步骤中对样本磁盘数据进行训练的结果。It needs to be explained here that the sample disk data is binned for smoothing the data in each bin. Since the data in each bin is similar, the binning process achieves stable and smooth data. Based on this, it does not affect the results of training the sample disk data in the next step.
此处还需要说明的是,对样本磁盘数据进行分箱处理的方法包括上述实施例中的任意一种分箱方法,且不限于此,任何能够达到平滑或稳定数据目的方法都可用于对样本磁盘数据的分箱处理。 It should be noted that the method for performing binning processing on the sample disk data includes any one of the foregoing embodiments, and is not limited thereto, and any method capable of achieving smooth or stable data can be used for the sample. Box processing of disk data.
步骤S25,采用Owlqn模型对分类后的样本磁盘数据进行样本训练,得到磁盘预测模型。In step S25, the Owlqn model is used to perform sample training on the classified sample disk data to obtain a disk prediction model.
在上述步骤中,对样本磁盘数据进行训练可以为将处理后的样本磁盘数据输入至Owlqn模型,其中,上述样本磁盘数据为预先知晓真实值的样本,样本的真实值可以是1或者0,用于表示样本为正样本或负样本,正样本用于表示该样本为故障磁盘,负样本表示该样本为正常磁盘。In the above steps, the sample disk data is trained to input the processed sample disk data to the Owlqn model, wherein the sample disk data is a sample that knows the true value in advance, and the true value of the sample may be 1 or 0. Indicates that the sample is a positive or negative sample, a positive sample indicates that the sample is a failed disk, and a negative sample indicates that the sample is a normal disk.
在一种可选的实施例中,每个输入的样本磁盘数据都能够从Owlqn模型中得到相应的输出值,在得到样本磁盘数据集合中每个样本相应的输出值之后,获取所有正样本的输出值,构成正样本输出值区间,同样获取所有负样本的输出值,构成负样本的输出区间,由此得到磁盘预测模型。In an optional embodiment, each input sample disk data can obtain a corresponding output value from the Owlqn model, and after obtaining the corresponding output value of each sample in the sample disk data set, all positive samples are obtained. The output value constitutes a positive sample output value interval, and the output values of all negative samples are also obtained to form an output interval of the negative sample, thereby obtaining a disk prediction model.
步骤S27,在接收到待测磁盘的磁盘数据之后,使用磁盘预测模型对待测磁盘的磁盘数据进行处理,确定待测磁盘是否为故障磁盘。Step S27: After receiving the disk data of the disk to be tested, use the disk prediction model to process the disk data of the disk to be tested, and determine whether the disk to be tested is a faulty disk.
需要进一步说明的是,在使用Owlqn模型对样本磁盘数据进行样本训练时,使用的是对样本磁盘数据进行分类后的样本数据,并且对分类后的样本数据进行了分箱处理,使得分类后的每个类别中的样本数据离散化,从而能够对样本磁盘的样本数据进行训练。It should be further explained that when the sample data of the sample disk data is trained by using the Owlqn model, the sample data after classifying the sample disk data is used, and the classified sample data is subjected to binning processing, so that the classified samples are processed. The sample data in each category is discretized so that the sample data of the sample disk can be trained.
在一种可选的实施例中,上述样本磁盘数据可以包括:底层数据读取错误率、启动/停止计数、重映射扇区数、通电时间累计、主轴起旋重试次数、磁盘校准重试次数、磁盘通电次数、温度以及写错误率,可以根据磁盘历史故障情况获取样本磁盘数据。例如,可以按照正负样本比例为1:5的比例进行样本获取,其中,正样本为存在故障的磁盘,负样本为不存在故障的磁盘。In an optional embodiment, the sample disk data may include: an underlying data read error rate, a start/stop count, a number of remapping sectors, a power-on time accumulation, a spindle spin retries, and a disk calibration retry. The number of times, the number of disk power-ons, the temperature, and the write error rate can be used to obtain sample disk data based on historical disk failure conditions. For example, sample acquisition can be performed at a ratio of 1:5 to positive and negative samples, where the positive sample is the faulty disk and the negative sample is the disk with no fault.
此处需要说明的是,在通过磁盘监控技术获取磁盘的样本磁盘数据时,由于预测磁盘故障的各个机构使用的磁盘并不一定相同,且由于各个机构不同温湿度等环境因素对磁盘的影响,使得不同机构的磁盘的好坏比例并不相同,为了使样本磁盘数据的训练提供更可靠的样本磁盘数据,还可以根据机构的实际上磁盘损坏情况进行获取样本磁盘数据。 It should be noted that when the disk data of the disk is obtained by the disk monitoring technology, the disks used by the various organizations that predict the disk failure are not necessarily the same, and the environmental factors such as temperature and humidity of the various mechanisms affect the disk. The ratio of the disks of different organizations is different. In order to provide more reliable sample disk data for the training of sample disk data, the sample disk data can also be obtained according to the actual disk damage of the mechanism.
由此,解决了现有技术的硬盘故障预测系统中一些容易致使硬盘故障的因素不能被采集或量化导致的预测结果不准确的技术问题Therefore, the technical problem that the prediction result of the hard disk failure prediction system in the prior art is easy to cause the failure of the hard disk cannot be collected or quantized is solved.
根据本申请上述实施例,在一种优选的方案中,样本磁盘数据为SMART磁盘数据,其中,样本磁盘数据至少包括如下四个维度上的样本数据:原始值、标准值、最差值和累积值。According to the above embodiment of the present application, in a preferred solution, the sample disk data is SMART disk data, wherein the sample disk data includes at least sample data in the following four dimensions: original value, standard value, worst value, and accumulation. value.
上述原始值为磁盘运行时的当前参数;上述标准值为正常磁盘运行时各项参数的数值;上述最差值为磁盘运行时,磁盘的各项检测参数曾出现过与正常值偏差最大的非正常值;上述累计值为磁盘的各项检测参数从磁盘使用至当前时刻的累计结果。The above-mentioned original value is the current parameter of the disk running time; the above-mentioned standard value is the value of each parameter of the normal disk running; the above-mentioned worst value is that when the disk is running, the detection parameters of the disk have the largest deviation from the normal value. Normal value; the above cumulative value is the cumulative result of each disk's detection parameters from disk usage to the current time.
在一种可选的实施例中,磁盘的各项参数可以是对磁盘的各项属性进行描述的信息,可以包括错误读取率、加电次数、重新分配扇区数、旋转重试次数、磁盘校准重试次数以及奇偶校验错误率中的一项或多项,也可以包括磁盘的其他属性信息。In an optional embodiment, the parameters of the disk may be information describing various attributes of the disk, and may include an error read rate, a power-on frequency, a number of re-allocated sectors, a number of rotation retries, One or more of the number of disk calibration retries and the parity error rate may also include other attribute information of the disk.
在一种可选的实施例中,可以采用HDTune、CrystalDiskInfo等软件获取样本磁盘数据。In an optional embodiment, the sample disk data can be obtained by using software such as HDTune or CrystalDiskInfo.
根据本申请上述实施例,在一种优选的方案中,步骤S21在通过磁盘监控技术获取磁盘的样本磁盘数据之后,上述方法还包括:According to the above embodiment of the present application, in a preferred solution, after the step S21 acquires the sample disk data of the disk by using the disk monitoring technology, the method further includes:
步骤S211,对每个维度上的样本数据进行如下任意一种或多种运算:差分运算、平方运算和分布求和运算,使得任意一个维度上的样本数据被扩展出新的维度上的样本数据。Step S211, performing any one or more of the following operations on the sample data in each dimension: a difference operation, a square operation, and a distribution sum operation, so that the sample data in any one dimension is expanded to the sample data in the new dimension. .
在一种可选的实施例中,对样本数据中的原始值进行差分运算、平方运算和分布求和运算,从而能得到原始值的差分值、原始值的平房值以及原始值的分布求和值,因此在知晓样本磁盘数据的原始值的基础上,得到另外四个维度的样本磁盘数据;同样可以将样本数据中的标准值、最差值以及累积值分别作上述运算,到的更多维度的样本磁盘数据。In an optional embodiment, the original value in the sample data is subjected to a difference operation, a square operation, and a distribution sum operation, thereby obtaining a difference value of the original value, a pool value of the original value, and a distribution sum of the original values. Value, so based on the original value of the sample disk data, the sample disk data of the other four dimensions is obtained; the standard value, the worst value, and the accumulated value in the sample data can also be used as the above operations, and more Sample disk data for dimensions.
需要说明的是,对样本磁盘数据进行多种运算得到更多维度的样本磁盘数据,能够提高对样本磁盘数据的利用率以及对样本磁盘数据进行训练 时,样本磁盘数据的敏感度,从而提高故障预测模型的准确度。It should be noted that performing multiple operations on the sample disk data to obtain more dimensional sample disk data can improve the utilization of the sample disk data and train the sample disk data. When the sample disk data is sensitive, the accuracy of the fault prediction model is improved.
根据本申请上述实施例,在一种优选的方案中,步骤S23采用Bucketing技术对样本磁盘数据进行分箱处理,对样本磁盘数据进行分类,包括:According to the above embodiment of the present application, in a preferred solution, step S23 uses the Bucking technology to perform binning processing on the sample disk data, and classifies the sample disk data, including:
步骤S231,确定预先划分的每个分箱的取值范围以及每个分箱对应的ID值。In step S231, the value range of each bin divided in advance and the ID value corresponding to each bin are determined.
在上述步骤中,划分的每个分箱的取值范围的目的在于确定与样本磁盘数据集合中的数据对应的分箱,即样本磁盘数据所属的范围对应的分箱即为该样本磁盘数据所属的分箱。确定每个分箱的ID值用于区分不同的分箱。In the above steps, the purpose of the range of values of each bin is to determine the bin corresponding to the data in the sample disk data set, that is, the bin corresponding to the range to which the sample disk data belongs is the data of the sample disk. Binning. Determine the ID value of each bin to distinguish between different bins.
步骤S233,通过将每个维度上的样本数据离散化至对应的分箱来对样本磁盘数据进行分类,得到每个维度上的样本数据所对应的ID值。Step S233, classifying the sample disk data by discretizing the sample data in each dimension to a corresponding bin, and obtaining an ID value corresponding to the sample data in each dimension.
在一种可选的实施例中,将样本磁盘数据分配至不同的分箱后,以分箱的ID号对分配至该分箱的数据进行替换,即将每一维度上的样本磁盘数据都被替换为该样本磁盘数据对应的分箱ID值,使得原始的样本磁盘数据的每一个维度上的数据都被替换为如上的整数值。In an optional embodiment, after the sample disk data is allocated to different bins, the data assigned to the bin is replaced by the ID number of the bin, that is, the sample disk data in each dimension is It is replaced with the bin ID value corresponding to the sample disk data, so that the data in each dimension of the original sample disk data is replaced with the above integer value.
在另一种可选的实施例中,例如在设置5个取值范围不同的分箱,且每个分箱的ID值分别为1,2,3,4,5的情况下,每个分箱中都包含不同的数据,当样本磁盘数据A落入分箱1和分箱3的取值范围内时,样本磁盘数据A的ID值可以为10100。按照上述实施例中的方案,使得每个维度上的样本数据都能够得到与之对应的ID值。In another optional embodiment, for example, when five bins having different value ranges are set, and each bin has an ID value of 1, 2, 3, 4, 5, each segment The box contains different data. When the sample disk data A falls within the range of the values of the bin 1 and the bin 3, the ID value of the sample disk data A may be 10100. According to the scheme in the above embodiment, the sample data in each dimension can be obtained with the ID value corresponding thereto.
根据本申请上述实施例,在一种优选的方案中,步骤S25,采用Owlqn模型对分类后的样本磁盘数据进行样本训练,得到磁盘预测模型,包括:According to the above embodiment of the present application, in a preferred solution, in step S25, the Owlqn model is used to perform sample training on the classified sample disk data to obtain a disk prediction model, including:
步骤S251,Owlqn模型对每个维度上的样本数据所对应的ID值进行训练,得到每个维度上的样本数据的权重值。Step S251, the Owlqn model trains the ID values corresponding to the sample data in each dimension to obtain the weight values of the sample data in each dimension.
在上述步骤中,每个维度上的样本数据的权重值为该样本为“1”的概率,即为该样本为正样本的概率。 In the above steps, the weight value of the sample data in each dimension is the probability that the sample is "1", that is, the probability that the sample is a positive sample.
在一种可选的是实例中,将待测磁盘数据表示为
Figure PCTCN2017071699-appb-000001
其中,
Figure PCTCN2017071699-appb-000002
yi为0或1,Owlqn模型获取用于训练的样本数据后,输出每个磁盘特征数据的权重值,即每个磁盘特征数据为故障磁盘数据的概率。权重值可以通过如下公式计算得到:权重值
Figure PCTCN2017071699-appb-000003
i用于表示第i个样本,n用于表示n个维度,k表示1至n之间任意一个维度,wk用于表示k维度上的权重值,其中,w0为截距,需要注意的是,输出的权重值需要满足条件:
Figure PCTCN2017071699-appb-000004
能取得最小值,J为最优化目标函数。
In an optional example, the disk data to be tested is represented as
Figure PCTCN2017071699-appb-000001
among them,
Figure PCTCN2017071699-appb-000002
y i is 0 or 1. After obtaining the sample data for training, the Owlqn model outputs the weight value of each disk characteristic data, that is, the probability that each disk characteristic data is faulty disk data. The weight value can be calculated by the following formula: weight value
Figure PCTCN2017071699-appb-000003
i is used to represent the i-th sample, n is used to represent n dimensions, k is to represent any dimension between 1 and n, and w k is used to represent the weight value in the k-dimension, where w 0 is the intercept, requiring attention The weight value of the output needs to meet the conditions:
Figure PCTCN2017071699-appb-000004
The minimum value can be obtained, and J is the optimization objective function.
步骤S253,根据每个维度上的样本数据及对应的权重值,确定磁盘预测模型,其中,磁盘预测模型包括每个维度上的样本数据的预测结果。Step S253, determining a disk prediction model according to the sample data and the corresponding weight value in each dimension, wherein the disk prediction model includes the prediction result of the sample data in each dimension.
在一种可选的实施例中,得到待测磁盘的磁盘数据后,计算待测磁盘的预测值,其中,计算待测磁盘的预测值可以根据如下公式进行计算:
Figure PCTCN2017071699-appb-000005
上述预测值即为训练样本磁盘数据得到的预测结果,由于样本磁盘是否为故障磁盘为已知量,因此,在得到预测结果后,将正样本磁盘的预测结果和负样本磁盘的预测结果进行区分,得到故障磁盘的预测值的取值范围和正常磁盘的预测值的取值范围。
In an optional embodiment, after obtaining the disk data of the disk to be tested, calculating a predicted value of the disk to be tested, wherein calculating the predicted value of the disk to be tested may be calculated according to the following formula:
Figure PCTCN2017071699-appb-000005
The above predicted value is the predicted result obtained from the training sample disk data. Since the sample disk is a faulty disk, the known value is obtained. Therefore, after the prediction result is obtained, the prediction result of the positive sample disk and the prediction result of the negative sample disk are distinguished. , get the range of the predicted value of the failed disk and the range of the predicted value of the normal disk.
在一种可选的实施例中,向Owlqn模型输入样本数据对应的ID值,并向Owlqn模型输入ID值对应的样本磁盘的故障状态,使Owlqn模型记忆ID值以及ID值对应的磁盘故障状态,再向Owlqn模型重复输入ID值,验证Owlqn模型是否能够输出ID值对应的故障状态。In an optional embodiment, the ID value corresponding to the sample data is input to the Owlqn model, and the fault state of the sample disk corresponding to the ID value is input to the Owlqn model, so that the Owlqn model stores the ID value and the disk fault status corresponding to the ID value. Then, input the ID value repeatedly to the Owlqn model to verify whether the Owlqn model can output the fault status corresponding to the ID value.
根据本申请上述实施例,在一种优选的方案中,每个维度上的样本数据的预测结果为样本磁盘数据进行分类后得到的预测值。According to the above embodiment of the present application, in a preferred embodiment, the prediction result of the sample data in each dimension is a predicted value obtained by classifying the sample disk data.
根据本申请上述实施例,在一种优选的方案中,步骤S27在接收到待测磁盘的磁盘数据之后,使用磁盘预测模型对待测磁盘的磁盘数据进行处理,确定待测磁盘是否为故障磁盘,包括:According to the above embodiment of the present application, in a preferred solution, after receiving the disk data of the disk to be tested, the step S27 uses the disk prediction model to process the disk data of the disk to be tested, and determines whether the disk to be tested is a failed disk. include:
步骤S271,接收到待测磁盘的磁盘数据之后,将待测磁盘的磁盘数据离散化至对应的分箱,得到待测磁盘的磁盘数据所对应的ID值。 Step S271: After receiving the disk data of the disk to be tested, the disk data of the disk to be tested is discretized to a corresponding bin, and the ID value corresponding to the disk data of the disk to be tested is obtained.
在上述步骤中,将待测磁盘的磁盘数据离散化至对应的分箱,得到待测磁盘的磁盘数据所对应的ID值,可以采用上实施例中的步骤S231至步骤S233中提出的方案实施。In the above step, the disc data of the disk to be tested is discretized to the corresponding sub-box, and the ID value corresponding to the disk data of the disk to be tested is obtained, which can be implemented by using the solution in step S231 to step S233 in the above embodiment. .
步骤S273,根据待测磁盘的磁盘数据所对应的ID值确定待测磁盘的磁盘数据的权重值。Step S273: Determine a weight value of the disk data of the disk to be tested according to the ID value corresponding to the disk data of the disk to be tested.
在一种可选的是实例中,将待测磁盘数据表示为
Figure PCTCN2017071699-appb-000006
其中,
Figure PCTCN2017071699-appb-000007
yi为0或1,owlqn模型获取用于训练的样本数据后,输出每个磁盘特征数据的权重值,即每个磁盘特征数据为故障磁盘数据的概率。
Figure PCTCN2017071699-appb-000008
In an optional example, the disk data to be tested is represented as
Figure PCTCN2017071699-appb-000006
among them,
Figure PCTCN2017071699-appb-000007
y i is 0 or 1. After the owlqn model obtains the sample data for training, the weight value of each disk characteristic data is output, that is, the probability that each disk characteristic data is faulty disk data.
Figure PCTCN2017071699-appb-000008
步骤S275,根据待测磁盘的磁盘数据的权重值从磁盘预测模型中确定待测磁盘是否为故障磁盘。Step S275: Determine, according to the weight value of the disk data of the disk to be tested, whether the disk to be tested is a failed disk from the disk prediction model.
在一种可选的实施例中,得到待测磁盘的磁盘数据后,计算待测磁盘的预测值,其中,计算待测磁盘的预测值可以根据如下公式进行计算:
Figure PCTCN2017071699-appb-000009
得到待测磁盘的预测值后,将待测磁盘的预测值与训练样本磁盘数据得到的正样本的取值范围和负样本的取值范围进行比对,若待测磁盘的预测值落入正样本的取值范围,则可以认为该待测磁盘为故障磁盘,若待测磁盘的预测值落入负样本的取值范围,则可以认为该待测磁盘为正常磁盘。
In an optional embodiment, after obtaining the disk data of the disk to be tested, calculating a predicted value of the disk to be tested, wherein calculating the predicted value of the disk to be tested may be calculated according to the following formula:
Figure PCTCN2017071699-appb-000009
After obtaining the predicted value of the disk to be tested, compare the predicted value of the disk to be tested with the value range of the positive sample obtained by the training sample disk data and the value range of the negative sample, if the predicted value of the disk to be tested falls into the positive value If the value of the sample is a faulty disk, the disk to be tested is considered to be a faulty disk. If the predicted value of the disk to be tested falls within the range of the negative sample, the disk to be tested can be considered as a normal disk.
需要说明的是,对于前述的各方法实施例,为了简单描述,故将其都表述为一系列的动作组合,但是本领域技术人员应该知悉,本发明并不受所描述的动作顺序的限制,因为依据本发明,某些步骤可以采用其他顺序或者同时进行。其次,本领域技术人员也应该知悉,说明书中所描述的实施例均属于优选实施例,所涉及的动作和模块并不一定是本发明所必须的。It should be noted that, for the foregoing method embodiments, for the sake of simple description, they are all expressed as a series of action combinations, but those skilled in the art should understand that the present invention is not limited by the described action sequence. Because certain steps may be performed in other sequences or concurrently in accordance with the present invention. In addition, those skilled in the art should also understand that the embodiments described in the specification are all preferred embodiments, and the actions and modules involved are not necessarily required by the present invention.
通过以上的实施方式的描述,本领域的技术人员可以清楚地了解到根据上述实施例的方法可借助软件加必需的通用硬件平台的方式来实现,当然也可以通过硬件,但很多情况下前者是更佳的实施方式。基于这样的理 解,本发明的技术方案本质上或者说对现有技术做出贡献的部分可以以软件产品的形式体现出来,该计算机软件产品存储在一个存储介质(如ROM/RAM、磁碟、光盘)中,包括若干指令用以使得一台终端设备(可以是手机,计算机,服务器,或者网络设备等)执行本发明各个实施例所述的方法。Through the description of the above embodiments, those skilled in the art can clearly understand that the method according to the above embodiment can be implemented by means of software plus a necessary general hardware platform, and of course, by hardware, but in many cases, the former is A better implementation. Based on this rationale The solution of the technical solution of the present invention in essence or contribution to the prior art can be embodied in the form of a software product stored in a storage medium (such as ROM/RAM, disk, CD). A number of instructions are included to cause a terminal device (which may be a cell phone, computer, server, or network device, etc.) to perform the methods described in various embodiments of the present invention.
如图3所示,提供了一种磁盘的故障预测方法,该方法可以包括如下步骤S31至步骤S37:As shown in FIG. 3, a method for predicting a fault of a magnetic disk is provided. The method may include the following steps S31 to S37:
S31,获取样本磁盘的样本数据。S31. Obtain sample data of the sample disk.
在上述步骤中,样本磁盘的样本数据可以是SMART磁盘数据。具体的,在上述步骤中,可以通过HDTune、CrystalDiskInfo等软件获取样本磁盘数据。In the above steps, the sample data of the sample disk may be SMART disk data. Specifically, in the above steps, the sample disk data can be obtained by using software such as HDTune or CrystalDiskInfo.
S32,对样本数据进行差分运算。S32, performing differential operations on the sample data.
具体的,在上述步骤中,差分运算指磁盘在某一时刻的特征数据与过该磁盘在24小时之前的特征数据做差运算得到的值。Specifically, in the above steps, the difference operation refers to a value obtained by performing difference calculation between the feature data of the disk at a certain time and the feature data of the disk before 24 hours.
S33,对差分运算得到的结果进行分布求和和/或平方运算。S33, performing a distribution summation and/or a square operation on the result obtained by the difference operation.
上述步骤对每个维度上的样本数据进行如下任意一种或多种运算:差分运算、平方运算和分布求和运算,使得任意一个维度上的样本数据被扩展出新的维度上的样本数据。The above steps perform any one or more of the following operations on the sample data in each dimension: a difference operation, a square operation, and a distribution sum operation, so that the sample data in any one dimension is expanded to the sample data in the new dimension.
S34,得到训练和预测使用的数据。S34, obtaining data for training and prediction use.
S35,采用分箱进行离散化。S35, using a bin to discretize.
上述步骤划分的每个分箱的取值范围的目的在于确定与样本磁盘数据集合中的数据对应的分箱,即样本磁盘数据所属的范围对应的分箱即为该样本磁盘数据所属的分箱。确定每个分箱的ID值用于区分不同的分箱,并对每个分箱中的数据进行离散化处理。The purpose of the value range of each bin divided by the above steps is to determine the bin corresponding to the data in the sample disk data set, that is, the bin corresponding to the range to which the sample disk data belongs is the bin to which the sample disk data belongs. . Determine the ID value of each bin to distinguish different bins and discretize the data in each bin.
S36,通过Owlqn模型进行训练。S36, training through the Owlqn model.
在上述步骤中,通过Owlqn模型对样本磁盘的样本数据进行训练得到 磁盘预测模型。In the above steps, the sample data of the sample disk is trained by the Owlqn model. Disk prediction model.
S37,得到磁盘的预测结果。S37, the predicted result of the disk is obtained.
在上述步骤中,使用上述步骤构建的磁盘预测模型对待测磁盘进行预测,得到预测值后,与模型中的预测取值范围进行比对,得到待测磁盘的预测结果。In the above steps, the disk prediction model constructed by the above steps is used to predict the disk to be tested, and after obtaining the predicted value, the predicted value range in the model is compared to obtain the prediction result of the disk to be tested.
实施例2Example 2
根据本发明实施例,还提供了一种用于实施上述磁盘的故障预测方法的装置,如图4所示,该装置包括:获取模块40、分类模块42、训练模块44和确定模块46。According to an embodiment of the present invention, there is also provided an apparatus for implementing the above-described failure prediction method for a magnetic disk. As shown in FIG. 4, the apparatus includes: an acquisition module 40, a classification module 42, a training module 44, and a determination module 46.
其中,获取模块40,用于通过磁盘监控技术获取磁盘的样本磁盘数据,其中,样本磁盘数据包括多个维度上的样本数据;分类模块42,用于采用Bucketing技术对样本磁盘数据进行分箱处理,对样本磁盘数据进行分类;训练模块44,用于采用Owlqn模型对分类后的样本磁盘数据进行样本训练,得到磁盘预测模型;确定模块46,用于在接收到待测磁盘的磁盘数据之后,使用磁盘预测模型对待测磁盘的磁盘数据进行处理,确定待测磁盘是否为故障磁盘。The obtaining module 40 is configured to obtain sample disk data of the disk by using a disk monitoring technology, where the sample disk data includes sample data in multiple dimensions, and the classifying module 42 is configured to perform binning processing on the sample disk data by using a bucketing technology. And the training module 44 is configured to perform sample training on the classified sample disk data by using the Owlqn model to obtain a disk prediction model, and the determining module 46 is configured to: after receiving the disk data of the disk to be tested, Use the disk prediction model to process the disk data of the disk to be tested to determine whether the disk to be tested is a failed disk.
此处需要说明的是,上述获取模块40、分类模块42、训练模块44和确定模块46对应于实施例一种的步骤S21至步骤S27所实现的实例和应用场景相同,但不限于上述实施例一所公开的内容。需要说明的是,上述模块作为装置的一部分可以运行在实施例一提供的计算机终端10中。It should be noted that the example and the application scenario implemented by the step S21 to the step S27 of the first embodiment are the same as the application scenario, but are not limited to the above embodiment. A public content. It should be noted that the above module can be operated as part of the device in the computer terminal 10 provided in the first embodiment.
根据本申请上述实施例,在一种优选的方案中,样本磁盘数据为SMART磁盘数据,其中,样本磁盘数据至少包括如下四个维度上的样本数据:原始值、标准值、最差值和累积值。According to the above embodiment of the present application, in a preferred solution, the sample disk data is SMART disk data, wherein the sample disk data includes at least sample data in the following four dimensions: original value, standard value, worst value, and accumulation. value.
根据本申请上述实施例,在一种优选的方案中,结合图5所示,上述装置还包括: According to the above embodiment of the present application, in a preferred solution, as shown in FIG. 5, the device further includes:
运算模块50,用于对每个维度上的样本数据进行如下任意一种或多种运算:差分运算、平方运算和分布求和运算,使得任意一个维度上的样本数据被扩展出新的维度上的样本数据。The operation module 50 is configured to perform any one or more of the following operations on the sample data in each dimension: a difference operation, a square operation, and a distribution sum operation, so that the sample data in any one dimension is expanded into a new dimension. Sample data.
此处需要说明的是,上述获取模块50对应于实施例一种的步骤S211所实现的实例和应用场景相同,但不限于上述实施例一所公开的内容。需要说明的是,上述模块作为装置的一部分可以运行在实施例一提供的计算机终端10中。It should be noted that the example and the application scenario implemented by the above-mentioned obtaining module 50 corresponding to step S211 of the embodiment are the same, but are not limited to the content disclosed in the first embodiment. It should be noted that the above module can be operated as part of the device in the computer terminal 10 provided in the first embodiment.
根据本申请上述实施例,在一种优选的方案中,结合图6所示,上述分类模块42包括:According to the above embodiment of the present application, in a preferred solution, as shown in FIG. 6, the classification module 42 includes:
第一确定子模块60,用于确定预先划分的每个分箱的取值范围以及每个分箱对应的ID值;分类子模块62,用于通过将每个维度上的样本数据离散化至对应的分箱来对样本磁盘数据进行分类,得到每个维度上的样本数据所对应的ID值。a first determining sub-module 60, configured to determine a range of values of each binning pre-divided and an ID value corresponding to each bin; a sub-module 62 for discretizing sample data in each dimension to The corresponding bins are used to classify the sample disk data to obtain the ID value corresponding to the sample data in each dimension.
此处需要说明的是,上述第一确定子模块60和分类子模块62对应于实施例一种的步骤S231和步骤S233所实现的实例和应用场景相同,但不限于上述实施例一所公开的内容。需要说明的是,上述模块作为装置的一部分可以运行在实施例一提供的计算机终端10中。It should be noted that the first determining sub-module 60 and the categorizing sub-module 62 are the same as the example and the application scenario implemented by the step S231 and the step S233 of the embodiment, but are not limited to the one disclosed in the first embodiment. content. It should be noted that the above module can be operated as part of the device in the computer terminal 10 provided in the first embodiment.
根据本申请上述实施例,在一种优选的方案中,结合图7所示,上述训练模块44包括:According to the above embodiment of the present application, in a preferred solution, as shown in FIG. 7, the training module 44 includes:
训练子模块70,用于Owlqn模型对每个维度上的样本数据所对应的ID值进行训练,得到每个维度上的样本数据的权重值;第二确定子模块72,用于根据每个维度上的样本数据及对应的权重值,确定磁盘预测模型,其中,磁盘预测模型包括每个维度上的样本数据的预测结果。The training sub-module 70 is configured to train the Owlqn model to match the ID value corresponding to the sample data in each dimension to obtain the weight value of the sample data in each dimension; and the second determining sub-module 72 is configured to use each dimension according to each dimension The disk prediction model is determined by the sample data and the corresponding weight value, wherein the disk prediction model includes prediction results of the sample data in each dimension.
此处需要说明的是,上述训练子模块70和第二确定子模块72对应于实施例一种的步骤S251和步骤S253所实现的实例和应用场景相同,但不限于上述实施例一所公开的内容。需要说明的是,上述模块作为装置的一部分可以运行在实施例一提供的计算机终端10中。 It should be noted that the training sub-module 70 and the second determining sub-module 72 are the same as the example and the application scenario implemented by the step S251 and the step S253 of the embodiment, but are not limited to the one disclosed in the first embodiment. content. It should be noted that the above module can be operated as part of the device in the computer terminal 10 provided in the first embodiment.
根据本申请上述实施例,在一种优选的方案中,每个维度上的样本数据的预测结果为样本磁盘数据进行分类后得到的预测值。According to the above embodiment of the present application, in a preferred embodiment, the prediction result of the sample data in each dimension is a predicted value obtained by classifying the sample disk data.
根据本申请上述实施例,在一种优选的方案中,结合图8所示,上述确定模块46还包括:According to the above embodiment of the present application, in a preferred solution, as shown in FIG. 8, the determining module 46 further includes:
离散模块80,用于接收到待测磁盘的磁盘数据之后,将待测磁盘的磁盘数据离散化至对应的分箱,得到待测磁盘的磁盘数据所对应的ID值;第三确定子模块82,用于根据待测磁盘的磁盘数据所对应的ID值确定待测磁盘的磁盘数据的权重值;第四确定子模块84,用于根据待测磁盘的磁盘数据的权重值从磁盘预测模型中确定待测磁盘是否为故障磁盘。The discretization module 80 is configured to discretize the disk data of the disk to be tested to the corresponding sub-box, and obtain the ID value corresponding to the disk data of the disk to be tested; the third determining sub-module 82 Determining, according to the ID value corresponding to the disk data of the disk to be tested, the weight value of the disk data of the disk to be tested; the fourth determining sub-module 84, configured to predict the model from the disk according to the weight value of the disk data of the disk to be tested. Determine if the disk to be tested is a failed disk.
此处需要说明的是,上述离散模块80、第三确定子模块82和第四确定子模块84对应于实施例一种的步骤S271和步骤S275所实现的实例和应用场景相同,但不限于上述实施例一所公开的内容。需要说明的是,上述模块作为装置的一部分可以运行在实施例一提供的计算机终端10中。It should be noted that the above-mentioned discrete module 80, the third determining sub-module 82, and the fourth determining sub-module 84 are the same as the examples and application scenarios implemented in step S271 and step S275 of the embodiment, but are not limited to the above. The content disclosed in the first embodiment. It should be noted that the above module can be operated as part of the device in the computer terminal 10 provided in the first embodiment.
实施例3Example 3
本发明的实施例可以提供一种计算机终端,该计算机终端可以是计算机终端群中的任意一个计算机终端设备。可选地,在本实施例中,上述计算机终端也可以替换为移动终端等终端设备。Embodiments of the present invention may provide a computer terminal, which may be any one of computer terminal groups. Optionally, in this embodiment, the foregoing computer terminal may also be replaced with a terminal device such as a mobile terminal.
可选地,在本实施例中,上述计算机终端可以位于计算机网络的多个网络设备中的至少一个网络设备。Optionally, in this embodiment, the computer terminal may be located in at least one network device of the plurality of network devices of the computer network.
在本实施例中,上述计算机终端可以执行磁盘的故障预测方法中以下步骤的程序代码:通过磁盘监控技术获取磁盘的样本磁盘数据,其中,样本磁盘数据包括多个维度上的样本数据;采用Bucketing技术对样本磁盘数据进行分箱处理,对样本磁盘数据进行分类;采用Owlqn模型对分类后的样本磁盘数据进行样本训练,得到磁盘预测模型;在接收到待测磁盘的磁盘数据之后,使用磁盘预测模型对待测磁盘的磁盘数据进行处理,确定待测磁盘是否为故障磁盘。 In this embodiment, the computer terminal may execute the program code of the following steps in the method for predicting the fault of the disk: acquiring the sample disk data of the disk by using the disk monitoring technology, wherein the sample disk data includes sample data in multiple dimensions; using Bucking The technology performs binning processing on the sample disk data, classifies the sample disk data, and uses the Owlqn model to perform sample training on the classified sample disk data to obtain a disk prediction model; after receiving the disk data of the disk to be tested, using the disk prediction The model processes the disk data of the disk to be tested to determine whether the disk to be tested is a failed disk.
可选地,图9是根据本发明实施例的一种计算机终端的结构框图。如图9所示,该计算机终端A可以包括:一个或多个(图中仅示出一个)处理器91、存储器93、以及传输装置95。Optionally, FIG. 9 is a structural block diagram of a computer terminal according to an embodiment of the present invention. As shown in FIG. 9, the computer terminal A may include one or more (only one shown in the figure) processor 91, memory 93, and transmission device 95.
其中,存储器可用于存储软件程序以及模块,如本发明实施例中的磁盘的故障预测方法和装置对应的程序指令/模块,处理器通过运行存储在存储器内的软件程序以及模块,从而执行各种功能应用以及数据处理,即实现上述的磁盘的故障预测方法。存储器可包括高速随机存储器,还可以包括非易失性存储器,如一个或者多个磁性存储装置、闪存、或者其他非易失性固态存储器。在一些实例中,存储器可进一步包括相对于处理器远程设置的存储器,这些远程存储器可以通过网络连接至终端A。上述网络的实例包括但不限于互联网、企业内部网、局域网、移动通信网及其组合。The memory can be used to store the software program and the module, such as the fault prediction method of the disk and the program instruction/module corresponding to the device in the embodiment of the present invention, and the processor executes various programs by running the software program and the module stored in the memory. Functional application and data processing, that is, the above-described method for predicting the failure of the disk. The memory may include a high speed random access memory, and may also include non-volatile memory such as one or more magnetic storage devices, flash memory, or other non-volatile solid state memory. In some examples, the memory can further include memory remotely located relative to the processor, which can be connected to terminal A via a network. Examples of such networks include, but are not limited to, the Internet, intranets, local area networks, mobile communication networks, and combinations thereof.
处理器可以通过传输装置调用存储器存储的信息及应用程序,以执行下述步骤:通过磁盘监控技术获取磁盘的样本磁盘数据,其中,样本磁盘数据包括多个维度上的样本数据;采用Bucketing技术对样本磁盘数据进行分箱处理,对样本磁盘数据进行分类;采用Owlqn模型对分类后的样本磁盘数据进行样本训练,得到磁盘预测模型;在接收到待测磁盘的磁盘数据之后,使用磁盘预测模型对待测磁盘的磁盘数据进行处理,确定待测磁盘是否为故障磁盘。The processor may call the memory stored information and the application program by the transmission device to perform the following steps: acquiring the sample disk data of the disk by using a disk monitoring technology, where the sample disk data includes sample data in multiple dimensions; using the Bucking technology The sample disk data is subjected to binning processing to classify the sample disk data; the Owlqn model is used to perform sample training on the classified sample disk data to obtain a disk prediction model; after receiving the disk data of the disk to be tested, the disk prediction model is used to treat The disk data of the disk is measured for processing to determine whether the disk to be tested is a failed disk.
可选的,上述处理器还可以执行如下步骤的程序代码:样本磁盘数据为SMART磁盘数据,其中,样本磁盘数据至少包括如下四个维度上的样本数据:原始值、标准值、最差值和累积值。Optionally, the foregoing processor may further execute the following program code: the sample disk data is SMART disk data, wherein the sample disk data includes at least sample data in the following four dimensions: original value, standard value, worst value, and Cumulative value.
可选的,上述处理器还可以执行如下步骤的程序代码:对每个维度上的样本数据进行如下任意一种或多种运算:差分运算、平方运算和分布求和运算,使得任意一个维度上的样本数据被扩展出新的维度上的样本数据。Optionally, the foregoing processor may further execute the following program code: perform any one or more of the following operations on the sample data in each dimension: a difference operation, a square operation, and a distributed sum operation, so that any one dimension The sample data is expanded out of the sample data on the new dimension.
可选的,上述处理器还可以执行如下步骤的程序代码:确定预先划分的每个分箱的取值范围以及每个分箱对应的ID值;通过将每个维度上的样本数据离散化至对应的分箱来对样本磁盘数据进行分类,得到每个维度上的样本数据所对应的ID值。 Optionally, the foregoing processor may further execute the following program code: determining a range of values of each of the pre-divided bins and an ID value corresponding to each bin; and discretizing the sample data in each dimension to The corresponding bins are used to classify the sample disk data to obtain the ID value corresponding to the sample data in each dimension.
可选的,上述处理器还可以执行如下步骤的程序代码:Owlqn模型对每个维度上的样本数据所对应的ID值进行训练,得到每个维度上的样本数据的权重值;根据每个维度上的样本数据及对应的权重值,确定磁盘预测模型,其中,磁盘预测模型包括每个维度上的样本数据的预测结果。Optionally, the foregoing processor may further execute the following program code: the Owlqn model trains the ID value corresponding to the sample data in each dimension, and obtains the weight value of the sample data in each dimension; according to each dimension The disk prediction model is determined by the sample data and the corresponding weight value, wherein the disk prediction model includes prediction results of the sample data in each dimension.
可选的,上述处理器还可以执行如下步骤的程序代码:每个维度上的样本数据的预测结果为样本磁盘数据进行分类后得到的预测值。Optionally, the foregoing processor may further execute program code of the following steps: the prediction result of the sample data in each dimension is a predicted value obtained by classifying the sample disk data.
可选的,上述处理器还可以执行如下步骤的程序代码:接收到待测磁盘的磁盘数据之后,将待测磁盘的磁盘数据离散化至对应的分箱,得到待测磁盘的磁盘数据所对应的ID值;根据待测磁盘的磁盘数据所对应的ID值确定待测磁盘的磁盘数据的权重值;根据待测磁盘的磁盘数据的权重值从磁盘预测模型中确定待测磁盘是否为故障磁盘。Optionally, the foregoing processor may further execute the following program code: after receiving the disk data of the disk to be tested, discretizing the disk data of the disk to be tested to a corresponding bin, and obtaining the disk data of the disk to be tested. ID value; determining the weight value of the disk data of the disk to be tested according to the ID value corresponding to the disk data of the disk to be tested; determining whether the disk to be tested is a fault disk from the disk prediction model according to the weight value of the disk data of the disk to be tested .
在本发明实施例中,采用通过磁盘监控技术获取磁盘的样本磁盘数据,其中,样本磁盘数据包括多个维度上的样本数据;采用Bucketing技术对样本磁盘数据进行分箱处理,对样本磁盘数据进行分类;采用Owlqn模型对分类后的样本磁盘数据进行样本训练,得到磁盘预测模型的方式,通过在接收到待测磁盘的磁盘数据之后,使用磁盘预测模型对待测磁盘的磁盘数据进行处理,达到了确定待测磁盘是否为故障磁盘的目的,从而实现了预测磁盘故障的技术效果,进而解决了现有技术的硬盘故障预测系统中一些容易致使硬盘故障的因素不能被采集或量化导致的预测结果不准确的技术问题。In the embodiment of the present invention, the sample disk data of the disk is obtained by using a disk monitoring technology, wherein the sample disk data includes sample data in multiple dimensions; the sample disk data is binned by the Bucking technology, and the sample disk data is processed. Classification; using the Owlqn model to perform sample training on the classified sample disk data to obtain a disk prediction model. After receiving the disk data of the disk to be tested, the disk prediction data is processed using the disk prediction model, and the disk data is processed. The purpose of determining whether the disk to be tested is a faulty disk is to achieve the technical effect of predicting the disk failure, thereby solving the problem that some factors in the prior art hard disk fault prediction system that are likely to cause the hard disk failure cannot be collected or quantized. Accurate technical issues.
本领域普通技术人员可以理解,图9所示的结构仅为示意,计算机终端也可以是智能手机(如Android手机、iOS手机等)、平板电脑、掌声电脑以及移动互联网设备(Mobile Internet Devices,MID)、PAD等终端设备。图9其并不对上述电子装置的结构造成限定。例如,计算机终端A还可包括比图9中所示更多或者更少的组件(如网络接口、显示装置等),或者具有与图9所示不同的配置。A person skilled in the art can understand that the structure shown in FIG. 9 is only an illustration, and the computer terminal can also be a smart phone (such as an Android mobile phone, an iOS mobile phone, etc.), a tablet computer, an applause computer, and a mobile Internet device (Mobile Internet Devices, MID). ), PAD and other terminal devices. FIG. 9 does not limit the structure of the above electronic device. For example, computer terminal A may also include more or fewer components (such as a network interface, display device, etc.) than shown in FIG. 9, or have a different configuration than that shown in FIG.
本领域普通技术人员可以理解上述实施例的各种方法中的全部或部分步骤是可以通过程序来指令终端设备相关的硬件来完成,该程序可以存 储于一计算机可读存储介质中,存储介质可以包括:闪存盘、只读存储器(Read-Only Memory,ROM)、随机存取器(Random Access Memory,RAM)、磁盘或光盘等。A person of ordinary skill in the art may understand that all or part of the steps of the foregoing embodiments may be completed by a program to instruct terminal device related hardware, and the program may be saved. The storage medium may include a flash disk, a read-only memory (ROM), a random access memory (RAM), a magnetic disk or an optical disk, and the like.
实施例4Example 4
本发明的实施例还提供了一种存储介质。可选地,在本实施例中,上述存储介质可以用于保存上述实施例一所提供的一种磁盘的故障预测方法所执行的程序代码。Embodiments of the present invention also provide a storage medium. Optionally, in the embodiment, the storage medium may be used to save the program code executed by the fault prediction method of the disk provided in the first embodiment.
可选地,在本实施例中,上述存储介质可以位于计算机网络中计算机终端群中的任意一个计算机终端中,或者位于移动终端群中的任意一个移动终端中。Optionally, in this embodiment, the foregoing storage medium may be located in any one of the computer terminal groups in the computer network, or in any one of the mobile terminal groups.
可选地,在本实施例中,存储介质被设置为存储用于执行以下步骤的程序代码:通过磁盘监控技术获取磁盘的样本磁盘数据,其中,样本磁盘数据包括多个维度上的样本数据;采用Bucketing技术对样本磁盘数据进行分箱处理,对样本磁盘数据进行分类;采用Owlqn模型对分类后的样本磁盘数据进行样本训练,得到磁盘预测模型;在接收到待测磁盘的磁盘数据之后,使用磁盘预测模型对待测磁盘的磁盘数据进行处理,确定待测磁盘是否为故障磁盘。Optionally, in this embodiment, the storage medium is configured to store program code for performing the following steps: acquiring sample disk data of the disk by using a disk monitoring technology, wherein the sample disk data includes sample data in multiple dimensions; The sample disk data is binned by the Bucking technology to classify the sample disk data. The Owlqn model is used to perform sample training on the classified sample disk data to obtain a disk prediction model. After receiving the disk data of the disk to be tested, the disk data is used. The disk prediction model processes the disk data of the disk to be tested to determine whether the disk to be tested is a failed disk.
可选地,上述存储介质还被设置为存储用于执行以下步骤的程序代码:样本磁盘数据为SMART磁盘数据,其中,样本磁盘数据至少包括如下四个维度上的样本数据:原始值、标准值、最差值和累积值。Optionally, the storage medium is further configured to store program code for performing the following steps: the sample disk data is SMART disk data, wherein the sample disk data includes at least sample data in the following four dimensions: original value, standard value , the worst value and the cumulative value.
可选地,上述存储介质还被设置为存储用于执行以下步骤的程序代码:对每个维度上的样本数据进行如下任意一种或多种运算:差分运算、平方运算和分布求和运算,使得任意一个维度上的样本数据被扩展出新的维度上的样本数据。Optionally, the storage medium is further configured to store program code for performing the following steps: performing one or more of the following operations on the sample data in each dimension: a difference operation, a square operation, and a distribution sum operation, The sample data in any one dimension is expanded to the sample data on the new dimension.
可选地,上述存储介质还被设置为存储用于执行以下步骤的程序代码:确定预先划分的每个分箱的取值范围以及每个分箱对应的ID值;通过将 每个维度上的样本数据离散化至对应的分箱来对样本磁盘数据进行分类,得到每个维度上的样本数据所对应的ID值。Optionally, the foregoing storage medium is further configured to store program code for performing the following steps: determining a range of values of each of the pre-divided bins and an ID value corresponding to each bin; The sample data in each dimension is discretized to the corresponding bins to classify the sample disk data to obtain the ID value corresponding to the sample data in each dimension.
可选地,上述存储介质还被设置为存储用于执行以下步骤的程序代码:Owlqn模型对每个维度上的样本数据所对应的ID值进行训练,得到每个维度上的样本数据的权重值;根据每个维度上的样本数据及对应的权重值,确定磁盘预测模型,其中,磁盘预测模型包括每个维度上的样本数据的预测结果。Optionally, the storage medium is further configured to store program code for performing the following steps: the Owlqn model trains the ID value corresponding to the sample data in each dimension to obtain the weight value of the sample data in each dimension. The disk prediction model is determined based on the sample data and the corresponding weight values in each dimension, wherein the disk prediction model includes prediction results of the sample data in each dimension.
可选地,上述存储介质还被设置为存储用于执行以下步骤的程序代码:每个维度上的样本数据的预测结果为样本磁盘数据进行分类后得到的预测值。Optionally, the storage medium is further configured to store program code for performing the following steps: the prediction result of the sample data on each dimension is a predicted value obtained by classifying the sample disk data.
可选地,上述存储介质还被设置为存储用于执行以下步骤的程序代码:接收到待测磁盘的磁盘数据之后,将待测磁盘的磁盘数据离散化至对应的分箱,得到待测磁盘的磁盘数据所对应的ID值;根据待测磁盘的磁盘数据所对应的ID值确定待测磁盘的磁盘数据的权重值;根据待测磁盘的磁盘数据的权重值从磁盘预测模型中确定待测磁盘是否为故障磁盘。Optionally, the foregoing storage medium is further configured to store program code for performing the following steps: after receiving the disk data of the disk to be tested, discretizing the disk data of the disk to be tested to a corresponding bin, and obtaining the disk to be tested. The ID value corresponding to the disk data; determining the weight value of the disk data of the disk to be tested according to the ID value corresponding to the disk data of the disk to be tested; determining the test value from the disk prediction model according to the weight value of the disk data of the disk to be tested Whether the disk is a failed disk.
上述本发明实施例序号仅仅为了描述,不代表实施例的优劣。The serial numbers of the embodiments of the present invention are merely for the description, and do not represent the advantages and disadvantages of the embodiments.
在本发明的上述实施例中,对各个实施例的描述都各有侧重,某个实施例中没有详述的部分,可以参见其他实施例的相关描述。In the above-mentioned embodiments of the present invention, the descriptions of the various embodiments are different, and the parts that are not detailed in a certain embodiment can be referred to the related descriptions of other embodiments.
在本申请所提供的几个实施例中,应该理解到,所揭露的技术内容,可通过其它的方式实现。其中,以上所描述的装置实施例仅仅是示意性的,例如所述单元的划分,仅仅为一种逻辑功能划分,实际实现时可以有另外的划分方式,例如多个单元或组件可以结合或者可以集成到另一个系统,或一些特征可以忽略,或不执行。另一点,所显示或讨论的相互之间的耦合或直接耦合或通信连接可以是通过一些接口,单元或模块的间接耦合或通信连接,可以是电性或其它的形式。In the several embodiments provided by the present application, it should be understood that the disclosed technical contents may be implemented in other manners. The device embodiments described above are merely illustrative. For example, the division of the unit is only a logical function division. In actual implementation, there may be another division manner. For example, multiple units or components may be combined or may be Integrate into another system, or some features can be ignored or not executed. In addition, the mutual coupling or direct coupling or communication connection shown or discussed may be an indirect coupling or communication connection through some interface, unit or module, and may be electrical or otherwise.
所述作为分离部件说明的单元可以是或者也可以不是物理上分开的,作为单元显示的部件可以是或者也可以不是物理单元,即可以位于一个地 方,或者也可以分布到多个网络单元上。可以根据实际的需要选择其中的部分或者全部单元来实现本实施例方案的目的。The units described as separate components may or may not be physically separated, and the components displayed as units may or may not be physical units, that is, may be located in one place. Square, or it can be distributed to multiple network elements. Some or all of the units may be selected according to actual needs to achieve the purpose of the solution of the embodiment.
另外,在本发明各个实施例中的各功能单元可以集成在一个处理单元中,也可以是各个单元单独物理存在,也可以两个或两个以上单元集成在一个单元中。上述集成的单元既可以采用硬件的形式实现,也可以采用软件功能单元的形式实现。In addition, each functional unit in each embodiment of the present invention may be integrated into one processing unit, or each unit may exist physically separately, or two or more units may be integrated into one unit. The above integrated unit can be implemented in the form of hardware or in the form of a software functional unit.
所述集成的单元如果以软件功能单元的形式实现并作为独立的产品销售或使用时,可以存储在一个计算机可读取存储介质中。基于这样的理解,本发明的技术方案本质上或者说对现有技术做出贡献的部分或者该技术方案的全部或部分可以以软件产品的形式体现出来,该计算机软件产品存储在一个存储介质中,包括若干指令用以使得一台计算机设备(可为个人计算机、服务器或者网络设备等)执行本发明各个实施例所述方法的全部或部分步骤。而前述的存储介质包括:U盘、只读存储器(ROM,Read-Only Memory)、随机存取存储器(RAM,Random Access Memory)、移动硬盘、磁碟或者光盘等各种可以存储程序代码的介质。The integrated unit, if implemented in the form of a software functional unit and sold or used as a standalone product, may be stored in a computer readable storage medium. Based on such understanding, the technical solution of the present invention, which is essential or contributes to the prior art, or all or part of the technical solution, may be embodied in the form of a software product stored in a storage medium. A number of instructions are included to cause a computer device (which may be a personal computer, server or network device, etc.) to perform all or part of the steps of the methods described in various embodiments of the present invention. The foregoing storage medium includes: a U disk, a Read-Only Memory (ROM), a Random Access Memory (RAM), a removable hard disk, a magnetic disk, or an optical disk, and the like. .
以上所述仅是本发明的优选实施方式,应当指出,对于本技术领域的普通技术人员来说,在不脱离本发明原理的前提下,还可以做出若干改进和润饰,这些改进和润饰也应视为本发明的保护范围。 The above description is only a preferred embodiment of the present invention, and it should be noted that those skilled in the art can also make several improvements and retouchings without departing from the principles of the present invention. It should be considered as the scope of protection of the present invention.

Claims (14)

  1. 一种磁盘的故障预测方法,其特征在于,包括:A method for predicting a fault of a disk, comprising:
    通过磁盘监控技术获取磁盘的样本磁盘数据,其中,所述样本磁盘数据包括多个维度上的样本数据;Obtaining sample disk data of the disk by using a disk monitoring technology, wherein the sample disk data includes sample data in multiple dimensions;
    采用Bucketing技术对所述样本磁盘数据进行分箱处理,对所述样本磁盘数据进行分类;Performing a binning process on the sample disk data by using a bucketing technique to classify the sample disk data;
    采用Owlqn模型对所述分类后的样本磁盘数据进行样本训练,得到磁盘预测模型;Performing sample training on the classified sample disk data by using the Owlqn model to obtain a disk prediction model;
    在接收到待测磁盘的磁盘数据之后,使用所述磁盘预测模型对所述待测磁盘的磁盘数据进行处理,确定所述待测磁盘是否为故障磁盘。After the disk data of the disk to be tested is received, the disk data of the disk to be tested is processed by using the disk prediction model to determine whether the disk to be tested is a faulty disk.
  2. 根据权利要求1所述的方法,其特征在于,所述样本磁盘数据为SMART磁盘数据,其中,所述样本磁盘数据至少包括如下四个维度上的样本数据:原始值、标准值、最差值和累积值。The method according to claim 1, wherein the sample disk data is SMART disk data, wherein the sample disk data includes at least sample data in four dimensions: original value, standard value, and worst value. And cumulative values.
  3. 根据权利要求2所述的方法,其特征在于,在通过磁盘监控技术获取磁盘的样本磁盘数据之后,所述方法还包括:The method of claim 2, after the obtaining the sample disk data of the disk by the disk monitoring technology, the method further comprises:
    对每个维度上的样本数据进行如下任意一种或多种运算:差分运算、平方运算和分布求和运算,使得任意一个维度上的样本数据被扩展出新的维度上的样本数据。The sample data in each dimension is subjected to any one or more of the following operations: a difference operation, a square operation, and a distribution sum operation, so that the sample data in any one dimension is expanded to the sample data in the new dimension.
  4. 根据权利要求1至3中任意一项所述的方法,其特征在于,采用Bucketing技术对所述样本磁盘数据进行分箱处理,对所述样本磁盘数据进行分类,包括:The method according to any one of claims 1 to 3, wherein the sample disk data is subjected to binning processing by using a bucketing technique, and the sample disk data is classified, including:
    确定预先划分的每个分箱的取值范围以及每个分箱对应的ID值;Determining the range of values of each bin that is pre-divided and the ID value corresponding to each bin;
    通过将每个维度上的样本数据离散化至对应的分箱来对所述样本磁盘数据进行分类,得到所述每个维度上的样本数据所对应的ID值。The sample disk data is classified by discretizing the sample data in each dimension to a corresponding bin, and the ID value corresponding to the sample data in each dimension is obtained.
  5. 根据权利要求4所述的方法,其特征在于,采用Owlqn模型对所述分 类后的样本磁盘数据进行样本训练,得到磁盘预测模型,包括:The method of claim 4 wherein said score is determined using an Owlqn model The sample disk data after the class is sampled and trained to obtain a disk prediction model, including:
    所述Owlqn模型对所述每个维度上的样本数据所对应的ID值进行训练,得到每个维度上的样本数据的权重值;The Owlqn model trains ID values corresponding to the sample data in each dimension to obtain weight values of sample data in each dimension;
    根据所述每个维度上的样本数据及对应的权重值,确定所述磁盘预测模型,其中,所述磁盘预测模型包括所述每个维度上的样本数据的预测结果。The disk prediction model is determined according to sample data in each dimension and a corresponding weight value, wherein the disk prediction model includes prediction results of sample data in each dimension.
  6. 根据权利要求5所述的方法,其特征在于,所述每个维度上的样本数据的预测结果为所述样本磁盘数据进行分类后得到的预测值。The method according to claim 5, wherein the prediction result of the sample data in each dimension is a predicted value obtained by classifying the sample disk data.
  7. 根据权利要求6所述的方法,其特征在于,在接收到待测磁盘的磁盘数据之后,使用所述磁盘预测模型对所述待测磁盘的磁盘数据进行处理,确定所述待测磁盘是否为故障磁盘,包括:The method according to claim 6, wherein after receiving the disk data of the disk to be tested, the disk data of the disk to be tested is processed by using the disk prediction model to determine whether the disk to be tested is Faulty disk, including:
    接收到所述待测磁盘的磁盘数据之后,将所述待测磁盘的磁盘数据离散化至对应的分箱,得到所述待测磁盘的磁盘数据所对应的ID值;After the disk data of the disk to be tested is received, the disk data of the disk to be tested is discretized to a corresponding bin, and the ID value corresponding to the disk data of the disk to be tested is obtained;
    根据所述待测磁盘的磁盘数据所对应的ID值确定所述待测磁盘的磁盘数据的权重值;Determining, according to an ID value corresponding to the disk data of the disk to be tested, a weight value of the disk data of the disk to be tested;
    根据所述待测磁盘的磁盘数据的权重值从所述磁盘预测模型中确定所述待测磁盘是否为故障磁盘。Determining, according to the weight value of the disk data of the disk to be tested, whether the disk to be tested is a failed disk from the disk prediction model.
  8. 一种磁盘的故障预测装置,其特征在于,包括:A fault prediction device for a magnetic disk, comprising:
    获取模块,用于通过磁盘监控技术获取磁盘的样本磁盘数据,其中,所述样本磁盘数据包括多个维度上的样本数据;An obtaining module, configured to acquire sample disk data of a disk by using a disk monitoring technology, where the sample disk data includes sample data in multiple dimensions;
    分类模块,用于采用Bucketing技术对所述样本磁盘数据进行分箱处理,对所述样本磁盘数据进行分类;a classifying module, configured to perform binning processing on the sample disk data by using a bucketing technology, and classify the sample disk data;
    训练模块,用于采用Owlqn模型对所述分类后的样本磁盘数据进行样本训练,得到磁盘预测模型;a training module, configured to perform sample training on the classified sample disk data by using an Owlqn model to obtain a disk prediction model;
    确定模块,用于在接收到待测磁盘的磁盘数据之后,使用所述磁盘预测模型对所述待测磁盘的磁盘数据进行处理,确定所述待测磁盘 是否为故障磁盘。Determining a module, after receiving the disk data of the disk to be tested, using the disk prediction model to process the disk data of the disk to be tested, and determining the disk to be tested Whether it is a failed disk.
  9. 根据权利要求8所述的装置,其特征在于,所述样本磁盘数据为SMART磁盘数据,其中,所述样本磁盘数据至少包括如下四个维度上的样本数据:原始值、标准值、最差值和累积值。The apparatus according to claim 8, wherein said sample disk data is SMART disk data, and wherein said sample disk data includes at least sample data in four dimensions: original value, standard value, and worst value. And cumulative values.
  10. 根据权利要求9所述的装置,其特征在于,所述装置还包括:The device according to claim 9, wherein the device further comprises:
    运算模块,用于对每个维度上的样本数据进行如下任意一种或多种运算:差分运算、平方运算和分布求和运算,使得任意一个维度上的样本数据被扩展出新的维度上的样本数据。The operation module is configured to perform any one or more of the following operations on the sample data in each dimension: a difference operation, a square operation, and a distribution sum operation, so that the sample data in any one dimension is expanded into a new dimension. sample.
  11. 根据权利要求8至10中任意一项所述的装置,其特征在于,所述分类模块包括:The apparatus according to any one of claims 8 to 10, wherein the classification module comprises:
    第一确定子模块,用于确定预先划分的每个分箱的取值范围以及每个分箱对应的ID值;a first determining sub-module, configured to determine a value range of each binning pre-divided and an ID value corresponding to each bin;
    分类子模块,用于通过将每个维度上的样本数据离散化至对应的分箱来对所述样本磁盘数据进行分类,得到所述每个维度上的样本数据所对应的ID值。And a classification sub-module, configured to classify the sample disk data by discretizing sample data in each dimension to a corresponding bin, to obtain an ID value corresponding to the sample data in each dimension.
  12. 根据权利要求11所述的装置,其特征在于,所述训练模块包括:The apparatus according to claim 11, wherein the training module comprises:
    训练子模块,用于所述Owlqn模型对所述每个维度上的样本数据所对应的ID值进行训练,得到每个维度上的样本数据的权重值;a training sub-module, configured to: the Owlqn model trains an ID value corresponding to the sample data in each dimension to obtain a weight value of the sample data in each dimension;
    第二确定子模块,用于根据所述每个维度上的样本数据及对应的权重值,确定所述磁盘预测模型,其中,所述磁盘预测模型包括所述每个维度上的样本数据的预测结果。a second determining submodule, configured to determine the disk prediction model according to the sample data and the corresponding weight value in each dimension, wherein the disk prediction model includes prediction of sample data in each dimension result.
  13. 根据权利要求12所述的装置,其特征在于,所述每个维度上的样本数据的预测结果为所述样本磁盘数据进行分类后得到的预测值。The apparatus according to claim 12, wherein the prediction result of the sample data in each dimension is a predicted value obtained by classifying the sample disk data.
  14. 根据权利要求13所述的装置,其特征在于,所述确定模块还包括:The device according to claim 13, wherein the determining module further comprises:
    离散模块,用于接收到所述待测磁盘的磁盘数据之后,将所述待测磁盘的磁盘数据离散化至对应的分箱,得到所述待测磁盘的磁盘数 据所对应的ID值;a discrete module, after receiving the disk data of the disk to be tested, discretizing the disk data of the disk to be tested into a corresponding bin, and obtaining the number of disks of the disk to be tested According to the corresponding ID value;
    第三确定子模块,用于根据所述待测磁盘的磁盘数据所对应的ID值确定所述待测磁盘的磁盘数据的权重值;a third determining submodule, configured to determine, according to an ID value corresponding to the disk data of the disk to be tested, a weight value of the disk data of the disk to be tested;
    第四确定子模块,用于根据所述待测磁盘的磁盘数据的权重值从所述磁盘预测模型中确定所述待测磁盘是否为故障磁盘。 And a fourth determining submodule, configured to determine, according to the weight value of the disk data of the disk to be tested, whether the disk to be tested is a faulty disk.
PCT/CN2017/071699 2016-01-29 2017-01-19 Disk failure prediction method and apparatus WO2017129032A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201610065609.6 2016-01-29
CN201610065609.6A CN107025153B (en) 2016-01-29 2016-01-29 Disk failure prediction method and device

Publications (1)

Publication Number Publication Date
WO2017129032A1 true WO2017129032A1 (en) 2017-08-03

Family

ID=59397441

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2017/071699 WO2017129032A1 (en) 2016-01-29 2017-01-19 Disk failure prediction method and apparatus

Country Status (3)

Country Link
CN (1) CN107025153B (en)
TW (1) TW201732789A (en)
WO (1) WO2017129032A1 (en)

Cited By (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108647136A (en) * 2018-05-10 2018-10-12 南京道熵信息技术有限公司 Hard disk corruptions prediction technique and device based on SMART information and deep learning
CN109828869A (en) * 2018-12-05 2019-05-31 中兴通讯股份有限公司 Predict the method, apparatus and storage medium of hard disk failure time of origin
CN109842521A (en) * 2019-01-28 2019-06-04 西安科技大学 A kind of mobile terminal delay machine forecasting system and method
WO2020000404A1 (en) 2018-06-29 2020-01-02 Microsoft Technology Licensing, Llc. Multi-factor cloud service storage device error prediction
CN111242150A (en) * 2018-11-29 2020-06-05 阿里巴巴集团控股有限公司 Data training method and device
CN111429003A (en) * 2020-03-23 2020-07-17 北京互金新融科技有限公司 Data processing method and device
CN113778791A (en) * 2021-08-19 2021-12-10 苏州浪潮智能科技有限公司 Fault early warning method and system for distributed storage disk

Families Citing this family (12)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108052979A (en) * 2017-12-15 2018-05-18 阿里巴巴集团控股有限公司 The method, apparatus and equipment merged to model predication value
CN108681496A (en) * 2018-05-09 2018-10-19 北京奇艺世纪科技有限公司 Prediction technique, device and the electronic equipment of disk failure
JP7021000B2 (en) * 2018-05-24 2022-02-16 株式会社東芝 Information processing equipment, programs and information processing methods
CN109240867A (en) * 2018-09-18 2019-01-18 鸿秦(北京)科技有限公司 Hard disk failure prediction technique
CN109669796B (en) * 2018-12-20 2022-02-11 湖南快乐阳光互动娱乐传媒有限公司 Disk fault prediction method and device
CN110175100B (en) * 2019-04-17 2020-05-19 华中科技大学 Storage disk fault prediction method and prediction system
CN113127274B (en) * 2019-12-31 2024-03-19 中移动信息技术有限公司 Disk failure prediction method, device, equipment and computer storage medium
CN111459706B (en) * 2020-01-16 2021-07-27 华中科技大学 NAND flash memory error rate prediction method and system
CN112395179B (en) * 2020-11-24 2023-03-10 创新奇智(西安)科技有限公司 Model training method, disk prediction method, device and electronic equipment
CN114610560B (en) * 2020-12-07 2024-04-02 腾讯科技(深圳)有限公司 System abnormality monitoring method, device and storage medium
CN112652351A (en) * 2020-12-25 2021-04-13 平安科技(深圳)有限公司 Hardware state detection method and device, computer equipment and storage medium
CN116259337B (en) * 2023-05-15 2023-09-05 合肥联宝信息技术有限公司 Disk abnormality detection method, model training method and related device

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102129397A (en) * 2010-12-29 2011-07-20 深圳市永达电子股份有限公司 Method and system for predicating self-adaptive disk array failure
CN104268040A (en) * 2014-09-16 2015-01-07 深圳市腾讯计算机系统有限公司 Method and device for detecting disk performance
CN104503874A (en) * 2014-12-29 2015-04-08 南京大学 Hard disk failure prediction method for cloud computing platform

Family Cites Families (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20140222724A1 (en) * 2013-02-02 2014-08-07 Microsoft Corporation Generation of log-linear models using l-1 regularization
CN103646114B (en) * 2013-12-26 2017-04-05 北京百度网讯科技有限公司 Characteristic extracting method and device in hard disk SMART data

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102129397A (en) * 2010-12-29 2011-07-20 深圳市永达电子股份有限公司 Method and system for predicating self-adaptive disk array failure
CN104268040A (en) * 2014-09-16 2015-01-07 深圳市腾讯计算机系统有限公司 Method and device for detecting disk performance
CN104503874A (en) * 2014-12-29 2015-04-08 南京大学 Hard disk failure prediction method for cloud computing platform

Cited By (13)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108647136A (en) * 2018-05-10 2018-10-12 南京道熵信息技术有限公司 Hard disk corruptions prediction technique and device based on SMART information and deep learning
CN108647136B (en) * 2018-05-10 2021-05-04 南京道熵信息技术有限公司 Hard disk damage prediction method and device based on SMART information and deep learning
US11748185B2 (en) 2018-06-29 2023-09-05 Microsoft Technology Licensing, Llc Multi-factor cloud service storage device error prediction
WO2020000404A1 (en) 2018-06-29 2020-01-02 Microsoft Technology Licensing, Llc. Multi-factor cloud service storage device error prediction
CN111242150B (en) * 2018-11-29 2024-05-17 阿里巴巴集团控股有限公司 Data training method and device
CN111242150A (en) * 2018-11-29 2020-06-05 阿里巴巴集团控股有限公司 Data training method and device
CN109828869B (en) * 2018-12-05 2020-12-04 南京中兴软件有限责任公司 Method, device and storage medium for predicting hard disk fault occurrence time
CN109828869A (en) * 2018-12-05 2019-05-31 中兴通讯股份有限公司 Predict the method, apparatus and storage medium of hard disk failure time of origin
CN109842521A (en) * 2019-01-28 2019-06-04 西安科技大学 A kind of mobile terminal delay machine forecasting system and method
CN111429003A (en) * 2020-03-23 2020-07-17 北京互金新融科技有限公司 Data processing method and device
CN111429003B (en) * 2020-03-23 2023-11-03 北京互金新融科技有限公司 Data processing method and device
CN113778791A (en) * 2021-08-19 2021-12-10 苏州浪潮智能科技有限公司 Fault early warning method and system for distributed storage disk
CN113778791B (en) * 2021-08-19 2023-07-18 苏州浪潮智能科技有限公司 Fault early warning method and system for distributed storage disk

Also Published As

Publication number Publication date
TW201732789A (en) 2017-09-16
CN107025153A (en) 2017-08-08
CN107025153B (en) 2021-02-12

Similar Documents

Publication Publication Date Title
WO2017129032A1 (en) Disk failure prediction method and apparatus
Lu et al. Making disk failure predictions {SMARTer}!
Xu et al. Improving service availability of cloud systems by predicting disk error
US20230385141A1 (en) Multi-factor cloud service storage device error prediction
WO2017129030A1 (en) Disk failure prediction method and apparatus
US10216558B1 (en) Predicting drive failures
US10223224B1 (en) Method and system for automatic disk failure isolation, diagnosis, and remediation
US11327742B2 (en) Affinity recommendation in software lifecycle management
CN107168995B (en) Data processing method and server
US10069702B2 (en) Dynamic discovery of applications, external dependencies, and relationships
CN112214369A (en) Hard disk fault prediction model establishing method based on model fusion and application thereof
US9009536B2 (en) Test case production utilizing problem reports
WO2022001125A1 (en) Method, system and device for predicting storage failure in storage system
JP2023544493A (en) Detecting performance degradation in remotely deployed applications
JP2020144493A (en) Learning model generation support device and learning model generation support method
CN114325405A (en) Battery pack consistency analysis method, modeling method, device, equipment and medium
US10817365B2 (en) Anomaly detection for incremental application deployments
CN115878400A (en) Test method, test apparatus, computer device, storage medium, and program product
CN115543665A (en) Memory reliability evaluation method and device and storage medium
US10380339B1 (en) Reactively identifying software products exhibiting anomalous behavior
KR20220043844A (en) Method for failure prediction and apparatus implementing the same method
CN116057902A (en) Health index of service
CN117421145B (en) Heterogeneous hard disk system fault early warning method and device
US20230396511A1 (en) Capacity Aware Cloud Environment Node Recovery System
CN113434088A (en) Disk identification method and device

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 17743647

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 17743647

Country of ref document: EP

Kind code of ref document: A1