WO2022001125A1 - Method, system and device for predicting storage failure in storage system - Google Patents

Method, system and device for predicting storage failure in storage system Download PDF

Info

Publication number
WO2022001125A1
WO2022001125A1 PCT/CN2021/076815 CN2021076815W WO2022001125A1 WO 2022001125 A1 WO2022001125 A1 WO 2022001125A1 CN 2021076815 W CN2021076815 W CN 2021076815W WO 2022001125 A1 WO2022001125 A1 WO 2022001125A1
Authority
WO
WIPO (PCT)
Prior art keywords
neural network
state data
network model
storage medium
operating state
Prior art date
Application number
PCT/CN2021/076815
Other languages
French (fr)
Chinese (zh)
Inventor
晏海龙
张东
颜秉珩
Original Assignee
苏州浪潮智能科技有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 苏州浪潮智能科技有限公司 filed Critical 苏州浪潮智能科技有限公司
Publication of WO2022001125A1 publication Critical patent/WO2022001125A1/en

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/30Monitoring
    • G06F11/32Monitoring with visual or acoustical indication of the functioning of the machine
    • G06F11/324Display of status information
    • G06F11/327Alarm or error message display
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/30Monitoring
    • G06F11/3003Monitoring arrangements specially adapted to the computing system or computing system component being monitored
    • G06F11/3034Monitoring arrangements specially adapted to the computing system or computing system component being monitored where the computing system component is a storage system, e.g. DASD based or network based
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/045Combinations of networks
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/08Learning methods

Abstract

A method, system and device for predicting storage failure in a storage system. On the basis of operating state data of a storage medium having time series characteristics and a cyclic neural network for processing time series characteristic data, failure prediction for a storage medium is achieved, which can significantly advance the time at which failure is predicted, being capable of predicting the failure of the storage medium at least a few days in advance, thereby increasing system security. Furthermore, the operating state data of the storage medium is processed to obtain operating state data, the correlation between said data and the operating variation conditions of the storage medium being higher than a certain value, and model training is performed, thus the amount of data for model training is reduced under the principle of ensuring that important data and information experience less loss, so as to increase the speed of model training.

Description

一种存储系统的存储故障预测方法、系统及装置A storage failure prediction method, system and device for a storage system
本申请要求于2020年06月30日提交至中国专利局、申请号为202010616525.3、发明名称为“一种存储系统的存储故障预测方法、系统及装置”的中国专利申请的优先权,其全部内容通过引用结合在本申请中。This application claims the priority of the Chinese patent application filed on June 30, 2020 with the application number 202010616525.3 and the invention titled "A storage failure prediction method, system and device for a storage system", the entire contents of which are Incorporated herein by reference.
技术领域technical field
本发明涉及存储领域,特别是涉及一种存储系统的存储故障预测方法、系统及装置。The present invention relates to the field of storage, and in particular, to a storage failure prediction method, system and device of a storage system.
背景技术Background technique
随着互联网的发展,各行各业都趋向于数据化,所需存储的数据量都呈现爆发式增长。目前,这些数据大都存储在互联网存储系统中,具体存储于存储系统的存储介质中,所以存储介质的好坏决定了存储系统的存储性能。一旦存储介质发生故障,轻则会造成存储系统对外提供的数据服务不可用,重则可能会导致存储在内的数据永久丢失,给用户带来巨大损失。With the development of the Internet, all walks of life tend to be digitized, and the amount of data that needs to be stored shows an explosive growth. At present, most of these data are stored in the Internet storage system, specifically in the storage medium of the storage system, so the quality of the storage medium determines the storage performance of the storage system. Once the storage medium fails, the external data services provided by the storage system may become unavailable, or the stored data may be permanently lost, causing huge losses to users.
现有技术中,存储系统的存储故障处理机制主要分为两种:In the prior art, the storage fault handling mechanism of the storage system is mainly divided into two types:
1)被动容错机制:被动容错机制是指系统在存储介质发生故障之后,对存储在内的数据进行备份,以对系统进行恢复。但是,对数据进行备份需要以大量存储介质为基础,增加了系统运营负担;而且,若用户在系统数据备份时发起数据请求,此数据请求将会有一定的响应延迟,不利于用户体验。1) Passive fault tolerance mechanism: Passive fault tolerance mechanism means that the system backs up the data stored in the storage medium after the failure of the storage medium to restore the system. However, backing up data needs to be based on a large number of storage media, which increases the operating burden of the system; moreover, if a user initiates a data request during system data backup, the data request will have a certain response delay, which is not conducive to user experience.
2)主动容错机制:主动容错机制是指系统在存储介质发生故障之前提前预知其故障,以提前对即将故障的存储介质进行数据迁移及数据备份,从而大大减少了数据丢失的风险。目前,通常采用的系统存储故障预测方法为:提前为存储介质的多个运行参数一一设置安全阈值,在存储系统运行过程中,监测存储介质的各运行参数值,并当存储介质的任一运行参数值超过其对应的安全阈值时,认为存储介质即将在24小时内故障,系统会发出预警信息。但是,此系统存储故障预测方法可提前预测出的存储介质即将发生故障的时间较短(24小时以内),即留给管理人员处理系统数据 的时间较短,不利于系统整体的安全性。2) Active fault tolerance mechanism: The active fault tolerance mechanism means that the system predicts the failure of the storage medium in advance, so as to perform data migration and data backup for the storage medium that is about to fail in advance, thereby greatly reducing the risk of data loss. At present, the commonly used method for predicting system storage failures is to set safety thresholds for multiple operating parameters of the storage medium in advance, monitor the operating parameter values of the storage medium during the operation of the storage system, and determine the value of each operating parameter of the storage medium when the storage medium is in operation. When the operating parameter value exceeds its corresponding safety threshold, it is considered that the storage medium will fail within 24 hours, and the system will issue an early warning message. However, this system storage failure prediction method can predict in advance that the storage medium will fail in a short time (within 24 hours), that is, the time left for managers to process system data is short, which is not conducive to the overall security of the system.
因此,如何提供一种解决上述技术问题的方案是本领域的技术人员目前需要解决的问题。Therefore, how to provide a solution to the above technical problem is a problem that those skilled in the art need to solve at present.
发明内容SUMMARY OF THE INVENTION
本发明的目的是提供一种存储系统的存储故障预测方法、系统及装置,基于具有时序特性的存储介质的运行状态数据及用于处理时序特性数据的循环神经网络,实现对存储介质的故障进行预测,可以显著提前故障预测时间,至少可提前几天预测存储介质的故障,从而提高系统安全性;而且,本申请将存储介质的运行状态数据处理得到与存储介质的运行变化情况的相关性高于一定值的运行状态数据进行模型训练,从而在保证重要数据信息丢失较少的原则下减少模型训练的数据量,以加快模型训练速度。The purpose of the present invention is to provide a storage fault prediction method, system and device for a storage system, based on the running state data of the storage medium with time series characteristics and a cyclic neural network for processing the time series characteristic data, to realize the fault detection of the storage medium. Prediction, the failure prediction time can be significantly advanced, and the failure of the storage medium can be predicted at least a few days in advance, thereby improving system security; moreover, the application processes the operation status data of the storage medium to obtain a high correlation with the operation change of the storage medium Model training is performed on a certain value of running state data, so as to reduce the amount of data for model training under the principle of ensuring less loss of important data information, so as to speed up model training.
为解决上述技术问题,本发明提供了一种存储系统的存储故障预测方法,包括:In order to solve the above technical problems, the present invention provides a storage failure prediction method of a storage system, including:
预先获取存储系统的存储介质在预设第一时间内均正常运行的第一运行状态数据及在故障发生前预设第二时间内运行的第二运行状态数据;Acquiring in advance first operating state data of the storage medium of the storage system running normally within a preset first time and second operating state data of running within a preset second time before the failure occurs;
将所述第一运行状态数据和所述第二运行状态数据进行预处理,以得到与所述存储介质的运行变化情况的相关性高于一定值的运行状态数据;Preprocessing the first operating state data and the second operating state data to obtain operating state data whose correlation with the operating change of the storage medium is higher than a certain value;
基于所述运行状态数据对预先建立好的循环神经网络模型进行训练,以得到用于对所述存储介质的故障进行预测的循环神经网络模型;training a pre-established recurrent neural network model based on the operating state data to obtain a recurrent neural network model for predicting the failure of the storage medium;
在所述存储系统运行过程中,基于所述循环神经网络模型对所述存储介质的当前运行状态数据进行分析处理,得到所述存储介质的故障预测结果。During the operation of the storage system, the current operating state data of the storage medium is analyzed and processed based on the cyclic neural network model to obtain a fault prediction result of the storage medium.
优选地,预先获取存储系统的存储介质在预设第一时间内均正常运行的第一运行状态数据及在故障发生前预设第二时间内的运行的第二运行状态数据的过程,包括:Preferably, the process of pre-obtaining the first operating state data of the storage medium of the storage system running normally within the preset first time and the second operating state data of running within the preset second time before the failure occurs, includes:
预先获取存储系统的存储介质在预设第一时间内均正常运行的多个第一运行状态数据,并将多个所述第一运行状态数据作为负样本;Acquiring in advance a plurality of first operating state data in which the storage medium of the storage system operates normally within a preset first time, and using the plurality of first operating state data as negative samples;
获取所述存储介质在故障发生前预设第二时间内运行的多个第二运行状态数据,并将多个所述第二运行状态数据作为正样本;Acquiring multiple pieces of second operating state data that are run by the storage medium within a preset second time before the failure occurs, and using the multiple pieces of second operating state data as positive samples;
其中,所述正样本和所述负样本的比例均衡,二者共同组成用于训练所述循环神经网络模型的样本集。The proportions of the positive samples and the negative samples are balanced, and the two together form a sample set for training the recurrent neural network model.
优选地,将所述第一运行状态数据和所述第二运行状态数据进行预处理,以得到与所述存储介质的运行变化情况的相关性高于一定值的运行状态数据;基于所述运行状态数据对预先建立好的循环神经网络模型进行训练,以得到用于对所述存储介质的故障进行预测的循环神经网络模型的过程,包括:Preferably, the first operating state data and the second operating state data are preprocessed, so as to obtain operating state data whose correlation with the operating variation of the storage medium is higher than a certain value; The process of training the pre-established recurrent neural network model with the state data to obtain the recurrent neural network model for predicting the failure of the storage medium, including:
基于获取的n个样本x i=(x i1,x i2,...,x ip) T,i=1,2,...,n构造样本矩阵;其中,每个样本采集p维向量数据x=(x 1,x 2,...,x p) T,n>p且n、p均为正整数; A sample matrix is constructed based on the acquired n samples x i =(x i1 ,x i2 ,...,x ip ) T , i=1,2,...,n; wherein, each sample collects p-dimensional vector data x=(x 1 , x 2 ,...,x p ) T , n>p and both n and p are positive integers;
基于标准变换关系式
Figure PCTCN2021076815-appb-000001
对所述样本矩阵进行标准变换,得到标准化矩阵Z;其中,
Figure PCTCN2021076815-appb-000002
Figure PCTCN2021076815-appb-000003
based on standard transformation relations
Figure PCTCN2021076815-appb-000001
Standard transformation is performed on the sample matrix to obtain a standardized matrix Z; wherein,
Figure PCTCN2021076815-appb-000002
Figure PCTCN2021076815-appb-000003
基于样本相关矩阵求取关系式
Figure PCTCN2021076815-appb-000004
得到样本相关矩阵R,并对样本相关矩阵R的特征方程|R-λI p|=0进行求解,得到p个特征根;
Relational Expression Based on Sample Correlation Matrix
Figure PCTCN2021076815-appb-000004
Obtain the sample correlation matrix R, and solve the characteristic equation |R-λI p |=0 of the sample correlation matrix R to obtain p characteristic roots;
基于
Figure PCTCN2021076815-appb-000005
确定m值,并基于Rb=λ jb对每个λ j,j=1,2,...,m进行求解,得到单位矩阵
Figure PCTCN2021076815-appb-000006
其中,Q为预设信息最低利用率,p>m且m为正整数;
based on
Figure PCTCN2021076815-appb-000005
Determine the value of m and solve for each λ j ,j=1,2,...,m based on Rb=λ j b to get the identity matrix
Figure PCTCN2021076815-appb-000006
Among them, Q is the minimum utilization rate of preset information, p>m and m is a positive integer;
基于指标转换关系式
Figure PCTCN2021076815-appb-000007
得到样本新变量U ij,并基于样本新变量U ij对预先建立好的循环神经网络模型进行训练,以得到用于对所述存储介质的故障进行预测的循环神经网络模型。
Converting Relational Expressions Based on Indicators
Figure PCTCN2021076815-appb-000007
A new sample variable U ij is obtained , and a pre-established recurrent neural network model is trained based on the new sample variable U ij to obtain a recurrent neural network model for predicting the failure of the storage medium.
优选地,基于样本新变量U ij对预先建立好的循环神经网络模型进行训练,以得到用于对所述存储介质的故障进行预测的循环神经网络模型的过程,包括: Preferably, the process of training a pre-established RNN model based on the new sample variable U ij to obtain a RNN model for predicting the failure of the storage medium includes:
获取各样本新变量U ij的算术平均值μ和标准差σ,并基于标准化关系式g2=(g1-μ)/σ对每个新变量进行标准化处理,得到各标准化变量值;其中,g1为每个新变量标准化处理之前的变量值,g2为每个新变量标准化处理之后的变量值; Obtain the arithmetic mean μ and standard deviation σ of the new variables U ij of each sample, and standardize each new variable based on the standardized relationship g2=(g1-μ)/σ to obtain the value of each standardized variable; among them, g1 is The variable value before each new variable is standardized, and g2 is the variable value after each new variable is standardized;
基于所述各标准化变量值的绝对值对预先建立好的循环神经网络模型进行训练,以得到用于对所述存储介质的故障进行预测的循环神经网络模型。The pre-established recurrent neural network model is trained based on the absolute value of each standardized variable value, so as to obtain a recurrent neural network model for predicting the failure of the storage medium.
优选地,基于所述运行状态数据对预先建立好的循环神经网络模型进行训练,以得到用于对所述存储介质的故障进行预测的循环神经网络模型的过程,包括:Preferably, the process of training a pre-established recurrent neural network model based on the operating state data to obtain a recurrent neural network model for predicting the failure of the storage medium includes:
将所述运行状态数据组成的样本集划分为训练集、验证集及测试集;dividing the sample set composed of the operating state data into a training set, a verification set and a test set;
基于所述训练集对预先建立好的循环神经网络模型进行训练,得到第一循环神经网络模型;The pre-established cyclic neural network model is trained based on the training set to obtain a first cyclic neural network model;
基于所述验证集对所述第一循环神经网络模型进行验证,并根据验证结果判断所述第一循环神经网络模型的训练是否达标;Verifying the first RNN model based on the verification set, and judging whether the training of the first RNN model meets the standard according to the verification result;
若训练达标,则基于所述测试集对所述第一循环神经网络模型进行测试,并根据测试结果判断所述第一循环神经网络模型的测试是否通过;If the training meets the standard, then test the first recurrent neural network model based on the test set, and determine whether the test of the first recurrent neural network model passes according to the test result;
若测试通过,则将测试通过的第一循环神经网络模型作为用于对所述存储介质的故障进行预测的循环神经网络模型;If the test is passed, the first recurrent neural network model that has passed the test is used as the recurrent neural network model for predicting the failure of the storage medium;
若测试未通过,则再次获取新样本集对所述第一循环神经网络模型继续训练,并返回基于所述测试集对所述第一循环神经网络模型进行测试的步骤;If the test fails, obtain a new sample set again to continue training the first recurrent neural network model, and return to the step of testing the first recurrent neural network model based on the test set;
若训练未达标,则再次获取新样本集对所述第一循环神经网络模型继续训练,并返回基于所述验证集对所述第一循环神经网络模型进行验证的步骤。If the training fails to meet the standard, a new sample set is obtained again to continue training the first recurrent neural network model, and the step of verifying the first recurrent neural network model based on the verification set is returned.
优选地,所述存储介质的第一运行状态数据及第二运行状态数据均具体为所述存储介质的SMART数据。Preferably, both the first operating state data and the second operating state data of the storage medium are specifically SMART data of the storage medium.
优选地,所述循环神经网络模型具体为BERT或Transformer。Preferably, the recurrent neural network model is specifically BERT or Transformer.
优选地,所述存储故障预测方法还包括:Preferably, the storage failure prediction method further includes:
将所述存储介质的故障预测结果记录在系统日志中,并在所述存储系统的管理界面上显示所述故障预测结果。The failure prediction result of the storage medium is recorded in a system log, and the failure prediction result is displayed on the management interface of the storage system.
为解决上述技术问题,本发明还提供了一种存储系统的存储故障预测系统,包括:In order to solve the above technical problems, the present invention also provides a storage failure prediction system of a storage system, including:
数据获取模块,用于预先获取存储系统的存储介质在预设第一时间内均正常运行的第一运行状态数据及在故障发生前预设第二时间内的运行的第二运行状态数据;a data acquisition module, used for pre-acquiring first operating state data of the storage medium of the storage system operating normally within a preset first time and second operating state data of operating within a preset second time before the failure occurs;
数据提取模块,用于将所述第一运行状态数据和所述第二运行状态数据进行预处理,以得到与所述存储介质的运行变化情况的相关性高于一定值的运行状态数据;a data extraction module, configured to preprocess the first operating state data and the second operating state data to obtain operating state data whose correlation with the operating change of the storage medium is higher than a certain value;
模型训练模块,用于基于所述运行状态数据对预先建立好的循环神经网络模型进行训练,以得到用于对所述存储介质的故障进行预测的循环神经网络模型;a model training module for training a pre-established recurrent neural network model based on the operating state data to obtain a recurrent neural network model for predicting the failure of the storage medium;
故障预测模块,用于在所述存储系统运行过程中,基于所述循环神经网络模型对所述存储介质的当前运行状态数据进行分析处理,得到所述存储介质的故障预测结果。The fault prediction module is configured to analyze and process the current operating state data of the storage medium based on the cyclic neural network model during the operation of the storage system to obtain a fault prediction result of the storage medium.
为解决上述技术问题,本发明还提供了一种存储系统的存储故障预测装置,包括:In order to solve the above technical problems, the present invention also provides a storage failure prediction device of a storage system, including:
存储器,用于存储计算机程序;memory for storing computer programs;
处理器,用于在执行所述计算机程序时实现上述任一种存储系统的存储故障预测方法的步骤。The processor is configured to implement the steps of any one of the above storage system storage failure prediction methods when executing the computer program.
本发明提供了一种存储系统的存储故障预测方法,预先获取存储系统的存储介质在预设第一时间内均正常运行的第一运行状态数据及在故障发生前预设第二时间内运行的第二运行状态数据;将第一运行状态数据和第二运行状态数据进行预处理,以得到与存储介质的运行变化情况的相关性高于一定值的运行状态数据;基于运行状态数据,对预先建立好的循环神经网络模型进行训练,以得到用于对存储介质的故障进行预测的循环神经网络模型;在存储系统运行过程中,基于循环神经网络模型对存储介质的当前运行状态数据进行分析处理,得到存储介质的故障预测结果。可见,本申请基于具有时序特性的存储介质的运行状态数据及用于处理时序特性数据的循环神经网络,实现对存储介质的故障进行预测,可以显著提前故障预测时间,至少可提前几天预测存储介质的故障,从而提高系统安全性;而且,本申请将存储介质的运行状态数据处理得到与存储介质的运行变化情况的相关性高于一定值的运行状态数据进行模型训练,从而在保证重要数据信息丢失较少的原则下减少模型训练的数据量,以加快模型训练速度。The present invention provides a storage failure prediction method for a storage system, which includes pre-acquiring first operating state data of the storage medium of the storage system running normally within a preset first time and data of running in a preset second time before the failure occurs. second operating state data; preprocessing the first operating state data and the second operating state data to obtain operating state data whose correlation with the operation change of the storage medium is higher than a certain value; The established recurrent neural network model is trained to obtain a recurrent neural network model for predicting the failure of the storage medium; during the operation of the storage system, the current operating state data of the storage medium is analyzed and processed based on the recurrent neural network model , to obtain the failure prediction result of the storage medium. It can be seen that the present application realizes the prediction of the failure of the storage medium based on the operating state data of the storage medium with time-series characteristics and the recurrent neural network for processing the time-series characteristic data, which can significantly advance the failure prediction time, at least several days in advance. In addition, the application processes the operation state data of the storage medium to obtain the operation state data whose correlation with the operation change of the storage medium is higher than a certain value for model training, so as to ensure important data. Reduce the amount of data for model training under the principle of less information loss to speed up model training.
本发明还提供了一种存储系统的存储故障预测系统及装置,与上述存储故障预测方法具有相同的有益效果。The present invention also provides a storage failure prediction system and device for a storage system, which have the same beneficial effects as the above storage failure prediction method.
附图说明Description of drawings
为了更清楚地说明本发明实施例中的技术方案,下面将对现有技术和实施例中所需要使用的附图作简单地介绍,显而易见地,下面描述中的附图仅仅是本发明的一些实施例,对于本领域普通技术人员来讲,在不付出创造性劳动的前提下,还可以根据这些附图获得其他的附图。In order to illustrate the technical solutions in the embodiments of the present invention more clearly, the following briefly introduces the prior art and the accompanying drawings required in the embodiments. Obviously, the drawings in the following description are only some of the present invention. In the embodiments, for those of ordinary skill in the art, other drawings can also be obtained according to these drawings without any creative effort.
图1为本发明实施例提供的一种存储系统的存储故障预测方法的流程图;1 is a flowchart of a method for predicting a storage failure of a storage system according to an embodiment of the present invention;
图2为本发明实施例提供的一种存储系统的整体预测示意图;FIG. 2 is a schematic diagram of an overall prediction of a storage system according to an embodiment of the present invention;
图3为本发明实施例提供的一种循环神经网络的训练示意图。FIG. 3 is a schematic diagram of training a recurrent neural network according to an embodiment of the present invention.
具体实施方式detailed description
本发明的核心是提供一种存储系统的存储故障预测方法、系统及装置, 基于具有时序特性的存储介质的运行状态数据及用于处理时序特性数据的循环神经网络,实现对存储介质的故障进行预测,可以显著提前故障预测时间,至少可提前几天预测存储介质的故障,从而提高系统安全性。The core of the present invention is to provide a storage fault prediction method, system and device of a storage system, based on the running state data of the storage medium with time series characteristics and the cyclic neural network for processing the time series characteristic data, to realize the fault detection of the storage medium. Prediction, the failure prediction time can be significantly advanced, and the failure of the storage medium can be predicted at least a few days in advance, thereby improving system security.
为使本发明实施例的目的、技术方案和优点更加清楚,下面将结合本发明实施例中的附图,对本发明实施例中的技术方案进行清楚、完整地描述,显然,所描述的实施例是本发明一部分实施例,而不是全部的实施例。基于本发明中的实施例,本领域普通技术人员在没有做出创造性劳动前提下所获得的所有其他实施例,都属于本发明保护的范围。In order to make the purposes, technical solutions and advantages of the embodiments of the present invention clearer, the technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the accompanying drawings in the embodiments of the present invention. Obviously, the described embodiments These are some embodiments of the present invention, but not all embodiments. Based on the embodiments of the present invention, all other embodiments obtained by those of ordinary skill in the art without creative efforts shall fall within the protection scope of the present invention.
请参照图1,图1为本发明实施例提供的一种存储系统的存储故障预测方法的流程图。Please refer to FIG. 1. FIG. 1 is a flowchart of a method for predicting a storage failure of a storage system according to an embodiment of the present invention.
该存储系统的存储故障预测方法包括:The storage failure prediction method of the storage system includes:
步骤S1:预先获取存储系统的存储介质在预设第一时间内均正常运行的第一运行状态数据及在故障发生前预设第二时间内运行的第二运行状态数据。Step S1: Pre-acquire first operating state data of the storage system of the storage system that operates normally within a preset first time and second operating state data that operate within a preset second time before the failure occurs.
需要说明的是,本申请的预设是提前设置好的,只需要设置一次,除非根据实际情况需要修改,否则不需要重新设置。It should be noted that the preset of this application is set in advance, and only needs to be set once, and does not need to be reset unless it needs to be modified according to the actual situation.
具体地,本申请提前获取存储系统(如云服务器)的存储介质(如机械硬盘、固态硬盘、闪存等存储介质)在预设第一时间内均正常运行的第一运行状态数据,同时获取存储系统的存储介质在故障发生前预设第二时间内运行的第二运行状态数据,目的是获取具有时序特性的运行状态数据,以供后续较适用于处理时序特性数据的循环神经网络模型训练使用。Specifically, the present application obtains in advance the first operating state data of the storage medium (such as a mechanical hard disk, solid-state hard disk, flash memory, etc.) of a storage system (such as a cloud server) in normal operation within a preset first time, and simultaneously obtains the storage medium The storage medium of the system presets the second operating state data that runs for a second time before the failure occurs, the purpose is to obtain the operating state data with time series characteristics for subsequent training of recurrent neural network models that are more suitable for processing time series characteristic data. .
步骤S2:将第一运行状态数据和第二运行状态数据进行预处理,以得到与存储介质的运行变化情况的相关性高于一定值的运行状态数据。Step S2: Preprocess the first operating state data and the second operating state data to obtain operating state data whose correlation with the operating change of the storage medium is higher than a certain value.
具体地,考虑到步骤S1获取的第一运行状态数据和第二运行状态数据中,并不是所有运行状态数据都能够很好地表征存储介质的运行变化情况,所以本申请将第一运行状态数据和第二运行状态数据进行预处理,目的是得到与存储介质的运行变化情况的相关性高于一定值的运行状态数据,以 基于处理得到的这些运行状态数据进行后续模型训练,从而在保证重要数据信息丢失较少的原则下减少模型训练的数据量,进而加快模型训练速度。Specifically, considering that in the first operating state data and the second operating state data acquired in step S1, not all the operating state data can well represent the operating changes of the storage medium, this application uses the first operating state data The purpose of preprocessing with the second operating state data is to obtain the operating state data whose correlation with the operating changes of the storage medium is higher than a certain value, so as to perform subsequent model training based on the processed operating state data, so as to ensure the important Reduce the amount of data for model training under the principle of less loss of data information, thereby speeding up model training.
步骤S3:基于运行状态数据对预先建立好的循环神经网络模型进行训练,以得到用于对存储介质的故障进行预测的循环神经网络模型。Step S3: Train the pre-established recurrent neural network model based on the operating state data to obtain a recurrent neural network model for predicting the failure of the storage medium.
具体地,第一运行状态数据为表示存储介质在预设第一时间内均正常运行的数据,其作为用于告知循环神经网络模型存储介质的正常运行状态的样本数据;第二运行状态数据为表示存储介质在故障发生前预设第二时间内运行的数据,其作为用于告知循环神经网络模型存储介质故障发生前的运行状态的样本数据。Specifically, the first operating state data is data indicating that the storage medium operates normally within a preset first time, which is used as sample data for informing the recurrent neural network model of the normal operating state of the storage medium; the second operating state data is Data representing the operation of the storage medium for a preset second time before the failure occurs, which is used as sample data for informing the recurrent neural network model of the operation state of the storage medium before the failure occurs.
基于将第一运行状态数据和第二运行状态数据进行处理得到的运行状态数据,对预先建立好的循环神经网络模型进行训练,目的是得到用于对存储介质的故障进行预测的循环神经网络模型,以供后续预测存储介质故障使用。Based on the operating state data obtained by processing the first operating state data and the second operating state data, the pre-established recurrent neural network model is trained to obtain a recurrent neural network model for predicting the failure of the storage medium , for subsequent prediction of storage media failures.
步骤S4:在存储系统运行过程中,基于循环神经网络模型对存储介质的当前运行状态数据进行分析处理,得到存储介质的故障预测结果。Step S4: During the operation of the storage system, analyze and process the current operating state data of the storage medium based on the cyclic neural network model to obtain a fault prediction result of the storage medium.
具体地,在存储系统运行过程中,实时获取存储系统的存储介质的运行状态数据,并基于循环神经网络模型对获取的存储介质的运行状态数据进行分析处理,以得到存储介质的故障预测结果,供管理人员参考。需要说明的是,基于具有时序特性的存储介质的运行状态数据及用于处理时序特性数据的循环神经网络,至少可提前几天预测存储介质的故障,从而留给管理人员较多处理系统数据的时间,利于系统整体的安全性。Specifically, during the operation of the storage system, the running status data of the storage medium of the storage system is acquired in real time, and the acquired running status data of the storage medium is analyzed and processed based on the cyclic neural network model, so as to obtain the fault prediction result of the storage medium, For managers' reference. It should be noted that, based on the operating state data of the storage medium with time series characteristics and the recurrent neural network used to process the time series characteristic data, the failure of the storage medium can be predicted at least a few days in advance, thus leaving more time for managers to process system data. time, which is beneficial to the overall security of the system.
本发明提供了一种存储系统的存储故障预测方法,预先获取存储系统的存储介质在预设第一时间内均正常运行的第一运行状态数据及在故障发生前预设第二时间内运行的第二运行状态数据;将第一运行状态数据和第二运行状态数据进行预处理,以得到与存储介质的运行变化情况的相关性高于一定值的运行状态数据;基于运行状态数据对预先建立好的循环神经网络模型进行训练,以得到用于对存储介质的故障进行预测的循环神经网络模型;在存储系统运行过程中,基于循环神经网络模型对存储介质的当前运行状态数据进行分析处理,得到存储介质的故障预测结果。可见,本 申请基于具有时序特性的存储介质的运行状态数据及用于处理时序特性数据的循环神经网络,实现对存储介质的故障进行预测,可以显著提前故障预测时间,至少可提前几天预测存储介质的故障,从而提高系统安全性;而且,本申请将存储介质的运行状态数据处理得到与存储介质的运行变化情况的相关性高于一定值的运行状态数据进行模型训练,从而在保证重要数据信息丢失较少的原则下减少模型训练的数据量,以加快模型训练速度。The present invention provides a storage failure prediction method for a storage system, which includes pre-acquiring first operating state data of the storage medium of the storage system running normally within a preset first time and data of running in a preset second time before the failure occurs. second operating state data; preprocessing the first operating state data and the second operating state data to obtain operating state data whose correlation with the operating change of the storage medium is higher than a certain value; based on the operating state data A good recurrent neural network model is trained to obtain a recurrent neural network model for predicting the failure of the storage medium; during the operation of the storage system, the current operating state data of the storage medium is analyzed and processed based on the recurrent neural network model. Obtain the failure prediction result of the storage medium. It can be seen that the present application realizes the prediction of the failure of the storage medium based on the operating state data of the storage medium with time-series characteristics and the recurrent neural network for processing the time-series characteristic data, which can significantly advance the failure prediction time, at least several days in advance. In addition, the application processes the operation state data of the storage medium to obtain the operation state data whose correlation with the operation change of the storage medium is higher than a certain value for model training, so as to ensure important data. Reduce the amount of data for model training under the principle of less information loss to speed up model training.
在上述实施例的基础上:On the basis of the above-mentioned embodiment:
请参照图2,图2为本发明实施例提供的一种存储系统的整体预测示意图。Please refer to FIG. 2 , which is a schematic diagram of an overall prediction of a storage system according to an embodiment of the present invention.
作为一种可选的实施例,预先获取存储系统的存储介质在预设第一时间内均正常运行的第一运行状态数据及在故障发生前预设第二时间内的运行的第二运行状态数据的过程,包括:As an optional embodiment, pre-acquire first operating state data of the storage medium of the storage system running normally within a preset first time and second operating state data of running within a preset second time before the fault occurs data process, including:
预先获取存储系统的存储介质在预设第一时间内均正常运行的多个第一运行状态数据,并将多个第一运行状态数据作为负样本;Acquiring in advance a plurality of first operating state data in which the storage medium of the storage system operates normally within a preset first time, and using the plurality of first operating state data as negative samples;
获取存储介质在故障发生前预设第二时间内运行的多个第二运行状态数据,并将多个第二运行状态数据作为正样本;Acquiring multiple pieces of second operating state data that the storage medium runs within a preset second time before the failure occurs, and using the multiple pieces of second operating state data as positive samples;
其中,正样本和负样本的比例均衡,二者共同组成用于训练循环神经网络模型的样本集。Among them, the proportion of positive samples and negative samples is balanced, and the two together form a sample set for training the recurrent neural network model.
具体地,本申请提前获取的存储介质在预设第一时间内均正常运行的第一运行状态数据的数量为多个,且多个第一运行状态数据作为训练循环神经网络模型的负样本;同样地,本申请提前获取的存储介质在故障发生前预设第二时间内运行的第二运行状态数据的数量为多个,且多个第二运行状态数据作为训练循环神经网络模型的正样本。Specifically, the storage medium obtained in advance by the present application has a plurality of first operating state data that all run normally within the preset first time, and the plurality of first operating state data are used as negative samples for training the recurrent neural network model; Similarly, the storage medium obtained in advance by the present application has a plurality of second operating state data that run within the preset second time before the failure occurs, and the plurality of second operating state data are used as positive samples for training the recurrent neural network model. .
需要说明的是,这里的正样本和负样本的比例应尽量保证均衡,即构成正样本的数据量与构成负样本的数据量尽量保证相等。It should be noted that the ratio of positive samples and negative samples here should be as balanced as possible, that is, the amount of data that constitutes a positive sample and the amount of data that constitutes a negative sample should be as equal as possible.
作为一种可选的实施例,将第一运行状态数据和第二运行状态数据进行预处理,以得到与存储介质的运行变化情况的相关性高于一定值的运行 状态数据;基于运行状态数据对预先建立好的循环神经网络模型进行训练,以得到用于对存储介质的故障进行预测的循环神经网络模型的过程,包括:As an optional embodiment, the first operation state data and the second operation state data are preprocessed to obtain operation state data whose correlation with the operation change of the storage medium is higher than a certain value; based on the operation state data The process of training the pre-established recurrent neural network model to obtain the recurrent neural network model for predicting the failure of the storage medium, including:
基于获取的n个样本x i=(x i1,x i2,...,x ip) T,i=1,2,...,n构造样本矩阵;其中,每个样本采集p维向量数据x=(x 1,x 2,...,x p) T,n>p且n、p均为正整数; A sample matrix is constructed based on the acquired n samples x i =(x i1 ,x i2 ,...,x ip ) T , i=1,2,...,n; wherein, each sample collects p-dimensional vector data x=(x 1 , x 2 ,...,x p ) T , n>p and both n and p are positive integers;
基于标准变换关系式
Figure PCTCN2021076815-appb-000008
对样本矩阵进行标准变换,得到标准化矩阵Z;其中,
Figure PCTCN2021076815-appb-000009
based on standard transformation relations
Figure PCTCN2021076815-appb-000008
Perform standard transformation on the sample matrix to obtain a standardized matrix Z; among them,
Figure PCTCN2021076815-appb-000009
基于样本相关矩阵求取关系式
Figure PCTCN2021076815-appb-000010
得到样本相关矩阵R,并对样本相关矩阵R的特征方程|R-λI p|=0进行求解,得到p个特征根;
Relational Expression Based on Sample Correlation Matrix
Figure PCTCN2021076815-appb-000010
Obtain the sample correlation matrix R, and solve the characteristic equation |R-λI p |=0 of the sample correlation matrix R to obtain p characteristic roots;
基于
Figure PCTCN2021076815-appb-000011
确定m值,并基于Rb=λ jb对每个λ j,j=1,2,...,m进行求解,得到单位矩阵
Figure PCTCN2021076815-appb-000012
其中,Q为预设信息最低利用率,p>m且m为正整数;
based on
Figure PCTCN2021076815-appb-000011
Determine the value of m and solve for each λ j ,j=1,2,...,m based on Rb=λ j b to get the identity matrix
Figure PCTCN2021076815-appb-000012
Among them, Q is the minimum utilization rate of preset information, p>m and m is a positive integer;
基于指标转换关系式
Figure PCTCN2021076815-appb-000013
得到样本新变量U ij,并基于样本新变量U ij对预先建立好的循环神经网络模型进行训练,以得到用于对存储介质的故障进行预测的循环神经网络模型。
Converting Relational Expressions Based on Indicators
Figure PCTCN2021076815-appb-000013
A new sample variable U ij is obtained , and the pre-established recurrent neural network model is trained based on the new sample variable U ij to obtain a recurrent neural network model for predicting the failure of the storage medium.
具体地,在基于第一运行状态数据和第二运行状态数据对循环神经网络模型进行训练时,可先对第一运行状态数据和第二运行状态数据进行如下预处理:Specifically, when training the recurrent neural network model based on the first operating state data and the second operating state data, the first operating state data and the second operating state data may be preprocessed as follows:
在获取第一运行状态数据和第二运行状态数据时,具体获取了n个样本,表示为:x i=(x i1,x i2,...,x ip) T,i=1,2,...,n;其中,每个样本包含p维向量数据,具体是p种运行状态在一段时间内的数据构成p维运行状态数据,p维运行状态数据构成一个样本,表示为:x=(x 1,x 2,...,x p) TWhen acquiring the first operating state data and the second operating state data, n samples are specifically acquired, which are expressed as: x i =(x i1 , x i2 ,...,x ip ) T , i=1,2, ...,n; wherein, each sample contains p-dimensional vector data, specifically, the data of p-type operating states within a period of time constitute p-dimensional operating state data, and the p-dimensional operating state data constitute a sample, expressed as: x = (x 1 ,x 2 ,...,x p ) T .
基于获取的n个样本x i=(x i1,x i2,...,x ip) T,i=1,2,...,n构造样本矩阵,并基于标准变换关系式
Figure PCTCN2021076815-appb-000014
对样本矩阵进行标准变换,得到标准化矩阵Z,然后基于样本相关矩阵求取关系式
Figure PCTCN2021076815-appb-000015
得到样本相关矩阵R,并对样本相关矩阵R的特征方程|R-λI p|=0进行求解,得到p个特征根,表示为λ j,j=1,2,...,p。
Based on the acquired n samples x i =(x i1 ,x i2 ,...,x ip ) T , i=1,2,...,n construct a sample matrix, and based on the standard transformation relation
Figure PCTCN2021076815-appb-000014
Perform standard transformation on the sample matrix to obtain the standardized matrix Z, and then obtain the relational expression based on the sample correlation matrix
Figure PCTCN2021076815-appb-000015
The sample correlation matrix R is obtained, and the characteristic equation |R-λI p |=0 of the sample correlation matrix R is solved to obtain p characteristic roots, which are expressed as λ j , j=1,2,...,p.
基于
Figure PCTCN2021076815-appb-000016
确定m值,设Q=85%,即使信息的利用率达到85%以上,并基于Rb=λ jb对每个λ j,j=1,2,...,m进行求解,得到单位矩阵
Figure PCTCN2021076815-appb-000017
然后基于指标转换关系式
Figure PCTCN2021076815-appb-000018
得到样本新变量U ij,即样本新变量U ij包含n个样本,每个样本包含m维新向量数据。
based on
Figure PCTCN2021076815-appb-000016
Determine the value of m, set Q=85%, even if the utilization rate of information reaches more than 85%, and solve each λ j ,j=1,2,...,m based on Rb=λ j b to obtain the identity matrix
Figure PCTCN2021076815-appb-000017
Then convert the relational expression based on the indicator
Figure PCTCN2021076815-appb-000018
The new sample variable U ij is obtained , that is, the new sample variable U ij contains n samples, and each sample contains m-dimensional new vector data.
可见,本申请用m维的Y空间代替p维的X空间(m<p,对多变量数据进行最佳综合简化),而低维的Y空间代替高维的X空间所损失的重要信息很少,即在保证重要数据信息丢失较少的原则下,对高维变量空间进行降维处理,以减少模型训练的数据量,加快循环神经网络模型的训练速度。It can be seen that in this application, the m-dimensional Y space is used to replace the p-dimensional X space (m<p, the best comprehensive simplification for multivariate data), and the important information lost by the low-dimensional Y space instead of the high-dimensional X space is very important. Less, that is, under the principle of ensuring less loss of important data information, dimensionality reduction processing is performed on the high-dimensional variable space to reduce the amount of data for model training and speed up the training speed of the recurrent neural network model.
基于此,本申请基于样本新变量U ij对预先建立好的循环神经网络模型进行训练,以得到用于对存储介质的故障进行预测的循环神经网络模型。 Based on this, the present application trains the pre-established cyclic neural network model based on the new sample variable U ij to obtain a cyclic neural network model for predicting the failure of the storage medium.
作为一种可选的实施例,基于样本新变量U ij对预先建立好的循环神经网络模型进行训练,以得到用于对存储介质的故障进行预测的循环神经网络模型的过程,包括: As an optional embodiment, the process of training the pre-established recurrent neural network model based on the new sample variable U ij to obtain the recurrent neural network model for predicting the failure of the storage medium includes:
获取各样本新变量U ij的算术平均值μ和标准差σ,并基于标准化关系式g2=(g1-μ)/σ对每个新变量进行标准化处理,得到各标准化变量值;其中, g1为每个新变量标准化处理之前的变量值,g2为每个新变量标准化处理之后的变量值; Obtain the arithmetic mean μ and standard deviation σ of the new variables U ij of each sample, and standardize each new variable based on the standardized relationship g2=(g1-μ)/σ to obtain the value of each standardized variable; among them, g1 is The variable value before each new variable is standardized, and g2 is the variable value after each new variable is standardized;
基于各标准化变量值的绝对值对预先建立好的循环神经网络模型进行训练,以得到用于对存储介质的故障进行预测的循环神经网络模型。The pre-established recurrent neural network model is trained based on the absolute value of each standardized variable value, so as to obtain a recurrent neural network model for predicting the failure of the storage medium.
具体地,在基于第一运行状态数据和第二运行状态数据对循环神经网络模型进行训练时,还可对第一运行状态数据和第二运行状态数据进行如下处理:Specifically, when the recurrent neural network model is trained based on the first operating state data and the second operating state data, the first operating state data and the second operating state data can also be processed as follows:
考虑到在多变量体系中,由于各变量的性质不同,通常具有不同的量纲和数量级,当各变量间的水平相差很大时,如果直接用原始变量值进行分析,就会突出数值较高的变量在综合分析中的作用,相对削弱数值水平较低变量的作用,所以为了保证综合分析结果的可靠性,本申请还对各样本新变量U ij进行标准化处理,具体是获取各样本新变量U ij的算术平均值μ和标准差σ,并基于标准化关系式g2=(g1-μ)/σ对每个新变量进行标准化处理,得到各标准化变量值。 Considering that in a multivariate system, due to the different nature of each variable, it usually has different dimensions and orders of magnitude. When the level of each variable is very different, if the original variable value is directly used for analysis, it will highlight that the value is higher. The role of the variables in the comprehensive analysis relatively weakens the role of the variables with lower numerical levels. Therefore, in order to ensure the reliability of the comprehensive analysis results, this application also standardizes the new variables U ij for each sample. Specifically, the new variables for each sample are obtained. The arithmetic mean μ and the standard deviation σ of U ij , and each new variable is standardized based on the standardized relational expression g2=(g1-μ)/σ to obtain the value of each standardized variable.
基于此,本申请基于各标准化变量值的绝对值对预先建立好的循环神经网络模型进行训练,以得到用于对存储介质的故障进行预测的循环神经网络模型。Based on this, the present application trains a pre-established recurrent neural network model based on the absolute value of each standardized variable value, so as to obtain a recurrent neural network model for predicting the failure of the storage medium.
请参照图3,图3为本发明实施例提供的一种循环神经网络的训练示意图。Please refer to FIG. 3 , which is a schematic diagram of training a recurrent neural network according to an embodiment of the present invention.
作为一种可选的实施例,基于运行状态数据对预先建立好的循环神经网络模型进行训练,以得到用于对存储介质的故障进行预测的循环神经网络模型的过程,包括:As an optional embodiment, the process of training a pre-established recurrent neural network model based on the operating state data to obtain a recurrent neural network model for predicting the failure of the storage medium includes:
将运行状态数据组成的样本集划分为训练集、验证集及测试集;Divide the sample set composed of running state data into training set, validation set and test set;
基于训练集对预先建立好的循环神经网络模型进行训练,得到第一循环神经网络模型;The pre-established recurrent neural network model is trained based on the training set to obtain the first recurrent neural network model;
基于验证集对第一循环神经网络模型进行验证,并根据验证结果判断第一循环神经网络模型的训练是否达标;Verifying the first recurrent neural network model based on the verification set, and judging whether the training of the first recurrent neural network model meets the standard according to the verification result;
若训练达标,则基于测试集对第一循环神经网络模型进行测试,并根据测试结果判断第一循环神经网络模型的测试是否通过;If the training meets the standard, then test the first recurrent neural network model based on the test set, and determine whether the test of the first recurrent neural network model passes according to the test result;
若测试通过,则将测试通过的第一循环神经网络模型作为用于对存储介质的故障进行预测的循环神经网络模型;If the test is passed, the first recurrent neural network model that has passed the test is used as the recurrent neural network model for predicting the failure of the storage medium;
若测试未通过,则再次获取新样本集对第一循环神经网络模型继续训练,并返回基于测试集对第一循环神经网络模型进行测试的步骤;If the test fails, obtain a new sample set again to continue training the first recurrent neural network model, and return to the step of testing the first recurrent neural network model based on the test set;
若训练未达标,则再次获取新样本集对第一循环神经网络模型继续训练,并返回基于验证集对第一循环神经网络模型进行验证的步骤。If the training fails to meet the standard, obtain a new sample set again to continue training the first recurrent neural network model, and return to the step of verifying the first recurrent neural network model based on the verification set.
具体地,本申请提前将基于运行状态数据构成的样本集划分为训练集、验证集及测试集;其中,训练集用于训练循环神经网络模型;验证集用于验证已训练过的循环神经网络模型;测试集用于测试训练通过的循环神经网络模型,使得循环神经网络模型的预测准确性较高。Specifically, the present application divides the sample set based on the operating state data into training set, verification set and test set in advance; wherein, the training set is used to train the recurrent neural network model; the verification set is used to verify the trained recurrent neural network Model; the test set is used to test the trained recurrent neural network model, so that the prediction accuracy of the recurrent neural network model is high.
基于此,循环神经网络模型的整个训练过程包括:1)基于训练集对预先建立好的循环神经网络模型进行训练,得到训练完成的循环神经网络模型(称为第一循环神经网络模型)。2)基于验证集对第一循环神经网络模型进行验证,并根据验证结果判断第一循环神经网络模型的训练是否达标(若第一循环神经网络模型基于验证集可准确预测验证集表示的存储介质的故障信息,则第一循环神经网络模型的训练达标,否则不达标);若训练达标,则执行后续基于测试集对第一循环神经网络模型进行测试的步骤;若训练未达标,则不执行后续基于测试集对第一循环神经网络模型进行测试的步骤,而是重新获取新样本集,并基于新样本集对第一循环神经网络模型继续训练,并返回基于验证集对第一循环神经网络模型进行验证的步骤,直至验证结果为第一循环神经网络模型的训练达标,才执行后续基于测试集对第一循环神经网络模型进行测试的步骤。3)基于测试集对第一循环神经网络模型进行测试,并根据测试结果判断第一循环神经网络模型的测试是否通过(若第一循环神经网络模型基于测试集可准确预测测试集表示的存储介质的故障信息,则第一循环神经网络模型的测试通过,否则不通过),若测试通过,则将测试通过的第一循环神经网络模型作为用于对存储介质的故障进行预测的循环神经网络模型,即可投入使用;若测试未通过,则重新获取新样本集,并基于新样本集对第一循环神经网络模型继续 训练,并返回基于测试集对第一循环神经网络模型进行测试的步骤,直至测试结果为第一循环神经网络模型的测试通过,才将其投入使用。Based on this, the entire training process of the RNN model includes: 1) training the pre-established RNN model based on the training set to obtain the trained RNN model (called the first RNN model). 2) Verify the first RNN model based on the verification set, and judge whether the training of the first RNN model meets the standard according to the verification result (if the first RNN model can accurately predict the storage medium represented by the verification set based on the verification set; If the training meets the standard, then execute the subsequent steps of testing the first recurrent neural network model based on the test set; if the training fails to meet the standard, do not execute The subsequent step of testing the first recurrent neural network model based on the test set, but to re-acquire a new sample set, and continue to train the first recurrent neural network model based on the new sample set, and return to the first recurrent neural network based on the validation set. In the step of verifying the model, the subsequent step of testing the first recurrent neural network model based on the test set is not performed until the verification result is that the training of the first recurrent neural network model meets the standard. 3) Test the first recurrent neural network model based on the test set, and judge whether the test of the first recurrent neural network model passes according to the test result (if the first recurrent neural network model can accurately predict the storage medium represented by the test set based on the test set; If the test is passed, the first RNN model that has passed the test is used as the RNN model for predicting the failure of the storage medium. , it can be put into use; if the test fails, a new sample set will be re-acquired, and the first cyclic neural network model will continue to be trained based on the new sample set, and return to the steps of testing the first cyclic neural network model based on the test set, It is not put into use until the test result is that the test of the first recurrent neural network model passes.
作为一种可选的实施例,存储介质的第一运行状态数据及第二运行状态数据均具体为存储介质的SMART数据。As an optional embodiment, both the first operating state data and the second operating state data of the storage medium are specifically SMART data of the storage medium.
具体地,本申请的存储介质的第一运行状态数据及第二运行状态数据可直接采用存储介质的SMART(Self-Monitoring Analysis and Reporting Technology,自我监测、分析及报告技术)数据,SMART数据是与存储介质的健康状况密切相关的一些数据,如寻道错误率、盘片启动时间、重新映射扇区计数、加电时间、磁头写入高度、温度等。Specifically, the first operating state data and the second operating state data of the storage medium of the present application can directly adopt the SMART (Self-Monitoring Analysis and Reporting Technology, self-monitoring, analysis and reporting technology) data of the storage medium, and the SMART data is related to Some data closely related to the health of the storage medium, such as seek error rate, disk startup time, remap sector count, power-on time, head write height, temperature, etc.
作为一种可选的实施例,循环神经网络模型具体为BERT或Transformer。As an optional embodiment, the recurrent neural network model is specifically BERT or Transformer.
具体地,本申请的循环神经网络模型可采用高精度的BERT(Bidirectional Encoder Representation from Transformers,双向编码器)或Transformer(循环神经网络的一种),也可采用LSTM(Long Short-Term Memory,长短期记忆网络),本申请在此不做特别的限定。Specifically, the cyclic neural network model of the present application can adopt high-precision BERT (Bidirectional Encoder Representation from Transformers, bidirectional encoder) or Transformer (a kind of cyclic neural network), or LSTM (Long Short-Term Memory, long short-term memory network), which is not specifically limited in this application.
作为一种可选的实施例,存储故障预测方法还包括:As an optional embodiment, the storage failure prediction method further includes:
将存储介质的故障预测结果记录在系统日志中,并在存储系统的管理界面上显示故障预测结果。Record the fault prediction result of the storage medium in the system log, and display the fault prediction result on the management interface of the storage system.
进一步地,本申请可将存储系统的存储故障预测结果记录在系统日志中,作为后续分析系统存储故障的依据;同时,本申请还可将存储故障预测结果显示在存储系统的管理界面上,供管理人员及时查看。Further, the application can record the storage failure prediction result of the storage system in the system log, as a basis for subsequent analysis of the storage failure of the system; at the same time, the application can also display the storage failure prediction result on the management interface of the storage system for use. Managers check in time.
本申请还提供了一种存储系统的存储故障预测系统,包括:The present application also provides a storage fault prediction system for a storage system, including:
数据获取模块,用于预先获取存储系统的存储介质在预设第一时间内均正常运行的第一运行状态数据及在故障发生前预设第二时间内的运行的第二运行状态数据;a data acquisition module, used for pre-acquiring first operating state data of the storage medium of the storage system operating normally within a preset first time and second operating state data of operating within a preset second time before the failure occurs;
数据提取模块,用于将第一运行状态数据和第二运行状态数据进行预处理,以得到与存储介质的运行变化情况的相关性高于一定值的运行状态数据;a data extraction module, configured to preprocess the first operating state data and the second operating state data to obtain operating state data whose correlation with the operating change of the storage medium is higher than a certain value;
模型训练模块,用于基于运行状态数据对预先建立好的循环神经网络模型进行训练,以得到用于对存储介质的故障进行预测的循环神经网络模型;The model training module is used to train the pre-established recurrent neural network model based on the running state data, so as to obtain the recurrent neural network model for predicting the failure of the storage medium;
故障预测模块,用于在存储系统运行过程中,基于循环神经网络模型对存储介质的当前运行状态数据进行分析处理,得到存储介质的故障预测结果。The fault prediction module is used for analyzing and processing the current running state data of the storage medium based on the cyclic neural network model during the operation of the storage system, so as to obtain the fault prediction result of the storage medium.
本申请提供的存储故障预测系统的介绍请参考上述存储故障预测方法的实施例,本申请在此不再赘述。For the introduction of the storage fault prediction system provided by the present application, please refer to the above-mentioned embodiments of the storage fault prediction method, which will not be repeated in this application.
本申请还提供了一种存储系统的存储故障预测装置,包括:The present application also provides a storage failure prediction device for a storage system, including:
存储器,用于存储计算机程序;memory for storing computer programs;
处理器,用于在执行计算机程序时实现上述任一种存储系统的存储故障预测方法的步骤。The processor is configured to implement the steps of any one of the above storage system storage failure prediction methods when executing the computer program.
本申请提供的存储故障预测装置的介绍请参考上述存储故障预测方法的实施例,本申请在此不再赘述。For the introduction of the storage fault prediction device provided by the present application, please refer to the above-mentioned embodiments of the storage fault prediction method, which will not be repeated in this application.
还需要说明的是,在本说明书中,诸如第一和第二等之类的关系术语仅仅用来将一个实体或者操作与另一个实体或操作区分开来,而不一定要求或者暗示这些实体或操作之间存在任何这种实际的关系或者顺序。而且,术语“包括”、“包含”或者其任何其他变体意在涵盖非排他性的包含,从而使得包括一系列要素的过程、方法、物品或者设备不仅包括那些要素,而且还包括没有明确列出的其他要素,或者是还包括为这种过程、方法、物品或者设备所固有的要素。在没有更多限制的情况下,由语句“包括一个……”限定的要素,并不排除在包括所述要素的过程、方法、物品或者设备中还存在另外的相同要素。It should also be noted that, in this specification, relational terms such as first and second, etc. are only used to distinguish one entity or operation from another entity or operation, and do not necessarily require or imply these entities or operations. There is no such actual relationship or sequence between operations. Moreover, the terms "comprising", "comprising" or any other variation thereof are intended to encompass a non-exclusive inclusion such that a process, method, article or device that includes a list of elements includes not only those elements, but also includes not explicitly listed or other elements inherent to such a process, method, article or apparatus. Without further limitation, an element qualified by the phrase "comprising a..." does not preclude the presence of additional identical elements in a process, method, article or apparatus that includes the element.
对所公开的实施例的上述说明,使本领域专业技术人员能够实现或使用本发明。对这些实施例的多种修改对本领域的专业技术人员来说将是显而易见的,本文中所定义的一般原理可以在不脱离本发明的精神或范围的情况下,在其他实施例中实现。因此,本发明将不会被限制于本文所示的 这些实施例,而是要符合与本文所公开的原理和新颖特点相一致的最宽的范围。The above description of the disclosed embodiments enables any person skilled in the art to make or use the present invention. Various modifications to these embodiments will be readily apparent to those skilled in the art, and the generic principles defined herein may be implemented in other embodiments without departing from the spirit or scope of the invention. Thus, the present invention is not intended to be limited to the embodiments shown herein, but is to be accorded the widest scope consistent with the principles and novel features disclosed herein.

Claims (10)

  1. 一种存储系统的存储故障预测方法,其特征在于,包括:A storage failure prediction method for a storage system, comprising:
    预先获取存储系统的存储介质在预设第一时间内均正常运行的第一运行状态数据及在故障发生前预设第二时间内运行的第二运行状态数据;Acquiring in advance first operating state data of the storage medium of the storage system running normally within a preset first time and second operating state data of running within a preset second time before the failure occurs;
    将所述第一运行状态数据和所述第二运行状态数据进行预处理,以得到与所述存储介质的运行变化情况的相关性高于一定值的运行状态数据;Preprocessing the first operating state data and the second operating state data to obtain operating state data whose correlation with the operating change of the storage medium is higher than a certain value;
    基于所述运行状态数据对预先建立好的循环神经网络模型进行训练,以得到用于对所述存储介质的故障进行预测的循环神经网络模型;training a pre-established recurrent neural network model based on the operating state data to obtain a recurrent neural network model for predicting the failure of the storage medium;
    在所述存储系统运行过程中,基于所述循环神经网络模型对所述存储介质的当前运行状态数据进行分析处理,得到所述存储介质的故障预测结果。During the operation of the storage system, the current operating state data of the storage medium is analyzed and processed based on the cyclic neural network model to obtain a fault prediction result of the storage medium.
  2. 如权利要求1所述的存储系统的存储故障预测方法,其特征在于,预先获取存储系统的存储介质在预设第一时间内均正常运行的第一运行状态数据及在故障发生前预设第二时间内的运行的第二运行状态数据的过程,包括:The method for predicting a storage failure of a storage system according to claim 1, wherein the first operating state data of the storage medium of the storage system running normally within a preset first time and the preset first data before the failure occur are obtained in advance. The process of running the second running state data within the second time, including:
    预先获取存储系统的存储介质在预设第一时间内均正常运行的多个第一运行状态数据,并将多个所述第一运行状态数据作为负样本;Acquiring in advance a plurality of first operating state data in which the storage medium of the storage system operates normally within a preset first time, and using the plurality of first operating state data as negative samples;
    获取所述存储介质在故障发生前预设第二时间内运行的多个第二运行状态数据,并将多个所述第二运行状态数据作为正样本;Acquiring multiple pieces of second operating state data that are run by the storage medium within a preset second time before the failure occurs, and using the multiple pieces of second operating state data as positive samples;
    其中,所述正样本和所述负样本的比例均衡,二者共同组成用于训练所述循环神经网络模型的样本集。The proportions of the positive samples and the negative samples are balanced, and the two together form a sample set for training the recurrent neural network model.
  3. 如权利要求2所述的存储系统的存储故障预测方法,其特征在于,将所述第一运行状态数据和所述第二运行状态数据进行预处理,以得到与所述存储介质的运行变化情况的相关性高于一定值的运行状态数据;基于所述运行状态数据对预先建立好的循环神经网络模型进行训练,以得到用于对所述存储介质的故障进行预测的循环神经网络模型的过程,包括:The method for predicting a storage failure of a storage system according to claim 2, wherein the first operating state data and the second operating state data are preprocessed to obtain an operating change condition related to the storage medium. The process of training the pre-established recurrent neural network model based on the operating state data to obtain the recurrent neural network model for predicting the failure of the storage medium ,include:
    基于获取的n个样本x i=(x i1,x i2,...,x ip) T,i=1,2,...,n构造样本矩阵;其中,每个样本采集p维向量数据x=(x 1,x 2,...,x p) T,n>p且n、p均为正整数; A sample matrix is constructed based on the acquired n samples x i =(x i1 ,x i2 ,...,x ip ) T , i=1,2,...,n; wherein, each sample collects p-dimensional vector data x=(x 1 , x 2 ,...,x p ) T , n>p and both n and p are positive integers;
    基于标准变换关系式
    Figure PCTCN2021076815-appb-100001
    对所述样本矩阵进行标准变换,得到标准化矩阵Z;其中,
    Figure PCTCN2021076815-appb-100002
    Figure PCTCN2021076815-appb-100003
    based on standard transformation relations
    Figure PCTCN2021076815-appb-100001
    Standard transformation is performed on the sample matrix to obtain a standardized matrix Z; wherein,
    Figure PCTCN2021076815-appb-100002
    Figure PCTCN2021076815-appb-100003
    基于样本相关矩阵求取关系式
    Figure PCTCN2021076815-appb-100004
    得到样本相关矩阵R,并对样本相关矩阵R的特征方程|R-λI p|=0进行求解,得到p个特征根;
    Relational Expression Based on Sample Correlation Matrix
    Figure PCTCN2021076815-appb-100004
    Obtain the sample correlation matrix R, and solve the characteristic equation |R-λI p |=0 of the sample correlation matrix R to obtain p characteristic roots;
    基于
    Figure PCTCN2021076815-appb-100005
    确定m值,并基于Rb=λ jb对每个λ j,j=1,2,...,m进行求解,得到单位矩阵
    Figure PCTCN2021076815-appb-100006
    其中,Q为预设信息最低利用率,p>m且m为正整数;
    based on
    Figure PCTCN2021076815-appb-100005
    Determine the value of m and solve for each λ j ,j=1,2,...,m based on Rb=λ j b to get the identity matrix
    Figure PCTCN2021076815-appb-100006
    Among them, Q is the minimum utilization rate of preset information, p>m and m is a positive integer;
    基于指标转换关系式
    Figure PCTCN2021076815-appb-100007
    得到样本新变量U ij,并基于样本新变量U ij对预先建立好的循环神经网络模型进行训练,以得到用于对所述存储介质的故障进行预测的循环神经网络模型。
    Converting Relational Expressions Based on Indicators
    Figure PCTCN2021076815-appb-100007
    A new sample variable U ij is obtained , and a pre-established recurrent neural network model is trained based on the new sample variable U ij to obtain a recurrent neural network model for predicting the failure of the storage medium.
  4. 如权利要求3所述的存储系统的存储故障预测方法,其特征在于,基于样本新变量U ij对预先建立好的循环神经网络模型进行训练,以得到用于对所述存储介质的故障进行预测的循环神经网络模型的过程,包括: The storage fault prediction method of a storage system according to claim 3, wherein the pre-established recurrent neural network model is trained based on the new sample variable U ij , so as to obtain a method for predicting the fault of the storage medium The process of the recurrent neural network model, including:
    获取各样本新变量U ij的算术平均值μ和标准差σ,并基于标准化关系式g2=(g1-μ)/σ对每个新变量进行标准化处理,得到各标准化变量值;其中,g1为每个新变量标准化处理之前的变量值,g2为每个新变量标准化处理之后的变量值; Obtain the arithmetic mean μ and standard deviation σ of the new variables U ij of each sample, and standardize each new variable based on the standardized relationship g2=(g1-μ)/σ to obtain the value of each standardized variable; among them, g1 is The variable value before each new variable is standardized, and g2 is the variable value after each new variable is standardized;
    基于所述各标准化变量值的绝对值对预先建立好的循环神经网络模型进行训练,以得到用于对所述存储介质的故障进行预测的循环神经网络模型。The pre-established recurrent neural network model is trained based on the absolute value of each standardized variable value, so as to obtain a recurrent neural network model for predicting the failure of the storage medium.
  5. 如权利要求2所述的存储系统的存储故障预测方法,其特征在于,基于所述运行状态数据对预先建立好的循环神经网络模型进行训练,以得到用于对所述存储介质的故障进行预测的循环神经网络模型的过程,包括:The storage fault prediction method of a storage system according to claim 2, wherein a pre-established recurrent neural network model is trained based on the operating state data, so as to obtain a method for predicting the fault of the storage medium. The process of the recurrent neural network model, including:
    将所述运行状态数据组成的样本集划分为训练集、验证集及测试集;dividing the sample set composed of the operating state data into a training set, a verification set and a test set;
    基于所述训练集对预先建立好的循环神经网络模型进行训练,得到第一循环神经网络模型;The pre-established cyclic neural network model is trained based on the training set to obtain a first cyclic neural network model;
    基于所述验证集对所述第一循环神经网络模型进行验证,并根据验证结果判断所述第一循环神经网络模型的训练是否达标;Verifying the first RNN model based on the verification set, and judging whether the training of the first RNN model meets the standard according to the verification result;
    若训练达标,则基于所述测试集对所述第一循环神经网络模型进行测试,并根据测试结果判断所述第一循环神经网络模型的测试是否通过;If the training meets the standard, then test the first recurrent neural network model based on the test set, and determine whether the test of the first recurrent neural network model passes according to the test result;
    若测试通过,则将测试通过的第一循环神经网络模型作为用于对所述存储介质的故障进行预测的循环神经网络模型;If the test is passed, the first recurrent neural network model that has passed the test is used as the recurrent neural network model for predicting the failure of the storage medium;
    若测试未通过,则再次获取新样本集对所述第一循环神经网络模型继续训练,并返回基于所述测试集对所述第一循环神经网络模型进行测试的步骤;If the test fails, obtain a new sample set again to continue training the first recurrent neural network model, and return to the step of testing the first recurrent neural network model based on the test set;
    若训练未达标,则再次获取新样本集对所述第一循环神经网络模型继续训练,并返回基于所述验证集对所述第一循环神经网络模型进行验证的步骤。If the training fails to meet the standard, obtain a new sample set again to continue training the first recurrent neural network model, and return to the step of verifying the first recurrent neural network model based on the verification set.
  6. 如权利要求1所述的存储系统的存储故障预测方法,其特征在于,所述存储介质的第一运行状态数据及第二运行状态数据均具体为所述存储介质的SMART数据。The method for predicting a storage failure of a storage system according to claim 1, wherein the first operation state data and the second operation state data of the storage medium are both specifically SMART data of the storage medium.
  7. 如权利要求1所述的存储系统的存储故障预测方法,其特征在于,所述循环神经网络模型具体为BERT或Transformer。The storage fault prediction method of a storage system according to claim 1, wherein the recurrent neural network model is specifically a BERT or a Transformer.
  8. 如权利要求1所述的存储系统的存储故障预测方法,其特征在于,所述存储故障预测方法还包括:The storage failure prediction method of a storage system according to claim 1, wherein the storage failure prediction method further comprises:
    将所述存储介质的故障预测结果记录在系统日志中,并在所述存储系统的管理界面上显示所述故障预测结果。The failure prediction result of the storage medium is recorded in a system log, and the failure prediction result is displayed on the management interface of the storage system.
  9. 一种存储系统的存储故障预测系统,其特征在于,包括:A storage failure prediction system for a storage system, characterized in that it includes:
    数据获取模块,用于预先获取存储系统的存储介质在预设第一时间内均正常运行的第一运行状态数据及在故障发生前预设第二时间内的运行的第二运行状态数据;a data acquisition module, used for pre-acquiring first operating state data of the storage medium of the storage system running normally within a preset first time and second operating state data of running within a preset second time before the failure occurs;
    数据提取模块,用于将所述第一运行状态数据和所述第二运行状态数据进行预处理,以得到与所述存储介质的运行变化情况的相关性高于一定值的运行状态数据;a data extraction module, configured to preprocess the first operating state data and the second operating state data to obtain operating state data whose correlation with the operating change of the storage medium is higher than a certain value;
    模型训练模块,用于基于所述运行状态数据对预先建立好的循环神经网络模型进行训练,以得到用于对所述存储介质的故障进行预测的循环神经网络模型;a model training module for training a pre-established recurrent neural network model based on the operating state data to obtain a recurrent neural network model for predicting the failure of the storage medium;
    故障预测模块,用于在所述存储系统运行过程中,基于所述循环神经网络模型对所述存储介质的当前运行状态数据进行分析处理,得到所述存储介质的故障预测结果。The fault prediction module is configured to analyze and process the current operating state data of the storage medium based on the cyclic neural network model during the operation of the storage system to obtain a fault prediction result of the storage medium.
  10. 一种存储系统的存储故障预测装置,其特征在于,包括:A storage failure prediction device for a storage system, characterized in that it includes:
    存储器,用于存储计算机程序;memory for storing computer programs;
    处理器,用于在执行所述计算机程序时实现如权利要求1-8任一种所述的存储系统的存储故障预测方法的步骤。The processor is configured to implement the steps of the storage failure prediction method of the storage system according to any one of claims 1-8 when executing the computer program.
PCT/CN2021/076815 2020-06-30 2021-02-19 Method, system and device for predicting storage failure in storage system WO2022001125A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN202010616525.3A CN111858265A (en) 2020-06-30 2020-06-30 Storage fault prediction method, system and device of storage system
CN202010616525.3 2020-06-30

Publications (1)

Publication Number Publication Date
WO2022001125A1 true WO2022001125A1 (en) 2022-01-06

Family

ID=72989652

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2021/076815 WO2022001125A1 (en) 2020-06-30 2021-02-19 Method, system and device for predicting storage failure in storage system

Country Status (2)

Country Link
CN (1) CN111858265A (en)
WO (1) WO2022001125A1 (en)

Families Citing this family (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111858265A (en) * 2020-06-30 2020-10-30 苏州浪潮智能科技有限公司 Storage fault prediction method, system and device of storage system
CN112737834A (en) * 2020-12-25 2021-04-30 北京浪潮数据技术有限公司 Cloud hard disk fault prediction method, device, equipment and storage medium
CN112822099A (en) * 2020-12-29 2021-05-18 北京浪潮数据技术有限公司 Method, device and medium for switching network card working modes
CN115758225B (en) * 2023-01-06 2023-08-29 中建科技集团有限公司 Fault prediction method and device based on multi-mode data fusion and storage medium

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20110106734A1 (en) * 2009-04-24 2011-05-05 Terrance Boult System and appartus for failure prediction and fusion in classification and recognition
CN108647136A (en) * 2018-05-10 2018-10-12 南京道熵信息技术有限公司 Hard disk corruptions prediction technique and device based on SMART information and deep learning
CN109634790A (en) * 2018-11-22 2019-04-16 华中科技大学 A kind of disk failure prediction technique based on Recognition with Recurrent Neural Network
CN109919335A (en) * 2019-03-11 2019-06-21 西安电子科技大学 Disk failure forecasting system based on deep learning
CN110471820A (en) * 2019-08-05 2019-11-19 南开大学 A kind of cloud storage system disk failure prediction technique based on Recognition with Recurrent Neural Network
CN111858265A (en) * 2020-06-30 2020-10-30 苏州浪潮智能科技有限公司 Storage fault prediction method, system and device of storage system

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20110106734A1 (en) * 2009-04-24 2011-05-05 Terrance Boult System and appartus for failure prediction and fusion in classification and recognition
CN108647136A (en) * 2018-05-10 2018-10-12 南京道熵信息技术有限公司 Hard disk corruptions prediction technique and device based on SMART information and deep learning
CN109634790A (en) * 2018-11-22 2019-04-16 华中科技大学 A kind of disk failure prediction technique based on Recognition with Recurrent Neural Network
CN109919335A (en) * 2019-03-11 2019-06-21 西安电子科技大学 Disk failure forecasting system based on deep learning
CN110471820A (en) * 2019-08-05 2019-11-19 南开大学 A kind of cloud storage system disk failure prediction technique based on Recognition with Recurrent Neural Network
CN111858265A (en) * 2020-06-30 2020-10-30 苏州浪潮智能科技有限公司 Storage fault prediction method, system and device of storage system

Also Published As

Publication number Publication date
CN111858265A (en) 2020-10-30

Similar Documents

Publication Publication Date Title
WO2022001125A1 (en) Method, system and device for predicting storage failure in storage system
Bodik et al. Fingerprinting the datacenter: automated classification of performance crises
Lin et al. Predicting node failure in cloud service systems
CN108052528B (en) A kind of storage equipment timing classification method for early warning
Oliner et al. Alert detection in system logs
KR102522005B1 (en) Apparatus for VNF Anomaly Detection based on Machine Learning for Virtual Network Management and a method thereof
US10373065B2 (en) Generating database cluster health alerts using machine learning
US11558272B2 (en) Methods and systems for predicting time of server failure using server logs and time-series data
CN110164501B (en) Hard disk detection method, device, storage medium and equipment
CN111177714B (en) Abnormal behavior detection method and device, computer equipment and storage medium
WO2022089202A1 (en) Fault identification model training method, fault identification method, apparatus and electronic device
US20120158364A1 (en) Method and system for it resources performance analysis
KR101948634B1 (en) Failure prediction method of system resource for smart computing
Lim et al. Identifying recurrent and unknown performance issues
CN110581834A (en) communication capability opening abnormity detection method and device
CN112596964A (en) Disk failure prediction method and device
CN112951311A (en) Hard disk fault prediction method and system based on variable weight random forest
CN116821646A (en) Data processing chain construction method, data reduction method, device, equipment and medium
CN113515434A (en) Abnormity classification method, abnormity classification device, abnormity classification equipment and storage medium
CN116701033A (en) Host switching abnormality detection method, device, computer equipment and storage medium
CN114325405A (en) Battery pack consistency analysis method, modeling method, device, equipment and medium
CN111858108A (en) Hard disk fault prediction method and device, electronic equipment and storage medium
CN117170915A (en) Data center equipment fault prediction method and device and computer equipment
CN111400122A (en) Hard disk health degree assessment method and device
Wang et al. SaaS software performance issue identification using HMRF‐MAP framework

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 21833792

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 21833792

Country of ref document: EP

Kind code of ref document: A1