CN111897696A - Server cluster hard disk state detection method and device, electronic equipment and storage medium - Google Patents

Server cluster hard disk state detection method and device, electronic equipment and storage medium Download PDF

Info

Publication number
CN111897696A
CN111897696A CN202010779492.4A CN202010779492A CN111897696A CN 111897696 A CN111897696 A CN 111897696A CN 202010779492 A CN202010779492 A CN 202010779492A CN 111897696 A CN111897696 A CN 111897696A
Authority
CN
China
Prior art keywords
hard disk
server cluster
determining
protection parameter
server
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202010779492.4A
Other languages
Chinese (zh)
Inventor
牛犇
李靖
姜凯
傅欢
严勇
刘裕勋
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Tencent Technology Shenzhen Co Ltd
Original Assignee
Tencent Technology Shenzhen Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Tencent Technology Shenzhen Co Ltd filed Critical Tencent Technology Shenzhen Co Ltd
Priority to CN202010779492.4A priority Critical patent/CN111897696A/en
Publication of CN111897696A publication Critical patent/CN111897696A/en
Pending legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/30Monitoring
    • G06F11/3003Monitoring arrangements specially adapted to the computing system or computing system component being monitored
    • G06F11/3034Monitoring arrangements specially adapted to the computing system or computing system component being monitored where the computing system component is a storage system, e.g. DASD based or network based
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/22Detection or location of defective computer hardware by testing during standby operation or during idle time, e.g. start-up testing
    • G06F11/2273Test methods
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F21/00Security arrangements for protecting computers, components thereof, programs or data against unauthorised activity
    • G06F21/60Protecting data
    • G06F21/62Protecting access to data via a platform, e.g. using keys or access control rules
    • G06F21/6218Protecting access to data via a platform, e.g. using keys or access control rules to a system of files or objects, e.g. local or distributed file system or database
    • G06F21/6245Protecting personal data, e.g. for financial or medical purposes

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • General Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Health & Medical Sciences (AREA)
  • Quality & Reliability (AREA)
  • Computing Systems (AREA)
  • Bioethics (AREA)
  • General Health & Medical Sciences (AREA)
  • Computer Hardware Design (AREA)
  • Mathematical Physics (AREA)
  • Medical Informatics (AREA)
  • Databases & Information Systems (AREA)
  • Computer Security & Cryptography (AREA)
  • Software Systems (AREA)
  • Debugging And Monitoring (AREA)

Abstract

The invention provides a method and a device for detecting the state of a server cluster hard disk, electronic equipment and a storage medium, wherein the method comprises the following steps: monitoring the operation data of the server cluster hard disk; acquiring detection parameters matched with the server cluster hard disk by triggering a matched state detection process based on the operation data of the server cluster hard disk; carrying out multi-dimensional processing on detection parameters matched with the server cluster hard disk, and determining detection results of the server cluster hard disk in different dimensions; the detection results of the server cluster hard disks in different dimensions are subjected to fusion processing, and the detection result of the state detection process is determined, so that the fault type of the server cluster hard disks can be automatically detected and pre-judged in real time, the replacement rate of the server cluster hard disks is reduced, the operation cost of a cloud server system is reduced, the maintenance efficiency of the server cluster hard disks is improved, the data safety of cloud server users is ensured, and the use experience of the users is improved.

Description

Server cluster hard disk state detection method and device, electronic equipment and storage medium
Technical Field
The invention relates to a hard disk system fault detection processing technology, in particular to a server cluster hard disk state detection method and device, electronic equipment and a storage medium.
Background
With the continuous development of computer technology, a Cloud Virtual Machine (CVM) can provide secure and reliable resilient computing services, and can also provide different instance types to meet the specific use scenarios of users. The example types comprise different combinations of a CPU, an internal memory, a storage and a network, but when the hard disk of the cloud server has the problems of disconnection, read-only and the like in the operation process, the submachine service deployed on the server by a user is also influenced. In the related technology, the state of the hard disk can be determined based on certain parameters in the SMART information of the hard disk only when the hard disk has problems, and the data security of a user using the cloud server is influenced.
Disclosure of Invention
In view of this, embodiments of the present invention provide a method and an apparatus for detecting a state of a server cluster hard disk, an electronic device, and a storage medium, which can automatically detect and pre-determine a fault type of the server cluster hard disk in real time, reduce a replacement rate of the server cluster hard disk, reduce an operation cost of a cloud server system, improve an efficiency of maintaining the server cluster hard disk, ensure data security of a cloud server user, and improve user experience. The technical scheme of the embodiment of the invention is realized as follows:
the embodiment of the invention provides a method for detecting the state of a server cluster hard disk, which comprises the following steps:
monitoring the operation data of the server cluster hard disk;
acquiring detection parameters matched with the server cluster hard disk by triggering a matched state detection process based on the operation data of the server cluster hard disk;
carrying out multi-dimensional processing on detection parameters matched with the server cluster hard disk, and determining detection results of the server cluster hard disk in different dimensions;
and fusing detection results of the server cluster hard disk in different dimensions, and determining the detection result of the state detection process.
The embodiment of the invention also provides a server cluster hard disk state detection device, which comprises:
the information transmission module is used for monitoring the operation data of the server cluster hard disk;
the information processing module is used for acquiring detection parameters matched with the server cluster hard disk by triggering a matched state detection process based on the operation data of the server cluster hard disk;
the information processing module is used for carrying out multi-dimensional processing on the detection parameters matched with the server cluster hard disk and determining the detection results of the server cluster hard disk in different dimensions;
and the information processing module is used for fusing detection results of the server cluster hard disk in different dimensions and determining the detection result of the state detection process.
An embodiment of the present invention further provides an electronic device, where the electronic device includes:
a memory for storing executable instructions;
and the processor is used for realizing the preorder server hard disk state detection method when the executable instruction stored in the memory is operated.
The embodiment of the invention also provides a computer readable storage medium, which stores executable instructions, and the executable instructions are executed by a processor to realize the preorder server cluster hard disk state detection method.
The embodiment of the invention has the following beneficial effects:
the embodiment of the invention monitors the operation data of the server cluster hard disk; acquiring detection parameters matched with the server cluster hard disk by triggering a matched state detection process based on the operation data of the server cluster hard disk; carrying out multi-dimensional processing on detection parameters matched with the server cluster hard disk, and determining detection results of the server cluster hard disk in different dimensions; and fusing detection results of the server cluster hard disk in different dimensions, and determining the detection result of the state detection process. Therefore, the fault type of the server cluster hard disk can be automatically detected in real time, the replacement rate of the server cluster hard disk is reduced, the operation cost of a cloud server system is reduced, the maintenance efficiency of the server cluster hard disk is improved, the data safety of a cloud server user is guaranteed, and the use experience of the user is improved.
Drawings
Fig. 1 is a schematic view of a usage scenario of a server cluster hard disk state detection method according to an embodiment of the present invention;
fig. 2 is a schematic structural diagram of an electronic device according to an embodiment of the present invention;
fig. 3 is an optional schematic flow chart of the server cluster hard disk state detection method provided in the embodiment of the present invention;
fig. 4 is an optional schematic flow chart of the server cluster hard disk state detection method provided in the embodiment of the present invention;
FIG. 5 is a schematic diagram of an alternative state calculation in an embodiment of the present invention;
FIG. 6 is a schematic diagram of an alternative state calculation in an embodiment of the present invention;
FIG. 7 is a schematic diagram of an alternative state calculation in an embodiment of the present invention;
FIG. 8 is a schematic diagram of an alternative state calculation in an embodiment of the present invention;
FIG. 9 is a schematic diagram of an alternative state calculation in an embodiment of the present invention;
fig. 10 is a schematic front-end display diagram of a server cluster hard disk state detection method provided in the present application;
fig. 11 is a data architecture diagram of a server cluster hard disk state detection method provided in the present application;
fig. 12 is a schematic front-end display diagram of a server cluster hard disk state detection method provided in the present application;
fig. 13 is a schematic view illustrating a processing effect of the method for detecting a hard disk state of a server cluster according to the present application;
fig. 14 is a schematic processing effect diagram of the server cluster hard disk state detection method provided in the present application;
fig. 15 is a schematic processing effect diagram of the server cluster hard disk state detection method provided by the present application.
Detailed Description
In order to make the objects, technical solutions and advantages of the present invention clearer, the present invention will be further described in detail with reference to the accompanying drawings, the described embodiments should not be construed as limiting the present invention, and all other embodiments obtained by a person of ordinary skill in the art without creative efforts shall fall within the protection scope of the present invention.
In the following description, reference is made to "some embodiments" which describe a subset of all possible embodiments, but it is understood that "some embodiments" may be the same subset or different subsets of all possible embodiments, and may be combined with each other without conflict.
Before further detailed description of the embodiments of the present invention, terms and expressions mentioned in the embodiments of the present invention are explained, and the terms and expressions mentioned in the embodiments of the present invention are applied to the following explanations.
1) In response to the condition or state on which the performed operation depends, one or more of the performed operations may be in real-time or may have a set delay when the dependent condition or state is satisfied; there is no restriction on the order of execution of the operations performed unless otherwise specified.
2) Terminals, including but not limited to: the system comprises a common terminal and a special terminal, wherein the common terminal is in long connection and/or short connection with a sending channel, and the special terminal is in long connection with the sending channel.
3) The client and the carrier for realizing the specific function in the terminal, for example, a mobile client (APP) is a carrier for realizing the specific function in the mobile terminal, for example, a function of executing report making or a function of displaying a report.
4) Firmware: the code running in the chip is a binary code used for realizing fault detection of the server cluster hard disk.
5) A Mini Program (Program) is a Program developed based on a front-end-oriented Language (e.g., JavaScript) and implementing a service in a hypertext Markup Language (HTML) page, and software downloaded by a client (e.g., a browser or any client embedded in a browser core) via a network (e.g., the internet) and interpreted and executed in a browser environment of the client saves steps installed in the client. For example, applets for implementing various services such as air ticket purchase, report making, data presentation, and the like can be downloaded and run in the social network client.
6) The runtime environment, the engine for interpreting and executing code, for example, for an applet, may be the JavaScript Core of the iOS platform, the X5 JS Core of the android platform.
7) Boot loader code: also known as Bootloader (Bootloader), boot mode, boot load, etc., refers to code that runs at chip boot time, typically to initialize the hardware environment, and to load code that the firmware runs, typically without it needing to be updated.
8) A Component (Component), which is a functional module of a view of an applet, also called the front-end Component, buttons, titles, tables, sidebars, content, and footers in a page, includes modular code to facilitate reuse among different pages of the applet.
9) Server cluster (Server cluster) refers to a collection of servers that together perform the same service, appearing to a client as if there is only one Server. The server cluster can utilize a plurality of computers to perform parallel computation so as to obtain high computation speed, and also can use a plurality of computers to perform backup so as to ensure that any one machine damages the whole system or can normally run. The server cluster hard disk fault processing method provided by the application can be applied to a Cloud server use scene and a distributed server use scene, and can be used for detecting the state of the server hard disk in different use scenes and repairing the fault, and particularly, a Cloud server (CVM Cloud Virtual Machine) is a computing service which is simple, efficient, safe and reliable and has elastically telescopic processing capacity. The management mode is simpler and more efficient than that of the traditional single physical server. A user can quickly create or release any plurality of cloud servers for the service process of the user to use without purchasing hardware in advance, and data of the cloud server user is stored. The data and programs of users in the use environment of the distributed server may not be located on one server, but are dispersed into a plurality of servers, and similarly, the use environment of the distributed server also needs to be configured with a large number of hard disks, and the state detection and fault repair of the hard disks of the servers need to be realized by the server cluster hard disk fault processing method provided by the application.
Fig. 1 is a schematic view of a usage scenario of a server cluster hard disk state detection method provided by an embodiment of the present invention, and referring to fig. 1, with continuous development of computer technology, a Cloud Virtual Machine (CVM) may provide a safe and reliable elastic computing service, and may also provide different instance types to meet a specific usage scenario of a user. The terminal (including the terminal 10-1 and the terminal 10-2) is provided with corresponding clients capable of executing different functions, wherein the clients are the terminals (including the terminal 10-1 and the terminal 10-2) which acquire different information from the corresponding cloud server 200 through the network 300, and different services can be deployed in the cloud server. The terminal is connected to the cloud server 200 through a network 300, and the network 300 may be a wide area network or a local area network, or a combination of the two, and uses a wireless link to implement data transmission. The example types provided by the cloud server are different combinations of a CPU, a memory, a storage and a network, and the service data of the user is stored in a hard disk of the cloud server, but when the hard disk of the cloud server has the problems of disconnection, read-only and the like, the sub-machine service deployed on the server by the user is also influenced. In the embodiment provided by the present invention, the cloud server application running in the cloud server 200 may be written in software code environments of different programming languages, and the code objects may be different types of code entities. For example, in the software code of C language, one code object may be one function. In the software code of JAVA language, a code object may be a class, and the OC language of IOS terminal may be a target code. In the software code of C + + language, a code object may be a class or a function to execute processing instructions from different terminals. In the application, the source of the compiling environment of the name cloud server is not distinguished any more.
As described in detail below with respect to the structure of the server cluster hard disk state detection apparatus according to the embodiment of the present invention, the server cluster hard disk state detection apparatus may be implemented in various forms, such as a dedicated terminal with a processing function of the server cluster hard disk state detection apparatus, or a server with a processing function of the server cluster hard disk state detection apparatus, for example, the cloud server 200 in the foregoing fig. 1. Fig. 2 is a schematic structural diagram of an electronic device according to an embodiment of the present invention, and it can be understood that fig. 2 only shows an exemplary structure of a server cluster hard disk state detection apparatus in the electronic device, and a part of or all of the structure shown in fig. 2 may be implemented according to needs.
The electronic equipment provided by the embodiment of the invention comprises: at least one processor 201, memory 202, user interface 203, and at least one network interface 204. The various components in the server cluster hard disk status detection apparatus are coupled together by a bus system 205. It will be appreciated that the bus system 205 is used to enable communications among the components. The bus system 205 includes a power bus, a control bus, and a status signal bus in addition to a data bus. For clarity of illustration, however, the various buses are labeled as bus system 205 in fig. 2.
The user interface 203 may include, among other things, a display, a keyboard, a mouse, a trackball, a click wheel, a key, a button, a touch pad, or a touch screen.
It will be appreciated that the memory 202 can be either volatile memory or nonvolatile memory, and can include both volatile and nonvolatile memory. The memory 202 in embodiments of the present invention is capable of storing data to support operation of the terminal (e.g., 10-1). Examples of such data include: any computer program, such as an operating system and application programs, for operating on a terminal (e.g., 10-1). The operating system includes various system programs, such as a framework layer, a core library layer, a driver layer, and the like, and is used for implementing various basic services and processing hardware-based tasks. The application program may include various application programs.
In some embodiments, the server cluster hard disk state detection apparatus provided in the embodiments of the present invention may be implemented by a combination of software and hardware, and as an example, the server cluster hard disk state detection apparatus provided in the embodiments of the present invention may be a processor in a form of a hardware decoding processor, which is programmed to execute the server cluster hard disk state detection method provided in the embodiments of the present invention. For example, a processor in the form of a hardware decoding processor may employ one or more Application Specific Integrated Circuits (ASICs), DSPs, Programmable Logic Devices (PLDs), Complex Programmable Logic Devices (CPLDs), Field Programmable Gate Arrays (FPGAs), or other electronic components.
As an example that the server cluster hard disk state detection apparatus provided by the embodiment of the present invention is implemented by combining software and hardware, the server cluster hard disk state detection apparatus provided by the embodiment of the present invention may be directly embodied as a combination of software modules executed by the processor 201, where the software modules may be located in a storage medium, the storage medium is located in the memory 202, the processor 201 reads executable instructions included in the software modules in the memory 202, and the server cluster hard disk state detection method provided by the embodiment of the present invention is completed by combining necessary hardware (for example, including the processor 201 and other components connected to the bus 205).
By way of example, the Processor 201 may be an integrated circuit chip having Signal processing capabilities, such as a general purpose Processor, a Digital Signal Processor (DSP), or other programmable logic device, discrete gate or transistor logic, discrete hardware components, or the like, wherein the general purpose Processor may be a microprocessor or any conventional Processor or the like.
As an example of the hard disk state detection apparatus of the server cluster provided by the embodiment of the present invention implemented by hardware, the apparatus provided by the embodiment of the present invention may be implemented by directly using a processor 201 in the form of a hardware decoding processor, for example, by being executed by one or more Application Specific Integrated Circuits (ASICs), DSPs, Programmable Logic Devices (PLDs), Complex Programmable Logic Devices (CPLDs), Field Programmable Gate Arrays (FPGAs), or other electronic elements, to implement the hard disk state detection method of the server cluster provided by the embodiment of the present invention.
The memory 202 in the embodiment of the present invention is used for storing various types of data to support the operation of the server cluster hard disk state detection apparatus. Examples of such data include: any executable instruction for operating on the server cluster hard disk state detection device, such as an executable instruction, may be included in the executable instruction, and a program for implementing the slave server cluster hard disk state detection method according to the embodiment of the present invention may be included in the executable instruction.
In other embodiments, the server cluster hard disk state detection apparatus provided in the embodiments of the present invention may be implemented in a software manner, fig. 2 shows the server cluster hard disk state detection apparatus stored in the memory 202, which may be software in the form of a program, a plug-in, and the like, and includes a series of modules, as an example of the program stored in the memory 202, and may include the server cluster hard disk state detection apparatus, where the server cluster hard disk state detection apparatus includes the following software module information transmission module 2081 and information processing module 2082. When the software modules in the server cluster hard disk state detection device are read into the RAM by the processor 201 and executed, the server cluster hard disk state detection method provided by the embodiment of the invention is implemented, wherein the functions of each software module in the server cluster hard disk state detection device include:
the information transmission module 2081 is used for monitoring the operation data of the server cluster hard disk;
the information processing module 2082 is configured to obtain a detection parameter matched with the server cluster hard disk by triggering a matched state detection process based on the operation data of the server cluster hard disk;
the information processing module 2082 is configured to perform multidimensional processing on the detection parameters matched with the server cluster hard disk, and determine detection results of the server cluster hard disk in different dimensions;
the information processing module 2082 is configured to perform fusion processing on the detection results of the server cluster hard disk in different dimensions, and determine the detection result of the state detection process.
As described in the foregoing embodiments, the monitoring of the state of the server hard disk in the related art is only calculated by using single-dimensional information, for example, based on some parameters of the hard disk SMART. Setting a threshold through the value of the parameter to quantify the health of the hard disk; or the hard disk is read/written to judge whether the hard disk can be read or written normally. However, in the two modes, the scheme that only the SMART current parameter is used as the basis for judging the state of the hard disk has great limitation, on one hand, the SMART only contains a few shallow parameters and lacks model construction of bottom layer parameters, and the state of the hard disk cannot be accurately evaluated only through the SMART parameter snapshot, so that misjudgment of the state of the hard disk is easily caused; on the other hand, the scheme lacks the transverse comparison of dynamically changed modeling and other hard disks in the same cluster, and the algorithm lacks comparison logic (including the comparison of the hard disk of the target cloud server and other hard disks of the same cloud server, and certainly lacks the change comparison of the hard disks of the same server cluster), so that the evaluation lacks accuracy, and is not favorable for accurately judging the state of the hard disks of the server cluster.
Further, if reading and writing are added to determine the state of the disk, the disk is not applicable in most cloud server service scenarios. On one hand, reading and writing are only carried out in a short area, only the hard disk can be judged to be capable of executing reading and writing commands, the internal problems of the hard disk such as disk scratching and magnetic head degradation cannot be judged, if the whole disk is covered for reading and writing, the consumed time is very high, and the operability is not realized; on the other hand, the data center runs services under a strong load, and read-write operations, especially write operations, are not allowed in the background, so that client data is lost or service performance is seriously affected. The scheme is not applicable to the cloud server service environment.
In order to overcome the above defect, referring to fig. 3, the present application provides a method for detecting a state of a server cluster hard disk, and in order to overcome the above defect, an embodiment of the present invention provides a method for detecting a state of a server cluster hard disk, referring to fig. 3, fig. 3 is an optional schematic flow diagram of the method for detecting a state of a server cluster hard disk provided by the embodiment of the present invention, and it can be understood that the steps shown in fig. 3 may be executed by various electronic devices operating the apparatus for detecting a state of a server cluster hard disk, for example, a mobile phone or a tablet computer with a function of detecting a state of a server cluster hard disk. The dedicated terminal with the server cluster hard disk state detection device may be packaged in the terminal 10-1 shown in fig. 1 to execute the corresponding software module in the server cluster hard disk state detection device in the electronic device shown in the foregoing fig. 2. The following is a description of the steps shown in fig. 3.
Step 301: the server cluster hard disk state detection device monitors the operation data of the server cluster hard disk.
The embodiment of the present invention may be implemented by combining a Cloud technology, where the Cloud technology (Cloud technology) is a hosting technology for unifying series resources such as hardware, software, and a network in a wide area network or a local area network to implement calculation, storage, processing, and sharing of data, and may also be understood as a generic term of a network technology, an information technology, an integration technology, a management platform technology, an application technology, and the like applied based on a Cloud computing business model. Background services of the technical network system require a large amount of computing and storage resources, such as video websites, photo-like websites and more portal websites, so cloud technology needs to be supported by cloud computing.
It should be noted that cloud computing is a computing mode, and distributes computing tasks on a resource pool formed by a large number of computers, so that various application systems can obtain computing power, storage space and information services as required. The network that provides the resources is referred to as the "cloud". Resources in the "cloud" appear to the user as being infinitely expandable and available at any time, available on demand, expandable at any time, and paid for on-demand. As a basic capability provider of cloud computing, a cloud computing resource pool platform, which is called an Infrastructure as a Service (IaaS) for short, is established, and multiple types of virtual resources are deployed in a resource pool and are used by external clients selectively. The cloud computing resource pool mainly comprises: a computing device (which may be a virtualized machine, including an operating system), a storage device, and a network device. When a user uses the cloud server to store data or deploys different application processes, the operation parameters of the server cluster hard disk are monitored, possible server cluster hard disk faults can be found in time, and user data loss caused by the server cluster hard disk faults with failure warning is avoided.
Step 302: the server cluster hard disk state detection device acquires detection parameters matched with the server cluster hard disk by triggering a matched state detection process based on the running data of the server cluster hard disk.
The detection parameter SMART (Self-Monitoring Analysis and Reporting Technology) is an automatic hard disk status detection and early warning system and specification. The running conditions of hardware of the hard disk, such as a magnetic head, a disk, a motor and a circuit, are monitored and recorded through a detection instruction in the hard disk hardware, and are compared with a preset safety value set by a manufacturer, if the monitoring conditions are or exceed the safety range of the preset safety value, a warning can be automatically given to a user through the monitoring hardware or software of a host computer, and slight automatic repair can be carried out, so that the safety of hard disk data is ensured in advance. Binary codes are adopted as basic instructions of smart, and specified writing is carried out in a standard register to form a specific smart information table for normal detection and operation. The smart instruction is divided into a main instruction (Command) and a sub instruction (Subcommands). The primary instruction mainly provides information whether the device supports smart or ignores certain primary instruction features. And the secondary instruction provides detection information supporting smart devices. By acquiring the detection parameters matched with the server cluster hard disks, the running state of the server cluster hard disks can be detected in real time, the server cluster hard disks with higher risks can be found in time, the standby hardware deployment is made in advance, and data loss caused by the downtime of the server cluster hard disks of a user is reduced.
Step 303: the server cluster hard disk state detection device carries out multi-dimensional processing on detection parameters matched with the server cluster hard disks, and determines detection results of the server cluster hard disks in different dimensions.
With continuing reference to fig. 4, fig. 4 is an optional flowchart of the server cluster hard disk state detection method according to the embodiment of the present invention, and it can be understood that the steps shown in fig. 4 may be executed by various electronic devices operating the server cluster hard disk state detection apparatus, for example, a mobile phone or a tablet computer with a server cluster hard disk state detection function. The dedicated terminal with the server cluster hard disk state detection device may be packaged in the terminal 10-1 shown in fig. 1 to execute the corresponding software module in the server cluster hard disk state detection device in the electronic device shown in fig. 2 in the foregoing sequence. The following is a description of the steps shown in fig. 4.
Step 401: and determining a corresponding parameter health detection result based on the hard disk protection parameters corresponding to the server cluster hard disk.
Different from a conventional SMART judgment mode, the server cluster hard disk fault detection method provided by the application can calculate the health degree score of the hard disk in a weighting manner through different algorithms from multiple dimensions, and specifically comprises the following steps: SMART parameter euclidean distance algorithm (health quantification of the weighted SMART parameter); SMART parameter statistics z-score algorithm (on-set of hard disk parameters)Statistical quantification of the distribution within the population); SMART parameter dynamic slope algorithm (dynamic trend quantification of parameter deterioration trend); and a hard disk underlying parameter machine learning failure prediction algorithm (parameter reflecting health degree in a hard disk is developed by cooperating with a hard disk supplier, and big data machine learning is carried out). The health degree score of the hard disk is calculated in a weighted mode according to a formula 1, and in service environments corresponding to different server clusters, operation and maintenance personnel can dynamically adjust different weights in the formula 1 according to service types and environment parameters, wherein in the service environment of the cloud server, selectable values of the weights are as follows: a is0=0.2,a1=0.2,a2=0.3,a3When 0.3, equation 1 is: :
Figure BDA0002619682220000111
step 402: and determining a distribution characteristic detection result of the hard disk protection parameter through the standardized processing of the hard disk protection parameter.
In some embodiments of the present invention, the determining the distribution characteristic detection result of the hard disk protection parameter by normalizing the hard disk protection parameter may be implemented by:
determining a target value parameter matched with the hard disk protection parameter; determining the offset of the hard disk protection parameter relative to the target parameter value through the standardization processing of the hard disk protection parameter; and determining a distribution characteristic detection result of the hard disk protection parameter based on the offset of the hard disk protection parameter relative to the target parameter value. Referring to fig. 5, fig. 5 is a schematic diagram of an optional state calculation according to an embodiment of the present invention, in combination with preamble formula 1, a0,a1,a2,a3A weighted value for four dimensions; meanwhile, basescore is the SMART parameter basic score, and an Euclidean distance algorithm is adopted to calculate a centrifugal value (target parameter value is targetparam) for the emphasized SMART basic parameter (baseparam), wherein the farther the distance from the target value targetparam is, the larger the calculated weighted value is. Wherein the basescore calculates the referenceEquation 2:
Figure BDA0002619682220000121
step 403: and determining a deterioration trend result of the hard disk protection parameter by processing the dynamic slope of the hard disk protection parameter. Referring to fig. 6, fig. 6 is a schematic diagram illustrating an optional state calculation in an embodiment of the present invention; zscore is the distribution statistical score of the SMART parameter, whether the distribution of the disk in the whole cluster is deviated or not is calculated by adopting a statistical z distribution value, and the deviation of the emphasized SMART parameter (param) relative to a target value (u) is reflected by the z distribution value, so that the Z distribution value is quantized into the distribution statistical score. The statistical z-distribution values are shown in fig. 6. Where zscore calculation refers to equation 3:
Figure BDA0002619682220000122
in some embodiments of the present invention, determining the result of the degradation trend of the hard disk protection parameter by performing dynamic slope processing on the hard disk protection parameter may be implemented by:
determining the dynamic slope change of the hard disk protection parameter in a single sampling period by processing the dynamic slope of the hard disk protection parameter; and determining a deterioration trend result of the hard disk protection parameter based on the dynamic slope change of the hard disk protection parameter in a single sampling period and a matched slope threshold. Referring to fig. 7, fig. 7 is a schematic diagram of an optional state calculation in an embodiment of the present invention, where dynamic score is a dynamic trend score of a SMART parameter, a dynamic slope algorithm is used to model a deterioration condition of the SMART parameter of an emphasis, and in a sampling period, t e [ t1, t2], if a dynamic slope k of the parameter is greater than a predetermined threshold, an average increase amplitude of the parameter in the acquisition period is calculated, and a square sum and a square of a plurality of parameters are performed. The dynamic slope algorithm is shown in fig. 7, wherein the dynamic slope calculation refers to equation 4:
Figure BDA0002619682220000123
step 404: and determining a fault probability result corresponding to the hard disk protection parameter by processing the prediction function of the hard disk protection parameter.
In some embodiments of the present invention, the determining the failure probability result corresponding to the hard disk protection parameter by processing the prediction function of the hard disk protection parameter may be implemented by:
determining hard disk attribute parameters of the server cluster hard disk, wherein the hard disk attribute parameters comprise: the hard disk type, the server type, the shelf time, the version number and the partition identification of the hard disk; determining a prediction function corresponding to the hard disk protection parameter based on the hard disk attribute parameter of the server cluster hard disk; and determining a fault probability result corresponding to the hard disk protection parameter through a prediction function corresponding to the hard disk protection parameter based on the operation data of the server cluster hard disk stored in a storage medium. The cloud server system comprises a plurality of server cluster hard disks, wherein the server cluster hard disks are used by a plurality of hard disks, and the server cluster hard disks can come from different hardware manufacturers or hard disks customized by an operator of the cloud server, so that the fault type can be judged more accurately by cross comparison of the hard disk type, the server type, the time to put on shelf, the version number, the partition identification of the hard disks and the fault type characteristics, and false reporting of the fault type caused by inconsistent versions of hard disk equipment are avoided. Specifically, referring to fig. 8 and 9, fig. 8 is a schematic diagram of an optional state calculation in an embodiment of the present invention, and fig. 9 is a schematic diagram of an optional state calculation in an embodiment of the present invention; the predictionscore is the probability score of the hard disk failure, a cloud server operator can develop a customized big data machine learning algorithm through deep cooperation with a hard disk supplier, the probability of the hard disk failure in a future specific time is deduced by modeling internal parameters of the hard disk, and the risk prediction score is obtained by weighting and quantifying the probability value.
Step 304 is continuously executed after determining the detection results of the server cluster hard disks in different dimensions.
Step 304: the server cluster hard disk state detection device performs fusion processing on detection results of the server cluster hard disks in different dimensions, and determines the detection result of the state detection process.
With reference to fig. 10, the following describes a method for detecting the hard disk status of a server cluster by taking financial transaction data stored in a cloud server as an example, wherein a user obtains the stored transaction data of financial resources, such as funds and stocks, from a corresponding cloud server 200 through a network 300 by using terminals (including a terminal 10-1 and a terminal 10-2) shown in fig. 1.
Referring to fig. 10, fig. 10 is a schematic front-end display diagram of the server cluster hard disk state detection method provided in the present application, where a terminal (for example, terminal 10-1 in fig. 1) is provided with a cloud server client or a cloud server operating plug-in capable of displaying corresponding software for performing financial information, and a user may store financial data of financial services such as payment, loan and financing provided by a bank, a security, mutual fund, P2P, and the like in the cloud server through the corresponding client. A management terminal (for example, a terminal 10-2 in fig. 1) of a cloud server detects an operating state of a server cluster hard disk through a front-end display schematic diagram of a server cluster hard disk state detection method shown in fig. 10, and specifically, displays a user interface, where the user interface includes a person name view angle picture for observing an operating environment of the cloud server at a fixed person name view angle, and the user interface includes different cloud server identifiers; the user interface also comprises a detection component and a display component; monitoring the operation parameters of the server cluster hard disk through the detection component; acquiring a state detection result of the server cluster hard disk through the detection component; and presenting the state detection result of the server cluster hard disk in the user interface based on the display component.
Fig. 11 is a data architecture diagram of the server cluster hard disk state detection method provided in the present application, where, taking a service environment in which a server cluster is a cloud server as an example, a data acquisition module: the part logs can be reported to the unified access layer through financial data operated by the data acquisition cloud server and stored in a Kafka message queue for consumption. The real-time calculation scoring module may calculate the real-time hardware score in real-time by consuming the raw hardware log data stored in Kafka. An offline scoring module: and calculating the statistical score of the same type of components and the dynamic change parameters of each component in the period according to the structured data. The API access module may perform de-IP authentication based on SHA 512-bit encryption authentication.
Further, fig. 12 is a schematic front-end display diagram of the server cluster hard disk state detection method provided in the present application, and an interface of a called target server cluster hard disk is presented in the user interface through the display component; and based on the detection component, sending a query instruction through an interface of the target server cluster hard disk to verify the state of at least one target server.
Further, fig. 13 is a schematic view of a processing effect of the method for detecting a hard disk state of a server cluster provided by the present application, and fig. 14 is a schematic view of a processing effect of the method for detecting a hard disk state of a server cluster provided by the present application; fig. 15 is a schematic view illustrating a processing effect of the method for detecting a hard disk state of a server cluster according to the present application; the financial transaction data are stored in the server cluster hard disk, the states of the server cluster hard disk in the three ABC use scenes can be determined in real time, the reserve of server cluster hard disk hardware is reduced, the spare parts of different cloud server clusters and machine rooms are pre-scheduled through the determined states of the server cluster hard disk, and the reserve flexibility of the spare parts is improved.
Furthermore, by the server cluster hard disk state detection method, the health degree of hard disks can be accurately identified, server cluster hard disks with higher risks in a cluster can be found in advance, and the risk of data loss caused by server cluster hard disk faults can be effectively and lowly avoided when the server cluster hard disk state detection method is applied to a million-scale server-scale super-large data center. Meanwhile, the state of a server cluster hard disk is analyzed manually, the running cost of the cloud server is saved, the safety of financial data saved in the cloud server by a user is guaranteed, and the risk of data loss is reduced.
The beneficial technical effects are as follows:
the embodiment of the invention monitors the operation data of the server cluster hard disk; acquiring detection parameters matched with the server cluster hard disk by triggering a matched state detection process based on the operation data of the server cluster hard disk; carrying out multi-dimensional processing on detection parameters matched with the server cluster hard disk, and determining detection results of the server cluster hard disk in different dimensions; and fusing detection results of the server cluster hard disk in different dimensions, and determining the detection result of the state detection process. Therefore, the fault type of the server cluster hard disk can be automatically detected in real time, the replacement rate of the server cluster hard disk is reduced, the operation cost of a cloud server system is reduced, the maintenance efficiency of the server cluster hard disk is improved, the data safety of a cloud server user is guaranteed, and the use experience of the user is improved.
The above description is only exemplary of the present invention and should not be taken as limiting the scope of the present invention, and any modifications, equivalents, improvements, etc. made within the spirit and principle of the present invention should be included in the scope of the present invention.

Claims (10)

1. A method for detecting the state of a hard disk of a server cluster is characterized by comprising the following steps:
monitoring the operation data of the server cluster hard disk;
acquiring detection parameters matched with the server cluster hard disk by triggering a matched state detection process based on the operation data of the server cluster hard disk;
carrying out multi-dimensional processing on detection parameters matched with the server cluster hard disk, and determining detection results of the server cluster hard disk in different dimensions;
and fusing detection results of the server cluster hard disk in different dimensions, and determining the detection result of the state detection process.
2. The method of claim 1, wherein the performing multidimensional processing on the detection parameters matched with the server cluster hard disk to determine the detection results of the server cluster hard disk in different dimensions comprises:
determining a corresponding parameter health detection result based on the hard disk protection parameters corresponding to the server cluster hard disk;
determining a distribution characteristic detection result of the hard disk protection parameter through the standardized processing of the hard disk protection parameter;
determining a deterioration trend result of the hard disk protection parameter by processing the dynamic slope of the hard disk protection parameter;
and determining a fault probability result corresponding to the hard disk protection parameter by processing the prediction function of the hard disk protection parameter.
3. The method according to claim 2, wherein the determining the distribution characteristic detection result of the hard disk protection parameter by the normalization processing of the hard disk protection parameter comprises:
determining a target value parameter matched with the hard disk protection parameter;
determining the offset of the hard disk protection parameter relative to the target parameter value through the standardization processing of the hard disk protection parameter;
and determining a distribution characteristic detection result of the hard disk protection parameter based on the offset of the hard disk protection parameter relative to the target parameter value.
4. The method of claim 2, wherein determining the result of the hard disk protection parameter degradation trend through dynamic slope processing of the hard disk protection parameter comprises:
determining the dynamic slope change of the hard disk protection parameter in a single sampling period by processing the dynamic slope of the hard disk protection parameter;
and determining a deterioration trend result of the hard disk protection parameter based on the dynamic slope change of the hard disk protection parameter in a single sampling period and a matched slope threshold.
5. The method of claim 2, wherein the determining the failure probability result corresponding to the hard disk protection parameter through the prediction function processing on the hard disk protection parameter comprises:
determining hard disk attribute parameters of the server cluster hard disk, wherein the hard disk attribute parameters comprise: the hard disk type, the server type, the shelf time, the version number and the partition identification of the hard disk;
determining a prediction function corresponding to the hard disk protection parameter based on the hard disk attribute parameter of the server cluster hard disk;
and determining a fault probability result corresponding to the hard disk protection parameter through a prediction function corresponding to the hard disk protection parameter based on the operation data of the server cluster hard disk stored in a storage medium.
6. The method of claim 1, further comprising:
displaying a user interface, wherein the user interface comprises a person name view angle picture for observing the cloud server operating environment by using a fixed person name view angle, and the user interface comprises different cloud server identifications;
the user interface also comprises a detection component and a display component;
monitoring the operation parameters of the server cluster hard disk through the detection component;
acquiring a state detection result of the server cluster hard disk through the detection component;
and presenting the state detection result of the server cluster hard disk in the user interface based on the display component.
7. The method of claim 6, further comprising:
presenting, by the display component, the interface of the called target server cluster hard disk in the user interface;
and based on the detection component, sending a query instruction through an interface of the target server cluster hard disk to verify the state of at least one target server.
8. A server cluster hard disk state detection device is characterized by comprising:
the information transmission module is used for monitoring the operation data of the server cluster hard disk;
the information processing module is used for acquiring detection parameters matched with the server cluster hard disk by triggering a matched state detection process based on the operation data of the server cluster hard disk;
the information processing module is used for carrying out multi-dimensional processing on the detection parameters matched with the server cluster hard disk and determining the detection results of the server cluster hard disk in different dimensions;
and the information processing module is used for fusing detection results of the server cluster hard disk in different dimensions and determining the detection result of the state detection process.
9. An electronic device, characterized in that the electronic device comprises:
a memory for storing executable instructions;
a processor, configured to execute the executable instructions stored in the memory, and implement the method for detecting a hard disk state of a server cluster according to any one of claims 1 to 7.
10. A computer-readable storage medium storing executable instructions, wherein the executable instructions, when executed by a processor, implement the server cluster hard disk state detection method of any one of claims 1 to 7.
CN202010779492.4A 2020-08-05 2020-08-05 Server cluster hard disk state detection method and device, electronic equipment and storage medium Pending CN111897696A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202010779492.4A CN111897696A (en) 2020-08-05 2020-08-05 Server cluster hard disk state detection method and device, electronic equipment and storage medium

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202010779492.4A CN111897696A (en) 2020-08-05 2020-08-05 Server cluster hard disk state detection method and device, electronic equipment and storage medium

Publications (1)

Publication Number Publication Date
CN111897696A true CN111897696A (en) 2020-11-06

Family

ID=73245805

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202010779492.4A Pending CN111897696A (en) 2020-08-05 2020-08-05 Server cluster hard disk state detection method and device, electronic equipment and storage medium

Country Status (1)

Country Link
CN (1) CN111897696A (en)

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112486755A (en) * 2020-12-11 2021-03-12 北京金山云网络技术有限公司 Server detection method, detection device, electronic equipment and storage medium
CN112860668A (en) * 2021-02-23 2021-05-28 浪潮云信息技术股份公司 Implementation method of Store disabling and enabling functions
CN113590406A (en) * 2021-08-16 2021-11-02 湖南博匠信息科技有限公司 Method and system for detecting solid state disk fault based on electrical variable

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112486755A (en) * 2020-12-11 2021-03-12 北京金山云网络技术有限公司 Server detection method, detection device, electronic equipment and storage medium
CN112860668A (en) * 2021-02-23 2021-05-28 浪潮云信息技术股份公司 Implementation method of Store disabling and enabling functions
CN113590406A (en) * 2021-08-16 2021-11-02 湖南博匠信息科技有限公司 Method and system for detecting solid state disk fault based on electrical variable

Similar Documents

Publication Publication Date Title
CN111897696A (en) Server cluster hard disk state detection method and device, electronic equipment and storage medium
EP3874372B1 (en) Automatically performing and evaluating pilot testing of software
US8516499B2 (en) Assistance in performing action responsive to detected event
US10467081B2 (en) Enabling symptom verification
US20140372347A1 (en) Methods and systems for identifying action for responding to anomaly in cloud computing system
US7685469B2 (en) Method and apparatus of analyzing computer system interruptions
US20190026805A1 (en) Issue resolution utilizing feature mapping
CN111897686A (en) Server cluster hard disk fault processing method and device, electronic equipment and storage medium
US11797416B2 (en) Detecting performance degradation in remotely deployed applications
US8141050B2 (en) Deadlock detection by lock classification
CN104321780A (en) Devices, systems, and methods for monitoring and asserting trust level using persistent trust log
US20150081875A1 (en) Method and system for operating system (os) verification
US20170346842A1 (en) Ordered correction of application based on dependency topology
GB2604007A (en) Software upgrade stability recommendations
EP3011454A1 (en) Generating a fingerprint representing a response of an application to a simulation of a fault of an external service
CN113656314A (en) Pressure test processing method and device
JP6142878B2 (en) Information system performance evaluation apparatus, method and program
WO2016053231A1 (en) Retain data above threshold
KR20200065683A (en) An apparatus for classify log massage to patterns
CN116661758B (en) Method, device, electronic equipment and medium for optimizing log framework configuration
US11929867B1 (en) Degradation engine execution triggering alerts for outages
US20240179044A1 (en) Monitoring service health statuses to raise alerts
Ushakova et al. Approaches to web application performance testing and real-time visualization of results
CN112306885A (en) Script debugging method and device
Jin et al. Impact of Extensions on Browser Performance: An Empirical Study on Google Chrome

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination