CN116501523A - Abnormality processing method and device for equipment, storage medium and electronic device - Google Patents

Abnormality processing method and device for equipment, storage medium and electronic device Download PDF

Info

Publication number
CN116501523A
CN116501523A CN202310301248.0A CN202310301248A CN116501523A CN 116501523 A CN116501523 A CN 116501523A CN 202310301248 A CN202310301248 A CN 202310301248A CN 116501523 A CN116501523 A CN 116501523A
Authority
CN
China
Prior art keywords
target
hard disk
mechanical hard
disk device
temperature
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202310301248.0A
Other languages
Chinese (zh)
Inventor
刘严雪
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Suzhou Inspur Intelligent Technology Co Ltd
Original Assignee
Suzhou Inspur Intelligent Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Suzhou Inspur Intelligent Technology Co Ltd filed Critical Suzhou Inspur Intelligent Technology Co Ltd
Priority to CN202310301248.0A priority Critical patent/CN116501523A/en
Publication of CN116501523A publication Critical patent/CN116501523A/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/07Responding to the occurrence of a fault, e.g. fault tolerance
    • G06F11/0703Error or fault processing not based on redundancy, i.e. by taking additional measures to deal with the error or fault not making use of redundancy in operation, in hardware, or in data representation
    • G06F11/0751Error or fault detection not based on redundancy
    • G06F11/0754Error or fault detection not based on redundancy by exceeding limits
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/07Responding to the occurrence of a fault, e.g. fault tolerance
    • G06F11/0703Error or fault processing not based on redundancy, i.e. by taking additional measures to deal with the error or fault not making use of redundancy in operation, in hardware, or in data representation
    • G06F11/0793Remedial or corrective actions
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/30Monitoring
    • G06F11/3058Monitoring arrangements for monitoring environmental properties or parameters of the computing system or of the computing system component, e.g. monitoring of power, currents, temperature, humidity, position, vibrations
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y02TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
    • Y02DCLIMATE CHANGE MITIGATION TECHNOLOGIES IN INFORMATION AND COMMUNICATION TECHNOLOGIES [ICT], I.E. INFORMATION AND COMMUNICATION TECHNOLOGIES AIMING AT THE REDUCTION OF THEIR OWN ENERGY USE
    • Y02D10/00Energy efficient computing, e.g. low power processors, power management or thermal management

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Quality & Reliability (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Computing Systems (AREA)
  • Debugging And Monitoring (AREA)
  • Moving Of Head For Track Selection And Changing (AREA)

Abstract

The embodiment of the application provides an exception handling method and device of equipment, a storage medium and an electronic device, wherein the method comprises the following steps: acquiring temperature and drive letter information of each of N mechanical hard disk devices, wherein N is a natural number greater than or equal to 1; under the condition that the temperature of the target mechanical hard disk device is greater than a target temperature threshold value, searching a target serial number of the target mechanical hard disk device based on target identifier information of the target mechanical hard disk device, and searching a target slot position of the target mechanical hard disk device based on the target serial number, wherein the target mechanical hard disk device is any one of N mechanical hard disk devices; and executing exception processing on the target slot. Through the method and the device, the problem that the detection efficiency of the mechanical hard disk device is low in the related technology is solved, and the effect of improving the efficiency of detecting the mechanical hard disk device is achieved.

Description

Abnormality processing method and device for equipment, storage medium and electronic device
Technical Field
The embodiment of the application relates to the field of computers, in particular to an abnormality processing method and device for equipment, a storage medium and an electronic device.
Background
The computer memory is widely used in server industry to store data, for example, a Hard Disk Drive (HDD), with the progress of server technology and the increase of data storage requirements of users, one server is often configured with a plurality of HDDs, during a heat dissipation test, the temperature of the plurality of HDDs needs to be monitored, and a baseboard management controller (Baseboard Management Controller, abbreviated as BMC) provides the temperature of each HDD, but the Drive symbol is random, and a tester cannot correspond a specific Drive symbol to a specific slot position of the HDD one by one, so that the position of the HDD with poor heat dissipation cannot be determined. If the temperature of a certain HDD is required to be further determined, stopping the current test, independently pressurizing each HDD again, and observing the state of the HDD for recording; the pressure test process of the HDD comprises working conditions such as different numbers of hard disks, different environment temperatures, different fan speeds, different fan failures and the like, the situation is complex, and if all the test working conditions adopt a mode of independently pressurizing and recording each HDD to identify the position of the HDD, a great deal of time is consumed, and the test efficiency is seriously affected.
Disclosure of Invention
The embodiment of the application provides an equipment abnormality processing method and device, a storage medium and an electronic device, which are used for at least solving the problem of low detection efficiency of mechanical hard disk equipment in the related technology.
According to one embodiment of the present application, there is provided an exception handling method for a device, including: acquiring temperature and drive letter information of each of N mechanical hard disk devices, wherein N is a natural number greater than or equal to 1; searching a target serial number of the target mechanical hard disk device based on target identifier information of the target mechanical hard disk device and searching a target slot position of the target mechanical hard disk device based on the target serial number under the condition that the temperature of the target mechanical hard disk device is greater than a target temperature threshold, wherein the target mechanical hard disk device is any one of N mechanical hard disk devices; and executing exception processing on the target slot position.
According to another embodiment of the present application, there is provided an abnormality processing apparatus of a device, including: a first obtaining module, configured to obtain temperature and drive letter information of each of N mechanical hard disk devices, where N is a natural number greater than or equal to 1; the first searching module is used for searching a target serial number of the target mechanical hard disk device based on target identifier information of the target mechanical hard disk device and searching a target slot position of the target mechanical hard disk device based on the target serial number when the temperature of the target mechanical hard disk device is greater than a target temperature threshold, wherein the target mechanical hard disk device is any one of N mechanical hard disk devices; and the first processing module is used for executing exception processing on the target slot position.
In an exemplary embodiment, the above apparatus further includes: the first response module is used for responding to a first acquisition instruction before acquiring the temperature and the drive letter information of each of the N mechanical hard disk devices, and acquiring the drive letter information of each mechanical hard disk device; the second response module is used for responding to the second acquisition instruction and acquiring the serial number of each mechanical hard disk device; the third response module is used for responding to a third acquisition instruction and acquiring slot position information of the slot position corresponding to the serial number of each mechanical hard disk device; and the first binding module is used for binding the drive letter information, the serial number and the slot position information of each mechanical hard disk device.
In an exemplary embodiment, the first search module includes: a first searching unit, configured to search a target serial number of the target mechanical hard disk device according to the target identifier information; and a second searching unit, configured to search a slot corresponding to the target sequence number to obtain the target slot, where a binding relationship among the target sequence number, the target drive symbol information, and the target slot is stored in advance.
In an exemplary embodiment, the first processing module includes: and the first control unit is used for controlling the indicator lamp of the target slot position to display according to an abnormal mode, wherein the abnormal mode is used for indicating that the target slot position is in an abnormal state.
In an exemplary embodiment, the above apparatus further includes: a first determining module, configured to determine, as an exception log, a correspondence between slot information of the target slot and a temperature of each of the N mechanical hard disk devices, after acquiring the temperature and the drive letter information of the target mechanical hard disk device when the temperature of the target mechanical hard disk device is greater than a target temperature threshold; and the first storage module is used for storing the exception log.
In an exemplary embodiment, the above apparatus further includes: the first setting module is configured to set a temperature threshold of each of the N mechanical hard disk devices according to a type and a slot where each of the mechanical hard disk devices is located before acquiring temperature and drive letter information of each of the mechanical hard disk devices, where, when a current temperature of the mechanical hard disk device is greater than the temperature threshold, it indicates that an abnormality occurs in the temperature of the mechanical hard disk device.
In an exemplary embodiment, the above apparatus further includes: the second setting module is used for setting the temperature threshold value of each mechanical hard disk device according to the type and the slot position of each mechanical hard disk device and then setting the temperature threshold value of each mechanical hard disk device in the configuration file of each mechanical hard disk device; and the first triggering module is used for triggering the temperature monitoring of each mechanical hard disk device based on the configuration file.
According to a further embodiment of the present application, there is also provided a computer readable storage medium having stored therein a computer program, wherein the computer program is arranged to perform the steps of any of the method embodiments described above when run.
According to a further embodiment of the present application, there is also provided an electronic device comprising a memory having stored therein a computer program and a processor arranged to run the computer program to perform the steps of any of the method embodiments described above.
By the method, the temperature and the drive letter information of each of N mechanical hard disk devices are acquired, wherein N is a natural number greater than or equal to 1; the first searching module is used for searching a target serial number of the target mechanical hard disk device based on target identifier information of the target mechanical hard disk device and searching a target slot position of the target mechanical hard disk device based on the target serial number under the condition that the temperature of the target mechanical hard disk device is greater than a target temperature threshold, wherein the target mechanical hard disk device is any one of N mechanical hard disk devices; and executing exception processing on the target slot. According to the method, after the temperature of each mechanical hard disk device is obtained, the slot position of the mechanical hard disk device can be found according to the identifier information of each mechanical hard disk device, the mechanical hard disk device does not need to be pressurized independently, the test of the mechanical hard disk device does not need to be stopped, a great deal of time is saved, and therefore the efficiency of detecting whether the mechanical hard disk device is abnormal or not can be improved. Therefore, the problem of low detection efficiency of the mechanical hard disk device can be solved, and the effect of improving the efficiency of detecting the mechanical hard disk device is achieved.
Drawings
Fig. 1 is a hardware block diagram of a mobile terminal of an abnormality processing method of a device according to an embodiment of the present application;
FIG. 2 is a flow chart of an exception handling method of a device according to an embodiment of the present application;
FIG. 3 is a particular flow chart according to an embodiment of the present application;
fig. 4 is a block diagram of the configuration of an abnormality processing apparatus of the device according to the embodiment of the present application.
Detailed Description
Embodiments of the present application will be described in detail below with reference to the accompanying drawings in conjunction with the embodiments.
It should be noted that the terms "first," "second," and the like in the description and claims of the present application and the above figures are used for distinguishing between similar objects and not necessarily for describing a particular sequential or chronological order.
The method embodiments provided in the embodiments of the present application may be performed in a mobile terminal, a computer terminal or similar computing device. Taking the operation on the mobile terminal as an example, fig. 1 is a block diagram of a hardware structure of the mobile terminal of an abnormality processing method of a device according to an embodiment of the present application. As shown in fig. 1, a mobile terminal may include one or more (only one is shown in fig. 1) processors 102 (the processor 102 may include, but is not limited to, a microprocessor MCU or a processing device such as a programmable logic device FPGA) and a memory 104 for storing data, wherein the mobile terminal may also include a transmission device 106 for communication functions and an input-output device 108. It will be appreciated by those skilled in the art that the structure shown in fig. 1 is merely illustrative and not limiting of the structure of the mobile terminal described above. For example, the mobile terminal may also include more or fewer components than shown in fig. 1, or have a different configuration than shown in fig. 1.
The memory 104 may be used to store a computer program, for example, a software program of application software and a module, such as a computer program corresponding to an abnormality processing method of an apparatus in an embodiment of the present application, and the processor 102 executes the computer program stored in the memory 104, thereby performing various functional applications and data processing, that is, implementing the above-described method. Memory 104 may include high-speed random access memory, and may also include non-volatile memory, such as one or more magnetic storage devices, flash memory, or other non-volatile solid-state memory. In some examples, the memory 104 may further include memory remotely located relative to the processor 102, which may be connected to the mobile terminal via a network. Examples of such networks include, but are not limited to, the internet, intranets, local area networks, mobile communication networks, and combinations thereof.
The transmission device 106 is used to receive or transmit data via a network. Specific examples of the network described above may include a wireless network provided by a communication provider of the mobile terminal. In one example, the transmission device 106 includes a network adapter (Network Interface Controller, simply referred to as NIC) that can connect to other network devices through a base station to communicate with the internet. In one example, the transmission device 106 may be a Radio Frequency (RF) module, which is configured to communicate with the internet wirelessly.
In this embodiment, there is provided an exception handling method of a device, and fig. 2 is a flowchart of an exception handling method of a device according to an embodiment of the present application, as shown in fig. 2, where the flowchart includes the following steps:
step S202, acquiring temperature and drive letter information of each of N mechanical hard disk devices, wherein N is a natural number greater than or equal to 1;
step S204, when the temperature of the target mechanical hard disk device is greater than the target temperature threshold value, searching a target serial number of the target mechanical hard disk device based on target identifier information of the target mechanical hard disk device, and searching a target slot position of the target mechanical hard disk device based on the target serial number, wherein the target mechanical hard disk device is any one of N mechanical hard disk devices;
in step S206, exception processing is performed on the target slot.
The above operations may be performed by a controller or a processor (for example, a baseboard management controller BMC), or a device or a system with control capability, or a controller or a processor disposed in the device or the system, or a controller or a processor that exists alone, or other processing devices or processing units with similar processing capability, etc. Note that the main body performing the above-described operation is not limited to the above-described example.
In the above embodiment, the mechanical hard disk device may be an HDD, or may be another computer storage device, such as a random access memory (Random Access Memory, simply referred to as RAM). For example, when the mechanical hard disk device is an HDD, 1 or more HDDs are provided in one server. In the case where a plurality of HDDs are provided in the server, it is necessary to detect the temperatures of the plurality of HDDs during the heat dissipation test. In this embodiment, in order to enable a plurality of HDDs to be in one-to-one correspondence with slot positions, the serial number of each HDD, the corresponding relation between the located drive letter and the slot position are stored in advance. In the process of performing the temperature test, the serial number of the HDD is obtained while the temperature of the HDD is obtained. Therefore, the slot position of the HDD can be quickly found, and the slot position can be quickly positioned. It should be noted that the above description of the mechanical hard disk device is only an exemplary embodiment, and the mechanical hard disk device is not limited to the above description.
Through the steps, the temperature and the drive letter information of each of N mechanical hard disk devices are acquired, wherein N is a natural number greater than or equal to 1; the first searching module is used for searching a target serial number of the target mechanical hard disk device based on target identifier information of the target mechanical hard disk device and searching a target slot position of the target mechanical hard disk device based on the target serial number under the condition that the temperature of the target mechanical hard disk device is greater than a target temperature threshold, wherein the target mechanical hard disk device is any one of N mechanical hard disk devices; and executing exception processing on the target slot. According to the method, after the temperature of each mechanical hard disk device is obtained, the slot position of the mechanical hard disk device can be found according to the identifier information of each mechanical hard disk device, the mechanical hard disk device does not need to be pressurized independently, the test of the mechanical hard disk device does not need to be stopped, a great deal of time is saved, and therefore the efficiency of detecting whether the mechanical hard disk device is abnormal or not can be improved. Therefore, the problem of low detection efficiency of the mechanical hard disk device can be solved, and the effect of improving the efficiency of detecting the mechanical hard disk device is achieved.
In an exemplary embodiment, before acquiring the temperature and the drive letter information of each of the N mechanical hard disk devices, the method further includes: responding to the first acquisition instruction, and acquiring the drive letter information of each mechanical hard disk device; responding to the second acquisition instruction, and acquiring the serial number of each mechanical hard disk device; responding to the third acquisition instruction, and acquiring slot position information of the slot position corresponding to the serial number of each mechanical hard disk device; and binding the drive letter information, the serial number and the slot position information of each mechanical hard disk device. In this embodiment, the first obtaining instruction may be a lsblk command triggered by the terminal device, for example, obtaining the disk identifiers sdX of all the HDDs through the lsblk command; obtaining a Serial Number (SN) code of the HDD through a smartctl-a/dev/sdX |grepservial command; the specific slot of the HDD corresponding to the SN code is obtained through the command of the store li 64/call/ea/sal show all|grep 'SN' -B2|grep 'drive' |awk '{ print $2 }'. By acquiring the drive letter information, the serial number and the slot position information of each mechanical hard disk device, the binding between the drive letter information and the serial number of each mechanical hard disk device and the binding between the slot position and the drive letter of each mechanical hard disk device can be realized. Thereby improving the efficiency of searching the slot position where the mechanical hard disk device is positioned. It should be noted that the foregoing description is only an exemplary embodiment, and the description of the manner in which the drive letter information, the serial number, and the slot information of each mechanical hard disk device are bound is not limited to the foregoing description.
In one exemplary embodiment, in a case where the temperature of the target mechanical hard disk device is greater than the target temperature threshold, searching for a target serial number of the target mechanical hard disk device based on target identifier information of the target mechanical hard disk device, and searching for a target slot in which the target mechanical hard disk device is located based on the target serial number: searching a target serial number of the target mechanical hard disk device according to the target disk character information; and searching a slot corresponding to the target serial number to obtain a target slot, wherein the binding relation among the target serial number, the target drive letter information and the target slot is prestored. In this embodiment, the efficiency of searching the target slot where the target mechanical hard disk device is located may be improved by storing the binding relationship among the target serial number, the target drive and the target slot in advance.
In one exemplary embodiment, performing exception handling on a target slot includes: the indicator lamp for controlling the target slot position is displayed according to an abnormal mode, wherein the abnormal mode is used for indicating that the target slot position is in an abnormal state. In this embodiment, the indicator light is connected to the target slot. The abnormal mode of the indicator lamp can be a mode of displaying red light or a mode of flashing light. Through the display of the indicator lamp, the temperature of the target mechanical hard disk device can be rapidly checked to be too high. It should be noted that the foregoing description is only one exemplary embodiment, and the indication lamp abnormality mode is not limited to the foregoing description.
In an exemplary embodiment, after acquiring the temperature and the drive letter information of each of the N mechanical hard disk devices, the method further includes: determining a correspondence between slot position information of a target slot position and the temperature of the target mechanical hard disk device as an abnormal log under the condition that the temperature of the target mechanical hard disk device is greater than a target temperature threshold value; an exception log is stored. In the present embodiment, the target temperature threshold value may be preset, and may be set to 60 degrees, 70 degrees, or the like. The working temperature of the mechanical hard disk device is between 30 and 50 ℃, the working temperature limit of the mechanical hard disk device cannot exceed 70 ℃, the mechanical hard disk device is damaged to a certain extent when the working temperature exceeds 70 ℃, when the temperature of the mechanical hard disk device reaches about 100 ℃, the mechanical hard disk device can cause data loss, and the mechanical hard disk device is damaged seriously. Under normal conditions. The working temperature of the mechanical hard disk device is 10-20 degrees higher than the indoor temperature, and can reach 60-70 degrees in summer and only 30-40 degrees in winter. For example, in the case where the temperature of the mechanical hard disk device reaches 70 degrees, slot information of 70 degrees and the target slot is correspondingly stored in the database in the form of a log. According to the embodiment, the temperature of each HDD and the specific physical slot position are recorded in the form of a log, so that later-stage testers can conveniently conduct arrangement judgment. It should be noted that the above description is only one exemplary embodiment, and the description of the exception log is not limited to the above description.
In an exemplary embodiment, before acquiring the temperature and the drive letter information of each of the N mechanical hard disk devices, the method further includes: setting a temperature threshold value of each mechanical hard disk device according to the type and the slot position of each mechanical hard disk device, wherein the temperature abnormality of the mechanical hard disk device is indicated when the current temperature of the mechanical hard disk device is greater than the temperature threshold value. In this embodiment, the temperatures at which different types of mechanical hard disk devices normally operate are different. For example, the HDD may operate at a temperature of 30-40 degrees. According to the embodiment, by setting different temperature thresholds for different types of mechanical hard disk equipment, whether the temperature of the mechanical hard disk equipment is too high can be accurately judged.
In one exemplary embodiment, after setting the temperature threshold of each mechanical hard disk device according to the type of each mechanical hard disk device and the slot in which each mechanical hard disk device is located, the method further comprises: setting a temperature threshold of each mechanical hard disk device in a configuration file of each mechanical hard disk device; the temperature monitoring of each mechanical hard disk device is triggered based on the configuration file. In this embodiment, the profile may further include attribute information of the mechanical hard disk device, for example, a model number, a storage capacity, and the like of the mechanical hard disk device. By setting the temperature threshold in the configuration file, the purpose of automatically monitoring the temperature of the mechanical hard disk device can be achieved.
The invention is illustrated below with reference to specific examples:
according to the method for intelligently monitoring the temperature of the HDD in the pressurization test of the server HDD in the heat dissipation field, a shell language is adopted to write a monitoring program, an HDD overtemperature point is configured in a configuration file, the pressurization of the HDD is automatically executed, the intelligent monitoring of the temperature of the HDD in the pressurization test process is performed, when a certain HDD is monitored to reach the overtemperature point, the specific slot position of the HDD can be analyzed, the HDD is lighted, a tester is prompted to pay attention to the HDD, and the overture Wen Rizhi is collected to a designated catalog for analysis by the tester, so that the test and time effectiveness are guaranteed, the manpower investment is reduced, the test duration is shortened, and the work efficiency is improved.
As shown in fig. 3, the method specifically comprises the following steps:
s301, writing the information of the HDD over-temperature point (corresponding to the temperature threshold value in the above description) into a configuration file, and starting an intelligent monitoring method of the HDD temperature;
s302, binding the HDD drive symbol and the specific slot information of the corresponding HDD with the equipment SN code through HDD slot matching;
the method mainly comprises the following steps of:
s11, obtaining the disk drives sdX (corresponding to the disk drive information in the above) of all the HDDs through the lsblk command;
s12, obtaining an SN code (corresponding to the Serial number) of the HDD through a smartctl-a/dev/sdX |grep Serial command;
s13, acquiring a specific slot bit (corresponding to the slot bit information in the above) of the HDD corresponding to the SN code in the last step through a store li 64/call/earl/sall show all|grep 'SN' -B2|grep 'Drive' |awk '{ print $2}' command. The binding of the HDD drive letter and the device SN code is realized through S1 and S2. Binding of the specific slot of the HDD and the HDD drive letter is achieved through S3.
In the present embodiment, the above-described command is flexibly settable, and is not limited to the command in the above example.
S303, calling related commands in the HDD driver, acquiring the temperature and SN code of each HDD, and analyzing the temperature of the HDD; wherein, analyzing the HDD temperature includes: the temperature of each HDD is obtained by the HDD driver using a smarttl-a/dev/sdX |grep-i temp command.
S304, dynamically monitoring the temperature of each HDD in real time; dynamically monitoring the temperature of each HDD in real time includes: and circularly calling the HDD temperature analysis function, obtaining the temperature of each HDD one by one, storing the temperature in a list, and simultaneously comparing the temperature of each HDD with the overtemperature point parameter value set in the configuration file to judge whether the temperature of the HDD is overtemperature. In this embodiment, the value of the overtemperature point parameter may be flexibly set, for example, to 70 degrees and 60 degrees. The temperature at which the HDD is operating properly may be determined by the type of HDD and may be related to the ambient temperature, for example, 30-40 degrees during summer.
S305, after the HDD overtemperature is monitored, matching the HDD with the slot position of the HDD is realized through an SN code, and lighting corresponding to the slot position of the HDD is performed (corresponding to the indicator lamp in the process of displaying according to an abnormal mode);
the lighting of the corresponding HDD slot mainly comprises: and comparing the temperature of each HDD with the overtemperature point parameter value set in the configuration file, and when the HDD overtemperature, calling the stop position of the stop 64[ HDD slot ] to light the overtemperature HDD slot. In this embodiment, the lighting of the HDD slot includes lighting an indicator light connected to the HDD slot, or controlling the color of the indicator light to red, or controlling the indicator light to flash. The temperature over-temperature of the HDD can also be prompted in the form of voice.
S306, stopping the pressurization of the HDD and collecting the excess Wen Rizhi (corresponding to the abnormal log in the above description) under the appointed directory, and if the whole process has no excess temperature phenomenon, automatically collecting the test process log under the appointed directory after the pressurization process is finished; after the script execution is finished, the test logs are summarized into a file, and after the whole flow is finished, the test result is fed back to the tester. In this embodiment, the super Wen Rizhi may be stored in a designated directory in the peripheral storage device or may be stored in a memory in the server. The over-temperature log may include other information of the HDD, such as attribute information, temperature information of normal operation, and the like, in addition to the temperature and the serial number.
The present embodiment collects super Wen Rizhi (corresponding to the abnormal log in the above) by a log collection function, and collecting the over-temperature log by the log collection function includes: the log collection function is mainly used for classifying and sorting files of data of the HDD pressurization test, when the HDD is triggered to be overheated, a failure log folder is created, the temperature and specific physical slot records of the overheated HDD are stored, if the overtemperature phenomenon does not occur in the whole pressurization process, a successful log folder is created, and the temperature and specific physical slot records of each HDD are recorded.
From the description of the above embodiments, it will be clear to a person skilled in the art that the method according to the above embodiments may be implemented by means of software plus the necessary general hardware platform, but of course also by means of hardware, but in many cases the former is a preferred embodiment. Based on such understanding, the technical solution of the present application may be embodied essentially or in a part contributing to the prior art in the form of a software product stored in a storage medium (such as ROM/RAM, magnetic disk, optical disk), comprising several instructions for causing a terminal device (which may be a mobile phone, a computer, a server, or a network device, etc.) to perform the method described in the embodiments of the present application.
The embodiment also provides an apparatus for processing an abnormality of a device, which is used to implement the foregoing embodiments and preferred embodiments, and is not described in detail. As used below, the term "module" may be a combination of software and/or hardware that implements a predetermined function. While the means described in the following embodiments are preferably implemented in software, implementation in hardware, or a combination of software and hardware, is also possible and contemplated.
Fig. 4 is a block diagram of an abnormality processing apparatus of a device according to an embodiment of the present application, as shown in fig. 4, the apparatus including:
a first obtaining module 42, configured to obtain temperature and drive letter information of each of N mechanical hard disk devices, where N is a natural number greater than or equal to 1;
a first searching module 44, configured to search, when the temperature of the target mechanical hard disk device is greater than the target temperature threshold, for a target serial number of the target mechanical hard disk device based on target identifier information of the target mechanical hard disk device, and for searching, based on the target serial number, a target slot in which the target mechanical hard disk device is located, where the target mechanical hard disk device is any one of the N mechanical hard disk devices;
the first processing module 46 is configured to perform exception processing on the target slot.
In the above embodiment, the mechanical hard disk device may be an HDD, or may be another computer storage device, such as a random access memory (Random Access Memory, simply referred to as RAM). For example, when the mechanical hard disk device is an HDD, 1 or more HDDs are provided in one server. In the case where a plurality of HDDs are provided in the server, it is necessary to detect the temperatures of the plurality of HDDs during the heat dissipation test. In this embodiment, in order to enable a plurality of HDDs to be in one-to-one correspondence with slot positions, the serial number of each HDD, the corresponding relation between the located drive letter and the slot position are stored in advance. In the process of performing the temperature test, the serial number of the HDD is obtained while the temperature of the HDD is obtained. Therefore, the slot position of the HDD can be quickly found, and the slot position can be quickly positioned. It should be noted that the above description of the mechanical hard disk device is only an exemplary embodiment, and the mechanical hard disk device is not limited to the above description.
By the device, the temperature and the serial number of each of N mechanical hard disk devices are acquired, wherein N is a natural number greater than or equal to 1; under the condition that the temperature of the target mechanical hard disk device is larger than a target temperature threshold value, searching a target slot position of the target mechanical hard disk device based on a target serial number of the target mechanical hard disk device, wherein the target mechanical hard disk device is any one of N mechanical hard disk devices; and executing exception processing on the target slot. According to the method, after the temperature of each mechanical hard disk device is obtained, the slot position of the mechanical hard disk device can be found according to the serial number of each mechanical hard disk device, the mechanical hard disk device does not need to be pressurized independently, the test of the mechanical hard disk device does not need to be stopped, a great deal of time is saved, and therefore the efficiency of detecting whether the mechanical hard disk device is abnormal or not can be improved. Therefore, the problem of lower detection efficiency of the mechanical hard disk device in the related technology can be solved, and the effect of improving the efficiency of detecting the mechanical hard disk device is achieved.
In an exemplary embodiment, the above apparatus further includes: the first response module is used for responding to a first acquisition instruction before acquiring the temperature and the drive letter information of each of the N mechanical hard disk devices, and acquiring the drive letter information of each mechanical hard disk device;
the second response module is used for responding to the second acquisition instruction and acquiring the serial number of each mechanical hard disk device;
the third response module is used for responding to a third acquisition instruction and acquiring slot position information of the slot position corresponding to the serial number of each mechanical hard disk device;
and the first binding module is used for binding the drive letter information, the serial number and the slot position information of each mechanical hard disk device.
In an exemplary embodiment, the first search module includes:
a first searching unit, configured to search a target serial number of the target mechanical hard disk device according to the target identifier information;
and a second searching unit, configured to search a slot corresponding to the target sequence number to obtain the target slot, where a binding relationship among the target sequence number, the target drive symbol information, and the target slot is stored in advance.
In an exemplary embodiment, the first processing module includes:
and the first control unit is used for controlling the indicator lamp of the target slot position to display according to an abnormal mode, wherein the abnormal mode is used for indicating that the target slot position is in an abnormal state.
In an exemplary embodiment, the above apparatus further includes: a first determining module, configured to determine, as an exception log, a correspondence between slot information of the target slot and a temperature of each of the N mechanical hard disk devices, after acquiring the temperature and the drive letter information of the target mechanical hard disk device when the temperature of the target mechanical hard disk device is greater than a target temperature threshold;
and the first storage module is used for storing the exception log.
In an exemplary embodiment, the above apparatus further includes:
the first setting module is configured to set a temperature threshold of each of the N mechanical hard disk devices according to a type and a slot where each of the mechanical hard disk devices is located before acquiring temperature and drive letter information of each of the mechanical hard disk devices, where, when a current temperature of the mechanical hard disk device is greater than the temperature threshold, it indicates that an abnormality occurs in the temperature of the mechanical hard disk device.
In an exemplary embodiment, the above apparatus further includes:
the second setting module is used for setting the temperature threshold value of each mechanical hard disk device according to the type and the slot position of each mechanical hard disk device and then setting the temperature threshold value of each mechanical hard disk device in the configuration file of each mechanical hard disk device;
and the first triggering module is used for triggering the temperature monitoring of each mechanical hard disk device based on the configuration file.
It should be noted that each of the above modules may be implemented by software or hardware, and for the latter, it may be implemented by, but not limited to: the modules are all located in the same processor; alternatively, the above modules may be located in different processors in any combination.
Embodiments of the present application also provide a computer readable storage medium having a computer program stored therein, wherein the computer program is arranged to perform the steps of any of the method embodiments described above when run.
In one exemplary embodiment, the computer readable storage medium may include, but is not limited to: a usb disk, a Read-Only Memory (ROM), a random access Memory (Random Access Memory, RAM), a removable hard disk, a magnetic disk, or an optical disk, or other various media capable of storing a computer program.
Embodiments of the present application also provide an electronic device comprising a memory having stored therein a computer program and a processor arranged to run the computer program to perform the steps of any of the method embodiments described above.
In an exemplary embodiment, the electronic device may further include a transmission device connected to the processor, and an input/output device connected to the processor.
Specific examples in this embodiment may refer to the examples described in the foregoing embodiments and the exemplary implementation, and this embodiment is not described herein.
It will be appreciated by those skilled in the art that the modules or steps of the application described above may be implemented in a general purpose computing device, they may be concentrated on a single computing device, or distributed across a network of computing devices, they may be implemented in program code executable by computing devices, so that they may be stored in a storage device for execution by computing devices, and in some cases, the steps shown or described may be performed in a different order than that shown or described herein, or they may be separately fabricated into individual integrated circuit modules, or multiple modules or steps of them may be fabricated into a single integrated circuit module. Thus, the present application is not limited to any specific combination of hardware and software.
The foregoing description is only of the preferred embodiments of the present application and is not intended to limit the same, but rather, various modifications and variations may be made by those skilled in the art. Any modification, equivalent replacement, improvement, etc. made within the principles of the present application should be included in the protection scope of the present application.

Claims (10)

1. An exception handling method for a device, comprising:
acquiring temperature and drive letter information of each of N mechanical hard disk devices, wherein N is a natural number greater than or equal to 1;
under the condition that the temperature of the target mechanical hard disk device is greater than a target temperature threshold value, searching a target serial number of the target mechanical hard disk device based on target identifier information of the target mechanical hard disk device, and searching a target slot position where the target mechanical hard disk device is located based on the target serial number, wherein the target mechanical hard disk device is any one of N mechanical hard disk devices;
and executing exception processing on the target slot.
2. The method of claim 1, wherein prior to acquiring the temperature and drive letter information for each of the N mechanical hard disk devices, the method further comprises:
responding to a first acquisition instruction, and acquiring the drive letter information of each mechanical hard disk device;
responding to a second acquisition instruction, and acquiring the serial number of each mechanical hard disk device;
responding to a third acquisition instruction, and acquiring slot position information of a slot position corresponding to the serial number of each mechanical hard disk device;
and binding the drive letter information, the serial number and the slot position information of each mechanical hard disk device.
3. The method of claim 1, wherein, in the case where the temperature of the target mechanical hard disk device is greater than the target temperature threshold, searching for a target serial number of the target mechanical hard disk device based on target identifier information of the target mechanical hard disk device, and searching for a target slot in which the target mechanical hard disk device is located based on the target serial number, comprises:
searching a target serial number of the target mechanical hard disk device according to the target drive letter information;
and searching a slot corresponding to the target serial number to obtain the target slot, wherein the binding relation among the target serial number, the target drive letter information and the target slot is prestored.
4. The method of claim 1, wherein performing exception handling on the target slot comprises:
and controlling the indicator lamp of the target slot position to display according to an abnormal mode, wherein the abnormal mode is used for indicating that the target slot position is in an abnormal state.
5. The method of claim 1, wherein after obtaining the temperature and drive letter information for each of the N mechanical hard disk devices, the method further comprises:
determining a correspondence between slot information of the target slot and the temperature of the target mechanical hard disk device as an anomaly log when the temperature of the target mechanical hard disk device is greater than a target temperature threshold;
and storing the exception log.
6. The method of claim 1, wherein prior to obtaining temperature and drive letter information for each of the N mechanical hard disk devices, the method further comprises:
setting a temperature threshold value of each mechanical hard disk device according to the type and the slot position of each mechanical hard disk device, wherein the temperature abnormality of the mechanical hard disk device is indicated when the current temperature of the mechanical hard disk device is greater than the temperature threshold value.
7. The method of claim 6, wherein after setting the temperature threshold for each of the mechanical hard disk devices according to the type and slot in which each of the mechanical hard disk devices is located, the method further comprises:
setting a temperature threshold of each mechanical hard disk device in a configuration file of each mechanical hard disk device;
triggering temperature monitoring of each mechanical hard disk device based on the configuration file.
8. An abnormality processing apparatus for a device, comprising:
a first obtaining module, configured to obtain temperature and drive letter information of each of N mechanical hard disk devices, where N is a natural number greater than or equal to 1;
the first searching module is used for searching a target serial number of the target mechanical hard disk device based on target identifier information of the target mechanical hard disk device and searching a target slot position of the target mechanical hard disk device based on the target serial number under the condition that the temperature of the target mechanical hard disk device is greater than a target temperature threshold, wherein the target mechanical hard disk device is any one of N mechanical hard disk devices;
and the first processing module is used for executing exception processing on the target slot position.
9. A computer readable storage medium, characterized in that a computer program is stored in the computer readable storage medium, wherein the computer program, when being executed by a processor, implements the steps of the method according to any of the claims 1 to 7.
10. An electronic device comprising a memory, a processor and a computer program stored on the memory and executable on the processor, characterized in that the processor implements the steps of the method of any one of claims 1 to 7 when the computer program is executed.
CN202310301248.0A 2023-03-24 2023-03-24 Abnormality processing method and device for equipment, storage medium and electronic device Pending CN116501523A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202310301248.0A CN116501523A (en) 2023-03-24 2023-03-24 Abnormality processing method and device for equipment, storage medium and electronic device

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202310301248.0A CN116501523A (en) 2023-03-24 2023-03-24 Abnormality processing method and device for equipment, storage medium and electronic device

Publications (1)

Publication Number Publication Date
CN116501523A true CN116501523A (en) 2023-07-28

Family

ID=87329345

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202310301248.0A Pending CN116501523A (en) 2023-03-24 2023-03-24 Abnormality processing method and device for equipment, storage medium and electronic device

Country Status (1)

Country Link
CN (1) CN116501523A (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN118312109A (en) * 2024-06-07 2024-07-09 深圳市源微创新实业有限公司 Bad block management method, system, medium and product of industrial solid state disk

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN118312109A (en) * 2024-06-07 2024-07-09 深圳市源微创新实业有限公司 Bad block management method, system, medium and product of industrial solid state disk
CN118312109B (en) * 2024-06-07 2024-09-03 深圳市源微创新实业有限公司 Bad block management method, system, medium and product of industrial solid state disk

Similar Documents

Publication Publication Date Title
CN111400121B (en) Server hard disk slot positioning and maintaining method
CN105223889A (en) Method for automatically monitoring PMC RAID card log suitable for production line
CN116501523A (en) Abnormality processing method and device for equipment, storage medium and electronic device
CN113708986B (en) Server monitoring apparatus, method and computer-readable storage medium
CN111694752B (en) Application testing method, electronic device and storage medium
CN116302989A (en) Pressure testing method and system, storage medium and computer equipment
CN109918221B (en) Hard disk error reporting analysis method, system, terminal and storage medium
CN113676368B (en) Method and device applied to ATS network performance test
CN113590405A (en) Hard disk error detection method and device, storage medium and electronic device
CN111611138B (en) Log data grabbing method, device, computer equipment and storage medium
CN113778960A (en) Fault determination method and device for Internet of things system and storage medium
CN110733037B (en) Signal processing method and device, storage medium and processor
CN115373916A (en) Abnormality detection method, abnormality detection device, electronic apparatus, and computer-readable storage medium
CN114237957A (en) Method and device for processing abnormal disk partition
CN114064510A (en) Function testing method and device, electronic equipment and storage medium
CN114372003A (en) Test environment monitoring method and device and electronic equipment
CN111124889B (en) ICOS system-based host Numa test method, system and equipment
CN112631868B (en) Performance monitoring method and device of CentOS system
CN113746680B (en) Physical position determining method and device, and device testing method and device
CN114942623B (en) Method and device for testing controller, vehicle and computer readable storage medium
US20240045949A1 (en) Information collection control apparatus, information collection system, information collection control method, and information collection control program
CN116662085A (en) Disk fault simulation test method, test device and electronic equipment
CN107122275A (en) It is a kind of to check the device and method that RMT tests information
CN118152187A (en) File system detection and repair method and device, electronic equipment and storage medium
CN116627730A (en) Smart log acquisition and analysis method, system, device and medium

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination