CN110989926A - Fault disk slot positioning method and device and electronic equipment - Google Patents
Fault disk slot positioning method and device and electronic equipment Download PDFInfo
- Publication number
- CN110989926A CN110989926A CN201911092449.4A CN201911092449A CN110989926A CN 110989926 A CN110989926 A CN 110989926A CN 201911092449 A CN201911092449 A CN 201911092449A CN 110989926 A CN110989926 A CN 110989926A
- Authority
- CN
- China
- Prior art keywords
- disk
- fault
- information
- disks
- slot
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
- 238000000034 method Methods 0.000 title claims abstract description 53
- 238000012545 processing Methods 0.000 claims abstract description 23
- 238000004458 analytical method Methods 0.000 claims description 8
- 238000007405 data analysis Methods 0.000 claims description 3
- 239000000126 substance Substances 0.000 claims description 3
- 238000004590 computer program Methods 0.000 description 13
- 238000010586 diagram Methods 0.000 description 11
- 230000008569 process Effects 0.000 description 7
- 230000006870 function Effects 0.000 description 6
- 238000011156 evaluation Methods 0.000 description 5
- 230000003287 optical effect Effects 0.000 description 4
- 238000004891 communication Methods 0.000 description 3
- 239000000835 fiber Substances 0.000 description 3
- 238000013461 design Methods 0.000 description 2
- 230000000644 propagated effect Effects 0.000 description 2
- 206010033799 Paralysis Diseases 0.000 description 1
- 238000010367 cloning Methods 0.000 description 1
- 238000003745 diagnosis Methods 0.000 description 1
- 230000000694 effects Effects 0.000 description 1
- 238000005516 engineering process Methods 0.000 description 1
- 239000004973 liquid crystal related substance Substances 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 239000013307 optical fiber Substances 0.000 description 1
- 230000008439 repair process Effects 0.000 description 1
- 239000004065 semiconductor Substances 0.000 description 1
- 238000000638 solvent extraction Methods 0.000 description 1
- 238000006467 substitution reaction Methods 0.000 description 1
- 230000009385 viral infection Effects 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F3/00—Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
- G06F3/06—Digital input from, or digital output to, record carriers, e.g. RAID, emulated record carriers or networked record carriers
- G06F3/0601—Interfaces specially adapted for storage systems
- G06F3/0668—Interfaces specially adapted for storage systems adopting a particular infrastructure
- G06F3/0671—In-line storage system
- G06F3/0673—Single storage device
- G06F3/0674—Disk device
- G06F3/0676—Magnetic disk device
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F3/00—Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
- G06F3/06—Digital input from, or digital output to, record carriers, e.g. RAID, emulated record carriers or networked record carriers
- G06F3/0601—Interfaces specially adapted for storage systems
- G06F3/0602—Interfaces specially adapted for storage systems specifically adapted to achieve a particular effect
- G06F3/0604—Improving or facilitating administration, e.g. storage management
- G06F3/0607—Improving or facilitating administration, e.g. storage management by facilitating the process of upgrading existing storage systems, e.g. for improving compatibility between host and storage device
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F3/00—Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
- G06F3/06—Digital input from, or digital output to, record carriers, e.g. RAID, emulated record carriers or networked record carriers
- G06F3/0601—Interfaces specially adapted for storage systems
- G06F3/0602—Interfaces specially adapted for storage systems specifically adapted to achieve a particular effect
- G06F3/0614—Improving the reliability of storage systems
- G06F3/0617—Improving the reliability of storage systems in relation to availability
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F3/00—Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
- G06F3/06—Digital input from, or digital output to, record carriers, e.g. RAID, emulated record carriers or networked record carriers
- G06F3/0601—Interfaces specially adapted for storage systems
- G06F3/0628—Interfaces specially adapted for storage systems making use of a particular technique
- G06F3/0653—Monitoring storage devices or systems
Abstract
The embodiment of the disclosure provides a method and a device for positioning a slot position of a fault disk and electronic equipment, belonging to the technical field of data processing, wherein the method comprises the following steps: acquiring equipment information including disk symbols of the disks according to different types of the disks of the data center, wherein the equipment information of the disks of different types is different; establishing a corresponding relation between the equipment information and the slot position identification information based on the acquired slot position identification information of all the disks; performing data processing on the read disk log through a preset fault model so as to judge the fault state of the disk; and when the fault disk exists through the fault model, positioning the fault disk based on the established corresponding relation between the equipment information and the slot position identification information. Through the scheme disclosed by the invention, the fault disk slot position can be quickly and accurately positioned.
Description
Technical Field
The present disclosure relates to the field of data processing technologies, and in particular, to a method and an apparatus for locating a slot position of a failed disk, and an electronic device.
Background
In the process of daily operation of the hard disk, data loss of the hard disk due to various failures may occur. For example, a hard disk may be damaged by virus infection, mis-formatting or mis-partitioning, mis-cloning, mis-deleting or overwriting, hacking software man-made damage, zero track damage, a hard disk logic lock, power failure during operation, data loss or damage caused by unexpected electromagnetic interference, file loss or damage caused by system errors or paralysis, and the like, which easily causes data loss of the hard disk.
As an application scenario, a mass of storage disks are arranged in a data center, a disk signature of a disk is reported by a disk failure of the data center, the disk signature does not have a fixed corresponding relationship with a disk slot position (the disk signature can drift), and great efficiency influence is generated on data center repair reporting.
Disclosure of Invention
In view of this, embodiments of the present disclosure provide a method and an apparatus for locating a failed disk slot, and an electronic device, which at least partially solve the problems in the prior art.
In a first aspect, an embodiment of the present disclosure provides a method for locating a slot of a failed disk, including:
acquiring equipment information including disk symbols of the disks according to different types of the disks of the data center, wherein the equipment information of the disks of different types is different;
establishing a corresponding relation between the equipment information and the slot position identification information based on the acquired slot position identification information of all the disks;
performing data processing on the read disk log through a preset fault model so as to judge the fault state of the disk;
and when the fault disk exists through the fault model, positioning the fault disk based on the established corresponding relation between the equipment information and the slot position identification information.
According to a specific implementation manner of the embodiment of the present disclosure, the obtaining, according to different types of disks in a data center, device information including a disk drive identifier includes:
and acquiring host paths and associated drive symbols of all the sas type disks.
According to a specific implementation manner of the embodiment of the present disclosure, the obtaining, according to different types of disks in a data center, device information including a disk drive identifier includes:
and acquiring the equipment information and the associated drive identifier of all non-sas scsi type disks.
According to a specific implementation manner of the embodiment of the present disclosure, the obtaining, according to different types of disks in a data center, device information including a disk drive identifier includes:
and acquiring the disk device information, the associated drive symbols and the pci paths of all the nvme type disk devices.
According to a specific implementation manner of the embodiment of the present disclosure, the establishing a corresponding relationship between the device information and the slot identification information based on the obtained slot identification information of all the disks includes:
searching a disk drive symbol corresponding to the slot position identification information;
and establishing a corresponding relation between the slot position identification information and the disk drive letter corresponding to the slot position identification information.
According to a specific implementation manner of the embodiment of the present disclosure, before performing data processing on the read disk log through a preset fault model, the method further includes:
acquiring log information on a magnetic disk within a preset time period;
and merging the log information and the disk equipment information on the disk to form a disk log.
According to a specific implementation manner of the embodiment of the present disclosure, the performing data processing on the read disk log through a preset fault model includes:
performing data analysis on the disk log to obtain analysis parameters related to the disk log;
comparing the model parameters in the fault model with the analysis parameters to obtain a comparison result;
and judging whether the disk has a fault or not based on the comparison result.
According to a specific implementation manner of the embodiment of the present disclosure, when a failed disk is found by the failure model, after the failed disk is located based on the established correspondence between the device information and the slot identification information, the method further includes:
and performing offline and repairing operations on the disk with the fault.
In a second aspect, an embodiment of the present disclosure provides a device for locating a slot of a failed disk, including:
the acquisition module is used for acquiring equipment information including disk drive symbols according to different types of disks of the data center, wherein the equipment information of the different types of disks is different;
the establishing module is used for establishing a corresponding relation between the equipment information and the slot position identification information based on the acquired slot position identification information of all the disks;
the processing module is used for carrying out data processing on the read disk logs through a preset fault model so as to judge the fault state of the disk;
and the positioning module is used for positioning the fault disk based on the established corresponding relation between the equipment information and the slot position identification information when the fault disk is found to exist through the fault model.
In a third aspect, an embodiment of the present disclosure further provides an electronic device, where the electronic device includes:
at least one processor; and the number of the first and second groups,
a memory communicatively coupled to the at least one processor; wherein the content of the first and second substances,
the memory stores instructions executable by the at least one processor to enable the at least one processor to perform the method for locating a failed disk slot in any of the first aspects or any implementation manner of the first aspect.
In a fourth aspect, the disclosed embodiments also provide a non-transitory computer-readable storage medium storing computer instructions for causing a computer to execute the method for locating a failed disk slot in any one of the foregoing first aspect or the implementation manner of the first aspect.
In a fifth aspect, the disclosed embodiments also provide a computer program product comprising a computer program stored on a non-transitory computer readable storage medium, the computer program comprising program instructions that, when executed by a computer, cause the computer to perform the method for locating a failed disk slot in the foregoing first aspect or any implementation manner of the first aspect.
The fault disk slot positioning scheme in the embodiment of the disclosure includes acquiring device information including disk drives according to different types of disks of a data center, wherein the device information of the different types of disks is different; establishing a corresponding relation between the equipment information and the slot position identification information based on the acquired slot position identification information of all the disks; performing data processing on the read disk log through a preset fault model so as to judge the fault state of the disk; and when the fault disk exists through the fault model, positioning the fault disk based on the established corresponding relation between the equipment information and the slot position identification information. Through the scheme disclosed by the invention, the fault disk can be automatically positioned.
Drawings
In order to more clearly illustrate the technical solutions of the embodiments of the present disclosure, the drawings needed to be used in the embodiments will be briefly described below, and it is apparent that the drawings in the following description are only some embodiments of the present disclosure, and it is obvious for those skilled in the art that other drawings can be obtained according to the drawings without creative efforts.
Fig. 1 is a schematic diagram of a fault disk slot positioning process provided in an embodiment of the present disclosure;
fig. 2 is a schematic diagram of another fault disk slot positioning process provided in the embodiment of the present disclosure;
fig. 3 is a schematic diagram illustrating another fault disk slot positioning process provided in the embodiment of the present disclosure;
fig. 4 is a schematic diagram illustrating another fault disk slot positioning process provided in the embodiment of the present disclosure;
fig. 5 is a schematic structural diagram of a device for locating a slot position of a failed disk according to an embodiment of the present disclosure;
fig. 6 is a schematic diagram of an electronic device provided in an embodiment of the present disclosure.
Detailed Description
The embodiments of the present disclosure are described in detail below with reference to the accompanying drawings.
The embodiments of the present disclosure are described below with specific examples, and other advantages and effects of the present disclosure will be readily apparent to those skilled in the art from the disclosure in the specification. It is to be understood that the described embodiments are merely illustrative of some, and not restrictive, of the embodiments of the disclosure. The disclosure may be embodied or carried out in various other specific embodiments, and various modifications and changes may be made in the details within the description without departing from the spirit of the disclosure. It is to be noted that the features in the following embodiments and examples may be combined with each other without conflict. All other embodiments, which can be derived by a person skilled in the art from the embodiments disclosed herein without making any creative effort, shall fall within the protection scope of the present disclosure.
It is noted that various aspects of the embodiments are described below within the scope of the appended claims. It should be apparent that the aspects described herein may be embodied in a wide variety of forms and that any specific structure and/or function described herein is merely illustrative. Based on the disclosure, one skilled in the art should appreciate that one aspect described herein may be implemented independently of any other aspects and that two or more of these aspects may be combined in various ways. For example, an apparatus may be implemented and/or a method practiced using any number of the aspects set forth herein. Additionally, such an apparatus may be implemented and/or such a method may be practiced using other structure and/or functionality in addition to one or more of the aspects set forth herein.
It should be noted that the drawings provided in the following embodiments are only for illustrating the basic idea of the present disclosure, and the drawings only show the components related to the present disclosure rather than the number, shape and size of the components in actual implementation, and the type, amount and ratio of the components in actual implementation may be changed arbitrarily, and the layout of the components may be more complicated.
In addition, in the following description, specific details are provided to facilitate a thorough understanding of the examples. However, it will be understood by those skilled in the art that the aspects may be practiced without these specific details.
The embodiment of the disclosure provides a fault disk slot positioning method. The method for locating a failed disk slot provided in this embodiment may be executed by a computing device, where the computing device may be implemented as software, or implemented as a combination of software and hardware, and the computing device may be integrally disposed in a server, a terminal device, or the like.
Referring to fig. 1, a method for locating a slot of a failed disk according to an embodiment of the present disclosure includes the following steps:
s101, acquiring equipment information including disk drive symbols according to different types of disks of the data center, wherein the equipment information of the different types of disks is different.
The data center is generally a data storage center formed by a plurality of disks, the disks of the data center need to be repaired in time due to huge data, and the failed disks (for example, SSD or HDD type hard disks) need to be repaired in time. Aiming at the problem, the corresponding of the drive letter and the slot position is realized by a specific design method, and the precision is improved to the available level.
For the above situation, the device information of the disk may be acquired according to different types of disks in the data center, and the disk existing in the data center may be identified by the device information of the disk.
As an application scenario, host paths and their associated drives of all the sas devices may be obtained. For the sas device, it may obtain class-sa _ host and class-sa _ phy information (corresponding/dev/sg ioctl issues corresponding sci command): including capacity, manufacturer, product name, serial number, type information (SSD or HDD), sa path, cage number, slot number.
As another application scenario, information of all non-sas scsi devices (ata devices) and their associated drives may be acquired. For the sci device (ata device) of non-sas, the ata number can be obtained from bus-sci-devices, and corresponding ata command is issued to corresponding/dev/sg through Ioctl to obtain the capacity, manufacturer, product name, serial number and type information (SSD or HDD).
As another application scenario, information of all nvme devices, associated disk drives and pci paths can be acquired. For the nvme device, a corresponding nvme ctl command is issued through an nvme _ ctl interface to acquire information such as capacity, manufacturer, product name and serial number.
S102, based on the acquired slot position identification information of all the disks, establishing a corresponding relation between the equipment information and the slot position identification information.
The disk is connected with data through the slot position, and the slot position can be any type of hard disk data interface. The slot is a connection part between the disk and the host system and is used for transmitting data between the hard disk cache and the host memory. Different types of disk slots determine the connection speed between the disk and the computer, and because the different types of disk slots influence the program running speed and the system performance, the different types of disk slots can be set based on different application scenes, for example, slot interfaces of IDE, SATA, SCSI and fiber channel types can be set.
All slot position information of the data center can be obtained from an operating system of the system by sending a request to a host system where the data center is located, and the slot position information contains specific position information of the slot position. All the slot positions can be uniquely marked by extracting the slot position identification information in the slot position information.
The device information of the disk comprises the drive letter of the disk and the disk data interface information, and the corresponding relation between the disk data interface and the slot identification information can be established by matching the disk data interface information with the slot identification information, so that the corresponding relation between the drive letter of the disk and the slot identification information is further established.
And S103, performing data processing on the read disk log through a preset fault model so as to judge the fault state of the disk.
In order to facilitate the judgment of the fault of the disk, the log data in the disk can be read through the system, the disk log data can be obtained by sorting the log data, and the data on the disk days is analyzed, so that whether the fault exists in the disk can be judged based on the analyzed data.
As an application scenario, a captured log file (e.g., a log file in SM2 format) of a disk may be converted into a JSON file, and a JSON character string and a corresponding character string array are generated by the JSON file. By analyzing the character string and the character string array, the disk parameter information such as SN (serial number) of the disk, hard disk Firmware, hard disk interface type, formatted sector size, voltage, RV vibration value, temperature, read-write Workload, irreparable read-write error and the like can be generated.
By comparing the analyzed disk parameter information with the preset parameter information in the fault model, whether the disk has the fault or not and what type of fault exists can be judged.
And S104, when the fault disk exists through the fault model, positioning the fault disk based on the established corresponding relation between the equipment information and the slot position identification information.
When a failure of the disk is found, the device information of the failed disk may be read, and the failed disk may be further located based on the location information in the slot information through a correspondence between the preset device information (e.g., a drive letter) and the slot identification information, so as to determine a specific location of the failed disk.
By the specific method designed by the disclosure, the correspondence between the disk identifier and the slot position is realized, the precision is improved to the available level, and the automatic positioning efficiency of the fault disk is improved.
According to a specific implementation manner of the embodiment of the present disclosure, the obtaining, according to different types of disks in a data center, device information including a disk drive identifier includes: and acquiring host paths and associated drive symbols of all the sas type disks. For the sas device, it may obtain class-sa _ host and class-sa _ phy information (corresponding/dev/sg ioctl issues corresponding sci command): including capacity, manufacturer, product name, serial number, type information (SSD or HDD), sa path, cage number, slot number.
According to a specific implementation manner of the embodiment of the present disclosure, the obtaining, according to different types of disks in a data center, device information including a disk drive identifier includes: and acquiring the equipment information and the associated drive identifier of all non-sas scsi type disks. For the sci device (ata device) of non-sas, the ata number can be obtained from bus-sci-devices, and corresponding ata command is issued to corresponding/dev/sg through Ioctl to obtain the capacity, manufacturer, product name, serial number and type information (SSD or HDD).
According to a specific implementation manner of the embodiment of the present disclosure, the obtaining, according to different types of disks in a data center, device information including a disk drive identifier includes: and acquiring the disk device information, the associated drive symbols and the pci paths of all the nvme type disk devices. For the nvme device, a corresponding nvme ctl command is issued through an nvme _ ctl interface to acquire information such as capacity, manufacturer, product name and serial number.
Referring to fig. 2, according to a specific implementation manner of the embodiment of the present disclosure, establishing a corresponding relationship between device information and slot identification information based on the obtained slot identification information of all disks includes:
s201, searching a disk drive letter corresponding to the slot position identification information.
The disk is connected with data through the slot position, and the slot position can be any type of hard disk data interface. The slot is a connection part between the disk and the host system and is used for transmitting data between the hard disk cache and the host memory. Different types of disk slots determine the connection speed between the disk and the computer, and because the different types of disk slots influence the program running speed and the system performance, the different types of disk slots can be set based on different application scenes, for example, slot interfaces of IDE, SATA, SCSI and fiber channel types can be set.
All slot position information of the data center can be obtained from an operating system of the system by sending a request to a host system where the data center is located, and the slot position information contains specific position information of the slot position. All the slot positions can be uniquely marked by extracting the slot position identification information in the slot position information.
S202, establishing a corresponding relation between the slot position identification information and the disk drive character corresponding to the slot position identification information.
The device information of the disk comprises the drive letter of the disk and the disk data interface information, and the corresponding relation between the disk data interface and the slot identification information can be established by matching the disk data interface information with the slot identification information, so that the corresponding relation between the drive letter of the disk and the slot identification information is further established.
Referring to fig. 3, according to a specific implementation manner of the embodiment of the present disclosure, before performing data processing on the read disk log through a preset fault model, the method further includes:
s301, obtaining log information on the disk in a preset time period.
The existence of the fault disk can be periodically judged, and for this reason, log information on the disk within a predetermined time period (for example, one day, one week, etc.) can be extracted, and the existence of the fault on the disk can be further analyzed through the log information.
And S302, merging the log information and the disk equipment information on the disk to form a disk log.
Because the data center has a plurality of magnetic disks, the extracted magnetic disks, the log information and the video information on the magnetic disks can be combined to form a magnetic disk log together.
By adding the disk device information into the disk log, the disk information can be identified, and a basis is provided for subsequent fault analysis based on the disk log.
Referring to fig. 4, according to a specific implementation manner of the embodiment of the present disclosure, the performing data processing on the read disk log through a preset fault model includes:
s401, carrying out data analysis on the disk log to obtain analysis parameters related to the disk log.
As an application scenario, a captured log file (e.g., a log file in SM2 format) of a disk may be converted into a JSON file, and a JSON character string and a corresponding character string array are generated by the JSON file. By analyzing the character string and the character string array, the disk parameter information such as SN (serial number) of the disk, hard disk Firmware, hard disk interface type, formatted sector size, voltage, RV vibration value, temperature, read-write Workload, irreparable read-write error and the like can be generated.
S402, comparing the model parameters in the fault model with the analysis parameters to obtain a comparison result.
By comparing the analyzed disk parameter information with the preset parameter information in the fault model, whether the disk has the fault and what type of fault exists can be judged, and a comparison result is formed.
And S403, judging whether the disk has a fault or not based on the comparison result.
Through the embodiment, corresponding analysis and comparison can be carried out based on the parameters on the disk, and the accuracy of fault diagnosis is improved.
According to a specific implementation manner of the embodiment of the present disclosure, when a failed disk is found by the failure model, after the failed disk is located based on the established correspondence between the device information and the slot identification information, the method further includes: and performing offline and repairing operations on the disk with the fault.
Corresponding to the above method embodiment, referring to fig. 5, an embodiment of the present disclosure further provides a failed disk slot positioning device 50, including:
the obtaining module 501 is configured to obtain device information including a disk identifier according to different types of disks in a data center, where the device information of the different types of disks is different.
The data center is generally a data storage center formed by a plurality of disks, the disks of the data center need to be repaired in time due to huge data, and the failed disks (for example, SSD or HDD type hard disks) need to be repaired in time. Aiming at the problem, the corresponding of the drive letter and the slot position is realized by a specific design method, and the precision is improved to the available level.
For the above situation, the device information of the disk may be acquired according to different types of disks in the data center, and the disk existing in the data center may be identified by the device information of the disk.
As an application scenario, host paths and their associated drives of all the sas devices may be obtained. For the sas device, it may obtain class-sa _ host and class-sa _ phy information (corresponding/dev/sg ioctl issues corresponding sci command): including capacity, manufacturer, product name, serial number, type information (SSD or HDD), sa path, cage number, slot number.
As another application scenario, information of all non-sas scsi devices (ata devices) and their associated drives may be acquired. For the sci device (ata device) of non-sas, the ata number can be obtained from bus-sci-devices, and corresponding ata command is issued to corresponding/dev/sg through Ioctl to obtain the capacity, manufacturer, product name, serial number and type information (SSD or HDD).
As another application scenario, information of all nvme devices, associated disk drives and pci paths can be acquired. For the nvme device, a corresponding nvme ctl command is issued through an nvme _ ctl interface to acquire information such as capacity, manufacturer, product name and serial number.
An establishing module 502 is configured to establish a corresponding relationship between the device information and the slot identification information based on the obtained slot identification information of all the disks.
The disk is connected with data through the slot position, and the slot position can be any type of hard disk data interface. The slot is a connection part between the disk and the host system and is used for transmitting data between the hard disk cache and the host memory. Different types of disk slots determine the connection speed between the disk and the computer, and because the different types of disk slots influence the program running speed and the system performance, the different types of disk slots can be set based on different application scenes, for example, slot interfaces of IDE, SATA, SCSI and fiber channel types can be set.
All slot position information of the data center can be obtained from an operating system of the system by sending a request to a host system where the data center is located, and the slot position information contains specific position information of the slot position. All the slot positions can be uniquely marked by extracting the slot position identification information in the slot position information.
The device information of the disk comprises the drive letter of the disk and the disk data interface information, and the corresponding relation between the disk data interface and the slot identification information can be established by matching the disk data interface information with the slot identification information, so that the corresponding relation between the drive letter of the disk and the slot identification information is further established.
The processing module 503 is configured to perform data processing on the read disk log through a preset fault model, so as to determine a fault state of the disk.
In order to facilitate the judgment of the fault of the disk, the log data in the disk can be read through the system, the disk log data can be obtained by sorting the log data, and the data on the disk days is analyzed, so that whether the fault exists in the disk can be judged based on the analyzed data.
As an application scenario, a captured log file (e.g., a log file in SM2 format) of a disk may be converted into a JSON file, and a JSON character string and a corresponding character string array are generated by the JSON file. By analyzing the character string and the character string array, the disk parameter information such as SN (serial number) of the disk, hard disk Firmware, hard disk interface type, formatted sector size, voltage, RV vibration value, temperature, read-write Workload, irreparable read-write error and the like can be generated.
By comparing the analyzed disk parameter information with the preset parameter information in the fault model, whether the disk has the fault or not and what type of fault exists can be judged.
A positioning module 504, configured to, when a failed disk is found by the failure model, position the failed disk based on the established correspondence between the device information and the slot identification information.
When a failure of the disk is found, the device information of the failed disk may be read, and the failed disk may be further located based on the location information in the slot information through a correspondence between the preset device information (e.g., a drive letter) and the slot identification information, so as to determine a specific location of the failed disk.
By the specific method designed by the disclosure, the correspondence between the disk identifier and the slot position is realized, the precision is improved to the available level, and the automatic positioning efficiency of the fault disk is improved.
The apparatus shown in fig. 5 may correspondingly execute the content in the above method embodiment, and details of the part not described in detail in this embodiment refer to the content described in the above method embodiment, which is not described again here.
Referring to fig. 6, an embodiment of the present disclosure also provides an electronic device 60, including:
at least one processor; and the number of the first and second groups,
a memory communicatively coupled to the at least one processor; wherein the content of the first and second substances,
the memory stores instructions executable by the at least one processor to enable the at least one processor to perform the method for locating a failed disk slot in the above method embodiments.
The disclosed embodiments also provide a non-transitory computer-readable storage medium storing computer instructions for causing the computer to perform the foregoing method embodiments.
The disclosed embodiments also provide a computer program product comprising a computer program stored on a non-transitory computer readable storage medium, the computer program comprising program instructions that, when executed by a computer, cause the computer to perform the method for locating a failed disk slot in the aforementioned method embodiments.
Referring now to FIG. 6, a schematic diagram of an electronic device 60 suitable for use in implementing embodiments of the present disclosure is shown. The electronic devices in the embodiments of the present disclosure may include, but are not limited to, mobile terminals such as mobile phones, notebook computers, digital broadcast receivers, PDAs (personal digital assistants), PADs (tablet computers), PMPs (portable multimedia players), in-vehicle terminals (e.g., car navigation terminals), and the like, and fixed terminals such as digital TVs, desktop computers, and the like. The electronic device shown in fig. 6 is only an example, and should not bring any limitation to the functions and the scope of use of the embodiments of the present disclosure.
As shown in fig. 6, the electronic device 60 may include a processing means (e.g., a central processing unit, a graphics processor, etc.) 601 that may perform various appropriate actions and processes in accordance with a program stored in a Read Only Memory (ROM)602 or a program loaded from a storage means 608 into a Random Access Memory (RAM) 603. In the RAM 603, various programs and data necessary for the operation of the electronic apparatus 60 are also stored. The processing device 601, the ROM 602, and the RAM 603 are connected to each other via a bus 604. An input/output (I/O) interface 605 is also connected to bus 604.
Generally, the following devices may be connected to the I/O interface 605: input devices 606 including, for example, a touch screen, touch pad, keyboard, mouse, image sensor, microphone, accelerometer, gyroscope, etc.; output devices 607 including, for example, a Liquid Crystal Display (LCD), a speaker, a vibrator, and the like; storage 608 including, for example, tape, hard disk, etc.; and a communication device 609. The communication means 609 may allow the electronic device 60 to communicate with other devices wirelessly or by wire to exchange data. While the figures illustrate an electronic device 60 having various means, it is to be understood that not all illustrated means are required to be implemented or provided. More or fewer devices may alternatively be implemented or provided.
In particular, according to an embodiment of the present disclosure, the processes described above with reference to the flowcharts may be implemented as computer software programs. For example, embodiments of the present disclosure include a computer program product comprising a computer program embodied on a computer readable medium, the computer program comprising program code for performing the method illustrated in the flow chart. In such an embodiment, the computer program may be downloaded and installed from a network via the communication means 609, or may be installed from the storage means 608, or may be installed from the ROM 602. The computer program, when executed by the processing device 601, performs the above-described functions defined in the methods of the embodiments of the present disclosure.
It should be noted that the computer readable medium in the present disclosure can be a computer readable signal medium or a computer readable storage medium or any combination of the two. A computer readable storage medium may be, for example, but not limited to, an electronic, magnetic, optical, electromagnetic, infrared, or semiconductor system, apparatus, or device, or any combination of the foregoing. More specific examples of the computer readable storage medium may include, but are not limited to: an electrical connection having one or more wires, a portable computer diskette, a hard disk, a Random Access Memory (RAM), a read-only memory (ROM), an erasable programmable read-only memory (EPROM or flash memory), an optical fiber, a portable compact disc read-only memory (CD-ROM), an optical storage device, a magnetic storage device, or any suitable combination of the foregoing. In the present disclosure, a computer readable storage medium may be any tangible medium that can contain, or store a program for use by or in connection with an instruction execution system, apparatus, or device. In contrast, in the present disclosure, a computer readable signal medium may comprise a propagated data signal with computer readable program code embodied therein, either in baseband or as part of a carrier wave. Such a propagated data signal may take many forms, including, but not limited to, electro-magnetic, optical, or any suitable combination thereof. A computer readable signal medium may also be any computer readable medium that is not a computer readable storage medium and that can communicate, propagate, or transport a program for use by or in connection with an instruction execution system, apparatus, or device. Program code embodied on a computer readable medium may be transmitted using any appropriate medium, including but not limited to: electrical wires, optical cables, RF (radio frequency), etc., or any suitable combination of the foregoing.
The computer readable medium may be embodied in the electronic device; or may exist separately without being assembled into the electronic device.
The computer readable medium carries one or more programs which, when executed by the electronic device, cause the electronic device to: acquiring at least two internet protocol addresses; sending a node evaluation request comprising the at least two internet protocol addresses to node evaluation equipment, wherein the node evaluation equipment selects the internet protocol addresses from the at least two internet protocol addresses and returns the internet protocol addresses; receiving an internet protocol address returned by the node evaluation equipment; wherein the obtained internet protocol address indicates an edge node in the content distribution network.
Alternatively, the computer readable medium carries one or more programs which, when executed by the electronic device, cause the electronic device to: receiving a node evaluation request comprising at least two internet protocol addresses; selecting an internet protocol address from the at least two internet protocol addresses; returning the selected internet protocol address; wherein the received internet protocol address indicates an edge node in the content distribution network.
Computer program code for carrying out operations for aspects of the present disclosure may be written in any combination of one or more programming languages, including an object oriented programming language such as Java, Smalltalk, C + +, and conventional procedural programming languages, such as the "C" programming language or similar programming languages. The program code may execute entirely on the user's computer, partly on the user's computer, as a stand-alone software package, partly on the user's computer and partly on a remote computer or entirely on the remote computer or server. In the case of a remote computer, the remote computer may be connected to the user's computer through any type of network, including a Local Area Network (LAN) or a Wide Area Network (WAN), or the connection may be made to an external computer (for example, through the Internet using an Internet service provider).
The flowchart and block diagrams in the figures illustrate the architecture, functionality, and operation of possible implementations of systems, methods and computer program products according to various embodiments of the present disclosure. In this regard, each block in the flowchart or block diagrams may represent a module, segment, or portion of code, which comprises one or more executable instructions for implementing the specified logical function(s). It should also be noted that, in some alternative implementations, the functions noted in the block may occur out of the order noted in the figures. For example, two blocks shown in succession may, in fact, be executed substantially concurrently, or the blocks may sometimes be executed in the reverse order, depending upon the functionality involved. It will also be noted that each block of the block diagrams and/or flowchart illustration, and combinations of blocks in the block diagrams and/or flowchart illustration, can be implemented by special purpose hardware-based systems which perform the specified functions or acts, or combinations of special purpose hardware and computer instructions.
The units described in the embodiments of the present disclosure may be implemented by software or hardware. Where the name of a unit does not in some cases constitute a limitation of the unit itself, for example, the first retrieving unit may also be described as a "unit for retrieving at least two internet protocol addresses".
It should be understood that portions of the present disclosure may be implemented in hardware, software, firmware, or a combination thereof.
The above description is only for the specific embodiments of the present disclosure, but the scope of the present disclosure is not limited thereto, and any changes or substitutions that can be easily conceived by those skilled in the art within the technical scope of the present disclosure should be covered within the scope of the present disclosure. Therefore, the protection scope of the present disclosure shall be subject to the protection scope of the claims.
Claims (11)
1. A method for locating a slot of a failed disk is characterized by comprising the following steps:
acquiring equipment information including disk symbols of the disks according to different types of the disks of the data center, wherein the equipment information of the disks of different types is different;
establishing a corresponding relation between the equipment information and the slot position identification information based on the acquired slot position identification information of all the disks;
performing data processing on the read disk log through a preset fault model so as to judge the fault state of the disk;
and when the fault disk exists through the fault model, positioning the fault disk based on the established corresponding relation between the equipment information and the slot position identification information.
2. The method of claim 1, wherein the obtaining the device information including the drive letter according to different types of the data center disk comprises:
and acquiring host paths and associated drive symbols of all the sas type disks.
3. The method of claim 1, wherein the obtaining the device information including the drive letter according to different types of the data center disk comprises:
and acquiring the equipment information and the associated drive identifier of all non-sas scsi type disks.
4. The method of claim 1, wherein the obtaining the device information including the drive letter according to different types of the data center disk comprises:
and acquiring the disk device information, the associated drive symbols and the pci paths of all the nvme type disk devices.
5. The method according to claim 1, wherein the establishing a correspondence between the device information and the slot identification information based on the obtained slot identification information of all the disks comprises:
searching a disk drive symbol corresponding to the slot position identification information;
and establishing a corresponding relation between the slot position identification information and the disk drive letter corresponding to the slot position identification information.
6. The method according to claim 1, wherein before performing data processing on the read disk log through a preset fault model, the method further comprises:
acquiring log information on a magnetic disk within a preset time period;
and merging the log information and the disk equipment information on the disk to form a disk log.
7. The method according to claim 1, wherein the data processing of the read disk log through a preset fault model comprises:
performing data analysis on the disk log to obtain analysis parameters related to the disk log;
comparing the model parameters in the fault model with the analysis parameters to obtain a comparison result;
and judging whether the disk has a fault or not based on the comparison result.
8. The method according to claim 1, wherein when a failed disk is found by the failure model, after the failed disk is located based on the already established correspondence between the device information and the slot identification information, the method further comprises:
and performing offline and repairing operations on the disk with the fault.
9. A failed disk slot location apparatus, comprising:
the acquisition module is used for acquiring equipment information including disk drive symbols according to different types of disks of the data center, wherein the equipment information of the different types of disks is different;
the establishing module is used for establishing a corresponding relation between the equipment information and the slot position identification information based on the acquired slot position identification information of all the disks;
the processing module is used for carrying out data processing on the read disk logs through a preset fault model so as to judge the fault state of the disk;
and the positioning module is used for positioning the fault disk based on the established corresponding relation between the equipment information and the slot position identification information when the fault disk is found to exist through the fault model.
10. An electronic device, characterized in that the electronic device comprises:
at least one processor; and the number of the first and second groups,
a memory communicatively coupled to the at least one processor; wherein the content of the first and second substances,
the memory stores instructions executable by the at least one processor to enable the at least one processor to perform the method of locating a failed disk slot of any of claims 1-8.
11. A non-transitory computer readable storage medium storing computer instructions for causing a computer to perform the method for locating a failed disk slot of any of claims 1-8.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201911092449.4A CN110989926B (en) | 2019-11-11 | 2019-11-11 | Fault magnetic disc slot positioning method and device and electronic equipment |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201911092449.4A CN110989926B (en) | 2019-11-11 | 2019-11-11 | Fault magnetic disc slot positioning method and device and electronic equipment |
Publications (2)
Publication Number | Publication Date |
---|---|
CN110989926A true CN110989926A (en) | 2020-04-10 |
CN110989926B CN110989926B (en) | 2024-02-13 |
Family
ID=70083707
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201911092449.4A Active CN110989926B (en) | 2019-11-11 | 2019-11-11 | Fault magnetic disc slot positioning method and device and electronic equipment |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN110989926B (en) |
Cited By (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN111813588A (en) * | 2020-06-01 | 2020-10-23 | 北京百卓网络技术有限公司 | Computer hard disk fault positioning method, device, equipment and storage medium |
CN111813589A (en) * | 2020-06-01 | 2020-10-23 | 北京百卓网络技术有限公司 | Distributed cluster fault positioning method, device, equipment and storage medium |
CN111949488A (en) * | 2020-08-14 | 2020-11-17 | 山东英信计算机技术有限公司 | Hard disk fault prediction method and system, electronic equipment and storage medium |
CN112130531A (en) * | 2020-09-04 | 2020-12-25 | 苏州浪潮智能科技有限公司 | Method and device for prompting and detecting installation position |
CN112346932A (en) * | 2020-11-05 | 2021-02-09 | 中国建设银行股份有限公司 | Method and device for positioning hidden bad disk, electronic equipment and computer storage medium |
CN113641555A (en) * | 2021-08-17 | 2021-11-12 | 中国联合网络通信集团有限公司 | Fault equipment positioning method, device, equipment and readable storage medium |
CN115061641A (en) * | 2022-08-16 | 2022-09-16 | 新华三信息技术有限公司 | Disk fault processing method, device, equipment and storage medium |
Citations (16)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US4630269A (en) * | 1983-05-16 | 1986-12-16 | Data General Corporation | Methods for diagnosing malfunctions in a disk drive |
US20050050401A1 (en) * | 2003-08-27 | 2005-03-03 | Kunihito Matsuki | Disk array system and fault information control method |
US20050268033A1 (en) * | 2004-05-27 | 2005-12-01 | Hiroshi Ogasawara | Disk array device and controlling method thereof |
US20070180293A1 (en) * | 2006-01-31 | 2007-08-02 | Fujitsu Limited | Data storage system, data storage control apparatus and fault location diagnosis method |
US20100138702A1 (en) * | 2008-11-28 | 2010-06-03 | Kabushiki Kaisha Toshiba | Information processing apparatus and sign of failure determination method |
CN103218180A (en) * | 2013-05-14 | 2013-07-24 | 北京百度网讯科技有限公司 | Disk positioning method and device |
US8719462B1 (en) * | 2013-10-16 | 2014-05-06 | Google Inc. | Systems and methods for distributed log file processing |
CN104699423A (en) * | 2015-03-13 | 2015-06-10 | 华为技术有限公司 | Method and device for binding disk drive in Linux system |
CN107239273A (en) * | 2017-05-02 | 2017-10-10 | 杭州沃趣科技股份有限公司 | A kind of disk drive binding method of server |
CN107247672A (en) * | 2017-05-03 | 2017-10-13 | 深圳市同泰怡信息技术有限公司 | Pass through the method for udev and expander relevant interface hyperdisks |
CN107577545A (en) * | 2016-07-05 | 2018-01-12 | 北京金山云网络技术有限公司 | A kind of failed disk detection and restorative procedure and device |
CN108008914A (en) * | 2016-10-27 | 2018-05-08 | 华为技术有限公司 | The method, apparatus and ARM equipment of disk management in a kind of ARM equipment |
US20180300212A1 (en) * | 2017-04-17 | 2018-10-18 | EMC IP Holding Company LLC | Method, device and computer readable storage media for rebuilding redundant array of independent disks |
CN109542708A (en) * | 2018-12-04 | 2019-03-29 | 浪潮电子信息产业股份有限公司 | A method of record backboard physics slot position and system drive corresponding relationship |
CN110096413A (en) * | 2019-05-08 | 2019-08-06 | 苏州浪潮智能科技有限公司 | A kind of disk localization method, system and electronic equipment and storage medium |
US20190324875A1 (en) * | 2018-04-20 | 2019-10-24 | EMC IP Holding Company LLC | Method and apparatus for failure recovery of storage device |
-
2019
- 2019-11-11 CN CN201911092449.4A patent/CN110989926B/en active Active
Patent Citations (16)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US4630269A (en) * | 1983-05-16 | 1986-12-16 | Data General Corporation | Methods for diagnosing malfunctions in a disk drive |
US20050050401A1 (en) * | 2003-08-27 | 2005-03-03 | Kunihito Matsuki | Disk array system and fault information control method |
US20050268033A1 (en) * | 2004-05-27 | 2005-12-01 | Hiroshi Ogasawara | Disk array device and controlling method thereof |
US20070180293A1 (en) * | 2006-01-31 | 2007-08-02 | Fujitsu Limited | Data storage system, data storage control apparatus and fault location diagnosis method |
US20100138702A1 (en) * | 2008-11-28 | 2010-06-03 | Kabushiki Kaisha Toshiba | Information processing apparatus and sign of failure determination method |
CN103218180A (en) * | 2013-05-14 | 2013-07-24 | 北京百度网讯科技有限公司 | Disk positioning method and device |
US8719462B1 (en) * | 2013-10-16 | 2014-05-06 | Google Inc. | Systems and methods for distributed log file processing |
CN104699423A (en) * | 2015-03-13 | 2015-06-10 | 华为技术有限公司 | Method and device for binding disk drive in Linux system |
CN107577545A (en) * | 2016-07-05 | 2018-01-12 | 北京金山云网络技术有限公司 | A kind of failed disk detection and restorative procedure and device |
CN108008914A (en) * | 2016-10-27 | 2018-05-08 | 华为技术有限公司 | The method, apparatus and ARM equipment of disk management in a kind of ARM equipment |
US20180300212A1 (en) * | 2017-04-17 | 2018-10-18 | EMC IP Holding Company LLC | Method, device and computer readable storage media for rebuilding redundant array of independent disks |
CN107239273A (en) * | 2017-05-02 | 2017-10-10 | 杭州沃趣科技股份有限公司 | A kind of disk drive binding method of server |
CN107247672A (en) * | 2017-05-03 | 2017-10-13 | 深圳市同泰怡信息技术有限公司 | Pass through the method for udev and expander relevant interface hyperdisks |
US20190324875A1 (en) * | 2018-04-20 | 2019-10-24 | EMC IP Holding Company LLC | Method and apparatus for failure recovery of storage device |
CN109542708A (en) * | 2018-12-04 | 2019-03-29 | 浪潮电子信息产业股份有限公司 | A method of record backboard physics slot position and system drive corresponding relationship |
CN110096413A (en) * | 2019-05-08 | 2019-08-06 | 苏州浪潮智能科技有限公司 | A kind of disk localization method, system and electronic equipment and storage medium |
Cited By (9)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN111813588A (en) * | 2020-06-01 | 2020-10-23 | 北京百卓网络技术有限公司 | Computer hard disk fault positioning method, device, equipment and storage medium |
CN111813589A (en) * | 2020-06-01 | 2020-10-23 | 北京百卓网络技术有限公司 | Distributed cluster fault positioning method, device, equipment and storage medium |
CN111813588B (en) * | 2020-06-01 | 2024-03-19 | 北京百卓网络技术有限公司 | Computer hard disk fault positioning method, device, equipment and storage medium |
CN111949488A (en) * | 2020-08-14 | 2020-11-17 | 山东英信计算机技术有限公司 | Hard disk fault prediction method and system, electronic equipment and storage medium |
CN112130531A (en) * | 2020-09-04 | 2020-12-25 | 苏州浪潮智能科技有限公司 | Method and device for prompting and detecting installation position |
CN112346932A (en) * | 2020-11-05 | 2021-02-09 | 中国建设银行股份有限公司 | Method and device for positioning hidden bad disk, electronic equipment and computer storage medium |
CN113641555A (en) * | 2021-08-17 | 2021-11-12 | 中国联合网络通信集团有限公司 | Fault equipment positioning method, device, equipment and readable storage medium |
CN113641555B (en) * | 2021-08-17 | 2023-11-24 | 中国联合网络通信集团有限公司 | Fault equipment positioning method, device, equipment and readable storage medium |
CN115061641A (en) * | 2022-08-16 | 2022-09-16 | 新华三信息技术有限公司 | Disk fault processing method, device, equipment and storage medium |
Also Published As
Publication number | Publication date |
---|---|
CN110989926B (en) | 2024-02-13 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN110989926B (en) | Fault magnetic disc slot positioning method and device and electronic equipment | |
CN110764945B (en) | Crash log processing method, device, equipment and storage medium | |
CN104866542A (en) | POI data verification method and device | |
CN109672722B (en) | Data deployment method and device, computer storage medium and electronic equipment | |
WO2019227641A1 (en) | Application testing method, apparatus, terminal device and medium | |
CN110928559A (en) | Data burning method and device, electronic equipment and storage medium | |
CN110674047A (en) | Software testing method and device and electronic equipment | |
CN110659210A (en) | Information acquisition method and device, electronic equipment and storage medium | |
CN111694684B (en) | Abnormal construction method and device of storage device, electronic device and storage medium | |
CN110737509B (en) | Thermal migration processing method and device, storage medium and electronic equipment | |
CN112527678A (en) | Method, apparatus, device and storage medium for testing protocol | |
US20130117493A1 (en) | Reliable Memory Mapping In A Computing System | |
US20230025081A1 (en) | Model training method, failure determining method, electronic device, and program product | |
CN110569182B (en) | Crash rate calculation method and device, computer equipment and storage medium | |
CN115309334A (en) | Disk management method, device, equipment and storage medium | |
US9329856B2 (en) | Managing a code load | |
CN113296996A (en) | Service request processing method, related device and storage medium | |
CN108563552B (en) | Method, equipment and storage medium for recording peripheral operation behaviors | |
US10754643B2 (en) | Detecting software incompatibilities within a system stack | |
CN110928814A (en) | Method and device for detecting transmission abnormality of serial hard disk and electronic equipment | |
CN111262727A (en) | Service capacity expansion method, device, equipment and storage medium | |
CN113704020B (en) | Method and device for analyzing error field data of solid state disk | |
US8645774B2 (en) | Expedited memory drive self test | |
CN114781567B (en) | Detection equipment installation method and device, electronic equipment and readable storage medium | |
CN110568994B (en) | Logic block device access method and device and electronic device |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant |