CN108899059B - Detection method and equipment for solid state disk - Google Patents

Detection method and equipment for solid state disk Download PDF

Info

Publication number
CN108899059B
CN108899059B CN201810829395.4A CN201810829395A CN108899059B CN 108899059 B CN108899059 B CN 108899059B CN 201810829395 A CN201810829395 A CN 201810829395A CN 108899059 B CN108899059 B CN 108899059B
Authority
CN
China
Prior art keywords
controller
parameter
working
working parameter
solid state
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201810829395.4A
Other languages
Chinese (zh)
Other versions
CN108899059A (en
Inventor
刘福东
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Suzhou Inspur Intelligent Technology Co Ltd
Original Assignee
Suzhou Inspur Intelligent Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Suzhou Inspur Intelligent Technology Co Ltd filed Critical Suzhou Inspur Intelligent Technology Co Ltd
Priority to CN201810829395.4A priority Critical patent/CN108899059B/en
Publication of CN108899059A publication Critical patent/CN108899059A/en
Application granted granted Critical
Publication of CN108899059B publication Critical patent/CN108899059B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G11INFORMATION STORAGE
    • G11CSTATIC STORES
    • G11C29/00Checking stores for correct operation ; Subsequent repair; Testing stores during standby or offline operation
    • G11C29/04Detection or location of defective memory elements, e.g. cell constructio details, timing of test signals
    • G11C29/08Functional testing, e.g. testing during refresh, power-on self testing [POST] or distributed testing
    • G11C29/10Test algorithms, e.g. memory scan [MScan] algorithms; Test patterns, e.g. checkerboard patterns 

Abstract

The application discloses a detection method and equipment for a solid state disk, wherein the detection method comprises the following steps: receiving a first working parameter of the solid state disk sent by the first controller and a second working parameter of the solid state disk sent by the second controller; judging whether the difference value of the first working parameter and the second working parameter exceeds a preset threshold value or not; if the current working parameter does not exceed the preset threshold, judging whether the solid state disk is abnormal or not according to the historical working parameter, the first working parameter and the second working parameter; therefore, compared with the prior art that the operating parameters are collected by only one controller to judge the operating state of the SSD, the method provided by the application adds a redundancy mechanism, namely under the condition that the two operating parameters are normal, the operating condition of the SSD is judged, so that the judgment accuracy is improved, when the SSD is abnormal, a server can be timely notified to maintain, the stable operation of the storage device is ensured, and the loss of data in production and life is reduced.

Description

Detection method and equipment for solid state disk
Technical Field
The present application relates to the field of computer technologies, and in particular, to a method and an apparatus for detecting a solid state disk.
Background
With the continuous development of computer technology, people have higher and higher requirements for services provided by computers, and particularly with the emergence of large data, the requirements for computer storage space are also increased sharply, so that expanding the computer storage becomes an important means for solving the limited storage space of the computers.
In order to expand the external memory of the computer, usually, a Hard Disk Drive (HDD) or a Solid State Disk (SSD) may be used to expand the external memory. The HDD is a traditional common hard disk and mainly comprises a disk, a magnetic head, a disk rotating shaft, a control motor, a magnetic head controller and other devices; the SSD is composed of a control unit and a storage unit, wherein the storage unit is mainly made of flash memory particles, compared with the SSD, the SSD does not have any mechanical part inside, is shock-resistant and drop-resistant, and compared with an HDD, the SSD utilizes the flash memory particles to store data, and has the advantages of high read-write speed, low power consumption, no noise and the like. Although expensive in terms of price, with the advent of large-capacity flash memory particles, SDD is becoming cheaper and cheaper, making it more widely used in servers, memories, etc.
However, due to the characteristics of the SDD solid state disk, when a fault occurs, for example, current is too large or temperature is too high, data in the SDD cannot be recovered, and normal operation of devices such as a server is affected, so how to timely monitor and report the running state of the SDD to ensure reliable running of the SDD becomes an important problem to be solved urgently.
Disclosure of Invention
In view of this, embodiments of the present application provide a method and an apparatus for detecting a solid state disk, so as to detect a working state of the solid state disk in real time, so as to ensure reliable operation of the solid state disk, thereby providing a guarantee for normal operation of a server.
In a first aspect, an embodiment of the present application provides a method for detecting a solid state disk, where the solid state disk includes a first controller and a second controller, and the method includes:
receiving a first working parameter of the solid state disk sent by the first controller and a second working parameter of the solid state disk sent by the second controller;
judging whether the difference value of the first working parameter and the second working parameter exceeds a preset threshold value or not;
if not, judging whether the solid state disk is abnormal or not according to historical working parameters, the first working parameters and the second working parameters;
wherein the historical operating parameter, the first operating parameter, and the second operating parameter include at least one of temperature, current, and power.
Optionally, when the difference between the first operating parameter and the second operating parameter exceeds the preset threshold, the method further includes:
judging whether the first working parameter and/or the second working parameter is a null value;
when the first working parameter is a null value, determining that the first controller works abnormally;
and/or the presence of a gas in the gas,
and when the second working parameter is a null value, determining that the second controller works abnormally.
Optionally, the method further includes:
and when the first controller works abnormally and the second controller works normally, sending an abnormal signal to the second controller so that the second controller resets the first controller according to the abnormal signal.
Optionally, the method further includes:
and when the second controller works abnormally and the first controller works normally, sending an abnormal signal to the first controller so that the first controller resets the second controller according to the abnormal signal.
In a second aspect, an embodiment of the present application provides a detection apparatus for a solid state disk, where the detection apparatus includes:
the first controller is used for acquiring a first working parameter of the solid state disk and sending the first working parameter to the processor;
the second controller is used for acquiring a second working parameter of the solid state disk and sending the second working parameter to the processor;
the processor is configured to receive the first working parameter and the second working parameter, determine whether a difference between the first working parameter and the second working parameter exceeds a preset threshold, and determine whether the solid state disk is abnormal according to a historical working parameter, the first working parameter and the second working parameter when the difference does not exceed the preset threshold;
wherein the historical operating parameter, the first operating parameter, and the second operating parameter include at least one of temperature, current, and power.
Optionally, the processor is further configured to, when a difference between the first operating parameter and the second operating parameter exceeds the preset threshold, determine whether the first operating parameter and/or the second operating parameter is a null value;
when the first working parameter is a null value, determining that the first controller works abnormally;
and/or the presence of a gas in the gas,
and when the second working parameter is a null value, determining that the second controller works abnormally.
Optionally, when the first controller works abnormally and the second controller works normally, the processor is further configured to send an abnormal signal to the second controller;
and the second controller is used for resetting the first controller according to the abnormal signal.
Optionally, when the second controller works abnormally and the first controller works normally, the processor is further configured to send an abnormal signal to the first controller;
the first controller is used for resetting the first controller according to the abnormal signal.
Optionally, the apparatus further comprises: a monitor;
the monitor is used for monitoring the first controller and the second controller, and resetting the first controller and/or the second controller when the first controller and/or the second controller work abnormally.
Optionally, the monitor comprises a first monitor and a second monitor;
the first monitor is used for monitoring the first controller and resetting the first controller when the first controller works abnormally;
and the second monitor is used for monitoring the second controller and resetting the second controller when the second controller works abnormally.
Compared with the prior art, the method has the advantages that:
in the embodiment of the application, the solid state disk comprises a first controller and a second controller, the first controller and the second controller can both obtain working parameters of the solid state disk, and simultaneously receive the first working parameters sent by the first controller and the second working parameters sent by the second controller, when a difference value between the first working parameters and the second working parameters is within a certain preset range, the working parameters obtained by the first controller and the second controller are normal, and whether the solid state disk is abnormal or not is judged according to historical working parameters, the first working parameters and the second working parameters, so that the working state of the solid state disk can be monitored in real time, and the reliable operation of the solid state disk is ensured.
Drawings
In order to more clearly illustrate the technical solutions of the embodiments of the present application, the drawings needed to be used in the description of the embodiments of the present application will be briefly described below, and it is obvious that the drawings in the following description are only some embodiments described in the present application, and it is obvious for those skilled in the art that other drawings can be obtained according to the drawings without creative efforts.
Fig. 1 is a flowchart of a method for detecting a solid state disk according to an embodiment of the present disclosure;
fig. 2 is a flowchart for detecting SSD anomalies according to an embodiment of the present application;
fig. 3 is a structural diagram of a detection apparatus of a solid state disk according to an embodiment of the present application;
fig. 4 is a schematic diagram of an application scenario framework provided in the embodiment of the present application.
Detailed Description
In order to make the technical solutions of the present application better understood, the technical solutions in the embodiments of the present application will be clearly and completely described below with reference to the drawings in the embodiments of the present application, and it is obvious that the described embodiments are only a part of the embodiments of the present application, and not all of the embodiments. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present application.
The inventor finds that, in the conventional SSD monitoring method, a main control chip is used to obtain working parameters of the SSD, such as temperature, current, power, and other information, and then the obtained working parameters are uploaded to a host through a bus, and the host performs data analysis and determines the operating condition of the SSD according to the analysis result. However, when the main control chip is abnormal, the SSD working parameters cannot be obtained or the obtained working parameters are abnormal due to the main control chip itself, the host cannot determine the working condition of the SSD or the determination is inaccurate.
In order to solve the problem, in the embodiment of the application, two controllers of a solid state disk, namely a first controller and a second controller, are used for simultaneously acquiring working parameters of an SSD, when the working parameters respectively sent by the first controller and the second controller are received, whether a difference value of the two working parameters exceeds a preset threshold value is judged, if the difference value does not exceed the preset threshold value, indicating that the two working parameters are normal values, whether the SSD is abnormal is judged according to the two working parameters and historical working parameters, and therefore, compared with the prior art that the working parameters are acquired by only one controller to judge the running state of the SSD, a redundancy mechanism is added, namely, the running condition of the SSD is judged under the condition that the two working parameters are normal, the judgment accuracy is improved, so that a server can be timely notified for maintenance when the SSD is abnormal, the stable operation of the storage device is ensured, and the loss of data in production and life is reduced.
The following describes in detail a specific implementation manner of the method and the device for detecting a solid state disk in the embodiment of the present application by using an embodiment with reference to the accompanying drawings.
Exemplary method
Referring to fig. 1, which is a flowchart of a detection method for a solid state disk provided in an embodiment of the present application, where the solid state disk includes a first controller and a second controller, as shown in fig. 1, the method is applied to a processor, and specifically may include:
s101: and receiving the first working parameter of the solid state disk sent by the first controller and the second working parameter of the solid state disk sent by the second controller.
In this embodiment, both the first controller and the second controller may obtain the working parameters of the solid state disk, and send the obtained working parameters to the processor. The working parameters may be temperature, current, power, and the like.
It should be noted that the working parameters collected by the first controller and the second controller are the same, that is, when the first working parameter includes the temperature of the solid state disk, the second working parameter also includes the temperature of the solid state disk; when the first operating parameter includes the temperature and the current of the solid state disk, the second operating parameter also includes the temperature and the current of the solid state disk.
In practical application, the first controller may be a NAND Flash main control chip in the SSD, and the second controller may be an external Microcontroller Unit (MCU) of the SSD. In specific implementation, a temperature sensor and a current sensor may be added to the periphery of the SSD, and the main control chip may read the temperature or the current of the SSD during operation from the temperature sensor or the current sensor through an Inter-Integrated Circuit (IIC); the MCU can also read the operating temperature or current of the SSD from the temperature sensor or the current sensor through the IIC, and then the main control chip and the MCU can send the obtained current and temperature to the processor through a System Management Bus (SMBus).
S102: and judging whether the difference value of the first working parameter and the second working parameter exceeds a preset threshold value.
In this embodiment, when the processor receives the first working parameter and the second working parameter, the difference between the first working parameter and the second working parameter is first calculated, and then it is determined whether the difference exceeds a preset threshold. The preset threshold value can be set according to the running condition of the SSD.
It can be understood that, since the first operating parameter is a parameter such as temperature and current of the SSD obtained by the first controller during operation, and the second operating parameter is a parameter such as temperature and current of the SSD obtained by the second controller during operation, theoretically, parameter values of the two operating parameters should be equal, and in consideration of an error in obtaining and transmitting the operating parameters in practical application, when comparing the two operating parameters, a certain error value, that is, a preset threshold value, is allowed to occur.
When the difference value between the two working parameters is smaller than or equal to the preset threshold, it indicates that the difference between the two working parameters is small, and the difference can be used for judging the running condition of the SSD, so that when the difference value does not exceed the preset threshold, S103 is executed.
In practical application, in order to improve the accuracy of judgment, the first controller and the second controller can acquire the working parameters of the SSD for multiple times, and then the processor averages a plurality of first working parameters sent by the first controller to serve as a first average value; averaging a plurality of second working parameters sent by a second controller to serve as a second average value; and calculating a difference value between the first average value and the second average value, judging whether the difference value exceeds a preset threshold value, and if not, executing S103.
S103: and judging whether the solid state disk is abnormal or not according to the historical working parameters, the first working parameters and the second working parameters.
In this embodiment, when the difference between the first working parameter and the second working parameter does not exceed the preset threshold, it is indicated that the second working parameter and the first working parameter are normal values, and whether the solid state disk is abnormal is determined according to the historical working parameter, the first working parameter, and the second working parameter. The historical operating parameter, the first operating parameter and the second operating parameter at least comprise one of temperature, current and power.
In practical application, the historical operating parameters are operating parameters which are stored by the processor and transmitted by the first controller and the second controller in the past time, and the operating parameters represent parameters for the SSD to operate normally. That is, comparing the first working parameter and the second working parameter with historical working parameters, and if the first working parameter and the second working parameter are both consistent with the historical working parameters, indicating that the SSD operates normally; and if the first working parameter and the second working parameter are not consistent with the historical working parameter, indicating that the SSD operates abnormally, and informing the server of maintaining in time. In specific implementation, the first working parameter and the second working parameter may be respectively different from the historical working parameter, and when the difference values both satisfy the preset condition, the correspondence is indicated, otherwise, the correspondence is not indicated.
For convenience of understanding, for example, the first operating parameter includes a first temperature of 31 degrees, the second operating parameter includes a second temperature of 32 degrees, the preset threshold is plus or minus 2 degrees, the historical temperature is 34 degrees, the preset condition is less than or equal to plus or minus 3 degrees, and through calculation, the difference value between the first temperature and the second temperature does not exceed the preset threshold, and meanwhile, the difference values between the first temperature, the second temperature and the historical temperature meet the preset condition, the SSD operates normally; if the historical temperature 35, the difference value between the first temperature and the historical temperature is 4 degrees, and the preset condition is not met, the SSD works abnormally.
In practical application, when the SSD is judged to be working normally according to the first working parameter, the second working parameter and the historical working parameter, the historical working parameter may be updated by using the first working parameter and the second working parameter, so that the processor may update the stored working parameter, and perform subsequent judgment by using the updated historical working parameter. In a specific implementation, the working parameters may be stored for the first controller and the second controller, that is, for the first controller, corresponding to the first historical working parameter, and for the second controller, corresponding to the second historical working parameter, whether the solid state disk is abnormal is determined according to the first working parameter and the first historical working parameter, and whether the solid state disk is abnormal is determined according to the second working parameter and the second historical working parameter.
When the SSD works abnormally according to the first working parameter, the second working parameter and the historical working parameter, abnormal events are recorded, so that maintenance personnel can check and maintain according to the log.
In a possible implementation manner of the embodiment of the present application, when the first operating parameter and the second operating parameter exceed a preset threshold, the method further includes: judging whether the first working parameter and/or the second working parameter are null or not according to the historical working parameters; when the first working parameter is a null value, determining that the first controller works abnormally; and/or determining that the second controller works abnormally when the second working parameter is a null value.
In this embodiment, when the difference between the first working parameter and the second working parameter exceeds the preset threshold, it indicates that at least one of the two working parameters is abnormal, and in order to determine the abnormal working parameter, during the specific implementation, it is determined whether the first working parameter and the second working parameter are null values, and if the first working parameter is null value, it indicates that the first controller is abnormal in operation; similarly, if the second operating parameter is a null value, the second operating parameter is determined to be an abnormal value, indicating that the second controller is abnormal in operation.
In practical application, when any one of the two working parameters is null, the difference between the two working parameters is defined as infinite, so that the difference between the two working parameters exceeds a preset threshold, and when the difference exceeds the threshold, the working parameter is abnormal, so that the abnormal working parameter can be further judged.
When the first controller or the second controller is abnormal due to downtime, in order to ensure normal operation of the first controller or the second controller, the embodiment of the present application further provides a method for solving the abnormality of the controller, which specifically includes: when the first controller works abnormally and the second controller works normally, an abnormal signal is sent to the second controller, so that the second controller resets the first controller according to the abnormal signal; or when the second controller works abnormally and the first controller works normally, sending an abnormal signal to the first controller so that the first controller resets the second controller according to the abnormal signal.
In practical application, the first controller can be connected with a reset interface of the second controller through a General Input/Output (General Input Output) interface, the second controller is connected with the reset interface of the first controller through a GPIO interface, when only the first controller works abnormally, the processor sends an abnormal signal to the second controller, and the second controller sends a reset signal to the reset interface of the first controller through the GPIO interface according to the received abnormal signal so as to restart the first controller; similarly, when only the second controller works abnormally, the processor sends an abnormal signal to the first controller, and the first controller sends a reset signal to the reset interface of the second controller through the GPIO interface according to the received abnormal signal so as to restart the second controller. The abnormal signal may carry an identifier of the abnormal controller, so that another controller may reset the controller corresponding to the identifier according to the identifier.
In this embodiment, the first controller may also be connected to the reset interface of the second controller through another interface, which is not limited herein.
It can be understood that after the controller is restarted, whether the controller is recovered to normal or not can be judged according to the working parameters sent by the controller, and if the controller is recovered to normal, the controller is shut down when running; if the storage device is not normal, reporting an abnormal event to the server so as to inform a user of timely processing and avoid influencing the normal use of the storage device.
It can be seen from the above embodiments that, in the embodiment of the present application, the two controllers of the solid state disk, that is, the first controller and the second controller, are used to simultaneously acquire the working parameters of the SSD, and when the working parameters respectively acquired by the first controller and the second controller are received, it is determined whether the difference between the two working parameters exceeds the preset threshold, and if the difference does not exceed the preset threshold, it is determined whether the SSD is abnormal according to the two working parameters and the historical working parameters, and it can be seen that the method provided in the embodiment of the present application increases a redundancy mechanism compared with the prior art that only one controller is used to acquire the working parameters to determine the running state of the SSD, and determines the running condition of the SSD under the condition that the working parameters sent by the two controllers are both normal, thereby improving the accuracy of the determination, so that the server can be timely notified to perform maintenance when the SSD is abnormal, the stable operation of the storage device is ensured, and the loss of data in production and life is reduced.
In addition, when any one of the two controllers is abnormal, the other controller can be used for resetting the abnormal controller, so that a mechanism for processing the abnormal condition is added, the abnormal operation of the SSD caused by the downtime is prevented, and the running reliability of the SSD is improved. Referring to fig. 2, which is a flowchart for detecting an SSD exception according to an embodiment of the present disclosure, as shown in fig. 2, first receive a first working parameter sent by a first controller and a second working parameter sent by a second controller, determine whether a difference between the two working parameters exceeds a preset threshold, if not, determine an SSD running condition according to a historical working parameter, the first working parameter, and the second working parameter, and obtain the working parameter again; if the two working parameters exceed the preset threshold, judging whether the two working parameters have empty values, if not, recording abnormal working parameters, and acquiring data again; if the two working parameters are both null values, the two controllers can be reset by other equipment, and then data is acquired again; when the first parameter is null, the first controller is judged to be abnormal, an abnormal signal is sent to the second controller, the second controller resets the first controller, abnormal working parameters are recorded, and data is obtained again; and if the first working parameter is not a null value, sending an abnormal signal to the first controller, resetting the second controller by the first controller, recording the abnormal working parameter, and re-acquiring the data.
Exemplary devices
Based on the above method embodiment, the present application provides a detection device for a solid state disk, which will be described below with reference to the accompanying drawings.
Referring to fig. 3, which is a structural diagram of a detection apparatus for a solid state disk according to an embodiment of the present application, as shown in fig. 3, the apparatus includes:
the first controller 301 is configured to obtain a first working parameter of the solid state disk, and send the first working parameter to a processor;
the second controller 302 is configured to obtain a second working parameter of the solid state disk, and send the second working parameter to the processor;
the processor 303 is configured to receive the first working parameter and the second working parameter, determine whether a difference between the first working parameter and the second working parameter exceeds a preset threshold, and determine whether the solid state disk is abnormal according to a historical working parameter, the first working parameter, and the second working parameter when the difference does not exceed the preset threshold;
wherein the historical operating parameter, the first operating parameter, and the second operating parameter include at least one of temperature, current, and power.
Optionally, the processor is further configured to, when a difference between the first operating parameter and the second operating parameter exceeds the preset threshold, determine whether the first operating parameter and/or the second operating parameter is a null value;
when the first working parameter is a null value, determining that the first controller works abnormally;
and/or the presence of a gas in the gas,
and when the second working parameter is a null value, determining that the second controller works abnormally.
Optionally, when the first controller works abnormally and the second controller works normally, the processor is further configured to send an abnormal signal to the second controller;
the second controller is used for resetting the first controller according to the abnormal signal;
alternatively, the first and second electrodes may be,
when the second controller works abnormally and the first controller works normally, the processor is further used for sending an abnormal signal to the first controller;
the first controller is used for resetting the first controller according to the abnormal signal.
Optionally, the apparatus further comprises: a monitor;
the monitor is used for monitoring the first controller and the second controller, and resetting the first controller and/or the second controller when the first controller and/or the second controller work abnormally.
In this embodiment, the monitor can independently monitor the operating condition of the controller, and when the monitor monitors that the first controller is abnormal, the monitor automatically resets the first monitor, so that the monitor can flexibly monitor the operating condition of the controller. In specific implementation, the monitor may be a watchdog monitoring chip, the monitoring chip may monitor the working states of the first controller and the second controller in real time, and when any one of the controllers is monitored to be abnormal, the abnormal controller is reset to restart the abnormal controller.
Optionally, the monitor comprises a first monitor and a second monitor;
the first monitor is used for monitoring the first controller and resetting the first controller when the first controller works abnormally;
and the second monitor is used for monitoring the second controller and resetting the second controller when the second controller works abnormally.
In this embodiment, in order to facilitate monitoring of the operating states of the two controllers, a monitor may be respectively configured for the two controllers, where the first monitor monitors the first controller, and the second monitor monitors the second controller. When the monitor monitors that the corresponding controller is abnormal, the monitor resets the corresponding controller to restart the controller, so that the use of the storage device is prevented from being influenced by downtime.
It should be noted that, for specific implementation of each module in this embodiment, reference may be made to implementation of the method embodiment described in fig. 1, and details of this embodiment are not described herein again.
It can be seen from the above embodiments that, in the embodiment of the present application, two controllers of a solid state disk, that is, a first controller and a second controller, are used to simultaneously acquire working parameters of an SSD, and when receiving the working parameters respectively acquired by the first controller and the second controller, it is determined whether a difference between the two working parameters exceeds a preset threshold, and if the difference does not exceed the preset threshold, indicating that the two working parameters are normal values, it is determined whether the SSD is abnormal according to the two working parameters and historical working parameters, and it can be seen that the method provided in the embodiment of the present application increases a redundancy mechanism compared with the prior art in which only one controller is used to acquire the working parameters to determine an SSD operating state, and determines an SSD operating condition under the condition that the two working parameters are normal, improves the accuracy of the determination, so that a server can be timely notified to perform maintenance when the SSD is abnormal, the stable operation of the storage device is ensured, and the loss of data in production and life is reduced.
In addition, when any one of the two controllers is abnormal, the other controller can be used for resetting the abnormal controller, so that a mechanism for processing the abnormal condition is added, the abnormal operation of the SSD caused by the downtime is prevented, and the running reliability of the SSD is improved. Moreover, when the two controllers are abnormal, the monitor can be used for resetting the two controllers, and further guarantee is provided for timely handling abnormal conditions.
For convenience of understanding of the technical solution of the present application, a description will be given below with reference to a specific application scenario, referring to fig. 4, which is an exemplary diagram of an application scenario provided in an embodiment of the present application, as shown in fig. 4, a solid state disk includes a main controller and a microcontroller unit, where both controllers can obtain a temperature and a current when an SSD operates from a temperature sensor and a current sensor, each of the two controllers sends an obtained parameter value to a host, when the host determines that a parameter sent by one of the two controllers is a null value, it is determined that the controller corresponding to the null value is abnormal, and when the main controller is abnormal, the host sends an abnormal signal to the microcontroller unit, where the abnormal signal carries an identifier of the main controller, so that the microcontroller resets the main controller according to the identifier in the abnormal signal; when the microcontroller unit is abnormal, the host sends an abnormal signal to the main controller, and the abnormal signal carries the identifier of the microcontroller unit, so that the main controller resets the microcontroller unit according to the identifier in the abnormal signal; when main control unit and microcontroller unit are all unusual, watchdog 1 resets the microcontroller unit, and watchdog 2 resets main control unit to restart unusual controller, in order to guarantee that the host computer can acquire the working parameter of SSD, in order to guarantee the operational aspect that can real time monitoring SSD, improve the reliability of SSD operation.
The embodiments in the present description are described in a progressive manner, each embodiment focuses on differences from other embodiments, and the same and similar parts among the embodiments are referred to each other. The device disclosed by the embodiment corresponds to the method disclosed by the embodiment, so that the description is simple, and the relevant points can be referred to the method part for description.
Those of skill would further appreciate that the various illustrative machines and algorithm steps described in connection with the embodiments disclosed herein may be implemented as electronic hardware, computer software, or combinations of both, and that the various illustrative components and steps have been described above generally in terms of their functionality in order to clearly illustrate this interchangeability of hardware and software. Whether such functionality is implemented as hardware or software depends upon the particular application and design constraints imposed on the implementation. Skilled artisans may implement the described functionality in varying ways for each particular application, but such implementation decisions should not be interpreted as causing a departure from the scope of the present application.
It is noted that, herein, relational terms such as first and second, and the like may be used solely to distinguish one entity or action from another entity or action without necessarily requiring or implying any actual such relationship or order between such entities or actions. The terms "comprises," "comprising," or any other variation thereof, are intended to cover a non-exclusive inclusion, such that a process, method, article, or apparatus that comprises a list of elements does not include only those elements but may include other elements not expressly listed or inherent to such process, method, article, or apparatus. Without further limitation, an element defined by the phrase "comprising an … …" does not exclude the presence of other identical elements in a process, method, article, or apparatus that comprises the element.
The foregoing is merely a preferred embodiment of the present application and is not intended to limit the present application in any way. Although the present application has been described with reference to the preferred embodiments, it is not intended to limit the present application. Those skilled in the art can now make numerous possible variations and modifications to the disclosed embodiments, or modify equivalent embodiments, using the methods and techniques disclosed above, without departing from the scope of the claimed embodiments. Therefore, any simple modification, equivalent change and modification made to the above embodiments according to the technical essence of the present application still fall within the protection scope of the technical solution of the present application without departing from the content of the technical solution of the present application.

Claims (10)

1. The detection method of the solid state disk is characterized in that the solid state disk comprises a first controller and a second controller; the method comprises the following steps:
receiving a first working parameter of the solid state disk sent by the first controller and a second working parameter of the solid state disk sent by the second controller; the first operating parameter and the second operating parameter are the same;
judging whether the difference value of the first working parameter and the second working parameter exceeds a preset threshold value or not;
if not, judging whether the solid state disk is abnormal or not according to historical working parameters, the first working parameters and the second working parameters;
wherein the historical operating parameter, the first operating parameter, and the second operating parameter include at least one of temperature, current, and power.
2. The method of claim 1, wherein when the difference between the first operating parameter and the second operating parameter exceeds the preset threshold, the method further comprises:
judging whether the first working parameter and/or the second working parameter is a null value;
when the first working parameter is a null value, determining that the first controller works abnormally;
and/or the presence of a gas in the gas,
and when the second working parameter is a null value, determining that the second controller works abnormally.
3. The method of claim 2, further comprising:
and when the first controller works abnormally and the second controller works normally, sending an abnormal signal to the second controller so that the second controller resets the first controller according to the abnormal signal.
4. The method of claim 2, further comprising:
and when the second controller works abnormally and the first controller works normally, sending an abnormal signal to the first controller so that the first controller resets the second controller according to the abnormal signal.
5. The detection device for the solid state disk is characterized by comprising:
the first controller is used for acquiring a first working parameter of the solid state disk and sending the first working parameter to the processor;
the second controller is used for acquiring a second working parameter of the solid state disk and sending the second working parameter to the processor; the first operating parameter and the second operating parameter are the same;
the processor is configured to receive the first working parameter and the second working parameter, determine whether a difference between the first working parameter and the second working parameter exceeds a preset threshold, and determine whether the solid state disk is abnormal according to a historical working parameter, the first working parameter and the second working parameter when the difference does not exceed the preset threshold;
wherein the historical operating parameter, the first operating parameter, and the second operating parameter include at least one of temperature, current, and power.
6. The device of claim 5, wherein the processor is further configured to determine whether the first operating parameter and/or the second operating parameter is null when the difference between the first operating parameter and the second operating parameter exceeds the preset threshold;
when the first working parameter is a null value, determining that the first controller works abnormally;
and/or the presence of a gas in the gas,
and when the second working parameter is a null value, determining that the second controller works abnormally.
7. The apparatus of claim 6, wherein when the first controller is operating abnormally and the second controller is operating normally, the processor is further configured to send an exception signal to the second controller;
and the second controller is used for resetting the first controller according to the abnormal signal.
8. The apparatus of claim 6, wherein when the second controller is operating abnormally and the first controller is operating normally, the processor is further configured to send an exception signal to the first controller;
the first controller is used for resetting the second controller according to the abnormal signal.
9. The apparatus according to any one of claims 5 to 8, characterized in that it further comprises: a monitor;
the monitor is used for monitoring the first controller and the second controller, and resetting the first controller and/or the second controller when the first controller and/or the second controller work abnormally.
10. The apparatus of claim 9, wherein the monitor comprises a first monitor and a second monitor;
the first monitor is used for monitoring the first controller and resetting the first controller when the first controller works abnormally;
and the second monitor is used for monitoring the second controller and resetting the second controller when the second controller works abnormally.
CN201810829395.4A 2018-07-25 2018-07-25 Detection method and equipment for solid state disk Active CN108899059B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201810829395.4A CN108899059B (en) 2018-07-25 2018-07-25 Detection method and equipment for solid state disk

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201810829395.4A CN108899059B (en) 2018-07-25 2018-07-25 Detection method and equipment for solid state disk

Publications (2)

Publication Number Publication Date
CN108899059A CN108899059A (en) 2018-11-27
CN108899059B true CN108899059B (en) 2021-01-12

Family

ID=64352727

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201810829395.4A Active CN108899059B (en) 2018-07-25 2018-07-25 Detection method and equipment for solid state disk

Country Status (1)

Country Link
CN (1) CN108899059B (en)

Families Citing this family (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109976674A (en) * 2019-03-29 2019-07-05 苏州浪潮智能科技有限公司 A kind of hard disk management method, system, equipment and computer readable storage medium
CN110045210A (en) * 2019-05-15 2019-07-23 深圳市英威腾电气股份有限公司 Functional safety detection method, device, functional safety module and detection system
CN113031552B (en) * 2021-03-09 2022-10-25 浙江菲达环保科技股份有限公司 Cooperative control method and system for environmental protection equipment behind furnace

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102982836A (en) * 2012-11-21 2013-03-20 记忆科技(深圳)有限公司 Method for improving reliability of solid state disk and solid state disk of method
CN104731523A (en) * 2013-12-24 2015-06-24 国际商业机器公司 Method and controller for collaborative management of non-volatile hierarchical storage system
CN107919979A (en) * 2017-10-24 2018-04-17 新华三技术有限公司 Controller state processing method and processing device
CN107977169A (en) * 2017-12-25 2018-05-01 成都信息工程大学 A kind of high reliability solid state hard disc

Family Cites Families (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8737141B2 (en) * 2010-07-07 2014-05-27 Stec, Inc. Apparatus and method for determining an operating condition of a memory cell based on cycle information

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102982836A (en) * 2012-11-21 2013-03-20 记忆科技(深圳)有限公司 Method for improving reliability of solid state disk and solid state disk of method
CN104731523A (en) * 2013-12-24 2015-06-24 国际商业机器公司 Method and controller for collaborative management of non-volatile hierarchical storage system
CN107919979A (en) * 2017-10-24 2018-04-17 新华三技术有限公司 Controller state processing method and processing device
CN107977169A (en) * 2017-12-25 2018-05-01 成都信息工程大学 A kind of high reliability solid state hard disc

Also Published As

Publication number Publication date
CN108899059A (en) 2018-11-27

Similar Documents

Publication Publication Date Title
CN109783262B (en) Fault data processing method, device, server and computer readable storage medium
US9747182B2 (en) System and method for in-service diagnostics based on health signatures
US9195561B2 (en) System and method for proactive management of an information handling system with in-situ measurement of end user actions
CN108899059B (en) Detection method and equipment for solid state disk
JP5736881B2 (en) Log collection system, apparatus, method and program
EP3321807B1 (en) Disk detection method and device
JP4573179B2 (en) Performance load abnormality detection system, performance load abnormality detection method, and program
WO2017125014A1 (en) Method and device for monitoring hard disk
WO2015023201A2 (en) Method and system for determining hardware life expectancy and failure prevention
CN105117301A (en) Memory warning method and apparatus
JP2016152011A (en) Failure prediction system of control device
CN111796959A (en) Host machine container self-healing method, device and system
US11032627B2 (en) Maintenance device, presentation system, and program
CN115878356A (en) Disk failure prediction method and device
CN114758714A (en) Hard disk fault prediction method and device, electronic equipment and storage medium
US8984333B2 (en) Automatic computer storage medium diagnostics
US20100011100A1 (en) Health Check System, Server Apparatus, Health Check Method, and Storage Medium
CN117251336A (en) Monitoring system, method and device for server hard disk backboard
CN115168168A (en) Server failure prediction method, system, device and medium
US20220091920A1 (en) Bmc, server system, device stability determination method, and non-transitory computer-readable recording medium
JP2012038362A (en) Hard disk failure sign detection method
US20210349775A1 (en) Method of data management and method of data analysis
CN112988442B (en) Method and equipment for transmitting fault information in server operation stage
CN112256535B (en) Hard disk alarm method, device, computer equipment and storage medium
CN111752800B (en) Monitoring method and monitoring device for computer system and application program

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
TA01 Transfer of patent application right
TA01 Transfer of patent application right

Effective date of registration: 20201201

Address after: 215100 No. 1 Guanpu Road, Guoxiang Street, Wuzhong Economic Development Zone, Suzhou City, Jiangsu Province

Applicant after: SUZHOU LANGCHAO INTELLIGENT TECHNOLOGY Co.,Ltd.

Address before: 450018 Henan province Zheng Dong New District of Zhengzhou City Xinyi Road No. 278 16 floor room 1601

Applicant before: ZHENGZHOU YUNHAI INFORMATION TECHNOLOGY Co.,Ltd.

GR01 Patent grant
GR01 Patent grant