CN110704228A - Solid state disk exception handling method and system - Google Patents

Solid state disk exception handling method and system Download PDF

Info

Publication number
CN110704228A
CN110704228A CN201910930370.8A CN201910930370A CN110704228A CN 110704228 A CN110704228 A CN 110704228A CN 201910930370 A CN201910930370 A CN 201910930370A CN 110704228 A CN110704228 A CN 110704228A
Authority
CN
China
Prior art keywords
solid state
state disk
host system
data
task scheduling
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201910930370.8A
Other languages
Chinese (zh)
Other versions
CN110704228B (en
Inventor
束南山
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Zhiyu Technology Co ltd
Original Assignee
To Reputation Technology (wuhan) Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by To Reputation Technology (wuhan) Co Ltd filed Critical To Reputation Technology (wuhan) Co Ltd
Priority to CN201910930370.8A priority Critical patent/CN110704228B/en
Publication of CN110704228A publication Critical patent/CN110704228A/en
Application granted granted Critical
Publication of CN110704228B publication Critical patent/CN110704228B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/07Responding to the occurrence of a fault, e.g. fault tolerance
    • G06F11/0703Error or fault processing not based on redundancy, i.e. by taking additional measures to deal with the error or fault not making use of redundancy in operation, in hardware, or in data representation
    • G06F11/0793Remedial or corrective actions
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/07Responding to the occurrence of a fault, e.g. fault tolerance
    • G06F11/08Error detection or correction by redundancy in data representation, e.g. by using checking codes
    • G06F11/10Adding special bits or symbols to the coded information, e.g. parity check, casting out 9's or 11's
    • G06F11/1008Adding special bits or symbols to the coded information, e.g. parity check, casting out 9's or 11's in individual solid state devices
    • G06F11/1048Adding special bits or symbols to the coded information, e.g. parity check, casting out 9's or 11's in individual solid state devices using arrangements adapted for a specific error detection or correction feature
    • GPHYSICS
    • G11INFORMATION STORAGE
    • G11CSTATIC STORES
    • G11C7/00Arrangements for writing information into, or reading information out from, a digital store
    • G11C7/20Memory cell initialisation circuits, e.g. when powering up or down, memory clear, latent image memory
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y02TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
    • Y02DCLIMATE CHANGE MITIGATION TECHNOLOGIES IN INFORMATION AND COMMUNICATION TECHNOLOGIES [ICT], I.E. INFORMATION AND COMMUNICATION TECHNOLOGIES AIMING AT THE REDUCTION OF THEIR OWN ENERGY USE
    • Y02D10/00Energy efficient computing, e.g. low power processors, power management or thermal management

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Quality & Reliability (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Debugging And Monitoring (AREA)

Abstract

The invention discloses a solid state disk exception handling method and a solid state disk exception handling system, which relate to the field of data storage. And after detecting that the operation of the solid state disk is abnormal, the firmware records the abnormal information of the solid state disk and enters a safety mode. And operating the solid state disk in the safety mode, and reporting abnormal information to a host system. And the host system receives an operation instruction input by a user and issues the operation instruction to the solid state disk. The solid state disk acquires an operating instruction from a host system, and only executes task scheduling which can be executed in a safe mode in task scheduling corresponding to the operating instruction.

Description

Solid state disk exception handling method and system
Technical Field
The invention relates to the field of data storage, in particular to a method and a system for processing exception of a solid state disk.
Background
The SSD is a data storage device with NAND FLASH chip as storage medium, and is connected to the computer system in the form of SATA or PCIe interface, so as to provide non-volatile data storage service for HOST.
In the use process of the solid state disk, due to factors such as working environment temperature, electromagnetic radiation, NAND FLASH chip self defects, firmware design defects and the like, various errors and abnormal conditions are inevitable. For example, when the operating temperature of the SSD is too high, a severe Bit Flip error occurs in the SRAM inside the SSD controller or the external DRAM; or when the connection between the SSD and the system is subjected to external serious electromagnetic interference, the data cannot be normally transmitted; or due to design defects of a firmware algorithm, some extreme conditions cannot be handled, and the like, the serious errors usually cause that the SSD firmware cannot normally run and cannot provide normal data access service for the computer system, and in a serious condition, the whole computer system does not respond and the system is down.
As shown in fig. 1, the main idea of the prior art solution is to stop the SSDController when the SSD firmware cannot normally run, or to try to resume the normal running of the SSD firmware by resetting the SSD Controller. The existing solution has the following conditions that cannot be handled:
1. due to the fact that the SSD Controller stops running, SSD firmware cannot interact with the host system, the whole computer system is down, and phenomena such as the blue screen of a Windows operating system and the like
2. The SSD firmware does not timely inform the host system of the abnormal conditions currently encountered by the SSD firmware, and the host system cannot acquire specific error information by other means
3. Due to the interruption of the interaction between the SSD and the host system, the host system cannot recover the data in the disk by other means, resulting in the loss of data stored on the SSD
4. If the SSD firmware has serious errors in the power-on starting process, the Watchdog Timer overtime restart mechanism can cause the SSD to be repeatedly restarted, and under the condition, the SSD can be recovered to be normally used only by being repaired and returned to the factory
For the above described abnormal situation, the existing solution can be summarized into two points, firstly, when the SSD firmware cannot handle the abnormality, the SSD firmware will directly enter the assertion mode, and its SSD Controller will stop running, and cannot respond to the request and query command of the user; and secondly, opening a Watchdog Timer of the SSD Controller, after a preset time period, if the SSD Controller fails to operate normally, resetting the Watchdog Timer by the firmware within a specified time period, and after the Watchdog Timer is overtime, enabling the SSD Controller to enter an interrupt processing mode or enabling the SSD system to automatically restart and reset.
Disclosure of Invention
Aiming at the defects in the prior art, the invention aims to provide a solid state disk exception handling method and system, which can improve the corresponding and handling capacity of the solid state disk under the exception condition.
To achieve the above object, in a first aspect, an embodiment of the present invention provides a method for handling an exception in a solid state disk
Setting a safety mode of the solid state disk, wherein the safety mode is configured that when a host system is in a running state, the solid state disk only executes necessary task scheduling and limited command processing;
after the data transmission abnormality of the solid state disk is detected, recording the abnormal information of the solid state disk and entering a safety mode;
operating the solid state disk in the safety mode, and reporting abnormal information to a host system;
the host system receives an operation instruction input by a user and issues the operation instruction to the solid state disk;
the solid state disk acquires an operating instruction from a host system, and only executes task scheduling and command processing which can be executed in a safe mode in task scheduling corresponding to the operating instruction.
In a preferred embodiment, when the task scheduling does not belong to the necessary task scheduling, the solid state disk returns a message that the execution cannot be performed to the host system.
As a preferred embodiment, the exception information includes a data exception category, and the data exception category includes a bit flip error and a transmission timeout.
As a preferred embodiment, the exception information includes current state information of the solid state disk, and the state information of the solid state disk includes temperature of the solid state disk and data throughput.
As a preferred embodiment, the commands capable of being processed include backing up all data in the solid state disk to other hard disks, and restoring factory settings.
As a preferred embodiment, the commands that can be processed in the secure mode include user-customized commands.
As a preferred embodiment, the user-customized command includes a priority transfer of the data of the fixed area to the backup.
In a second aspect, an embodiment of the present invention further provides a system for processing an exception in a solid state disk, where the system includes a security module, a detection module, a reporting module, a receiving module, and an execution module:
the system comprises a security module, a data processing module and a data processing module, wherein the security module is used for setting a security mode of the solid state disk, and the security mode is configured in such a way that when a host system is in a running state, the solid state disk only executes necessary task scheduling and limited command processing;
the detection module is used for recording abnormal information of the solid state disk and entering a safety mode after detecting that the data transmission of the solid state disk is abnormal;
the reporting module is used for operating the solid state disk in the safety mode and reporting abnormal information to the host system;
the receiving module is used for receiving an operation instruction input by a user through the host system and issuing the operation instruction to the solid state disk;
and the execution module is used for acquiring the operating instruction from the host system through the solid state disk and only executing task scheduling and command processing which can be executed in a safe mode in task scheduling corresponding to the operating instruction.
As a preferred embodiment, the exception information includes a data exception category and current solid state disk state information, the data exception category includes a bit flip error and transmission timeout, and the solid state disk state information includes a solid state disk temperature and a data throughput.
As a preferred embodiment, the commands capable of being processed include backing up all data in the solid state disk to other hard disks, and restoring factory settings.
Compared with the prior art, the invention has the advantages that: after the abnormal state of the hard disk occurs, the performance is more active and flexible. In an abnormal state, the operation of the host system can still be maintained for the commands capable of being processed, and the processing commands sent by a user or the host system are received while the operation of the host system is ensured, so that the current state or data of the solid state disk is processed in a targeted manner, and the data safety on the solid state disk is further ensured to a certain extent.
Drawings
In order to more clearly illustrate the technical solutions in the embodiments of the present invention, the drawings corresponding to the embodiments are briefly introduced below, and it is obvious that the drawings in the following description are some embodiments of the present invention, and it is obvious for those skilled in the art that other drawings can be obtained according to these drawings without creative efforts.
FIG. 1 is a flowchart illustrating a conventional processing procedure for an abnormal solid state disk;
FIG. 2 is a flowchart illustrating steps of a method for handling an exception in a solid state drive according to an embodiment of the present invention;
FIG. 3 is a flowchart illustrating steps of a method for handling an exception in a solid state drive according to another embodiment of the present invention;
fig. 4 is a schematic structural diagram of a solid state disk exception handling system according to an embodiment of the present invention.
Detailed Description
The noun explains:
ssd (solid State drive): solid state disk, data storage device consisting of several NAND FLASH arrays.
NAND FLASH: a non-volatile storage medium, also known as a NAND flash memory chip.
HOST: a computer host system running system software and applications.
SATA (Serial ATA: Serial Advanced Technology Attachment): serial ATA, one of the hard disk data transfer interfaces, has a maximum transfer rate of 6 Gb/s.
PCIe (peripheral Component Interconnect express): a high speed serial computer expansion bus.
SRAM (static Random Access memory): static random access memory, a volatile storage medium.
DRAM (dynamic Random Access memory): dynamic random access memory, a volatile storage medium.
Firmware: and program codes solidified on the embedded equipment are responsible for controlling the functions of the embedded equipment.
SSD Controller: a microprocessor chip running SSD firmware.
Bit Flip: and the bit is inverted, and the data bit in the storage unit is accidently jumped because of the external interference factor.
Watchdog Timer: a watchdog timer, a part of an embedded microprocessor chip, is used for forcibly resetting the whole microprocessor or enabling the processor to execute a preset program after the counter is overtime if the counter is not reset in a preset period.
Embodiments of the present invention will be described in further detail below with reference to the accompanying drawings.
The embodiment of the invention provides a method and a system for processing the exception of a solid state disk, which can ensure that a host system can continuously run and simultaneously can still selectively execute a task scheduling command issued by a chairman system by setting a safety mode, namely, the task scheduling command which can be executed in the safety mode can be executed, so that the solid state disk can be ensured to adjust the state and backup data in the safety mode, and further the safety and the stability of the host system and the data are ensured.
In order to achieve the technical effects, the general idea of the application is as follows:
setting a safety mode of the solid state disk, wherein the safety mode is configured that when a host system is in a running state, the solid state disk only executes necessary task scheduling and limited command processing;
after the data transmission abnormality of the solid state disk is detected, recording the abnormal information of the solid state disk and entering a safety mode;
operating the solid state disk in the safety mode, and reporting abnormal information to a host system;
the host system receives an operation instruction input by a user and issues the operation instruction to the solid state disk;
the solid state disk acquires an operating instruction from a host system, and only executes task scheduling and command processing which can be executed in a safe mode in task scheduling corresponding to the operating instruction.
In summary, in the conventional technical solution, the solid state disk is very susceptible to environmental influences, such as over-high temperature of the working environment and strong electromagnetic radiation of the working environment, except for the working environment; in addition, the solid state disk will also have an influence on its own defects. For example, chip bug, Bit Flip error of internal SRAM or external DRAM, designer error of firmware algorithm, etc. When the influence of these factors is serious, the SSD firmware cannot normally operate, and cannot provide a normal data access service for the host system, and when the influence is serious, the host system cannot respond to any command, and the host is down. As shown in fig. 1, after the solid state disk is initialized and self-scheduled with a task, the solid state disk enters an assertion mode or is restarted directly after an exception occurs, and the host system and the user cannot acquire any state of the current solid state disk and cannot complete a remedial measure for the solid state disk through task scheduling.
According to the invention, after the transmission abnormality of the solid state disk occurs, the solid state disk does not directly enter into an assertion mode or is restarted, but enters into a safe mode, and executes task scheduling which can be executed in the safe mode, so that the host system can be ensured to operate at the lowest limit, and the abnormal information is reported to the host system. The user or the host system can perform specific judgment according to the abnormal information and know the error of the solid state disk, so that the solid state disk can be pertinently sent out an operation instruction, and after the solid state disk receives task scheduling corresponding to the operation instruction, the data and the state of the solid state disk can be more pertinently processed, so that the data safety and the system stability are improved.
In order to better understand the technical solution, the following detailed description is made with reference to specific embodiments.
Referring to fig. 2 and fig. 3, an embodiment of the present invention provides a method for processing an exception in a solid state disk, including:
s1: and setting a safety mode of the solid state disk, wherein the safety mode is configured that when the host system is in a running state, the solid state disk only executes necessary task scheduling and limited command processing.
The security mode of the solid state disk is set, so that the stability of the host system can be improved. After the solid state disk is in the safety mode, task scheduling and command processing of the solid state disk are limited, only part of task scheduling commands and command processing can be executed, matching can be performed according to a list of executable task scheduling commands in the preset safety mode, and the solid state disk is guaranteed to run under the condition that resource and data throughput is small, namely the influence on the solid state disk is small. The task scheduling command and command processing at least includes task scheduling command and command processing for ensuring that the host system can continuously run, so that the host system can continuously run when the solid state disk runs.
It should be noted that the security mode may be set for task scheduling command and command processing, and this setting may be manually preset, for example, task scheduling and command processing of the targeted survival nature to the host system must be performed, and task scheduling and command processing of the non-required nature may be manually removed; besides specifying the specific task scheduling artificially, the task scheduling and command processing can be performed according to the importance level, for example, a higher importance level is set for the task scheduling and command processing which concern the host system can continue to run, and the security mode only or preferentially executes the task scheduling with a higher importance level. In addition to setting for task scheduling commands and command processing, the data position where the execution command of the host system is located can be monitored according to the partition where the task scheduling and command processing are located, and scheduling and command processing of data in the area can be guaranteed to be completed. The safety mode of the invention is a mode which ensures that the host system can continuously run in a running state by selecting the task scheduling and command processing in the solid state disk.
S2: and after the data transmission abnormality of the solid state disk is detected, recording the abnormal information of the solid state disk and entering a safety mode.
The abnormal information of the solid state disk is recorded and the solid state disk enters a safety mode, so that a host system and a user can know the state of the solid state disk. When the data transmission abnormality of the solid state disk is detected, the hard disk information of the solid state disk is recorded immediately, and the host system is in a safe mode, so that the host system can be prevented from directly going down due to the abnormality of the solid state disk to a large extent, and meanwhile, if the host system goes down, a user can also perform backtracking analysis by analyzing the abnormal information recorded by the solid state disk at that time. Compared with the traditional solid state disk which executes diagnosis and is automatically restarted, when the host system can continuously run, the storage and backup of the current task of the user and the management of the state of the solid state disk can be ensured as much as possible, when the host system cannot continuously run, the user can also know the abnormal reason by tracing the abnormal information.
Preferably, the exception information includes a data exception category, and the data exception category includes a bit flip error and a transmission timeout.
The abnormal information corresponds to the classified mode, and besides the user can discriminate, the host system can also preprocess according to the category, so that the processing can be carried out more quickly.
Optionally, the abnormal information includes current state information of the solid state disk, and the state information of the solid state disk includes a temperature of the solid state disk and a data throughput.
Except for the abnormal type, in order to enable a host system and a user to more accurately judge the reason of the current solid state disk abnormality, the abnormality information can also carry the state information of the current solid state disk, and the user judges according to the temperature and the data throughput of the solid state disk in the state information. If the solid state disk cannot normally complete the Data Flip at a high working temperature, the host system and the user can judge that the Data Flip is a working environment problem from the temperature of the hard disk carried in the abnormal information, and then corresponding measures are taken.
S3: and operating the solid state disk in the safety mode, and reporting abnormal information to a host system.
After the solid state disk enters the safety mode, the abnormal information is reported to the host system, so that the diagnosis of the host system and a user can be assisted, and the success rate of the rescue is improved. After the solid state disk enters the safety mode, the recorded abnormal information is reported to the host system, the host system and a user can know the principle of the current solid state disk abnormality according to the abnormal information, the hard disk is diagnosed through the abnormal information, and the reason of the solid state disk abnormality is tried to be found out. Therefore, the solid state disk enters a safety mode, and reporting of abnormal information is guaranteed under the condition that the host system is continuously operated, which is very helpful for the host system and a user to diagnose the abnormality of the solid state disk.
S4: and the host system receives an operation instruction input by a user and issues the operation instruction to the solid state disk.
The solid state disk can receive an operation instruction input by a user through the host system, so that the user can process the abnormity of the solid state disk. And the solid state disk enters a safety mode, and after the host system is maintained to run, a user diagnoses and knows the abnormal reason of the solid state disk through the abnormal information reported by the user. At this time, the user also wants to be able to adjust the current state of the solid state disk and the like through the operation instruction, so that the solid state disk can recover or avoid the current abnormal state, and thus the host can continue to operate. The hard disk provided by the invention can receive the operation instruction input by the user through the host system, so that the repair work of the solid state disk can be ensured to be intervened by the user to a certain extent, and the stability of the solid state disk, the safety of data on the solid state disk and the stability of the host system are improved to a greater extent.
For example, when a host used by a user enters a high-temperature environment, data transmission on the solid state disk is further influenced, the solid state disk immediately enters a safe mode, the operation of the host system is maintained, and the work of the user on the host system is guaranteed not to be lost. Meanwhile, the solid state disk records and reports abnormal information to the host system, the user and the host system know that the reason causing the current abnormality of the solid state disk is the environmental temperature according to the abnormal information, and then after the host is cooled, the user can command the solid state disk to exit the safety mode through the operation instruction and continue to operate. The user accesses the self-diagnosis and repair process of the solid state disk in the traditional technology according to the abnormal information and the operation instruction, so that the stability of the solid state disk, the safety of data on the solid state disk and the stability of a host system are improved to a greater extent.
As a preferred embodiment, the commands capable of being processed include backing up all data in the solid state disk to other hard disks, and restoring factory settings.
For users, the main application of the solid state disk is to store and ensure the correctness and safety of data. When the solid state disk is abnormal, it is dangerous to store the data, so that in the safety mode, the command to be processed should include the backup of all data in the solid state disk to other disks. Furthermore, when the solid state disk has a setting error, the user tends to return to the original factory setting, so that the solid state disk can return to the old state and then work normally.
Preferably, the commands that can be processed in the secure mode include user-customized commands.
A means for flexibly processing the abnormal condition of the SSD is provided for the host, customized design can be carried out according to customer requirements, and the competitiveness of SSD products in the market is improved. If the server uses the solid state disk, a damage immediate transfer command can be set, so that the data is preferentially transferred after the server is abnormal, and the server continues to serve by using the transferred data. That is, the user-customized command includes a backup of data of the fixed area by priority transmission.
S5: the solid state disk acquires an operating instruction from a host system, and only executes task scheduling and command processing which can be executed in a safe mode in task scheduling corresponding to the operating instruction.
The task scheduling in the security mode can be executed, so that the stability of the host system and the security of data can be ensured. After the host system and the user complete the initial diagnosis of the solid state disk according to the abnormal information, corresponding operation instructions may be performed on the solid state disk.
It should be noted that the following conditions may exist in the initial diagnosis:
1. host system and user do not know abnormality cause of solid state disk
2. The host system and the cause of the abnormality diagnosed by the user are wrong
3. The host system and the user execute other commands without taking care of the exception
4. The host system and the user diagnose the correct abnormality cause but the operation instruction input is wrong.
In the above situations, if the solid state disk in the secure mode directly performs the operation instruction under the above situations, a host system may be down, data of the solid state disk may be lost, and even the solid state disk may be damaged. Therefore, the operation instruction needs to be performed within a range that can be executed in the solid state disk security mode. Therefore, in the command processing, the solid state disk only runs limited command processing, preferably preset command processing, and the solid state disk confirms whether the solid state disk can run or not in a matching corresponding mode.
In a preferred embodiment, when the task scheduling does not belong to the necessary task scheduling, the solid state disk returns a message that the execution cannot be performed to the host system.
For a host system and a user to send an operating instruction for task scheduling to a solid state disk, the solid state disk may not be operated, or after the operation, data is directly damaged or enters an assertion mode, and finally the system goes down, at this time, information which cannot be executed is timely fed back to the host system and the user to inform that the host system and the user operating instruction cannot be completed, and the user and the host system can know that the current operating instruction cannot be completed, so that other operating instructions are selected to continue to complete the operation on the solid state disk.
As shown in fig. 4, an embodiment of the present invention further provides a system for processing an exception of a solid state disk, which includes a security module, a detection module, a reporting module, a receiving module, and an execution module:
the system comprises a security module, a data processing module and a data processing module, wherein the security module is used for setting a security mode of the solid state disk, and the security mode is configured in such a way that when a host system is in a running state, the solid state disk only executes necessary task scheduling and limited command processing;
the detection module is used for recording abnormal information of the solid state disk and entering a safety mode after detecting that the data transmission of the solid state disk is abnormal;
the reporting module is used for operating the solid state disk in the safety mode and reporting abnormal information to the host system;
the receiving module is used for receiving an operation instruction input by a user through the host system and issuing the operation instruction to the solid state disk;
and the execution module is used for acquiring the operating instruction from the host system through the solid state disk and only executing task scheduling and command processing which can be executed in a safe mode in task scheduling corresponding to the operating instruction.
Preferably, the exception information includes a data exception type and current solid state disk state information, the data exception type includes a bit flip error and transmission timeout, and the solid state disk state information includes a solid state disk temperature and a data throughput.
Further, the commands capable of being processed include backing up all data in the solid state disk to other hard disks and restoring factory settings.
Various modifications and specific examples in the foregoing method embodiments are also applicable to the system of the present embodiment, and the detailed description of the method is clear to those skilled in the art, so that the detailed description is omitted here for the sake of brevity.
Based on the same inventive concept, an embodiment of the present invention provides a computer-readable storage medium, on which a computer program is stored, where the computer program, when executed by a processor, implements a method for exception handling of a solid state disk, the method including:
setting a safety mode of the solid state disk, wherein the safety mode is configured that when a host system is in a running state, the solid state disk only executes necessary task scheduling;
after the data transmission abnormality of the solid state disk is detected, recording the abnormal information of the solid state disk and entering a safety mode;
operating the solid state disk in the safety mode, and reporting abnormal information to a host system;
the host system receives an operation instruction input by a user and issues the operation instruction to the solid state disk;
the solid state disk acquires an operating instruction from a host system, and only executes task scheduling which can be executed in a safe mode in task scheduling corresponding to the operating instruction.
Computer storage media for embodiments of the invention may employ any combination of one or more computer-readable media. The computer readable medium may be a computer readable signal medium or a computer readable storage medium. The computer-readable storage medium may be, for example but not limited to: an electrical, magnetic, optical, electromagnetic, infrared, or semiconductor system, apparatus, or device, or any combination thereof. More specific examples (a non-exhaustive list) of the computer readable storage medium would include the following: an electrical connection having one or more wires, a portable computer diskette, a hard disk, a Random Access Memory (RAM), a read-only memory (ROM), an erasable programmable read-only memory (EPROM or flash memory), an optical fiber, a portable compact disc read-only memory (CD-ROM), an optical storage device, a magnetic storage device, or any suitable combination of the foregoing. In the context of this document, a computer readable storage medium may be any tangible medium that can contain, or store a program for use by or in connection with an instruction execution system, apparatus, or device.
A computer readable signal medium may include a propagated data signal with computer readable program code embodied therein, for example, in baseband or as part of a carrier wave. Such a propagated data signal may take many forms, including, but not limited to, electro-magnetic, optical, or any suitable combination thereof. A computer readable signal medium may also be any computer readable medium that is not a computer readable storage medium and that can communicate, propagate, or transport a program for use by or in connection with an instruction execution system, apparatus, or device.
Program code embodied on a computer readable medium may be transmitted using any appropriate medium, including but not limited to: wireless, wire, fiber optic cable, RF, etc., or any suitable combination of the foregoing.
Computer program code for carrying out operations for aspects of the present invention may be written in any combination of one or more programming languages, including an object oriented programming language such as Java, Smalltalk, C + + or the like and conventional procedural programming languages, such as the "C" programming language or similar programming languages. The program code may execute entirely on the user's computer, partly on the user's computer, as a stand-alone software package, partly on the user's computer and partly on a remote computer or entirely on the remote computer or server. In the case of a remote computer, the remote computer may be connected to the user's computer through any type of network, including a Local Area Network (LAN) or a Wide Area Network (WAN), or the connection may be made to an external computer (for example, through the Internet using an Internet service provider).
It is to be noted that the foregoing is only illustrative of the preferred embodiments of the present invention and the technical principles employed. It will be understood by those skilled in the art that the present invention is not limited to the particular embodiments described herein, but is capable of various obvious changes, rearrangements and substitutions as will now become apparent to those skilled in the art without departing from the scope of the invention. Therefore, although the present invention has been described in greater detail by the above embodiments, the present invention is not limited to the above embodiments, and may include other equivalent embodiments without departing from the spirit of the present invention, and the scope of the present invention is determined by the scope of the appended claims.
Based on the same inventive concept, an embodiment of the present invention further provides an electronic device, which includes a memory and a processor, where the memory stores a computer program running on the processor, and the processor executes the computer program to implement all or part of the method steps in the first embodiment.
The Processor may be a Central Processing Unit (CPU), other general purpose Processor, a Digital Signal Processor (DSP), an Application Specific Integrated Circuit (ASIC), an off-the-shelf Programmable Gate Array (FPGA) or other Programmable logic device, discrete Gate or transistor logic, discrete hardware components, etc. The general purpose processor may be a microprocessor or the processor may be any conventional processor or the like which is the control center for the computer device and which connects the various parts of the overall computer device using various interfaces and lines.
The memory may be used to store the computer programs and/or modules, and the processor may implement various functions of the computer device by running or executing the computer programs and/or modules stored in the memory and invoking data stored in the memory. The memory may mainly include a storage program area and a storage data area, wherein the storage program area may store an operating system, an application program required by at least one function (such as a sound playing function, an image playing function, etc.), and the like; the storage data area may store data (such as audio data, video data, etc.) created according to the use of the cellular phone, etc. In addition, the memory may include high speed random access memory, and may also include non-volatile memory, such as a hard disk, a memory, a plug-in hard disk, a Smart Media Card (SMC), a Secure Digital (SD) Card, a Flash memory Card (Flash Card), at least one magnetic disk storage device, a Flash memory device, or other volatile solid state storage device.
Generally, the method and system for processing the exception of the solid state disk provided by the embodiment of the invention limit the operation of the hard disk through the preset security mode, and can improve the corresponding and processing capability of the solid state disk under the exception condition and improve the stability and data security of the solid state disk compared with the traditional technology and the like.
As will be appreciated by one skilled in the art, embodiments of the present invention may be provided as a method, system, or computer program product. Accordingly, the present invention may take the form of an entirely hardware embodiment, an entirely software embodiment or an embodiment combining software and hardware aspects. Furthermore, the present invention may take the form of a computer program product embodied on one or more computer-usable storage media (including, but not limited to, disk storage, optical storage, and the like) having computer-usable program code embodied therein.
The present invention is described with reference to flowchart illustrations and/or block diagrams of methods, apparatus (systems), and computer program products according to embodiments of the invention. It will be understood that each flow and/or block of the flow diagrams and/or block diagrams, and combinations of flows and/or blocks in the flow diagrams and/or block diagrams, can be implemented by computer program instructions. These computer program instructions may be provided to a processor of a general purpose computer, special purpose computer, embedded processor, or other programmable data processing apparatus to produce a machine, such that the instructions, which execute via the processor of the computer or other programmable data processing apparatus, create means for implementing the functions specified in the flowchart flow or flows and/or block diagram block or blocks.
These computer program instructions may also be stored in a computer-readable memory that can direct a computer or other programmable data processing apparatus to function in a particular manner, such that the instructions stored in the computer-readable memory produce an article of manufacture including instruction means which implement the function specified in the flowchart flow or flows and/or block diagram block or blocks.
These computer program instructions may also be loaded onto a computer or other programmable data processing apparatus to cause a series of operational steps to be performed on the computer or other programmable apparatus to produce a computer implemented process such that the instructions which execute on the computer or other programmable apparatus provide steps for implementing the functions specified in the flowchart flow or flows and/or block diagram block or blocks.
It will be apparent to those skilled in the art that various changes and modifications may be made in the present invention without departing from the spirit and scope of the invention. Thus, if such modifications and variations of the present invention fall within the scope of the claims of the present invention and their equivalents, the present invention is also intended to include such modifications and variations.

Claims (10)

1. A solid state disk exception handling method is characterized by comprising the following steps:
setting a safety mode of the solid state disk, wherein the safety mode is configured that when a host system is in a running state, the solid state disk only executes necessary task scheduling and limited command processing;
after the data transmission abnormality of the solid state disk is detected, recording the abnormal information of the solid state disk and entering a safety mode;
operating the solid state disk in the safety mode, and reporting abnormal information to a host system;
the host system receives an operation instruction input by a user and issues the operation instruction to the solid state disk;
the solid state disk acquires an operating instruction from a host system, and only executes task scheduling and command processing which can be executed in a safe mode in task scheduling corresponding to the operating instruction.
2. The method of claim 1, wherein: and when the task scheduling does not belong to the necessary task scheduling, the solid state disk returns a message which cannot be executed to the host system.
3. The method of claim 2, wherein: the exception information comprises data exception types, and the data exception types comprise bit flip errors and transmission overtime.
4. The method of claim 1, wherein: the abnormal information comprises the current state information of the solid state disk, and the state information of the solid state disk comprises the temperature of the solid state disk and the data throughput.
5. The method of claim 1, wherein: the commands capable of being processed comprise that all data in the solid state disk are backed up to other hard disks, and factory settings are restored.
6. The method of claim 1, wherein: the commands that can be processed in the secure mode include user-customized commands.
7. The method of claim 6, wherein: the user customization command comprises the prior transmission backup of the data of the fixed area.
8. The solid state disk exception handling system is characterized by comprising a security module, a detection module, a reporting module, a receiving module and an execution module:
the system comprises a security module, a data processing module and a data processing module, wherein the security module is used for setting a security mode of the solid state disk, and the security mode is configured in such a way that when a host system is in a running state, the solid state disk only executes necessary task scheduling and limited command processing;
the detection module is used for recording abnormal information of the solid state disk and entering a safety mode after detecting that the data transmission of the solid state disk is abnormal;
the reporting module is used for operating the solid state disk in the safety mode and reporting abnormal information to the host system;
the receiving module is used for receiving an operation instruction input by a user through the host system and issuing the operation instruction to the solid state disk;
and the execution module is used for acquiring the operating instruction from the host system through the solid state disk and only executing task scheduling and command processing which can be executed in a safe mode in task scheduling corresponding to the operating instruction.
9. The system of claim 8, wherein: the abnormal information comprises a data abnormal type and current solid state disk state information, the data abnormal type comprises a bit flip error and transmission overtime, and the solid state disk state information comprises solid state disk temperature and data throughput.
10. The system of claim 8, wherein: the commands capable of being processed comprise that all data in the solid state disk are backed up to other hard disks, and factory settings are restored.
CN201910930370.8A 2019-09-29 2019-09-29 Solid state disk exception handling method and system Active CN110704228B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201910930370.8A CN110704228B (en) 2019-09-29 2019-09-29 Solid state disk exception handling method and system

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201910930370.8A CN110704228B (en) 2019-09-29 2019-09-29 Solid state disk exception handling method and system

Publications (2)

Publication Number Publication Date
CN110704228A true CN110704228A (en) 2020-01-17
CN110704228B CN110704228B (en) 2023-07-28

Family

ID=69197210

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201910930370.8A Active CN110704228B (en) 2019-09-29 2019-09-29 Solid state disk exception handling method and system

Country Status (1)

Country Link
CN (1) CN110704228B (en)

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111625405A (en) * 2020-04-22 2020-09-04 深圳忆联信息系统有限公司 SSD terminal fault diagnosis method, system, computer device and storage medium
CN112506744A (en) * 2020-12-11 2021-03-16 浪潮电子信息产业股份有限公司 Method, device and equipment for monitoring running state of NVMe hard disk
CN113590416A (en) * 2021-06-30 2021-11-02 郑州云海信息技术有限公司 CPU system firmware assertion processing method, device, equipment and storage medium
CN114356227A (en) * 2021-12-17 2022-04-15 得一微电子股份有限公司 Solid state disk instruction processing method, device and system and readable storage medium

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2000222201A (en) * 1999-01-27 2000-08-11 Toshiba Corp Instruction executing method of information reproducing device, record medium recording program for allowing the device to execute instruction and disk reproducer
CN102096459A (en) * 2009-12-14 2011-06-15 腾讯科技(深圳)有限公司 Energy-saving method and device
CN103268147A (en) * 2013-06-04 2013-08-28 北京百度网讯科技有限公司 Method and device for hard disk power consumption management
CN106778369A (en) * 2016-11-09 2017-05-31 百望金赋科技有限公司 A kind of hard disk data access method, tax control server
CN106991031A (en) * 2017-03-07 2017-07-28 记忆科技(深圳)有限公司 A kind of implementation method of SMART information monitorings

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2000222201A (en) * 1999-01-27 2000-08-11 Toshiba Corp Instruction executing method of information reproducing device, record medium recording program for allowing the device to execute instruction and disk reproducer
CN102096459A (en) * 2009-12-14 2011-06-15 腾讯科技(深圳)有限公司 Energy-saving method and device
CN103268147A (en) * 2013-06-04 2013-08-28 北京百度网讯科技有限公司 Method and device for hard disk power consumption management
CN106778369A (en) * 2016-11-09 2017-05-31 百望金赋科技有限公司 A kind of hard disk data access method, tax control server
CN106991031A (en) * 2017-03-07 2017-07-28 记忆科技(深圳)有限公司 A kind of implementation method of SMART information monitorings

Cited By (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111625405A (en) * 2020-04-22 2020-09-04 深圳忆联信息系统有限公司 SSD terminal fault diagnosis method, system, computer device and storage medium
CN112506744A (en) * 2020-12-11 2021-03-16 浪潮电子信息产业股份有限公司 Method, device and equipment for monitoring running state of NVMe hard disk
CN112506744B (en) * 2020-12-11 2023-08-25 浪潮电子信息产业股份有限公司 Method, device and equipment for monitoring running state of NVMe hard disk
CN113590416A (en) * 2021-06-30 2021-11-02 郑州云海信息技术有限公司 CPU system firmware assertion processing method, device, equipment and storage medium
CN113590416B (en) * 2021-06-30 2023-09-15 郑州云海信息技术有限公司 CPU system firmware assertion processing method, device, equipment and storage medium
CN114356227A (en) * 2021-12-17 2022-04-15 得一微电子股份有限公司 Solid state disk instruction processing method, device and system and readable storage medium

Also Published As

Publication number Publication date
CN110704228B (en) 2023-07-28

Similar Documents

Publication Publication Date Title
CN110704228B (en) Solid state disk exception handling method and system
JP6530774B2 (en) Hardware failure recovery system
CN105938450B (en) The method and system that automatic debugging information is collected
JP6333410B2 (en) Fault processing method, related apparatus, and computer
TWI632462B (en) Switching device and method for detecting i2c bus
TWI470420B (en) Dubugging method and computer system using the smae
US20210255939A1 (en) System and method for improving detection and capture of a host system catastrophic failure
US10733077B2 (en) Techniques for monitoring errors and system performance using debug trace information
US20140095948A1 (en) Memory testing in a data processing system
US9436539B2 (en) Synchronized debug information generation
US10922162B2 (en) Capturing video data and serial data during an information handling system failure
CN117389790B (en) Firmware detection system, method, storage medium and server capable of recovering faults
US10514972B2 (en) Embedding forensic and triage data in memory dumps
US9411666B2 (en) Anticipatory protection of critical jobs in a computing system
US7415560B2 (en) Method of automatically monitoring computer system debugging routine
US20200250017A1 (en) Bios to os event communication
KR20130075807A (en) An atm with back-up hdd for booting and the booting method there of
TWI529525B (en) System and method for handling system failure
US9880893B2 (en) Failure interval determination
US20240012651A1 (en) Enhanced service operating system capabilities through embedded controller system health state tracking
CN108415788B (en) Data processing apparatus and method for responding to non-responsive processing circuitry
US20080195896A1 (en) Apparratus and method for universal programmable error detection and real time error detection
Sultania Monitoring and Failure Recovery of Cloud-Managed Digital Signage
CN114090467A (en) Chip management method, device, chip, processor and equipment
CN114356708A (en) Equipment fault monitoring method, device, equipment and readable storage medium

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant
CP03 Change of name, title or address
CP03 Change of name, title or address

Address after: 430000 west of 2-3 / F, No.2 factory building, Guannan Industrial Park, No.1 Gaoxin 2nd Road, Wuhan Donghu New Technology Development Zone, Wuhan City, Hubei Province

Patentee after: Zhiyu Technology Co.,Ltd.

Address before: 438600 West of Building 2-3, Guannan Industrial Park, No. 1 Gaoxin Second Road, Donghu New Technology Development Zone, Wuhan City, Hubei Province

Patentee before: EXASCEND TECHNOLOGY (WUHAN) CO.,LTD.