CN109032901A - A kind of monitoring method, device and the controlled terminal of the outer SSD of remote band - Google Patents
A kind of monitoring method, device and the controlled terminal of the outer SSD of remote band Download PDFInfo
- Publication number
- CN109032901A CN109032901A CN201810684778.7A CN201810684778A CN109032901A CN 109032901 A CN109032901 A CN 109032901A CN 201810684778 A CN201810684778 A CN 201810684778A CN 109032901 A CN109032901 A CN 109032901A
- Authority
- CN
- China
- Prior art keywords
- hard disk
- ssd hard
- ssd
- accumulative
- disk failure
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F11/00—Error detection; Error correction; Monitoring
- G06F11/30—Monitoring
- G06F11/3003—Monitoring arrangements specially adapted to the computing system or computing system component being monitored
- G06F11/3037—Monitoring arrangements specially adapted to the computing system or computing system component being monitored where the computing system component is a memory, e.g. virtual memory, cache
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F11/00—Error detection; Error correction; Monitoring
- G06F11/07—Responding to the occurrence of a fault, e.g. fault tolerance
- G06F11/0703—Error or fault processing not based on redundancy, i.e. by taking additional measures to deal with the error or fault not making use of redundancy in operation, in hardware, or in data representation
- G06F11/0751—Error or fault detection not based on redundancy
- G06F11/0754—Error or fault detection not based on redundancy by exceeding limits
- G06F11/076—Error or fault detection not based on redundancy by exceeding limits by exceeding a count or rate limit, e.g. word- or bit count limit
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F11/00—Error detection; Error correction; Monitoring
- G06F11/07—Responding to the occurrence of a fault, e.g. fault tolerance
- G06F11/0703—Error or fault processing not based on redundancy, i.e. by taking additional measures to deal with the error or fault not making use of redundancy in operation, in hardware, or in data representation
- G06F11/0793—Remedial or corrective actions
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F11/00—Error detection; Error correction; Monitoring
- G06F11/30—Monitoring
- G06F11/32—Monitoring with visual or acoustical indication of the functioning of the machine
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Physics & Mathematics (AREA)
- Quality & Reliability (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Computing Systems (AREA)
- Mathematical Physics (AREA)
- Debugging And Monitoring (AREA)
Abstract
The embodiment of the present application provides monitoring method, device and the controlled terminal of a kind of outer SSD of remote band, which comprises BMC receives the fault-signal that BIOS is sent, and restarts SSD hard disk;It determines that SSD hard disk failure is accumulative and restarts number, restart number more than given threshold when SSD hard disk failure is accumulative, send failure alarm signal, otherwise continue accumulative SSD hard disk failure and restart number;The application BMC can restart SSD hard disk by operating GPIO level mode, when when SD hard disk failure breaks down, BIOS sends fault-signal and informs that BMC carries out SSD hard disk and restarts, accurately monitor SSD disk state, only just engineer is notified to check scene when SSD hard disk restarts cumulative frequency more than certain quantity simultaneously, reduce artificial participation, it is practical, time cost and human cost is greatly saved.
Description
Technical field
The present invention relates to field of computer technology, more particularly to the monitoring method of SSD outside a kind of remote band, device and by
Control terminal.
Background technique
With the development of semi-conductor industry, Flash integrated level is higher and higher, and cost constantly declines, and is using multichannel simultaneously
Row etc. is under technologies, SSD disk is fast with its read or write speed, calorific value is low, operating temperature range is big, green power saving, shock resistance by force and
High, the more and more extensive use in high-end server, storage and portable machine without mechanical part reliability.SSD, that is, solid state hard disk
(Solid State Disk or Solid State Drive), also referred to as electronic hard disc or solid-state electronic disk are single by control
The storage equipment of member and solid state storage elements (DRAM or FLASH chip) composition, supports SATA, FC, SAS interface at present, can be with
Substitute common mechanical hard disk.
SSD state, which obtains, at present mainly directly carries out communicating with SDD hard disk by I2C interface that Dump is corresponding deposits
Device, but most probably there is the phenomenon that I2C can not obtain corresponding device address, it causes to be unable to normal communication, SSD hard disk is read at this time
It writes normally, but SSD disk state can not be monitored accurately.In addition, unavoidably going out in SSD firmware hard disk use process
Existing SSD hard disk failure, SSD hard disk crashes, can not detect drive or other SSD abnormal conditions, these are required to engineer
It reaches computer room scene and checks actual state, can be by replacement SSD hard disc apparatus, or simply SSD hard disk is restarted to solve to ask
Topic.And most of situation, it only need to restart SSD hard disk, just need engineer to rush to the scene, greatly waste manpower and wealth
Power resource.
Therefore, how to guarantee that SSD disk state can be monitored accurately, while SSD is hard with postponing in server change
Disk failure, crash can be restarted automatically when can not detect drive or other SSD abnormal conditions, be led to when problem can not solve
Know that engineer rushes to the scene, is current urgent problem to be solved.
Summary of the invention
In view of the deficiencies of the prior art, the present invention provides the monitoring method of SSD, device and controlled ends outside a kind of remote band
End, can accurately monitor SSD disk state, and in SSD hard disk failure, crash, can not detect that drive or other SSD are different
Can be restarted automatically when reason condition, when SSD hard disk failure be more than certain quantity, notify engineer to check Solve on site problem.
In a first aspect, providing the monitoring method of outer SSD of remote band a kind of, which comprises
BMC receives the fault-signal that BIOS is sent, and restarts SSD hard disk;
It determines that SSD hard disk failure is accumulative and restarts number, restart number more than given threshold, hair when SSD hard disk failure is accumulative
Failure alarm signal is sent, otherwise continues accumulative SSD hard disk failure and restarts number.
With reference to first aspect, in the first possible implementation of the first aspect, the BMC receives what BIOS was sent
Fault-signal, and restart SSD hard disk, comprising:
The BMC receives the SSD hard disk failure of BIOS transmission, crash, can not detect drive or other SSD exception feelings
When condition, SSD hard disk is restarted.
With reference to first aspect and its above-mentioned implementation, in the second possible implementation of the first aspect, described
BMC receives the fault-signal that BIOS is sent, and restarts SSD hard disk, comprising:
BMC receives the fault-signal that BIOS is sent and restarts SSD hard disk by dragging down PWDIS pin.
With reference to first aspect and its above-mentioned implementation, in a third possible implementation of the first aspect, described
It determines that SSD hard disk failure is accumulative and restarts number, restart number more than given threshold when SSD hard disk failure is accumulative, send failure report
Otherwise alert signal continues accumulative SSD hard disk failure and restarts number, comprising:
It determines that SSD hard disk failure is accumulative and restarts number, when SSD hard disk failure adds up to restart number to be more than 10 times, transmission is former
Hinder alarm signal, otherwise continues accumulative SSD hard disk failure and restart number.
With reference to first aspect and its above-mentioned implementation, in a fourth possible implementation of the first aspect, described
It determines that SSD hard disk failure is accumulative and restarts number, restart number more than given threshold when SSD hard disk failure is accumulative, send failure report
Otherwise alert signal continues accumulative SSD hard disk failure and restarts number, comprising:
The LED light controlled by BMC is set on the server, as SSD hard disk failure lamp, determines that SSD hard disk failure is accumulative
To restart number, restarts number more than given threshold when SSD hard disk failure is accumulative, SSD hard disk failure lamp sends failure alarm signal,
Otherwise continue accumulative SSD hard disk failure and restart number.
Second aspect provides a kind of device for the monitoring of SSD outside remote band, comprising:
Receiving unit, the receiving unit receives the fault-signal that BIOS is sent for BMC, and restarts SSD hard disk;
Determination unit, the determination unit restart number for determining that SSD hard disk failure is accumulative, when SSD hard disk failure is tired
Weight calculation opens number more than given threshold, sends failure alarm signal, otherwise continues accumulative SSD hard disk failure and restarts number.
In conjunction with second aspect, in the first possible implementation of the second aspect, the receiving unit is specifically used for:
BMC receives the SSD hard disk failure of BIOS transmission, crash, can not detect drive or other SSD abnormal conditions
When, restart SSD hard disk.
It is in a second possible implementation of the second aspect, described in conjunction with second aspect and its above-mentioned implementation
Receiving unit is specifically used for:
BMC receives the fault-signal that BIOS is sent and restarts SSD hard disk by dragging down PWDIS pin.
It is in the third possible implementation of the second aspect, described in conjunction with second aspect and its above-mentioned implementation
Determination unit is specifically used for:
It determines that SSD hard disk failure is accumulative and restarts number, when SSD hard disk failure adds up to restart number to be more than 10 times, transmission is former
Hinder alarm signal, otherwise continues accumulative SSD hard disk failure and restart number.
It is in the fourth possible implementation of the second aspect, described in conjunction with second aspect and its above-mentioned implementation
Determination unit is specifically used for:
The LED light controlled by BMC is set on the server, as SSD hard disk failure lamp, determines that SSD hard disk failure is accumulative
To restart number, restarts number more than given threshold when SSD hard disk failure is accumulative, SSD hard disk failure lamp sends failure alarm signal,
Otherwise continue accumulative SSD hard disk failure and restart number.
The third aspect provides a kind of controlled terminal, comprising:
Processor, memory, wherein
The memory is for storing computer program, and the processor from memory for calling and running the computer journey
Sequence, so that the method that terminal device executes above-mentioned terminal device.
Fourth aspect provides a kind of computer storage medium, instruction is stored in the computer readable storage medium,
When run on a computer, so that computer executes method described in above-mentioned various aspects.
5th aspect, provides a kind of computer program product comprising instruction, when run on a computer, so that
Computer executes method described in above-mentioned various aspects.
Therefore, the embodiment of the present application BMC can restart SSD hard disk by operating GPIO level mode, when work as SD hard disk failure
When breaking down, BIOS sends fault-signal and informs that BMC carries out SSD hard disk and restarts, and accurately monitors SSD disk state, simultaneously
Only when SSD hard disk restarts cumulative frequency more than certain quantity, just engineer is notified to check scene, greatly reduced artificial
It participates in, it is practical, time cost and human cost is greatly saved.
Detailed description of the invention
In order to more clearly explain the embodiment of the invention or the technical proposal in the existing technology, to embodiment or will show below
There is attached drawing needed in technical description to be briefly described, it should be apparent that, for those of ordinary skill in the art
Speech, without creative efforts, is also possible to obtain other drawings based on these drawings.
Fig. 1 is the schematic flow chart of the method for the application one embodiment.
Fig. 2 is the schematic flow chart of the method for the application another embodiment.
Fig. 3 is the schematic block diagram of the device of the application one embodiment.
Fig. 4 is a kind of structural schematic diagram of controlled terminal provided in an embodiment of the present invention.
Specific embodiment
Technical solution in order to enable those skilled in the art to better understand the present invention, below in conjunction with of the invention real
The attached drawing in example is applied, technical scheme in the embodiment of the invention is clearly and completely described, it is clear that described implementation
Example is only a part of the embodiment of the present invention, instead of all the embodiments.Based on the embodiments of the present invention, this field is common
Technical staff's every other embodiment obtained without making creative work, all should belong to protection of the present invention
Range.Technical solution in order to enable those skilled in the art to better understand the present invention, below in conjunction with of the invention real
The attached drawing in example is applied, technical scheme in the embodiment of the invention is clearly and completely described, it is clear that described implementation
Example is only a part of the embodiment of the present invention, instead of all the embodiments.Based on the embodiments of the present invention, this field is common
Technical staff's every other embodiment obtained without making creative work, all should belong to protection of the present invention
Range.
The Key Term occurred in the application is explained below.
BIOS (Basic Input/ouput system): it indicates basic input output system, is one group and is cured to calculating
Program in machine on mainboard on a rom chip, after the program of its in store most important basic input and output of computer, booting
Self-check program and system self-triggered program, it can from CMOS read-write system be arranged specifying information, major function be for meter
Calculation machine provides the bottom, the most direct hardware setting and control.
BMC (Baseboard Management Controller): it indicates baseboard management controller, is that computer is counted
The equipment of mould signal conversion undertakes the task of output display figure.Aobvious to be connected on computer main board, it is by the digital signal of computer
Being converted into analog signal allows display to show.
GPIO (General Purpose Input Output): universal input/output or bus extender are indicated, when micro-
Controller or chipset do not have enough ports I/O, or when system is needed using distal end serial communication or control, GPIO product
It is capable of providing additional control and function for monitoring.
PWDIS (Power Disable): it indicates power-off, that is, the 3.3V taken less than in desktop computer SATA interface is utilized
Stitch is exported, using Pin3 as power-off signal.
It should be understood that the first, second of described in the text is intended merely to refer to and distinguish different signals, instruction etc., wherein the
One, the second restriction without sequencing.
Fig. 1 is the schematic flow chart of the method for the application one embodiment.Wherein, Fig. 1 executing subject can be long-range
The device of monitoring method with outer SSD.
As shown in Figure 1, this method 100 includes:
Step 110, BMC receives the fault-signal that BIOS is sent, and restarts SSD hard disk;
Step 120, it determines that SSD hard disk failure is accumulative and restarts number, restart number more than setting when SSD hard disk failure is accumulative
Threshold value sends failure alarm signal, otherwise continues accumulative SSD hard disk failure and restarts number.
Optionally, as the application one embodiment, the BMC receives the fault-signal that BIOS is sent, and it is hard to restart SSD
Disk, comprising:
The BMC receives the SSD hard disk failure of BIOS transmission, crash, can not detect drive or other SSD exception feelings
When condition, SSD hard disk is restarted.
Optionally, as the application one embodiment, the BMC receives the fault-signal that BIOS is sent, and it is hard to restart SSD
Disk, comprising:
BMC receives the fault-signal that BIOS is sent and restarts SSD hard disk by dragging down PWDIS pin.
Optionally, as the application one embodiment, the determining SSD hard disk failure is accumulative to restart number, when SSD hard disk
Failure is accumulative to restart number more than given threshold, sends failure alarm signal, otherwise continues accumulative SSD hard disk failure and restarts time
Number, comprising:
It determines that SSD hard disk failure is accumulative and restarts number, when SSD hard disk failure adds up to restart number to be more than 10 times, transmission is former
Hinder alarm signal, otherwise continues accumulative SSD hard disk failure and restart number.
Optionally, as the application one embodiment, the determining SSD hard disk failure is accumulative to restart number, when SSD hard disk
Failure is accumulative to restart number more than given threshold, sends failure alarm signal, otherwise continues accumulative SSD hard disk failure and restarts time
Number, comprising:
The LED light controlled by BMC is set on the server, as SSD hard disk failure lamp, determines that SSD hard disk failure is accumulative
To restart number, restarts number more than given threshold when SSD hard disk failure is accumulative, SSD hard disk failure lamp sends failure alarm signal,
Otherwise continue accumulative SSD hard disk failure and restart number.
Specifically, Fig. 2 shows the schematic diagrames of the method for the application one embodiment.
(1) channel I2C that the GPIO pin of BMC is connected directly to Samsung SSD interface PWDIS, BMC on hardware is connected to three
The I2C interface of star SSD;
(2) server powers on, and BMC is operated normally, and sends OEM order by I2C and enables PWDIS function;
(3) when that SSD hard disk failure, crash, can not detect drive or other SSD abnormal conditions, BIOS sends event
Hinder signal to BMC, BMC is notified to reach SSD status threshold;
(4) BMC restarts SSD hard disk, carries out of short duration recovery by dragging down PWDIS pin;
(5) number is restarted to SSD hard disk failure to add up, record number;
(6) number is restarted when SSD hard disk failure to be accumulated over given threshold such as 10 times, be the case where can not restoring,
SSD hard disk failure lamp sends failure alarm signal and informs engineer's on-site support, otherwise continues accumulative SSD hard disk failure and restarts time
Number.
Therefore, the embodiment of the present application BMC can restart SSD hard disk by operating GPIO level mode, when work as SD hard disk failure
When breaking down, BIOS sends fault-signal and informs that BMC carries out SSD hard disk and restarts, and accurately monitors SSD disk state, simultaneously
Only when SSD hard disk restarts cumulative frequency more than certain quantity, just engineer is notified to check scene, greatly reduced artificial
It participates in, it is practical, time cost and human cost is greatly saved.
Fig. 3 shows the schematic block diagram of the device of the application one embodiment.
As shown in figure 3, the device 300 includes:
Receiving unit 310, the receiving unit receives the fault-signal that BIOS is sent for BMC, and restarts SSD hard disk;
Determination unit 320, the determination unit restarts number for determining that SSD hard disk failure is accumulative, when SSD hard disk failure
It is accumulative to restart number more than given threshold, failure alarm signal is sent, otherwise continues accumulative SSD hard disk failure and restarts number.
Optionally, as the application one embodiment, the receiving unit 310 is specifically used for:
BMC receives the SSD hard disk failure of BIOS transmission, crash, can not detect drive or other SSD abnormal conditions
When, restart SSD hard disk.
Optionally, as the application one embodiment, the receiving unit 310 is specifically used for:
BMC receives the fault-signal that BIOS is sent and restarts SSD hard disk by dragging down PWDIS pin.
Optionally, as the application one embodiment, the determination unit 320 is specifically used for:
It determines that SSD hard disk failure is accumulative and restarts number, when SSD hard disk failure adds up to restart number to be more than 10 times, transmission is former
Hinder alarm signal, otherwise continues accumulative SSD hard disk failure and restart number.
Optionally, as the application one embodiment, the determination unit 320 is specifically used for:
The LED light controlled by BMC is set on the server, as SSD hard disk failure lamp, determines that SSD hard disk failure is accumulative
To restart number, restarts number more than given threshold when SSD hard disk failure is accumulative, SSD hard disk failure lamp sends failure alarm signal,
Otherwise continue accumulative SSD hard disk failure and restart number.
Fig. 4 is a kind of structural schematic diagram of controlled terminal 400 provided in an embodiment of the present invention, which can be with
For executing the method outside remote band provided by the embodiments of the present application shown in the monitoring of SSD.
Wherein, which may include: processor 410, memory 420 and communication unit 430.These components
It is communicated by one or more bus, it will be understood by those skilled in the art that the structure of server shown in figure is not
The restriction to the application is constituted, it is also possible to hub-and-spoke configuration either busbar network, can also include more than illustrating
Or less component, perhaps combine certain components or different component layouts.
Wherein, which can be used for executing instruction for storage processor 410, and memory 420 can be by any class
The volatibility or non-volatile memory device or their combination of type are realized, such as static random access memory (SRAM), electricity
Erasable Programmable Read Only Memory EPROM (EEPROM), Erasable Programmable Read Only Memory EPROM (EPROM), programmable read only memory
(PROM), read-only memory (ROM), magnetic memory, flash memory, disk or CD.When executing instruction in memory 420
When being executed by processor 410, so that terminal 400 some or all of is able to carry out in following above method embodiment step.
Processor 410 is the control centre for storing equipment, utilizes each of various interfaces and the entire electronic equipment of connection
A part by running or execute the software program and/or module that are stored in memory 420, and calls and is stored in storage
Data in device, to execute the various functions and/or processing data of electronic equipment.The processor can be by integrated circuit
(Integrated Circuit, abbreviation IC) composition, such as the IC that can be encapsulated by single are formed, can also be by more of connection
The encapsulation IC of identical function or different function and form.For example, processor 410 can only include central processing unit
(Central Processing Unit, abbreviation CPU).In the application embodiment, CPU can be single operation core, can also
To include multioperation core.
Communication unit 430, for establishing communication channel, so that the storage equipment be allow to be led to other equipment
Letter.Receive the user data or send user data to other equipment that other equipment are sent.
The application also provides a kind of computer storage medium,
Wherein, which can be stored with program, which may include each reality provided by the present application when executing
Some or all of apply in example step.The storage medium can be magnetic disk, CD, read-only memory (English: read-
Only memory, referred to as: ROM) or random access memory (English: random access memory, referred to as: RAM) etc..
It is required that those skilled in the art can be understood that the technology in the embodiment of the present application can add by software
The mode of general hardware platform realize.Based on this understanding, the technical solution in the embodiment of the present application substantially or
Say that the part that contributes to existing technology can be embodied in the form of software products, which is stored in
Such as USB flash disk, mobile hard disk, read-only memory (ROM, Read-Only Memory), random access memory in one storage medium
The various media that can store program code such as (RAM, Random Access Memory), magnetic or disk, including it is several
Instruction is used so that a computer equipment (can be personal computer, server or the second equipment, the network equipment etc.) is held
Row all or part of the steps of the method according to each embodiment of the present invention.
Same and similar part may refer to each other between each embodiment in this specification.Implement especially for terminal
For example, since it is substantially similar to the method embodiment, so being described relatively simple, related place is referring in embodiment of the method
Explanation.
In several embodiments provided herein, it should be understood that disclosed systems, devices and methods, it can be with
It realizes by another way.For example, the apparatus embodiments described above are merely exemplary, for example, the unit
It divides, only a kind of logical function partition, there may be another division manner in actual implementation, such as multiple units or components
It can be combined or can be integrated into another system, or some features can be ignored or not executed.Another point, it is shown or
The mutual coupling, direct-coupling or communication connection discussed can be through some interfaces, the indirect coupling of device or unit
It closes or communicates to connect, can be electrical property, mechanical or other forms.
The unit as illustrated by the separation member may or may not be physically separated, aobvious as unit
The component shown may or may not be physical unit, it can and it is in one place, or may be distributed over multiple
In network unit.It can select some or all of unit therein according to the actual needs to realize the mesh of this embodiment scheme
's.
It, can also be in addition, the functional units in various embodiments of the present invention may be integrated into one processing unit
It is that each unit physically exists alone, can also be integrated in one unit with two or more units.
Although by reference to attached drawing and combining the mode of preferred embodiment to the present invention have been described in detail, the present invention
It is not limited to this.Without departing from the spirit and substance of the premise in the present invention, those of ordinary skill in the art can be to the present invention
Embodiment carry out various equivalent modifications or substitutions, and these modifications or substitutions all should in covering scope of the invention/appoint
What those familiar with the art in the technical scope disclosed by the present invention, can easily think of the change or the replacement, answer
It is included within the scope of the present invention.Therefore, protection scope of the present invention is answered described is with scope of protection of the claims
It is quasi-.
Claims (10)
1. the monitoring method of the outer SSD of remote band a kind of characterized by comprising
BMC receives the fault-signal that BIOS is sent, and restarts SSD hard disk;
It determines that SSD hard disk failure is accumulative and restarts number, restart number more than given threshold when SSD hard disk failure is accumulative, send event
Hinder alarm signal, otherwise continues accumulative SSD hard disk failure and restart number.
2. the method according to claim 1, wherein the BMC receives the fault-signal that BIOS is sent, and restarting
SSD hard disk, comprising:
The BMC receives the SSD hard disk failure of BIOS transmission, crash, can not detect drive or other SSD abnormal conditions
When, restart SSD hard disk.
3. method according to claim 1 or 2, which is characterized in that the BMC receives the fault-signal that BIOS is sent, and
Restart SSD hard disk, comprising:
BMC receives the fault-signal that BIOS is sent and restarts SSD hard disk by dragging down PWDIS pin.
4. method according to claim 1, which is characterized in that the determining SSD hard disk failure is accumulative to restart number, when
SSD hard disk failure is accumulative to restart number more than given threshold, sends failure alarm signal, otherwise continues accumulative SSD hard disk failure
Restart number, comprising:
It determines that SSD hard disk failure is accumulative and restarts number, to restart number be more than 10 times when SSD hard disk failure is accumulative, transmission failure report
Otherwise alert signal continues accumulative SSD hard disk failure and restarts number.
5. method according to claim 1 to 4, which is characterized in that the accumulative weight of the determining SSD hard disk failure
Number is opened, number is restarted more than given threshold when SSD hard disk failure is accumulative, sends failure alarm signal, otherwise continue accumulative SSD
Hard disk failure restarts number, comprising:
The LED light controlled by BMC is set on the server, as SSD hard disk failure lamp, determines that SSD hard disk failure is accumulative and restarts
Number restarts number more than given threshold when SSD hard disk failure is accumulative, and SSD hard disk failure lamp sends failure alarm signal, otherwise
Continue accumulative SSD hard disk failure and restarts number.
6. a kind of for monitoring the device of SSD outside remote band characterized by comprising
Receiving unit, the receiving unit receives the fault-signal that BIOS is sent for BMC, and restarts SSD hard disk;
Determination unit, the determination unit restart number for determining that SSD hard disk failure is accumulative, when the accumulative weight of SSD hard disk failure
Number is opened more than given threshold, sends failure alarm signal, otherwise continues accumulative SSD hard disk failure and restarts number.
7. device according to claim 6, which is characterized in that the receiving unit is specifically used for:
BMC receives the SSD hard disk failure that BIOS sends, crashs, when can not detect drive or other SSD abnormal conditions, weighs
Open SSD hard disk.
8. device according to claim 6 or 7, which is characterized in that the receiving unit is specifically used for:
BMC receives the fault-signal that BIOS is sent and restarts SSD hard disk by dragging down PWDIS pin.
9. the device according to any one of claim 6 to 8, which is characterized in that the determination unit is specifically used for:
The LED light controlled by BMC is set on the server, as SSD hard disk failure lamp, determines that SSD hard disk failure is accumulative and restarts
Number restarts number more than given threshold when SSD hard disk failure is accumulative, and SSD hard disk failure lamp sends failure alarm signal, otherwise
Continue accumulative SSD hard disk failure and restarts number.
10. a kind of controlled terminal characterized by comprising
Processor;
The memory executed instruction for storage processor;
Wherein, the processor is configured to perform claim requires the described in any item methods of 1-5.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201810684778.7A CN109032901A (en) | 2018-06-28 | 2018-06-28 | A kind of monitoring method, device and the controlled terminal of the outer SSD of remote band |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201810684778.7A CN109032901A (en) | 2018-06-28 | 2018-06-28 | A kind of monitoring method, device and the controlled terminal of the outer SSD of remote band |
Publications (1)
Publication Number | Publication Date |
---|---|
CN109032901A true CN109032901A (en) | 2018-12-18 |
Family
ID=65522045
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201810684778.7A Pending CN109032901A (en) | 2018-06-28 | 2018-06-28 | A kind of monitoring method, device and the controlled terminal of the outer SSD of remote band |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN109032901A (en) |
Cited By (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN109710193A (en) * | 2018-12-27 | 2019-05-03 | 郑州云海信息技术有限公司 | A kind of system and method controlling PWDIS signal in M.3 SSD |
CN110347555A (en) * | 2019-07-09 | 2019-10-18 | 英业达科技有限公司 | Hard disk operating state determination method |
CN111858262A (en) * | 2019-04-29 | 2020-10-30 | 安图斯科技股份有限公司 | Warning lamp control method and electronic device |
CN113013920A (en) * | 2021-03-08 | 2021-06-22 | 射阳汉鼎新能源科技有限公司 | Photovoltaic power generation intelligent control system |
CN114924929A (en) * | 2022-04-22 | 2022-08-19 | 苏州浪潮智能科技有限公司 | NVMe hard disk fault early warning method, system and computer equipment |
Citations (8)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN104317712A (en) * | 2014-10-23 | 2015-01-28 | 浪潮电子信息产业股份有限公司 | Linux-based storage server fatigue testing method |
CN105302687A (en) * | 2015-12-10 | 2016-02-03 | 浪潮电子信息产业股份有限公司 | Hard disk sequence detection method based on linux |
CN106100884A (en) * | 2016-06-17 | 2016-11-09 | 国网辽宁省电力有限公司锦州供电公司 | The alarm method of supervisory control of substation equipment operation exception |
CN106295131A (en) * | 2016-07-29 | 2017-01-04 | 武汉联影医疗科技有限公司 | A kind of method and device supervising running software |
CN106844110A (en) * | 2017-02-09 | 2017-06-13 | 郑州云海信息技术有限公司 | A kind of method of failed disk data reconstruction in RAID groups |
CN107273257A (en) * | 2017-06-23 | 2017-10-20 | 郑州云海信息技术有限公司 | A kind of automatic method for carrying out the out of order inspection of hard disk under linux system |
CN107590017A (en) * | 2017-09-21 | 2018-01-16 | 联想(北京)有限公司 | The detection method and device of a kind of electronic equipment |
CN107833160A (en) * | 2017-11-24 | 2018-03-23 | 河北三川科技有限公司 | A kind of hotel occupancy advertisement plays fault self-detection method and system |
-
2018
- 2018-06-28 CN CN201810684778.7A patent/CN109032901A/en active Pending
Patent Citations (8)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN104317712A (en) * | 2014-10-23 | 2015-01-28 | 浪潮电子信息产业股份有限公司 | Linux-based storage server fatigue testing method |
CN105302687A (en) * | 2015-12-10 | 2016-02-03 | 浪潮电子信息产业股份有限公司 | Hard disk sequence detection method based on linux |
CN106100884A (en) * | 2016-06-17 | 2016-11-09 | 国网辽宁省电力有限公司锦州供电公司 | The alarm method of supervisory control of substation equipment operation exception |
CN106295131A (en) * | 2016-07-29 | 2017-01-04 | 武汉联影医疗科技有限公司 | A kind of method and device supervising running software |
CN106844110A (en) * | 2017-02-09 | 2017-06-13 | 郑州云海信息技术有限公司 | A kind of method of failed disk data reconstruction in RAID groups |
CN107273257A (en) * | 2017-06-23 | 2017-10-20 | 郑州云海信息技术有限公司 | A kind of automatic method for carrying out the out of order inspection of hard disk under linux system |
CN107590017A (en) * | 2017-09-21 | 2018-01-16 | 联想(北京)有限公司 | The detection method and device of a kind of electronic equipment |
CN107833160A (en) * | 2017-11-24 | 2018-03-23 | 河北三川科技有限公司 | A kind of hotel occupancy advertisement plays fault self-detection method and system |
Cited By (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN109710193A (en) * | 2018-12-27 | 2019-05-03 | 郑州云海信息技术有限公司 | A kind of system and method controlling PWDIS signal in M.3 SSD |
CN109710193B (en) * | 2018-12-27 | 2022-02-18 | 郑州云海信息技术有限公司 | System and method for controlling PWDIS signal in M.3 SSD |
CN111858262A (en) * | 2019-04-29 | 2020-10-30 | 安图斯科技股份有限公司 | Warning lamp control method and electronic device |
CN110347555A (en) * | 2019-07-09 | 2019-10-18 | 英业达科技有限公司 | Hard disk operating state determination method |
CN113013920A (en) * | 2021-03-08 | 2021-06-22 | 射阳汉鼎新能源科技有限公司 | Photovoltaic power generation intelligent control system |
CN114924929A (en) * | 2022-04-22 | 2022-08-19 | 苏州浪潮智能科技有限公司 | NVMe hard disk fault early warning method, system and computer equipment |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN109032901A (en) | A kind of monitoring method, device and the controlled terminal of the outer SSD of remote band | |
JP6530774B2 (en) | Hardware failure recovery system | |
CN105938450B (en) | The method and system that automatic debugging information is collected | |
CN107526665B (en) | Case management system and case management method | |
CN103473167B (en) | The fault display method of server and device | |
US8521929B2 (en) | Virtual serial port management system and method | |
CN110083494B (en) | Method and apparatus for managing hardware errors in a multi-core environment | |
EP2913754B1 (en) | Memory data migration computer and method | |
US10846159B2 (en) | System and method for managing, resetting and diagnosing failures of a device management bus | |
US8397053B2 (en) | Multi-motherboard server system | |
US10114688B2 (en) | System and method for peripheral bus device failure management | |
CN109032888A (en) | A kind of server start-up course fault detection method, device and controlled terminal | |
US9806959B2 (en) | Baseboard management controller (BMC) to host communication through device independent universal serial bus (USB) interface | |
CN102331959A (en) | Server system | |
US10922162B2 (en) | Capturing video data and serial data during an information handling system failure | |
JP2020053017A (en) | System and method for hybrid power source | |
CN109976926A (en) | Method, circuit, terminal and the storage medium of protection BMC renewal process are restarted in a kind of shielding | |
US9946552B2 (en) | System and method for detecting redundant array of independent disks (RAID) controller state from baseboard management controller (BMC) | |
US20080276121A1 (en) | Method and infrastructure for recognition of the resources of a defective hardware unit | |
CN115098294B (en) | Abnormal event processing method, electronic equipment and management terminal | |
US11126486B2 (en) | Prediction of power shutdown and outage incidents | |
CN115599617A (en) | Bus detection method and device, server and electronic equipment | |
CN109189638A (en) | A kind of GPU driving detection method, device, terminal and storage medium | |
US10664429B2 (en) | Systems and methods for managing serial attached small computer system interface (SAS) traffic with storage monitoring | |
US20200133681A1 (en) | Enabling software sensor power operation requests via baseboard management controller (bmc) |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
RJ01 | Rejection of invention patent application after publication |
Application publication date: 20181218 |
|
RJ01 | Rejection of invention patent application after publication |