CN104346221A - Method and device for grading and dispatching management of server hardware equipment and server - Google Patents

Method and device for grading and dispatching management of server hardware equipment and server Download PDF

Info

Publication number
CN104346221A
CN104346221A CN201310335053.4A CN201310335053A CN104346221A CN 104346221 A CN104346221 A CN 104346221A CN 201310335053 A CN201310335053 A CN 201310335053A CN 104346221 A CN104346221 A CN 104346221A
Authority
CN
China
Prior art keywords
hardware device
server hardware
information
health category
multiple server
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201310335053.4A
Other languages
Chinese (zh)
Other versions
CN104346221B (en
Inventor
胡殿明
杨文君
胡光
魏伟
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing Baidu Netcom Science and Technology Co Ltd
Original Assignee
Beijing Baidu Netcom Science and Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing Baidu Netcom Science and Technology Co Ltd filed Critical Beijing Baidu Netcom Science and Technology Co Ltd
Priority to CN201310335053.4A priority Critical patent/CN104346221B/en
Publication of CN104346221A publication Critical patent/CN104346221A/en
Application granted granted Critical
Publication of CN104346221B publication Critical patent/CN104346221B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y02TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
    • Y02DCLIMATE CHANGE MITIGATION TECHNOLOGIES IN INFORMATION AND COMMUNICATION TECHNOLOGIES [ICT], I.E. INFORMATION AND COMMUNICATION TECHNOLOGIES AIMING AT THE REDUCTION OF THEIR OWN ENERGY USE
    • Y02D10/00Energy efficient computing, e.g. low power processors, power management or thermal management

Landscapes

  • Debugging And Monitoring (AREA)

Abstract

The invention provides a method for grading server hardware equipment. The method comprises the following steps: obtaining status information of a plurality of server hardware equipment; calculating a health weight of the hardware equipment according to the status information of the hardware equipment and a preset model; determining health grade information corresponding to the plurality of server hardware equipment according to the health weight of the hardware equipment and a plurality of preset weight sections. The method is used for classifying the server hardware equipment according to the health level and marking the availability level under a large-scale application environment, so that the utilization rate of highly-available resources is increased, the failure rate and the fault cost of low-available resources are lowered, and the reliability of the server and a clustering system is improved; meanwhile, the method can support service grading and put the important service to the highly-available server, so as to improve the effectiveness on power consumption management of the server hardware equipment according to the health level. The invention further discloses a grading device for the server hardware equipment, a dispatching management method and a device for the server hardware equipment, and a server.

Description

Server hardware device grade classification, schedule management method and device, server
Technical field
The present invention relates to field of computer technology, particularly the schedule management method of a kind of rank division method of server hardware device and device, server hardware device and device, server.
Background technology
At present, in ultra-large data center, every ten thousand distributed type assemblies are configured with millions of server hardware device, and in specification, performance, in the life-span, degree of aging and running environment aspect difference obviously, but effectively distinguish use to it.Such as, for hard disk, all by the hard disk of operating system identification, all unify by as block device file to store data, like this based on not distinguishing that the unification of differentiation stores the mode of data as block device file, important data may be saved in the hard disk of incipient fault, also unessential data may be saved in the good hard disk of performance state, can not effectively utilize hard disk resources performance, meanwhile, before hard disk breaks down, can not transferring data timely and effectively, cause the loss of hard disc data, can not avoid risk in time.
Summary of the invention
Of the present inventionly be intended at least solve one of above-mentioned technological deficiency.
For this reason, the present invention's first object is to propose a kind of server hardware device rank division method, the method is by obtaining the status information of multiple server hardware device, and the healthy weight of multiple server hardware device is calculated according to the status information of multiple server hardware device and preset model, finally divide according to the healthy weight of multiple server hardware device and multiple default weight sector and determine the Health Category information that multiple server hardware device is corresponding.The present invention's second object is to propose a kind of server hardware device grade classification device.The present invention's the 3rd object is the schedule management method proposing a kind of server hardware device.The present invention's the 4th object is the dispatching managing device proposing a kind of server hardware device.The present invention's the 5th object is to propose a kind of server.
For achieving the above object, the server hardware device rank division method of embodiment according to a first aspect of the present invention, comprising: the status information obtaining multiple server hardware device; The healthy weight of described multiple server hardware device is calculated respectively according to the status information of described multiple server hardware device and preset model; And determine according to the healthy weight of described multiple server hardware device and multiple default weight sector the Health Category information that described multiple server hardware device is corresponding.
According to the server hardware device rank division method of the embodiment of the present invention, first the status information of multiple server hardware device is obtained, calculate according to the status information of multiple server hardware device and preset model the healthy weight that multiple server answers hardware device respectively again, finally determine according to the healthy weight of multiple server hardware device and multiple default weight sector the Health Category information that multiple server hardware device is corresponding.The method is by under large-scale application environment, Health Category classification is carried out to server hardware device thus reaches the object marking availability level, increase the utilization factor of High Availabitity resource on the one hand, reduce failure rate and the failure cost of low available resources, improve the Performance And Reliability of server and group system; On the other hand by server hardware grade classification ,important service is placed on the server of High Availabitity, the validity with server hardware device power managed and the high efficiency of executing the task.
For achieving the above object, the server hardware device grade classification device of embodiment according to a second aspect of the present invention, comprising: state information acquisition module, for obtaining the status information of multiple server hardware device; Computing module, for calculating the healthy weight of described multiple server hardware device respectively according to the status information of described multiple server hardware device and preset model; And grade classification module, for determining according to the healthy weight of described multiple server hardware device and multiple default weight sector the Health Category information that described multiple server hardware device is corresponding.
According to the server hardware device grade classification device of the embodiment of the present invention, first the status information of multiple server hardware device is obtained by state information acquisition module, calculate by computing module the healthy weight that multiple server answers hardware device according to the status information of multiple server hardware device and preset model respectively again, determine according to the healthy weight of multiple server hardware device and multiple default weight sector the Health Category information that multiple server hardware device is corresponding eventually through grade classification module.This device is by under large-scale application environment, Health Category classification is carried out to server hardware device thus reaches the object marking availability level, increase the utilization factor of High Availabitity resource on the one hand, reduce failure rate and the failure cost of low available resources, improve the Performance And Reliability of server and group system; On the other hand by server hardware grade classification, important service is placed on the server of High Availabitity, the validity with server hardware device power managed and the high efficiency of executing the task.
For achieving the above object, the schedule management method of the server hardware device of embodiment according to a third aspect of the present invention, comprising: the status information obtaining multiple server hardware device; Health Category information corresponding to described multiple server hardware device is obtained respectively according to the status information of described multiple server hardware device; And the Health Category information corresponding according to described multiple server hardware device manages described multiple server hardware device.
According to the schedule management method of the server hardware device of the embodiment of the present invention, first the status information of multiple server hardware device is obtained, multiple server hardware device Health Category information corresponding according to the state information acquisition of multiple server hardware device again, finally corresponding according to multiple server hardware device Health Category information manages multiple server hardware device.The method is by multiple server hardware device divided rank, make full use of server hardware device resource, there is the high efficiency and ease for use that utilize server hardware device resource, and reduce the risk of server hardware device failure rate and loss of data, improve the reliability and security that server hardware device stores data.
For achieving the above object, the server hardware device dispatching managing device of embodiment according to a fourth aspect of the present invention, comprising: state information acquisition module, for obtaining the status information of multiple server hardware device; Health Category data obtaining module, for obtaining Health Category information corresponding to described multiple server hardware device respectively according to the status information of described multiple server hardware device; And administration module, for the Health Category information corresponding according to described multiple server hardware device, described multiple server hardware device is managed.
According to the server hardware device dispatching managing device of the embodiment of the present invention, first the status information of multiple server hardware device is obtained by state information acquisition module, obtain corresponding multiple server hardware device Health Category information according to the status information of multiple server hardware device by Health Category data obtaining module again, finally corresponding according to multiple server hardware device Health Category information is managed multiple server hardware device by administration module.This device is by multiple server hardware device divided rank, make full use of server hardware device resource, there is the high efficiency and ease for use that utilize server hardware device resource, and reduce the risk of server hardware device failure rate and loss of data, improve the reliability and security that server hardware device stores data.
For achieving the above object, the server of embodiment according to a fifth aspect of the present invention, comprises the server hardware device dispatching managing device described in above-described embodiment.
According to the server of the embodiment of the present invention, first the status information of multiple server hardware device is obtained by the state information acquisition module of server hardware device dispatching managing device, corresponding multiple server hardware device Health Category information are obtained by the Health Category data obtaining module of server hardware device dispatching managing device again according to the status information of multiple server hardware device, the final Health Category information corresponding according to multiple server hardware device is managed multiple server hardware device by the administration module of server hardware device dispatching managing device.This server makes full use of the server hardware device resource of server hardware device management devices, there is the high efficiency and ease for use that utilize server hardware device resource, and reduce the risk of server hardware device failure rate and loss of data, improve the reliability and security of server.
The aspect that the present invention adds and advantage will part provide in the following description, and part will become obvious from the following description, or be recognized by practice of the present invention.
Accompanying drawing explanation
Of the present invention and/or additional aspect and advantage will become obvious and easy understand from the following description of the accompanying drawings of embodiments, wherein:
Fig. 1 is the process flow diagram of server hardware device rank division method according to an embodiment of the invention;
Fig. 2 is the process flow diagram of server hardware device rank division method in accordance with another embodiment of the present invention;
Fig. 3 is the structured flowchart of server hardware device grade classification device according to an embodiment of the invention;
Fig. 4 is the structured flowchart of server hardware device grade classification device in accordance with another embodiment of the present invention;
Fig. 5 is the process flow diagram of the schedule management method of server hardware device according to an embodiment of the invention;
Fig. 6 is the process flow diagram of the schedule management method of server hardware device in accordance with another embodiment of the present invention;
Fig. 7 is the structured flowchart of the dispatching managing device of server hardware device according to an embodiment of the invention;
Fig. 8 is the structured flowchart of the dispatching managing device of server hardware device in accordance with another embodiment of the present invention; And
Fig. 9 is the structured flowchart of server according to an embodiment of the invention.
Embodiment
Be described below in detail embodiments of the invention, the example of described embodiment is shown in the drawings, and wherein same or similar label represents same or similar element or has element that is identical or similar functions from start to finish.Being exemplary below by the embodiment be described with reference to the drawings, only for explaining the present invention, and can not limitation of the present invention being interpreted as.On the contrary, embodiments of the invention comprise fall into attached claims spirit and intension within the scope of all changes, amendment and equivalent.
In describing the invention, it is to be appreciated that term " first ", " second " etc. are only for describing object, and instruction or hint relative importance can not be interpreted as.In describing the invention, it should be noted that, unless otherwise clearly defined and limited, term " is connected ", " connection " should be interpreted broadly, such as, can be fixedly connected with, also can be removably connect, or connect integratedly; Can be mechanical connection, also can be electrical connection; Can be directly be connected, also indirectly can be connected by intermediary.For the ordinary skill in the art, concrete condition above-mentioned term concrete meaning in the present invention can be understood.In addition, in describing the invention, except as otherwise noted, the implication of " multiple " is two or more.
Describe and can be understood in process flow diagram or in this any process otherwise described or method, represent and comprise one or more for realizing the module of the code of the executable instruction of the step of specific logical function or process, fragment or part, and the scope of the preferred embodiment of the present invention comprises other realization, wherein can not according to order that is shown or that discuss, comprise according to involved function by the mode while of basic or by contrary order, carry out n-back test, this should understand by embodiments of the invention person of ordinary skill in the field.
Below with reference to the accompanying drawings schedule management method according to the server hardware device rank division method of the embodiment of the present invention and device, server hardware device and device, server are described.
At present, in ultra-large data center, every ten thousand distributed type assemblies are configured with millions of server hardware device, and in specification, performance, in the life-span, degree of aging and running environment aspect difference obviously, but effectively distinguish use to it.Such as, for hard disk, all by the hard disk of operating system identification, all unify by as block device file to store data, like this based on not distinguishing that the unification of differentiation stores the mode of data as block device file, important data may be saved in the hard disk of incipient fault, also unessential data may be saved in the good hard disk of performance state, can not effectively utilize hard disk resources performance, simultaneously, before hard disk breaks down, can not transferring data timely and effectively, cause the loss of hard disc data, can not avoid risk in time, and when running environment changes, initiatively do not protect the hard disc data of server, lack good ease for use.
For this reason, the present invention proposes a kind of server hardware device rank division method, comprise the steps: the status information obtaining multiple server hardware device; The healthy weight of multiple server hardware device is calculated respectively according to the status information of multiple server hardware device and preset model; And determine according to the healthy weight of multiple server hardware device and multiple default weight sector the Health Category information that multiple server hardware device is corresponding.
Fig. 1 is the process flow diagram of server hardware device rank division method according to an embodiment of the invention.
As shown in Figure 1, server hardware device rank division method, comprises the steps:
S101, obtains the status information of multiple server hardware device.
In one embodiment of the invention, status information comprises one or more in server hardware device temperature information, servo-information, head information, medium information, motor information, IO error message and life information.Thus, improve the diversity of status information.
Particularly, for SATA hard disc, status information comprises configuration specification information, temperature information, life information, the failure message of SATA hard disc, and load information etc.Such as, hard disc magnetic head is the critical component that hard disk reads data, its Main Function is exactly the magnetic information be stored on hard disc is converted into electric signal outwards transmit, its principle of work is then utilize the resistance value of special material can along with the principle of changes of magnetic field is to read and write the data on disc, the quality of hard disc magnetic head decides the storage density of hard disc to a great extent, and for example, dangerous power-off number of times hard disk is to the failure message of hard disk.
Be understandable that, obtain the example being only the status information obtaining server hardware device for the status information of SATA hard disc, server hardware device in the status information of the acquisition server hardware device in the embodiment of the present invention is not limited to the above-mentioned citing for SATA hard disc, can also be other server hardware device.
In one embodiment of the invention, server hardware device is CPU, internal memory, network interface card, power supply, hard disk and controller.Thus, improve the diversity of server hardware device.
S102, calculates the healthy weight of multiple server hardware device respectively according to the status information of multiple server hardware device and preset model.
In one embodiment of the invention, preset model is for obtain by machine learning.Thus, improve the accuracy obtaining preset model.
S103, determines according to the healthy weight of multiple server hardware device and multiple default weight sector the Health Category information that multiple server hardware device is corresponding.
In one embodiment of the invention, Health Category information comprises the first Health Category information to the 5th Health Category information, and the first Health Category information to the multiple default weight sector that the 5th Health Category information is corresponding is respectively the first weight sector to the 5th weight sector.Thus, improve accuracy and the ease for use of Health Category information.
In one embodiment of the invention, the first weight sector is [0,0.05]; Second weight sector be (0.05,0.2]; 3rd weight sector be (0.2,0.5]; 4th weight sector be (0.5,0.8]; And the 5th weight sector be (0.8,1].Thus, the accuracy of the healthy weight judging server hardware device is improve by weight sector.
According to the server hardware device rank division method of the embodiment of the present invention, first the status information of multiple server hardware device is obtained, calculate according to the status information of multiple server hardware device and preset model the healthy weight that multiple server answers hardware device respectively again, finally determine according to the healthy weight of multiple server hardware device and multiple default weight sector the Health Category information that multiple server hardware device is corresponding.The method is by under large-scale application environment, Health Category classification is carried out to server hardware device thus reaches the object marking availability level, increase the utilization factor of High Availabitity resource on the one hand, reduce failure rate and the failure cost of low available resources, improve the Performance And Reliability of server and group system; On the other hand by server hardware grade classification, important service is placed on the server of High Availabitity, the validity with server hardware device power managed and the high efficiency of executing the task.
Fig. 2 is the process flow diagram of server hardware device rank division method in accordance with another embodiment of the present invention.
As shown in Figure 2, server hardware device rank division method, comprises the steps:
S201, obtains the status information of multiple server hardware device.
In one embodiment of the invention, status information comprises one or more in server hardware device temperature information, servo-information, head information, medium information, motor information, IO error message and life information.Thus, improve the diversity of status information.
Particularly, for SATA hard disc, status information comprises configuration specification information, temperature information, life information, the failure message of SATA hard disc, and load information etc.Such as, hard disc magnetic head is the critical component that hard disk reads data, its Main Function is exactly the magnetic information be stored on hard disc is converted into electric signal outwards transmit, its principle of work is then utilize the resistance value of special material can along with the principle of changes of magnetic field is to read and write the data on disc, the quality of hard disc magnetic head decides the storage density of hard disc to a great extent, and for example, dangerous power-off number of times hard disk is to the failure message of hard disk.
Be understandable that, obtain the example being only the status information obtaining server hardware device for the status information of SATA hard disc, server hardware device in the status information of the acquisition server hardware device in the embodiment of the present invention is not limited to the above-mentioned citing for SATA hard disc, can also be other server hardware device.
In one embodiment of the invention, server hardware device is CPU, internal memory, network interface card, power supply, hard disk and controller.Thus, improve the diversity of server hardware device.
S202, calculates the healthy weight of multiple server hardware device respectively according to the status information of multiple server hardware device and preset model.
In one embodiment of the invention, preset model is for obtain by machine learning.Thus, improve the accuracy obtaining preset model.
S203, determines according to the healthy weight of multiple server hardware device and multiple default weight sector the Health Category information that multiple server hardware device is corresponding.
In one embodiment of the invention, Health Category information comprises the first Health Category information to the 5th Health Category information, and the first Health Category information to the multiple default weight sector that the 5th Health Category information is corresponding is respectively the first weight sector to the 5th weight sector.Thus, improve accuracy and the ease for use of Health Category information.
In one embodiment of the invention, the first weight sector is [0,0.05]; Second weight sector be (0.05,0.2]; 3rd weight sector be (0.2,0.5]; 4th weight sector be (0.5,0.8]; And the 5th weight sector be (0.8,1].Thus, the accuracy of the healthy weight judging server hardware device is improve by weight sector.
S204, carries out on-line checkingi by server hardware device On-line Fault Detection instrument to multiple server hardware device.
S205, the Health Category information corresponding to multiple server hardware device according to on-line checkingi result corrects.
According to the server hardware device rank division method of the embodiment of the present invention, first the status information of multiple server hardware device is obtained, calculate according to the status information of multiple server hardware device and preset model the healthy weight that multiple server answers hardware device respectively again, finally determine according to the healthy weight of multiple server hardware device and multiple default weight sector the Health Category information that multiple server hardware device is corresponding.The method is by under large-scale application environment, Health Category classification is carried out to server hardware device thus reaches the object marking availability level, increase the utilization factor of High Availabitity resource on the one hand, reduce failure rate and the failure cost of low available resources, improve the Performance And Reliability of server and group system; On the other hand by server hardware grade classification, important service is placed on the server of High Availabitity, the validity with server hardware device power managed and the high efficiency of executing the task.
To achieve these goals, the invention allows for a kind of server hardware device grade classification device.
A kind of server hardware device grade classification device, comprising: state information acquisition module, for obtaining the status information of multiple server hardware device; Computing module, for calculating the healthy weight of multiple server hardware device respectively according to the status information of multiple server hardware device and preset model; And grade classification module, for determining according to the healthy weight of multiple server hardware device and multiple default weight sector the Health Category information that multiple server hardware device is corresponding.
Fig. 3 is the structured flowchart according to one embodiment of the invention server hardware device grade classification device.
As shown in Figure 3, server hardware device grade classification device 30, comprising: state information acquisition module 310, computing module 320 and grade classification module 330.
In one embodiment of the invention, state information acquisition module 310 is for obtaining the status information of multiple server hardware device.
In one embodiment of the invention, status information comprises one or more in server hardware device temperature information, servo-information, head information, medium information, motor information, IO error message and life information.Thus, improve the diversity of status information.
Particularly, for SATA hard disc, status information comprises configuration specification information, temperature information, life information, the failure message of SATA hard disc, and load information etc.Such as, hard disc magnetic head is the critical component that hard disk reads data, its Main Function is exactly the magnetic information be stored on hard disc is converted into electric signal outwards transmit, its principle of work is then utilize the resistance value of special material can along with the principle of changes of magnetic field is to read and write the data on disc, the quality of hard disc magnetic head decides the storage density of hard disc to a great extent, and for example, dangerous power-off number of times hard disk is to the failure message of hard disk.
Be understandable that, obtain the example being only the status information obtaining server hardware device for the status information of SATA hard disc, server hardware device in the status information of the acquisition server hardware device in the embodiment of the present invention is not limited to the above-mentioned citing for SATA hard disc, can also be other server hardware device.
In one embodiment of the invention, server hardware device is CPU, internal memory, network interface card, power supply, hard disk and controller.Thus, improve the diversity of server hardware device.
In one embodiment of the invention, computing module 320 is for calculating the healthy weight of multiple server hardware device respectively according to the status information of multiple server hardware device and preset model.Thus, improve the accuracy of the healthy weight obtaining server hardware device.
In one embodiment of the invention, preset model is for obtain by machine learning.Thus, improve the accuracy obtaining preset model.
In one embodiment of the invention, grade classification module 330 is for determining according to the healthy weight of multiple server hardware device and multiple default weight sector the Health Category information that multiple server hardware device is corresponding.Thus, the high efficiency and ease for use that utilize multiple server hardware device is improve by grade classification module.
In one embodiment of the invention, Health Category information comprises the first Health Category information to the 5th Health Category information, and the first Health Category information to the multiple default weight sector that the 5th Health Category information is corresponding is respectively the first weight sector to the 5th weight sector.Thus, improve accuracy and the ease for use of Health Category information.
In one embodiment of the invention, the first weight sector is [0,0.05]; Second weight sector be (0.05,0.2]; 3rd weight sector be (0.2,0.5]; 4th weight sector be (0.5,0.8]; And the 5th weight sector be (0.8,1].Thus, the accuracy of the healthy weight judging server hardware device is improve by weight sector.
According to the server hardware device grade classification device of the embodiment of the present invention, first the status information of multiple server hardware device is obtained by state information acquisition module, calculate by computing module the healthy weight that multiple server answers hardware device according to the status information of multiple server hardware device and preset model respectively again, determine according to the healthy weight of multiple server hardware device and multiple default weight sector the Health Category information that multiple server hardware device is corresponding eventually through grade classification module.This device is by under large-scale application environment, Health Category classification is carried out to server hardware device thus reaches the object marking availability level, increase the utilization factor of High Availabitity resource on the one hand, reduce failure rate and the failure cost of low available resources, improve the Performance And Reliability of server and group system; On the other hand by server hardware grade classification, important service is placed on the server of High Availabitity, the validity with server hardware device power managed and the high efficiency of executing the task.
Fig. 4 is the structured flowchart of server hardware device grade classification device according to a further embodiment of the invention.
As shown in Figure 4, server hardware device grade classification device 30, also comprises: on-line monitoring module 340 and correction module 350.
In one embodiment of the invention, on-line checkingi module 340 is for carrying out on-line checkingi by server hardware device On-line Fault Detection instrument to multiple server hardware device; And correction module 350 corrects for the Health Category information corresponding to multiple server hardware device according to on-line checkingi result.Thus, improve the accuracy by detecting correction server hardware device Health Category information in real time and ease for use.
According to the server hardware device grade classification device of the embodiment of the present invention, first the status information of multiple server hardware device is obtained by state information acquisition module, calculate by computing module the healthy weight that multiple server answers hardware device according to the status information of multiple server hardware device and preset model respectively again, determine according to the healthy weight of multiple server hardware device and multiple default weight sector the Health Category information that multiple server hardware device is corresponding eventually through grade classification module.This device is by under large-scale application environment, Health Category classification is carried out to server hardware device thus reaches the object marking availability level, increase the utilization factor of High Availabitity resource on the one hand, reduce failure rate and the failure cost of low available resources, improve the Performance And Reliability of server and group system; On the other hand by server hardware grade classification, important service is placed on the server of High Availabitity, the validity with server hardware device power managed and the high efficiency of executing the task.
After server hardware device rank division method, management and running can be carried out to server hardware device, therefore the invention allows for a kind of schedule management method of server hardware device.
A schedule management method for server hardware device, comprises the steps: the status information obtaining multiple server hardware device; Health Category information corresponding to multiple server hardware device is obtained respectively according to the status information of multiple server hardware device; And the Health Category information corresponding according to multiple server hardware device manages multiple server hardware device.
Fig. 5 is the process flow diagram of the schedule management method of server hardware device according to an embodiment of the invention.
As shown in Figure 5, the schedule management method of server hardware device, comprises the steps:
S501, obtains the status information of multiple server hardware device.
In one embodiment of the invention, status information comprises one or more in server hardware device temperature information, servo-information, head information, medium information, motor information, IO error message and life information.Thus, improve the diversity of status information.
Particularly, for SATA hard disc, status information comprises configuration specification information, temperature information, life information, the failure message of SATA hard disc, and load information etc.Such as, hard disc magnetic head is the critical component that hard disk reads data, its Main Function is exactly the magnetic information be stored on hard disc is converted into electric signal outwards transmit, its principle of work is then utilize the resistance value of special material can along with the principle of changes of magnetic field is to read and write the data on disc, the quality of hard disc magnetic head decides the storage density of hard disc to a great extent, and for example, dangerous power-off number of times hard disk is to the failure message of hard disk.
Be understandable that, obtain the example being only the status information obtaining server hardware device for the status information of SATA hard disc, server hardware device in the status information of the acquisition server hardware device in the embodiment of the present invention is not limited to the above-mentioned citing for SATA hard disc, can also be other server hardware device.
In one embodiment of the invention, server hardware device is CPU, internal memory, network interface card, power supply, hard disk and controller.Thus, improve the diversity of server hardware device.
S502, obtains Health Category information corresponding to multiple server hardware device respectively according to the status information of multiple server hardware device.
In one embodiment of the invention, Health Category information comprises the first Health Category information to the 5th Health Category information, the first Health Category information to the 5th Health Category information respectively corresponding multiple default weight sector be respectively the first weight sector to the 5th weight sector.Thus, improve accuracy and the ease for use of Health Category information.
In one embodiment of the invention, the first weight sector is [0,0.05]; Second weight sector be (0.05,0.2]; 3rd weight sector be (0.2,0.5]; 4th weight sector be (0.5,0.8]; And the 5th weight sector be (0.8,1].Thus, the accuracy of the healthy weight judging server hardware device is improve by weight sector.
In one embodiment of the invention, the Health Category information obtaining multiple server hardware device respectively corresponding according to the status information of multiple server hardware device specifically comprises the following steps: first ,the healthy weight of multiple server hardware device is calculated respectively according to the status information of multiple server hardware device and preset model, wherein, preset model is obtained by machine learning, particularly, by the acquisition to multiple server hardware device status information, analysis obtains corresponding formula or calculates sample, constantly verifies obtain preset model further by machine; Then, determine according to the healthy weight of multiple server hardware device and multiple default weight sector the Health Category information that multiple server hardware device is corresponding.Thus, improve the accuracy obtaining Health Category information corresponding to server hardware device.
S503, the Health Category information corresponding according to multiple server hardware device manages multiple server hardware device.
According to the schedule management method of the server hardware device of the embodiment of the present invention, first by obtaining the status information of multiple server hardware device, multiple server hardware device Health Category information corresponding according to the state information acquisition of multiple server hardware device again, finally corresponding according to multiple server hardware device Health Category information manages multiple server hardware device.The method is by multiple server hardware device divided rank, make full use of server hardware device resource, there is the high efficiency and ease for use that utilize server hardware device resource, and reduce the risk of server hardware device failure rate and loss of data, improve the reliability and security that server hardware device stores data.
Fig. 6 is the process flow diagram of the schedule management method of server hardware device in accordance with another embodiment of the present invention.
As shown in Figure 6, the schedule management method of server hardware device, also comprises the steps:
S504, carries out on-line checkingi by server hardware device On-line Fault Detection instrument to multiple server hardware device.
S505, the Health Category information corresponding to multiple server hardware device according to on-line checkingi result corrects.
According to the schedule management method of the server hardware device of the embodiment of the present invention, first by obtaining the status information of multiple server hardware device, multiple server hardware device Health Category information corresponding according to the state information acquisition of multiple server hardware device again, then by server hardware device On-line Fault Detection instrument, on-line checkingi is carried out to multiple server hardware device, and the Health Category information corresponding to multiple server hardware device according to on-line checkingi result corrects, the final Health Category information corresponding according to multiple server hardware device manages multiple server hardware device.The method is by multiple server hardware device divided rank, make full use of server hardware device resource, there is the high efficiency and ease for use that utilize server hardware device resource, simultaneously, the Health Category information corresponding to server hardware device corrects, improve the accuracy of the class information of the server hardware device of acquisition, and reduce the risk of server hardware device failure rate and loss of data, improve the reliability and security that server hardware device stores data.
In order to make the advantage of embodiment of the present invention method more obvious, illustrate below.
First obtain the status information of multiple server hardware device, wherein, for SATA hard disc, status information comprises configuration specification information, temperature information, life information, the failure message of SATA hard disc, and load information etc.Such as, hard disc magnetic head is the critical component that hard disk reads data, its Main Function is exactly the magnetic information be stored on hard disc is converted into electric signal outwards transmit, its principle of work is then utilize the resistance value of special material can along with the principle of changes of magnetic field is to read and write the data on disc, the quality of hard disc magnetic head decides the storage density of hard disc to a great extent, and for example, dangerous power-off number of times hard disk is to the failure message of hard disk.
Be understandable that, obtain the example being only the status information obtaining server hardware device for the status information of SATA hard disc, server hardware device in the status information of the acquisition server hardware device in the embodiment of the present invention is not limited to the above-mentioned citing for SATA hard disc, can also be other server hardware device.
Further, again by utilizing the status information of hard disk, the preset model obtained to machine learning training carries out request prediction acquisition, then the healthy weight of multiple hard disk is calculated respectively according to the status information of multiple hard disk and preset model, the Health Category information that multiple hard disk is corresponding is determined according to the healthy weight of multiple hard disk and multiple default weight sector, wherein, default weight sector is divided into five default weight sub-ranges.
Further, by hard disk failure on-line checkingi instrument, on-line checkingi is carried out to multiple hard disk; And the Health Category information corresponding to multiple hard disk according to on-line checkingi result corrects.
As shown in Table 1, five weight sector lists:
Grade Concise and to the point descriptive grade Hard disk accounting Preset weight sector
FA Occurred or fault detected 5% [0,0.05]
FB Very likely occur in the recent period or minor failure 15% (0.05,0.2]
FC Obviously aging or hydraulic performance decline 30% (0.2,0.5]
GB Slightly aging or performance is slightly fallen 30% (0.5,0.8]
GA Performance, reliability are good 20% (0.8,1]
Table one
Particularly, Health Category information comprises the first Health Category information to the 5th Health Category information, respectively corresponding FA, FB, FC, GB and GA, the first Health Category information to the 5th Health Category information respectively corresponding multiple default weight sector be respectively the first weight sector to the 5th weight sector.Wherein, hard disk accounting is the ratio of hard disk shared by each Health Category information in the server hardware device of certain data center, and the hard disk health status in the server hardware device of this data center is very clear.
Further, the default weight sector of hard disk is [0,1], more close to 0 value, shows that disk state information is poorer, namely more close to fault; Otherwise more close to 1, then show that disk state information is better.The default weight sector Further Division of hard disk is 5 default weight sub-ranges, is respectively [0,0.05], (0.05,0.2] (and 0.2,0.5] (0.5,0.8] and (0.8,1], acquiescence is corresponding FA, FB, FC, GB and GA respectively.
Particularly, in the comparatively ideal situation of data, according to passing through machine learning, positive example with negative example at 0.5 place by strict cutting, strict positive example interval (0.8 is obtained again according to preset model, 1] and strict negative example interval [0,0.05], corresponding hard disk performance and reliability well and occurred or by the bad hard disk of hard disk failure on-line checkingi tool detection to fault respectively.Further, according to the preset model obtained by machine learning, wherein, by the acquisition to multiple disk state information, analysis obtains corresponding formula or calculates sample, constantly verify further by machine and obtain preset model, the stricter positive example interval obtained by the result that preset model calculates (0.8,1] positive example accuracy rate, the accuracy rate of namely good hard disk is 0.9938, the negative routine accuracy rate of strict negative example interval [0,0.05], the accuracy rate of namely bad hard disk is 0.9992.As shown in Table 2:
Threshold value Positive example accuracy rate Negative routine accuracy rate
(0.8,1] 0.9938 -
[0,0.05] - 0.9992
Table two
And relatively positive example interval (0.5,0.8] the then corresponding hard disk slightly good hard disk that slightly falls of aging or hard disk performance.Again according to practical application scene and sample mark, the model adopted can the look-ahead ratio that goes out the hard disk probably broken down in the recent period be 15%, therefore relatively negative example interval is (0.05,0.2] there is the possible hard disk of incipient fault in correspondence, and remaining interval (0.2,0.5] then aging the or hard disk performance of corresponding hard disk declines, but the hard disk that short-term can not break down, namely in like manner according to practical application scene and sample mark, the model of employing can look-ahead to go out the ratio that obviously aging hard disk or hard disk performance decline be 30%.
Finally by hard disk failure on-line checkingi instrument, on-line checkingi is carried out to multiple hard disk again, and the Health Category information corresponding to hard disk according to on-line checkingi result corrects, wherein, hard disk failure on-line checkingi instrument carries out on-line checkingi hard disk to hard disk respectively and whether there is fault, whether readyly comprise detection hard disk, detect hard disk and whether there is bad sector, detect hard disk whether hydraulic performance decline, detect hard drive internal temperature whether higher and detect hard disk whether close to one or more of guarantee period.
Particularly, the Health Category information corresponding to hard disk respectively according to the on-line checkingi result to above-mentioned situation corrects, namely as shown in above-mentioned table one: five the Health Category information corresponding to hard disk correct, and correction rule is as follows:
For FA: if the Health Category information correction mistake that last time is corresponding to hard disk according to on-line checkingi result, then keep the result corrected; Otherwise when fault not detected by hard disk failure on-line checkingi instrument, and when reading performance higher than 80MB/s by the hard disk of hard disk failure on-line checkingi tool detection, if life information is less than 5000 hours, then be corrected to GB, namely hard disk slightly aging or performance slightly fall, some unessential data can be preserved; If life information is less than 10000 hours, be then corrected to FC, namely hard disk obviously aging or hydraulic performance decline, can preserve some unessential data.
For FB: if the Health Category information correction mistake that last time is corresponding to hard disk according to on-line checkingi result, then keep the result corrected; If the temperature information of hard disk is more than 52 degrees Celsius; Or life information was more than 20000 hours, and reads performance by the hard disk of hard disk failure on-line checkingi tool detection and be less than 40MB/s, be then corrected to FA, namely occurred or fault detected, maintenance of can rolling off the production line immediately; When fault not detected by hard disk failure on-line checkingi instrument, and when reading performance higher than 80MB/s by the hard disk of hard disk failure on-line checkingi tool detection, if life information is less than 5000 hours, be then corrected to GB, namely hard disk slightly aging or performance slightly fall; If life information is less than 10000 hours, be then corrected to FC, namely hard disk obviously aging or hydraulic performance decline, can preserve some unessential data.
For FC: if hard disk temperature information is more than 52 degrees Celsius; Or temperature information is more than 43 degrees Celsius, and reads performance by the hard disk of hard disk failure on-line checkingi tool detection and be less than 80MB/s; Or life information was more than 20000 hours, and reads performance by the hard disk of hard disk failure on-line checkingi tool detection and be less than 40MB/s; Then be corrected to FB, namely hard disk very likely occurs or minor failure in the recent period, needs migration data early, avoids hard disk failure to cause loss of data; If life information is less than 5000 hours, and read performance higher than 80MB/s by the hard disk of hard disk failure on-line checkingi tool detection, be then corrected to GB, namely hard disk slightly aging or performance slightly fall, some unessential data can be preserved.
For GB: if hard disk temperature information is more than 52 degrees Celsius; Or temperature information is more than 43 degrees Celsius, and reads performance by the hard disk of hard disk failure on-line checkingi tool detection and be less than 80MB/s, be then corrected to FC, namely hard disk obviously aging or hydraulic performance decline, can preserve some unessential data; If life information was more than 20000 hours, and read performance by the hard disk of hard disk failure on-line checkingi tool detection and be less than 40MB/s, be then corrected to FB, namely hard disk very likely occurs or minor failure in the recent period, need migration data early, avoid fault to cause loss of data.
For GA: if hard disk temperature information is more than 52 degrees Celsius, be then corrected to FC, namely hard disk obviously aging or hydraulic performance decline, can preserve some unessential data; If temperature information is more than 43 degrees Celsius, and reads performance by the hard disk of hard disk failure on-line checkingi tool detection and be less than 80MB/s; Or life information was more than 20000 hours, and reads performance by the hard disk of hard disk failure on-line checkingi tool detection and be less than 40MB/s, be then corrected to GB, namely hard disk slightly aging or performance slightly fall, some unessential data can be preserved.
This method is according to the status information of hard disk and the healthy weight being calculated hard disk by the preset model that machine learning obtains respectively; And the Health Category information of hard disk is determined according to the healthy weight of hard disk and five weight sector presetting, further, by hard disk failure on-line checkingi instrument, on-line checkingi is carried out to hard disk, the Health Category information corresponding to hard disk according to on-line checkingi result is done and is corrected further, to make full use of hard disk resources, and there is the high efficiency and ease for use that utilize hard disk resources, and reduce the risk of hard disk failure rate and loss of data, improve the reliability and security storing data.
Be understandable that, because server hardware device can be CPU, internal memory, network interface card, hard disk and controller etc., the citing of above-mentioned SATA hard disc is only a kind of example of the schedule management method realizing server hardware device.Spread to thus, such as, on CPU is intensive, IO is intensive, the service of communications-intensive is placed on High Availabitity by the schedule management method of server hardware device CPU, IO and network, thus management and running more efficiently and application can also be realized.
In order to realize above-mentioned example, the present invention also proposes a kind of server hardware device dispatching managing device.
A kind of server hardware device dispatching managing device, comprising: state information acquisition module, for obtaining the status information of multiple server hardware device; Health Category data obtaining module, for obtaining Health Category information corresponding to multiple server hardware device respectively according to the status information of multiple server hardware device; And administration module, for the Health Category information corresponding according to multiple server hardware device, multiple server hardware device is managed.
Fig. 7 is the structured flowchart according to one embodiment of the invention server hardware device dispatching managing device.
As shown in Figure 7, server hardware device dispatching managing device 70, comprising: state information acquisition module 710, Health Category data obtaining module 720 and administration module 730.
Particularly, state information acquisition module 710, for obtaining the status information of multiple server hardware device.
In one embodiment of the invention, status information comprises one or more in server hardware device temperature information, servo-information, head information, medium information, motor information, IO error message and life information.Thus, improve the diversity of status information.
Particularly, for SATA hard disc, status information comprises configuration specification information, temperature information, life information, the failure message of SATA hard disc, and load information etc.Such as, hard disc magnetic head is the critical component that hard disk reads data, its Main Function is exactly the magnetic information be stored on hard disc is converted into electric signal outwards transmit, its principle of work is then utilize the resistance value of special material can along with the principle of changes of magnetic field is to read and write the data on disc, the quality of hard disc magnetic head decides the storage density of hard disc to a great extent, and for example, dangerous power-off number of times hard disk is to the failure message of hard disk.
Be understandable that, obtain the example being only the status information obtaining server hardware device for the status information of SATA hard disc, server hardware device in the status information of the acquisition server hardware device in the embodiment of the present invention is not limited to the above-mentioned citing for SATA hard disc, can also be other server hardware device.
In one embodiment of the invention, server hardware device is CPU, internal memory, network interface card, power supply, hard disk and controller.Thus, improve the diversity of server hardware device.
Health Category data obtaining module 720, for obtaining Health Category information corresponding to multiple server hardware device respectively according to the status information of multiple server hardware device.
In one embodiment of the invention, Health Category information comprises the first Health Category information to the 5th Health Category information, the first Health Category information to the 5th Health Category information respectively corresponding multiple default weight sector be respectively the first weight sector to the 5th weight sector.Thus, improve accuracy and the ease for use of Health Category information.
In one embodiment of the invention, the first weight sector is [0,0.05]; Second weight sector be (0.05,0.2]; 3rd weight sector be (0.2,0.5]; 4th weight sector be (0.5,0.8]; And the 5th weight sector be (0.8,1].Thus, improve the accuracy of the healthy weight judging server hardware device.
In one embodiment of the invention, Health Category information module 720 calculates the healthy weight of multiple server hardware device respectively according to the status information of multiple server hardware device and preset model, wherein, preset model is obtained by machine learning, particularly, by the acquisition to multiple server hardware device status information, analysis obtains corresponding formula or calculates sample, constantly verify further by machine and obtain preset model, and determine according to the healthy weight of multiple server hardware device and multiple default weight sector the Health Category information that multiple server hardware device is corresponding.Thus, improve the accuracy obtaining Health Category information corresponding to server hardware device.
Administration module 730, manages multiple server hardware device for the Health Category information corresponding according to multiple server hardware device.
According to the server hardware device dispatching managing device of the embodiment of the present invention, first the status information of multiple server hardware device is obtained by state information acquisition module, obtain corresponding multiple server hardware device Health Category information according to the status information of multiple server hardware device by Health Category data obtaining module again, finally corresponding according to multiple server hardware device Health Category information is managed multiple server hardware device by administration module.This device is by multiple server hardware device divided rank, make full use of server hardware device resource, there is the high efficiency and ease for use that utilize server hardware device resource, and reduce the risk of server hardware device failure rate and loss of data, improve the reliability and security that server hardware device stores data.
Fig. 8 is the structured flowchart of server hardware device dispatching managing device according to a further embodiment of the invention.
As shown in Figure 8, server hardware device dispatching managing device 70, also comprises: on-line checkingi module 740 and correction module 750.
On-line checkingi module 740, for carrying out on-line checkingi by server hardware device On-line Fault Detection instrument to multiple server hardware device; And correction module 750, correct for the Health Category information corresponding to multiple server hardware device according to on-line checkingi result.Thus, improve the accuracy by detecting correction server hardware device Health Category information in real time and ease for use.
According to the server hardware device dispatching managing device of the embodiment of the present invention, first the status information of multiple server hardware device is obtained by state information acquisition module, corresponding multiple server hardware device Health Category information are obtained by Health Category data obtaining module again according to the status information of multiple server hardware device, then server hardware device On-line Fault Detection instrument carries out on-line checkingi by on-line checkingi module to multiple server hardware device, and corrected by the Health Category information that correction module is corresponding to multiple server hardware device according to on-line checkingi result, the final Health Category information corresponding according to multiple server hardware device is managed multiple server hardware device by administration module.This device is by multiple server hardware device divided rank, make full use of server hardware device resource, there is the high efficiency and ease for use that utilize server hardware device resource, simultaneously, the Health Category information corresponding to server hardware device corrects, improve the accuracy of the class information of the server hardware device of acquisition, and reduce the risk of server hardware device failure rate and loss of data, improve the reliability and security that server hardware device stores data.
Fig. 9 is the structured flowchart according to one embodiment of the invention server.
As shown in Figure 9, server 90, comprises the server hardware device dispatching managing device 70 of above-described embodiment.
According to the server of the embodiment of the present invention, first the status information of multiple server hardware device is obtained by the state information acquisition module of server hardware device dispatching managing device, corresponding multiple server hardware device Health Category information are obtained by the Health Category data obtaining module of server hardware device dispatching managing device again according to the status information of multiple server hardware device, the final Health Category information corresponding according to multiple server hardware device is managed multiple server hardware device by the administration module of server hardware device dispatching managing device.This server makes full use of the server hardware device resource of server hardware device management devices, there is the high efficiency and ease for use that utilize server hardware device resource, and reduce the risk of server hardware device failure rate and loss of data, improve the reliability and security of server.
Describe and can be understood in process flow diagram or in this any process otherwise described or method, represent and comprise one or more for realizing the module of the code of the executable instruction of the step of specific logical function or process, fragment or part, and the scope of the preferred embodiment of the present invention comprises other realization, wherein can not according to order that is shown or that discuss, comprise according to involved function by the mode while of basic or by contrary order, carry out n-back test, this should understand by embodiments of the invention person of ordinary skill in the field.In flow charts represent or in this logic otherwise described and/or step, such as, the sequencing list of the executable instruction for realizing logic function can be considered to, may be embodied in any computer-readable medium, for instruction execution system, device or equipment (as computer based system, comprise the system of processor or other can from instruction execution system, device or equipment instruction fetch and perform the system of instruction) use, or to use in conjunction with these instruction execution systems, device or equipment.With regard to this instructions, " computer-readable medium " can be anyly can to comprise, store, communicate, propagate or transmission procedure for instruction execution system, device or equipment or the device that uses in conjunction with these instruction execution systems, device or equipment.The example more specifically (non-exhaustive list) of computer-readable medium comprises following: the electrical connection section (electronic installation) with one or more wiring, portable computer diskette box (magnetic device), random-access memory (ram), ROM (read-only memory) (ROM), erasablely edit ROM (read-only memory) (EPROM or flash memory), fiber device, and portable optic disk ROM (read-only memory) (CDROM).In addition, computer-readable medium can be even paper or other suitable media that can print described program thereon, because can such as by carrying out optical scanning to paper or other media, then carry out editing, decipher or carry out process with other suitable methods if desired and electronically obtain described program, be then stored in computer memory.
Should be appreciated that each several part of the present invention can realize with hardware, software, firmware or their combination.In the above-described embodiment, multiple step or method can with to store in memory and the software performed by suitable instruction execution system or firmware realize.Such as, if realized with hardware, the same in another embodiment, can realize by any one in following technology well known in the art or their combination: the discrete logic with the logic gates for realizing logic function to data-signal, there is the special IC of suitable combinational logic gate circuit, programmable gate array (PGA), field programmable gate array (FPGA) etc.
Those skilled in the art are appreciated that realizing all or part of step that above-described embodiment method carries is that the hardware that can carry out instruction relevant by program completes, described program can be stored in a kind of computer-readable recording medium, this program perform time, step comprising embodiment of the method one or a combination set of.
In addition, each functional unit in each embodiment of the present invention can be integrated in a processing module, also can be that the independent physics of unit exists, also can be integrated in a module by two or more unit.Above-mentioned integrated module both can adopt the form of hardware to realize, and the form of software function module also can be adopted to realize.If described integrated module using the form of software function module realize and as independently production marketing or use time, also can be stored in a computer read/write memory medium.
The above-mentioned storage medium mentioned can be ROM (read-only memory), disk or CD etc.
In the description of this instructions, specific features, structure, material or feature that the description of reference term " embodiment ", " some embodiments ", " example ", " concrete example " or " some examples " etc. means to describe in conjunction with this embodiment or example are contained at least one embodiment of the present invention or example.In this manual, identical embodiment or example are not necessarily referred to the schematic representation of above-mentioned term.And the specific features of description, structure, material or feature can combine in an appropriate manner in any one or more embodiment or example.
Although illustrate and describe embodiments of the invention above, be understandable that, above-described embodiment is exemplary, can not be interpreted as limitation of the present invention, those of ordinary skill in the art can change above-described embodiment within the scope of the invention when not departing from principle of the present invention and aim, revising, replacing and modification.Scope of the present invention is by claims extremely equivalency.

Claims (31)

1. a server hardware device rank division method, is characterized in that, comprising:
Obtain the status information of multiple server hardware device;
The healthy weight of described multiple server hardware device is calculated respectively according to the status information of described multiple server hardware device and preset model; And
The Health Category information that described multiple server hardware device is corresponding is determined according to the healthy weight of described multiple server hardware device and multiple default weight sector.
2. the method for claim 1, is characterized in that, described preset model is for obtain by machine learning.
3. the method for claim 1, it is characterized in that, described Health Category information comprises the first Health Category information to the 5th Health Category information, and described first Health Category information is respectively the first weight sector to the 5th weight sector to described multiple default weight sector that the 5th Health Category information is corresponding.
4. method as claimed in claim 3, it is characterized in that, described first weight sector is [0,0.05]; Described second weight sector be (0.05,0.2]; Described 3rd weight sector be (0.2,0.5]; Described 4th weight sector be (0.5,0.8]; And described 5th weight sector be (0.8,1].
5. the method as described in any one of claim 1-4, is characterized in that, determines also to comprise the Health Category information that described multiple server hardware device is corresponding according to the healthy weight of described multiple server hardware device and multiple default weight sector:
By server hardware device On-line Fault Detection instrument, on-line checkingi is carried out to described multiple server hardware device; And
The Health Category information corresponding to described multiple server hardware device according to on-line checkingi result corrects.
6. the method for claim 1, is characterized in that, described status information comprises configuration specification information, temperature information, life information, the failure message of server hardware device, and one or more in load information.
7. the method for claim 1, is characterized in that, described server hardware device is CPU, internal memory, network interface card, power supply, hard disk and controller.
8. a server hardware device grade classification device, is characterized in that, comprising:
State information acquisition module, for obtaining the status information of multiple server hardware device;
Computing module, for calculating the healthy weight of described multiple server hardware device respectively according to the status information of described multiple server hardware device and preset model; And
Grade classification module, for determining according to the healthy weight of described multiple server hardware device and multiple default weight sector the Health Category information that described multiple server hardware device is corresponding.
9. device as claimed in claim 8, it is characterized in that, described preset model is for obtain by machine learning.
10. device as claimed in claim 8, it is characterized in that, described Health Category information comprises the first Health Category information to the 5th Health Category information, and described first Health Category information is respectively the first weight sector to the 5th weight sector to described multiple default weight sector that the 5th Health Category information is corresponding.
11. devices as claimed in claim 10, is characterized in that, described first weight sector is [0,0.05]; Described second weight sector be (0.05,0.2]; Described 3rd weight sector be (0.2,0.5]; Described 4th weight sector be (0.5,0.8]; And described 5th weight sector be (0.8,1].
12. devices as described in any one of claim 8-11, is characterized in that, also comprise:
On-line checkingi module, for carrying out on-line checkingi by server hardware device On-line Fault Detection instrument to described multiple server hardware device; And
Correction module, corrects for the Health Category information corresponding to described multiple server hardware device according to on-line checkingi result.
13. devices as claimed in claim 8, it is characterized in that, described status information comprises configuration specification information, temperature information, life information, the failure message of server hardware device, and one or more in load information.
14. devices as claimed in claim 8, it is characterized in that, described server hardware device is CPU, internal memory, network interface card, power supply, hard disk and controller.
The schedule management method of 15. 1 kinds of server hardware device, is characterized in that, comprising:
Obtain the status information of multiple server hardware device;
Health Category information corresponding to described multiple server hardware device is obtained respectively according to the status information of described multiple server hardware device; And
The Health Category information corresponding according to described multiple server hardware device manages described multiple server hardware device.
16. methods as claimed in claim 15, is characterized in that, the Health Category information that the described status information according to described multiple server hardware device obtains described multiple server hardware device corresponding respectively comprises further:
The healthy weight of described multiple server hardware device is calculated respectively according to the status information of described multiple server hardware device and preset model; And
The Health Category information that described multiple server hardware device is corresponding is determined according to the healthy weight of described multiple server hardware device and multiple default weight sector.
17. methods as claimed in claim 16, it is characterized in that, described preset model is for obtain by machine learning.
18. methods as claimed in claim 15, it is characterized in that, described Health Category information comprises the first Health Category information to the 5th Health Category information, and described first Health Category information is respectively the first weight sector to the 5th weight sector to described multiple default weight sector that the 5th Health Category information is corresponding.
19. methods as claimed in claim 18, is characterized in that, described first weight sector is [0,0.05]; Described second weight sector be (0.05,0.2]; Described 3rd weight sector be (0.2,0.5]; Described 4th weight sector be (0.5,0.8]; And described 5th weight sector be (0.8,1].
20. methods as described in any one of claim 15-19, is characterized in that, after the described status information according to multiple server hardware device obtains Health Category information corresponding to described multiple server hardware device respectively, also comprise:
By server hardware device On-line Fault Detection instrument, on-line checkingi is carried out to described multiple server hardware device; And
The Health Category information corresponding to described multiple server hardware device according to on-line checkingi result corrects.
21. methods as claimed in claim 15, it is characterized in that, described status information comprises configuration specification information, temperature information, life information, the failure message of server hardware device, and one or more in load information.
22. methods as claimed in claim 15, it is characterized in that, described server hardware device is CPU, internal memory, network interface card, power supply, hard disk and controller.
23. 1 kinds of server hardware device dispatching managing devices, is characterized in that, comprising:
State information acquisition module, for obtaining the status information of multiple server hardware device;
Health Category data obtaining module, for obtaining Health Category information corresponding to described multiple server hardware device respectively according to the status information of described multiple server hardware device; And
Administration module, manages described multiple server hardware device for the Health Category information corresponding according to described multiple server hardware device.
24. devices as claimed in claim 23, it is characterized in that, described Health Category data obtaining module calculates the healthy weight of described multiple server hardware device respectively according to the status information of described multiple server hardware device and preset model, and determines according to the healthy weight of described multiple server hardware device and multiple default weight sector the Health Category information that described multiple server hardware device is corresponding.
25. devices as claimed in claim 24, it is characterized in that, described preset model is for obtain by machine learning.
26. devices as claimed in claim 23, it is characterized in that, described Health Category information comprises the first Health Category information to the 5th Health Category information, and described first Health Category information is to corresponding first weight sector of the 5th Health Category information difference to the 5th weight sector.
27. devices as claimed in claim 26, is characterized in that, described first weight sector is [0,0.05]; Described second weight sector be (0.05,0.2]; Described 3rd weight sector be (0.2,0.5]; Described 4th weight sector be (0.5,0.8]; And described 5th weight sector be (0.8,1].
28. devices as described in any one of claim 23-27, is characterized in that, also comprise:
On-line checkingi module, for carrying out on-line checkingi by server hardware device On-line Fault Detection instrument to described multiple server hardware device; And
Correction module, corrects for the Health Category information corresponding to described multiple server hardware device according to on-line checkingi result.
29. devices as claimed in claim 23, it is characterized in that, described status information comprises configuration specification information, temperature information, life information, the failure message of server hardware device, and one or more in load information.
30. devices as claimed in claim 23, it is characterized in that, described server hardware device is CPU, internal memory, network interface card, power supply, hard disk and controller.
31. 1 kinds of servers, is characterized in that, comprise the server hardware device dispatching managing device as described in any one of claim 23-30.
CN201310335053.4A 2013-08-02 2013-08-02 Server hardware device grade classification, schedule management method and device, server Active CN104346221B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201310335053.4A CN104346221B (en) 2013-08-02 2013-08-02 Server hardware device grade classification, schedule management method and device, server

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201310335053.4A CN104346221B (en) 2013-08-02 2013-08-02 Server hardware device grade classification, schedule management method and device, server

Publications (2)

Publication Number Publication Date
CN104346221A true CN104346221A (en) 2015-02-11
CN104346221B CN104346221B (en) 2018-05-08

Family

ID=52501906

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201310335053.4A Active CN104346221B (en) 2013-08-02 2013-08-02 Server hardware device grade classification, schedule management method and device, server

Country Status (1)

Country Link
CN (1) CN104346221B (en)

Cited By (11)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2016180049A1 (en) * 2015-05-14 2016-11-17 中兴通讯股份有限公司 Storage management method and distributed file system
CN106155807A (en) * 2015-04-15 2016-11-23 阿里巴巴集团控股有限公司 A kind of method and apparatus realizing scheduling of resource
CN107436812A (en) * 2017-07-28 2017-12-05 北京深思数盾科技股份有限公司 A kind of method and device of linux system performance optimization
CN107766346A (en) * 2016-08-15 2018-03-06 中国联合网络通信集团有限公司 Distributed file system file access method and device
CN107918560A (en) * 2016-10-14 2018-04-17 郑州云海信息技术有限公司 A kind of server apparatus management method and device
WO2018077285A1 (en) * 2016-10-31 2018-05-03 腾讯科技(深圳)有限公司 Machine learning model training method and apparatus, server and storage medium
CN108228840A (en) * 2018-01-05 2018-06-29 北京盛世博创信息技术有限公司 Environment monitoring control method, device, terminal and computer readable storage medium
CN108628231A (en) * 2018-07-05 2018-10-09 郑州云海信息技术有限公司 Apparatus monitoring method and device in cloud data center
CN110955587A (en) * 2019-11-29 2020-04-03 北京金山云网络技术有限公司 Method and device for determining equipment to be replaced
CN112506725A (en) * 2020-12-04 2021-03-16 苏州浪潮智能科技有限公司 Method, device and equipment for judging grade of repaired solid state disk and readable medium
CN115840627A (en) * 2022-12-19 2023-03-24 上海伯镭智能科技有限公司 Mine car task scheduling method and device based on Internet of vehicles

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20050154860A1 (en) * 2004-01-13 2005-07-14 International Business Machines Corporation Method and data processing system optimizing performance through reporting of thread-level hardware resource utilization
CN103095488A (en) * 2012-12-14 2013-05-08 北京思特奇信息技术股份有限公司 Condition monitoring system and condition monitoring method for self-service terminal peripheral hardware
CN103139007A (en) * 2011-12-05 2013-06-05 阿里巴巴集团控股有限公司 Method and system for detecting application server performance
CN103200050A (en) * 2013-04-12 2013-07-10 北京百度网讯科技有限公司 Server hardware state monitoring method and server hardware state monitoring system

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20050154860A1 (en) * 2004-01-13 2005-07-14 International Business Machines Corporation Method and data processing system optimizing performance through reporting of thread-level hardware resource utilization
CN103139007A (en) * 2011-12-05 2013-06-05 阿里巴巴集团控股有限公司 Method and system for detecting application server performance
CN103095488A (en) * 2012-12-14 2013-05-08 北京思特奇信息技术股份有限公司 Condition monitoring system and condition monitoring method for self-service terminal peripheral hardware
CN103200050A (en) * 2013-04-12 2013-07-10 北京百度网讯科技有限公司 Server hardware state monitoring method and server hardware state monitoring system

Cited By (18)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US10789100B2 (en) 2015-04-15 2020-09-29 Alibaba Group Holding Limited System, apparatus and method for resource provisioning
CN106155807A (en) * 2015-04-15 2016-11-23 阿里巴巴集团控股有限公司 A kind of method and apparatus realizing scheduling of resource
CN106293492A (en) * 2015-05-14 2017-01-04 中兴通讯股份有限公司 A kind of memory management method and distributed file system
CN106293492B (en) * 2015-05-14 2021-08-20 中兴通讯股份有限公司 Storage management method and distributed file system
WO2016180049A1 (en) * 2015-05-14 2016-11-17 中兴通讯股份有限公司 Storage management method and distributed file system
CN107766346A (en) * 2016-08-15 2018-03-06 中国联合网络通信集团有限公司 Distributed file system file access method and device
CN107918560A (en) * 2016-10-14 2018-04-17 郑州云海信息技术有限公司 A kind of server apparatus management method and device
WO2018077285A1 (en) * 2016-10-31 2018-05-03 腾讯科技(深圳)有限公司 Machine learning model training method and apparatus, server and storage medium
US11531841B2 (en) 2016-10-31 2022-12-20 Tencent Technology (Shenzhen) Company Limited Machine learning model training method and apparatus, server, and storage medium
US11861478B2 (en) 2016-10-31 2024-01-02 Tencent Technology (Shenzhen) Company Limited Machine learning model training method and apparatus, server, and storage medium
CN107436812A (en) * 2017-07-28 2017-12-05 北京深思数盾科技股份有限公司 A kind of method and device of linux system performance optimization
CN108228840A (en) * 2018-01-05 2018-06-29 北京盛世博创信息技术有限公司 Environment monitoring control method, device, terminal and computer readable storage medium
CN108628231A (en) * 2018-07-05 2018-10-09 郑州云海信息技术有限公司 Apparatus monitoring method and device in cloud data center
CN110955587A (en) * 2019-11-29 2020-04-03 北京金山云网络技术有限公司 Method and device for determining equipment to be replaced
CN112506725A (en) * 2020-12-04 2021-03-16 苏州浪潮智能科技有限公司 Method, device and equipment for judging grade of repaired solid state disk and readable medium
CN112506725B (en) * 2020-12-04 2023-01-06 苏州浪潮智能科技有限公司 Method, device and equipment for judging grade of repaired solid state disk and readable medium
CN115840627A (en) * 2022-12-19 2023-03-24 上海伯镭智能科技有限公司 Mine car task scheduling method and device based on Internet of vehicles
CN115840627B (en) * 2022-12-19 2023-09-29 上海伯镭智能科技有限公司 Mine car task scheduling method and device based on Internet of vehicles

Also Published As

Publication number Publication date
CN104346221B (en) 2018-05-08

Similar Documents

Publication Publication Date Title
CN104346221A (en) Method and device for grading and dispatching management of server hardware equipment and server
US10519960B2 (en) Fan failure detection and reporting
US10147048B2 (en) Storage device lifetime monitoring system and storage device lifetime monitoring method thereof
US6219597B1 (en) Process and device for aiding the maintenance of a complex system, especially an aircraft
RU2757436C2 (en) Device and method for monitoring indications of malfunction from vehicle, computer-readable media
CN103218180B (en) Disk localization method and positioner
CN103617110A (en) Server device condition maintenance system
CN104350435A (en) Embedded prognostics on PLC platforms for equipment condition monitoring, diagnosis and time-to-failure/service prediction
CN103514068A (en) Method for automatically locating internal storage faults
US20110093157A1 (en) System and method for selecting a maintenance operation
US8286034B2 (en) Accurate fault status tracking of variable access sensors
CN108205424A (en) Data migration method, device and electronic equipment based on disk
CN103389124A (en) Method and system for sensor testing
US10749758B2 (en) Cognitive data center management
JP2020052714A (en) Monitoring system and monitoring method
CN111309502A (en) Solid state disk service life prediction method
WO2019049523A1 (en) Risk assessment device, risk assessment system, risk assessment method, and risk assessment program
CN114462820A (en) Bearing state monitoring and health management system performance testing and optimizing method and system
JP2001125626A (en) Plant equipment managing device
US20240037831A1 (en) Datacenter dashboard with temporal features
CN109669796A (en) A kind of prediction technique and device of disk failure
CN106197854A (en) Method and device for judging water inflow of battery pack
WO2019049521A1 (en) Risk evaluation device, risk evaluation system, risk evaluation method, risk evaluation program, and data structure
CN104345858A (en) Method and device for managing power consumption of server hardware equipment, and server
CN110083470B (en) Disk analysis method, apparatus and computer readable storage medium

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant