CN106445754A - Method and system for inspecting cluster health status and cluster server - Google Patents

Method and system for inspecting cluster health status and cluster server Download PDF

Info

Publication number
CN106445754A
CN106445754A CN201610822574.6A CN201610822574A CN106445754A CN 106445754 A CN106445754 A CN 106445754A CN 201610822574 A CN201610822574 A CN 201610822574A CN 106445754 A CN106445754 A CN 106445754A
Authority
CN
China
Prior art keywords
cluster
health status
detection
test result
status
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201610822574.6A
Other languages
Chinese (zh)
Inventor
马四腾
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Zhengzhou Yunhai Information Technology Co Ltd
Original Assignee
Zhengzhou Yunhai Information Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Zhengzhou Yunhai Information Technology Co Ltd filed Critical Zhengzhou Yunhai Information Technology Co Ltd
Priority to CN201610822574.6A priority Critical patent/CN106445754A/en
Publication of CN106445754A publication Critical patent/CN106445754A/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/22Detection or location of defective computer hardware by testing during standby operation or during idle time, e.g. start-up testing
    • G06F11/2205Detection or location of defective computer hardware by testing during standby operation or during idle time, e.g. start-up testing using arrangements specific to the hardware being tested
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/22Detection or location of defective computer hardware by testing during standby operation or during idle time, e.g. start-up testing
    • G06F11/26Functional testing

Abstract

The invention discloses a method for inspecting cluster health status comprising following steps: setting detection indexes of cluster health status, wherein the detection indexes comprise a device performance detection index and a cluster environment status detection index; collecting status information corresponding to the detection indexes; detecting according to the status information and by means of detection scripts corresponding to the cluster environment status detection indexes, and determining the health status of the cluster environment status according to detection results; testing according to the status information and by means of performance detection programs and/ or application performance detection programs, and determining the cluster health status according to test results; the method can carried out comprehensive health status inspection on a cluster through detecting aspects such as the cluster service status, hardware performance indexes and application compatibility; it is convenient for technicians to carry out malfunction elimination on a cluster system; the invention discloses a system and a server for inspecting cluster health status which have above beneficial effects.

Description

A kind of method checking cluster health status, system and cluster server
Technical field
The present invention relates to field of computer technology, particularly to a kind of method checking cluster health status, system and collection Group's server.
Background technology
At present, the development with computer technology and being increasingly widely applied, more and more depends on computer skill The application system of art has come into work and the life of people.Although as the speed development to make rapid progress for the computer technology, single The Performance And Reliability of platform computer is become better and better, but has the requirement of much reality to be that single computer is unapproachable. Such as a lot of industries, such as molecule power, fluid dynamic etc. is required for high-performance calculation as background support.High-performance calculation collection As a total system, its framework great majority is to build up cluster by a lot of server groups to use to group, because it needs to provide by force Big computing capability, server is combined for up to a hundred easily, and number of servers is many, and overall fault rate also can rise, firmly Part fault is easy to be found, but how Check System level fault is it is simply that a problem.
Content of the invention
It is an object of the invention to provide a kind of method checking cluster health status, system and server, can be by inspection Survey cluster service state, hardware performance index, the aspect such as application compatibility cluster is done with omnibearing health status inspection;Just In technical staff, malfunction elimination is carried out to group system.
For solving above-mentioned technical problem, the present invention provides a kind of method checking cluster health status, including:
The Testing index of setting cluster health status, wherein, described Testing index includes equipment performance Testing index and collection Group rings border state-detection index;
Gather the corresponding status information of described Testing index;
According to described status information, examined using the corresponding detection script of each described cluster environment state-detection index Survey, and judge the health status of cluster environment state according to testing result;
According to described status information, utility detection program and/or application performance detection program are tested, according to survey Test result judges cluster health status.
Wherein, according to described status information, utility detection program and/or application performance detection program are tested, According to test result, including:
When judging the health status of cluster environment state as health, according to described status information, utility detects journey Sequence and/or application performance detection program are tested, and according to test result, judge cluster health status.
Wherein, the method also includes:
Described status information and/or testing result and/or test result are preserved to journal file.
Wherein, utility detection program is tested, and judges cluster health status according to test result, including:
Utility detection program carries out the test of single node benchmark;
When test result is less than performance detection threshold value, cluster health status are unhealthy;
When test result is not less than performance detection threshold value, cluster health status are health.
Wherein, tested using application performance detection program, judged that cluster health status include according to test result:
Create the running environment of predetermined application;
In each running environment, little example calculating is carried out according to corresponding statess information, obtains test result;
When test result is less than application performance detection threshold value, cluster health status are unhealthy;
When test result is not less than application performance detection threshold value, cluster health status are health.
The present invention also provides a kind of system checking cluster health status, including:
Setup module, for arranging the Testing index of cluster health status, wherein, described Testing index includes equipment performance Testing index and cluster environment state-detection index;
Acquisition module, for gathering the corresponding status information of described Testing index;
Cluster environment state detection module, for according to described status information, using each described cluster environment state-detection The corresponding detection script of index is detected, and judges the health status of cluster environment state according to testing result;
Cluster performance detection module, for according to described status information, utility detection program and/or application performance inspection Ranging sequence is tested, and judges cluster health status according to test result.
Wherein, this system also includes:
Preserving module, for preserving described status information and/or testing result and/or test result to journal file In.
Wherein, described cluster performance detection module, including:Single node benchmark test cell, for utility inspection Ranging sequence carries out the test of single node benchmark;When test result is less than performance detection threshold value, cluster health status are not to be good for Health;When test result is not less than performance detection threshold value, cluster health status are health.
Wherein, described cluster performance detection module, including:Application performance detector unit, for creating the fortune of predetermined application Row environment;In each running environment, little example calculating is carried out according to corresponding statess information, obtains test result;Work as test result During less than application performance detection threshold value, cluster health status are unhealthy;When test result is not less than application performance detection threshold value When, cluster health status are health.
The present invention also provides a kind of cluster server, including:Inspection cluster health status according to any of the above-described System.
A kind of method checking cluster health status provided by the present invention, including:The detection of setting cluster health status Index, wherein, described Testing index includes equipment performance Testing index and cluster environment state-detection index;Gather described detection The corresponding status information of index;According to described status information, using the corresponding detection of each described cluster environment state-detection index Script is detected, and judges the health status of cluster environment state according to testing result;According to described status information, usability Can detect that program and/or application performance detection program are tested, cluster health status are judged according to test result;
It can be seen that, the method can be by detecting cluster service state, hardware performance index, and it is right that the aspect such as application compatibility is come Cluster does omnibearing health status inspection;It is easy to technical staff and malfunction elimination is carried out to group system;The invention provides one Plant the system checking cluster health status and server, there is above-mentioned beneficial effect, will not be described here.
Brief description
In order to be illustrated more clearly that the embodiment of the present invention or technical scheme of the prior art, below will be to embodiment or existing Have technology description in required use accompanying drawing be briefly described it should be apparent that, drawings in the following description be only this Inventive embodiment, for those of ordinary skill in the art, on the premise of not paying creative work, can also basis The accompanying drawing providing obtains other accompanying drawings.
The flow chart of the method for the inspection cluster health status that Fig. 1 is provided by the embodiment of the present invention;
The structured flowchart of the system of the inspection cluster health status that Fig. 2 is provided by the embodiment of the present invention.
Specific embodiment
The core of the present invention is to provide a kind of method checking cluster health status, system and server, can be by inspection Survey cluster service state, hardware performance index, the aspect such as application compatibility cluster is done with omnibearing health status inspection;Just In technical staff, malfunction elimination is carried out to group system.
Purpose, technical scheme and advantage for making the embodiment of the present invention are clearer, below in conjunction with the embodiment of the present invention In accompanying drawing, the technical scheme in the embodiment of the present invention is clearly and completely described it is clear that described embodiment is The a part of embodiment of the present invention, rather than whole embodiments.Based on the embodiment in the present invention, those of ordinary skill in the art The every other embodiment being obtained under the premise of not making creative work, broadly falls into the scope of protection of the invention.
Refer to Fig. 1, the flow chart of the method for the inspection cluster health status that Fig. 1 is provided by the embodiment of the present invention;Should Method can include:
S100, the Testing index of setting cluster health status, wherein, described Testing index includes equipment performance Testing index With cluster environment state-detection index;
Specifically, Testing index here will be set according to the actual demand of user, not to this monitoring index Particular content is defined, and user can also change according to the actual requirements, adaptation is carried out to Testing index;For example Increase, delete, the operation such as modification Testing index.
Here the point detecting health status is needed to be Testing index in High-Performance Computing Cluster to be analyzed, such as some basis clothes Business, such as:Whether NFS carry is normal, and whether NIS service is normal, and whether machine network is the state of UNICOM;Some machines for another example Performance is related, such as:Cpu performance, internal memory performance, network performance, application performance etc..
User can choose the point of the health status needing detection, to be configured, this configuration user by way of combination Can modify, and cluster health status inspection can be carried out when logging in every time in order to improve the effect of cluster health status Survey, further for being easy to technical staff, group system fault or timely understanding group system shape are excluded according to testing result State, can be to export testing result to user in the form of reporting.User can be for further processing according to the report of output.
Check the time of report further for saving technical staff, can be by some testing results with various specifically lively Form be indicated.The information of only output abnormality can also save user time further.
Due to can detect to system when starting shooting each time, user can sum up collection according to each testing result The long term state of group's system, so that user predicts in time or investigates group system fault according to historical data, can every time Monitoring result record in daily record so that for future reference.
S110, the collection corresponding status information of described Testing index;
Specifically, acquisition operations can be obtained by sending instruction to server OS each in cluster, including section Point title, CPU, internal memory, the status information of the index such as network, optionally, these status informations can be preserved to journal file.
S120, according to described status information, entered using the corresponding detection script of each described cluster environment state-detection index Row detection, and the health status of cluster environment state are judged according to testing result;
Specifically, by creating a series of scripts, health inspection is carried out to the cluster environment configuration of each node in cluster Survey, wherein can include ssh no cryptographic acess between node, NIS services, NFS services, nodal directory carry situation, and each section The script informations such as the consistency check of point configuration, optionally, these information and corresponding testing result are preserved to daily record literary composition Part.Can be determined that the health status of cluster environment state according to these testing results, specific decision rule can be according to user Demand carries out actual setting, and user can consider requirement to cluster environment state facilities and health status etc. to set Decision rule.
Here it is healthy and unhealthy it is also possible to cluster environment that result of determination can only comprise cluster environment state State demarcation Health Category.Optionally, these testing results can be preserved to journal file.
S130, according to described status information, utility detection program and/or application performance detection program are tested, Cluster health status are judged according to test result.
Specifically, performance detection program and application performance detection program user can select all to be detected here, also may be used Only to carry out one of which detection.And user can set performance detection program and application performance inspection according to the actual demand of itself The actual content of ranging sequence.Optionally, corresponding test result can be preserved to journal file.
Optionally, utility detection program is tested, and judges cluster health status according to test result, including:
Utility detection program carries out the test of single node benchmark;
When test result is less than performance detection threshold value, cluster health status are unhealthy;
When test result is not less than performance detection threshold value, cluster health status are health.
Specifically, single node benchmark detection, such as detects HPL (the High Performance of cpu performance Linpack, a kind of benchmark for measuring CPU floating-point operation performance), the STREAM of detection internal memory performance is one kind For measuring the benchmark of memory bandwidth performance, by the CPU collecting, memory information, can calculate The corresponding theoretical value of benchmark, defining a threshold value according to percentage ratio is performance detection threshold value, is typically set to 80% (this is an empirical value) is not defined to specific performance detection threshold value certainly here, and results of calculation is made with threshold value Contrast, higher than threshold value be by be cluster health status be health, less than not by be cluster health status be unhealthy, and Testing result can be shown.User can also carry out cluster health status grade and set that can to set different grades corresponding Threshold value.Here, when user only carries out performance detection, this testing result is cluster health status result, if user also need to into During the detection of row application performance, this result is the performance detection health status of cluster, and the health status of final cluster also need to consider The result of application performance detection.
Optionally, tested using application performance detection program, judged that cluster health status include according to test result:
Create the running environment of predetermined application;
In each running environment, little example calculating is carried out according to corresponding statess information, obtains test result;
When test result is less than application performance detection threshold value, cluster health status are unhealthy;
When test result is not less than application performance detection threshold value, cluster health status are health.
Specifically, according to different application types, create the running environment of typical case's application, provide little example to be calculated, And an empirical data is set for threshold value, judge by comparison threshold value whether cluster is examined by health when running application Survey.Here, when user only carries out application performance detection, this testing result is cluster health status result, if user also needs to When carrying out performance detection, this result is that the application performance of cluster detects health status, and the health status of final cluster also need to examine Consider the result of performance detection.
Wherein, test result here can be the synthesis when test result of single application or multiple application Test result.User can also carry out the setting of cluster health status grade and can set the corresponding threshold value of different grades.
If when two kinds of user detection is all carried out, cluster health status can be judged as being good for when every kind of detection be all health Health.Can also be that other decision rules are determined according to user configured detection content.
Further for improving cluster health status detection speed, can judge the health status of cluster environment state as When healthy, then execution step S130.
In the group system tentatively put up, implement the method, cluster health degree is checked, can be by configuring File is customizing detection content, general, carries out comprehensive health degree inspection, checks that after finishing, the method can be by testing result Export in journal file, and point out not pass through item, so that attendant discovers problems and solve them it is ensured that cluster is normally steady Fixed operation.But when S120 has been detected by mistake, the detection that the time of can saving no longer carries out step S130.
Wherein, the result that each step in S110 to S130 obtains can be shown to user, and user can be according to aobvious The result judgement shown is the need of the detection proceeding cluster monitoring state.And the process showing can make user more preferable Solution detection procedure.
Based on technique scheme, the method for inspection cluster health status provided in an embodiment of the present invention, collected by detection Group's service state, hardware performance index, the aspect such as application compatibility cluster is done with omnibearing health status inspection, simultaneously defeated Go out examining report, to solve the problems, such as the investigation of group system level fault.
Check that the system of cluster health status and cluster server are introduced to provided in an embodiment of the present invention below, under The system of inspection cluster health status of literary composition description and cluster server and the above-described method checking cluster health status Can be mutually to should refer to.
Refer to Fig. 2, the structured flowchart of the system of the inspection cluster health status that Fig. 2 is provided by the embodiment of the present invention; This system can include:
Setup module 100, for arranging the Testing index of cluster health status, wherein, described Testing index includes equipment Performance detection index and cluster environment state-detection index;
Acquisition module 200, for gathering the corresponding status information of described Testing index;
Cluster environment state detection module 300, for according to described status information, being examined using each described cluster environment state Survey the corresponding detection script of index to be detected, and judge the health status of cluster environment state according to testing result;
Cluster performance detection module 400, for according to described status information, utility detection program and/or application Can detect that program is tested, cluster health status are judged according to test result.
Based on above-described embodiment, this system also includes:
Preserving module, for preserving described status information and/or testing result and/or test result to journal file In.
Based on above-mentioned any embodiment, described cluster performance detection module 400, including:The test of single node benchmark is single Unit, carries out the test of single node benchmark for utility detection program;When test result is less than performance detection threshold value, Cluster health status are unhealthy;When test result is not less than performance detection threshold value, cluster health status are health.
Based on above-mentioned any embodiment, described cluster performance detection module 400, including:Application performance detector unit, is used for Create the running environment of predetermined application;In each running environment, little example calculating is carried out according to corresponding statess information, is tested Result;When test result is less than application performance detection threshold value, cluster health status are unhealthy;Answer when test result is not less than During with performance detection threshold value, cluster health status are health.
Based on above-mentioned any embodiment, this system also includes:
Display module, for entering described status information and/or testing result and/or test result and cluster health status Row display.
The embodiment of the present invention also provides a kind of cluster server, including:Inspection collection according to above-mentioned any embodiment The system of group's health status.
In description, each embodiment is described by the way of going forward one by one, and what each embodiment stressed is real with other Apply the difference of example, between each embodiment identical similar portion mutually referring to.For device disclosed in embodiment Speech, because it corresponds to the method disclosed in Example, so description is fairly simple, referring to method part illustration in place of correlation ?.
Professional further appreciates that, in conjunction with the unit of each example of the embodiments described herein description And algorithm steps, can with electronic hardware, computer software or the two be implemented in combination in, in order to clearly demonstrate hardware and The interchangeability of software, generally describes composition and the step of each example in the above description according to function.These Function to be executed with hardware or software mode actually, the application-specific depending on technical scheme and design constraint.Specialty Technical staff can use different methods to each specific application realize described function, but this realization should Think beyond the scope of this invention.
The step of the method in conjunction with the embodiments described herein description or algorithm can directly be held with hardware, processor The software module of row, or the combination of the two is implementing.Software module can be placed in random access memory (RAM), internal memory, read-only deposit Reservoir (ROM), electrically programmable ROM, electrically erasable ROM, depositor, hard disk, moveable magnetic disc, CD-ROM or technology In known any other form of storage medium in field.
Above the method checking cluster health status provided by the present invention, system and cluster server are carried out in detail Introduce.Specific case used herein is set forth to the principle of the present invention and embodiment, the explanation of above example It is only intended to help and understand the method for the present invention and its core concept.It should be pointed out that the ordinary skill people for the art Member for, under the premise without departing from the principles of the invention, the present invention can also be carried out some improve and modify, these improve and Modify and also fall in the protection domain of the claims in the present invention.

Claims (10)

1. a kind of method checking cluster health status is it is characterised in that include:
The Testing index of setting cluster health status, wherein, described Testing index includes equipment performance Testing index and collection group rings Border state-detection index;
Gather the corresponding status information of described Testing index;
According to described status information, detected using the corresponding detection script of each described cluster environment state-detection index, and Judge the health status of cluster environment state according to testing result;
According to described status information, utility detection program and/or application performance detection program are tested, according to test knot Fruit judges cluster health status.
2. method according to claim 1 is it is characterised in that according to described status information, utility detection program and/ Or application performance detection program tested, according to test result, including:
When judging the health status of cluster environment state as health, according to described status information, utility detection program and/ Or application performance detects that program is tested, according to test result, judge cluster health status.
3. method according to claim 2 is it is characterised in that also include:
Described status information and/or testing result and/or test result are preserved to journal file.
4. the method according to any one of claim 1-3 it is characterised in that utility detection program tested, root Judge cluster health status according to test result, including:
Utility detection program carries out the test of single node benchmark;
When test result is less than performance detection threshold value, cluster health status are unhealthy;
When test result is not less than performance detection threshold value, cluster health status are health.
5. the method according to any one of claim 1-3 is it is characterised in that surveyed using application performance detection program According to test result, examination, judges that cluster health status include:
Create the running environment of predetermined application;
In each running environment, little example calculating is carried out according to corresponding statess information, obtains test result;
When test result is less than application performance detection threshold value, cluster health status are unhealthy;
When test result is not less than application performance detection threshold value, cluster health status are health.
6. a kind of system checking cluster health status is it is characterised in that include:
Setup module, for arranging the Testing index of cluster health status, wherein, described Testing index includes equipment performance detection Index and cluster environment state-detection index;
Acquisition module, for gathering the corresponding status information of described Testing index;
Cluster environment state detection module, for according to described status information, using each described cluster environment state-detection index Corresponding detection script is detected, and judges the health status of cluster environment state according to testing result;
Cluster performance detection module, for according to described status information, utility detection program and/or application performance detect journey Sequence is tested, and judges cluster health status according to test result.
7. system according to claim 6 is it is characterised in that also include:
Preserving module, for preserving described status information and/or testing result and/or test result to journal file.
8. the system according to claim 6 or 7 is it is characterised in that described cluster performance detection module, including:Single node Benchmark test cell, carries out the test of single node benchmark for utility detection program;When test result is less than During performance detection threshold value, cluster health status are unhealthy;When test result is not less than performance detection threshold value, cluster health shape State is health.
9. the system according to claim 6 or 7 is it is characterised in that described cluster performance detection module, including:Application Energy detector unit, for creating the running environment of predetermined application;In each running environment, little calculation is carried out according to corresponding statess information Example calculates, and obtains test result;When test result is less than application performance detection threshold value, cluster health status are unhealthy;When When test result is not less than application performance detection threshold value, cluster health status are health.
10. a kind of cluster server is it is characterised in that include:Inspection cluster health according to any one of claim 6-9 The system of state.
CN201610822574.6A 2016-09-13 2016-09-13 Method and system for inspecting cluster health status and cluster server Pending CN106445754A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201610822574.6A CN106445754A (en) 2016-09-13 2016-09-13 Method and system for inspecting cluster health status and cluster server

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201610822574.6A CN106445754A (en) 2016-09-13 2016-09-13 Method and system for inspecting cluster health status and cluster server

Publications (1)

Publication Number Publication Date
CN106445754A true CN106445754A (en) 2017-02-22

Family

ID=58167848

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201610822574.6A Pending CN106445754A (en) 2016-09-13 2016-09-13 Method and system for inspecting cluster health status and cluster server

Country Status (1)

Country Link
CN (1) CN106445754A (en)

Cited By (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107391330A (en) * 2017-07-24 2017-11-24 郑州云海信息技术有限公司 Computing power method of testing and system under a kind of Itanium platform
CN107608857A (en) * 2017-09-25 2018-01-19 郑州云海信息技术有限公司 A kind of SAN storage health status inspection method, device and readable storage medium storing program for executing
CN108616421A (en) * 2018-04-13 2018-10-02 郑州云海信息技术有限公司 A kind of condition detection method of multi-node cluster, device and equipment
CN110032486A (en) * 2019-03-06 2019-07-19 平安科技(深圳)有限公司 Server test method, device, computer equipment and storage medium
CN110290012A (en) * 2019-07-03 2019-09-27 浪潮云信息技术有限公司 The detection recovery system and method for RabbitMQ clustering fault
CN110737560A (en) * 2019-10-22 2020-01-31 北京百度网讯科技有限公司 service state detection method, device, electronic equipment and medium
CN110798336A (en) * 2019-09-25 2020-02-14 苏州浪潮智能科技有限公司 Method and device for environmental inspection of large data platform deployment server
CN111400117A (en) * 2020-03-12 2020-07-10 山东汇贸电子口岸有限公司 Method for automatically testing Ceph cluster
CN112783745A (en) * 2021-02-02 2021-05-11 无锡车联天下信息技术有限公司 Cluster data monitoring method, device, system and storage medium
CN114138642A (en) * 2021-11-26 2022-03-04 苏州浪潮智能科技有限公司 Method, device and equipment for automatically selecting adaptive function according to environment state

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6701463B1 (en) * 2000-09-05 2004-03-02 Motorola, Inc. Host specific monitor script for networked computer clusters
CN101373447A (en) * 2008-08-20 2009-02-25 上海超级计算中心 System and method for detecting health degree of computer cluster
CN103294579A (en) * 2013-06-09 2013-09-11 浪潮电子信息产业股份有限公司 Method for testing high-performance computing cluster application performance
CN103746829A (en) * 2013-12-20 2014-04-23 中国科学院计算技术研究所 Cluster-based fault perception system and method thereof
CN104954189A (en) * 2015-07-07 2015-09-30 上海斐讯数据通信技术有限公司 Automatic server cluster detecting method and system

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6701463B1 (en) * 2000-09-05 2004-03-02 Motorola, Inc. Host specific monitor script for networked computer clusters
CN101373447A (en) * 2008-08-20 2009-02-25 上海超级计算中心 System and method for detecting health degree of computer cluster
CN103294579A (en) * 2013-06-09 2013-09-11 浪潮电子信息产业股份有限公司 Method for testing high-performance computing cluster application performance
CN103746829A (en) * 2013-12-20 2014-04-23 中国科学院计算技术研究所 Cluster-based fault perception system and method thereof
CN104954189A (en) * 2015-07-07 2015-09-30 上海斐讯数据通信技术有限公司 Automatic server cluster detecting method and system

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
杨晓云,王建桥译: "《Linux自学通》", 31 August 1998, 北京:机械工业出版社;西蒙与舒斯特国际出版公司 *
魏红,曾忠平: "《Red Hat Linux实用宝典》", 31 May 2008, 北京:中国铁道出版社 *

Cited By (15)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107391330B (en) * 2017-07-24 2020-10-20 苏州浪潮智能科技有限公司 Method and system for testing computer performance under Itanium platform
CN107391330A (en) * 2017-07-24 2017-11-24 郑州云海信息技术有限公司 Computing power method of testing and system under a kind of Itanium platform
CN107608857A (en) * 2017-09-25 2018-01-19 郑州云海信息技术有限公司 A kind of SAN storage health status inspection method, device and readable storage medium storing program for executing
CN108616421A (en) * 2018-04-13 2018-10-02 郑州云海信息技术有限公司 A kind of condition detection method of multi-node cluster, device and equipment
CN110032486A (en) * 2019-03-06 2019-07-19 平安科技(深圳)有限公司 Server test method, device, computer equipment and storage medium
CN110032486B (en) * 2019-03-06 2022-08-09 平安科技(深圳)有限公司 Server testing method and device, computer equipment and storage medium
CN110290012A (en) * 2019-07-03 2019-09-27 浪潮云信息技术有限公司 The detection recovery system and method for RabbitMQ clustering fault
CN110798336A (en) * 2019-09-25 2020-02-14 苏州浪潮智能科技有限公司 Method and device for environmental inspection of large data platform deployment server
CN110737560A (en) * 2019-10-22 2020-01-31 北京百度网讯科技有限公司 service state detection method, device, electronic equipment and medium
CN110737560B (en) * 2019-10-22 2023-10-20 北京百度网讯科技有限公司 Service state detection method and device, electronic equipment and medium
CN111400117A (en) * 2020-03-12 2020-07-10 山东汇贸电子口岸有限公司 Method for automatically testing Ceph cluster
CN111400117B (en) * 2020-03-12 2023-07-11 山东汇贸电子口岸有限公司 Method for automatically testing Ceph cluster
CN112783745A (en) * 2021-02-02 2021-05-11 无锡车联天下信息技术有限公司 Cluster data monitoring method, device, system and storage medium
CN114138642A (en) * 2021-11-26 2022-03-04 苏州浪潮智能科技有限公司 Method, device and equipment for automatically selecting adaptive function according to environment state
CN114138642B (en) * 2021-11-26 2023-08-29 苏州浪潮智能科技有限公司 Method, device and equipment for automatically selecting adaptation function according to environment state

Similar Documents

Publication Publication Date Title
CN106445754A (en) Method and system for inspecting cluster health status and cluster server
US9588834B1 (en) Methods and apparatus for improved fault analysis
CN109586952A (en) Method of server expansion, device
CN104375912B (en) The measuring method and device of mobile terminal interim card
CN104346221B (en) Server hardware device grade classification, schedule management method and device, server
CN107678908B (en) Log recording method and device, computer equipment and storage medium
CN103262048A (en) Operation management device, operation management method, and program
US10402298B2 (en) System and method for comprehensive performance and availability tracking using passive monitoring and intelligent synthetic transaction generation in a transaction processing system
CN105302697B (en) A kind of running state monitoring method and system of density data model database
US20160259714A1 (en) Production sampling for determining code coverage
CN108347352A (en) The diagnostic method of information system and equipment performance in a kind of electric system
CN108809760A (en) The control method and device in sampling period in sampled-data system
CN111242430A (en) Power equipment supplier evaluation method and device
CN104112003B (en) The method and system that the performance of game terminal is detected
CN109858097A (en) A kind of spacecraft single machine test assessment methods of sampling
US20070168751A1 (en) Quantitative measurement of the autonomic capabilities of computing systems
CN113515402A (en) Fault information classification method and device for engineering equipment and engineering equipment
CN106878109A (en) Server detection method and server system
US20100131497A1 (en) Method for determining which of a number of test cases should be run during testing
CN112433908B (en) Method, system, device and medium for determining interval time of detection server
CN107018039A (en) The method and apparatus of test server clustering performance bottleneck
CN106155866A (en) A kind of method and device of monitoring CPU core frequency
CN110232026A (en) AssetBundle resource detection method and system
CN113033845B (en) Construction method and device for power transmission resource co-construction and sharing
CN109144816A (en) A kind of node health degree detection method and system

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication

Application publication date: 20170222

RJ01 Rejection of invention patent application after publication