CN111984498A - Server cluster monitoring and management system - Google Patents
Server cluster monitoring and management system Download PDFInfo
- Publication number
- CN111984498A CN111984498A CN202010725476.7A CN202010725476A CN111984498A CN 111984498 A CN111984498 A CN 111984498A CN 202010725476 A CN202010725476 A CN 202010725476A CN 111984498 A CN111984498 A CN 111984498A
- Authority
- CN
- China
- Prior art keywords
- server
- information
- management system
- server cluster
- management
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
- 238000012544 monitoring process Methods 0.000 title claims abstract description 63
- 239000000758 substrate Substances 0.000 claims abstract description 23
- 238000004458 analytical method Methods 0.000 claims abstract description 7
- 238000000034 method Methods 0.000 claims description 11
- 238000004519 manufacturing process Methods 0.000 claims description 5
- 238000012545 processing Methods 0.000 claims description 5
- 238000013480 data collection Methods 0.000 claims description 2
- 238000012423 maintenance Methods 0.000 abstract description 15
- 238000001514 detection method Methods 0.000 description 3
- 238000010586 diagram Methods 0.000 description 3
- 238000007405 data analysis Methods 0.000 description 2
- 238000012986 modification Methods 0.000 description 2
- 230000004048 modification Effects 0.000 description 2
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Substances O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 2
- 230000009286 beneficial effect Effects 0.000 description 1
- 238000004891 communication Methods 0.000 description 1
- 238000013079 data visualisation Methods 0.000 description 1
- 230000007547 defect Effects 0.000 description 1
- 238000011161 development Methods 0.000 description 1
- 238000005516 engineering process Methods 0.000 description 1
- 230000000007 visual effect Effects 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F11/00—Error detection; Error correction; Monitoring
- G06F11/30—Monitoring
- G06F11/3058—Monitoring arrangements for monitoring environmental properties or parameters of the computing system or of the computing system component, e.g. monitoring of power, currents, temperature, humidity, position, vibrations
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F1/00—Details not covered by groups G06F3/00 - G06F13/00 and G06F21/00
- G06F1/16—Constructional details or arrangements
- G06F1/20—Cooling means
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F1/00—Details not covered by groups G06F3/00 - G06F13/00 and G06F21/00
- G06F1/26—Power supply means, e.g. regulation thereof
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F11/00—Error detection; Error correction; Monitoring
- G06F11/30—Monitoring
- G06F11/3065—Monitoring arrangements determined by the means or processing involved in reporting the monitored data
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F11/00—Error detection; Error correction; Monitoring
- G06F11/30—Monitoring
- G06F11/32—Monitoring with visual or acoustical indication of the functioning of the machine
- G06F11/324—Display of status information
- G06F11/325—Display of status information by lamps or LED's
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Quality & Reliability (AREA)
- Human Computer Interaction (AREA)
- Computing Systems (AREA)
- Debugging And Monitoring (AREA)
Abstract
The invention provides a server cluster monitoring and management system, which comprises: a data acquisition module: a temperature sensor, a voltage sensor, a current sensor, an E2PROM and a substrate management controller are arranged in a server, and corresponding data information is collected; a fault warning module: judging the collected data information, and when the data information exceeds a preset threshold value, carrying out fault warning by the substrate management controller through a preset judging program; a log management module: and recording the operation of the baseboard management controller and the generated fault information, forming a log, periodically uploading the log to a server cluster management system, and performing data backup and analysis. The invention realizes the centralized management of the server cluster through the server cluster monitoring and management, monitors the hardware states of all the servers in real time, realizes the graphical display of the health states of the server cluster according to real-time data, and greatly improves the operation and maintenance efficiency of the servers.
Description
Technical Field
The invention relates to the technical field of management control, in particular to a server cluster monitoring and management system. And more particularly, to an out-of-band monitoring and intelligent management system for cabinets.
Background
At present, with the vigorous development of emerging technologies such as cloud computing, containers and the internet of things, enterprises have higher and higher requirements on the performance of servers, the quantity of the requirements on computing nodes in the servers is higher and higher, the clustering degree of the servers is higher and higher, and the difficult problems in the aspect of server operation and maintenance are brought. Meanwhile, the traditional server operation and maintenance does not have a unified server operation and maintenance management system, and the state of each server cannot be monitored in real time and adjusted in time. Aiming at the problems, the invention discloses a BMC-based server cluster monitoring and management system which can monitor physical data of a server in real time, make adjustment, realize quick positioning of faults and store the data of the server in operation so as to provide data for inquiring the functional performance of the server.
Patent document CN110611669A (application number: 201910860860.5) discloses an intelligent operation and maintenance management system based on internet of things, which includes a server, a GPS module installed inside a monitoring case for positioning, a temperature and humidity sensor for detecting the temperature and humidity of the environment inside the monitoring case, a current sensor for detecting current, a voltage sensor for detecting voltage, a water sensor for detecting whether the inside of the monitoring case is soaked in water, and an equipment state detection module for detecting the state of working equipment inside the monitoring case, wherein a data acquisition module for collecting detection data is arranged inside the monitoring case, and the data acquisition module sends the detection data to the server through a wireless communication module. The patent has no quick fault location, only completes the data acquisition of each server, does not perform the fault location of the server, and still has difficulty in the operation and maintenance of the server. Lack of management of server clusters: the server cluster monitoring is not mentioned, and the requirement of a user on the server cluster monitoring cannot be met, and the scheme provided by the patent comprises a server cluster monitoring and managing module and meets the requirement of the user.
Patent document CN110224879A (application number: 201910630563.1) discloses a node configuration method for a whole rack server, including: the CMC obtains a target configuration information file; sending the target configuration information file to the BMC of the target computing node to configure the target computing node; the CMC is connected with the BMC of the computing nodes in the whole cabinet server, and the computing nodes comprise target computing nodes. The patent has fewer functional modules: the method only realizes the quick configuration of the server BMC and belongs to one branch of server management. Lack of management of server clusters: the server cluster monitoring is not mentioned, and the requirement of a user on the server cluster monitoring cannot be met, and the scheme provided by the patent comprises a server cluster monitoring and managing module and meets the requirement of the user.
Disclosure of Invention
Aiming at the defects in the prior art, the invention aims to provide a server cluster monitoring and management system.
The server cluster monitoring and management system provided by the invention comprises:
a data acquisition module: a temperature sensor, a voltage sensor, a current sensor, an E2PROM and a substrate management controller are arranged in a server, and corresponding data information is collected;
a fault warning module: judging the collected data information, and when the data information exceeds a preset threshold value, carrying out fault warning by the substrate management controller through a preset judging program;
a log management module: and recording the operation of the baseboard management controller and the generated fault information, forming a log, periodically uploading the log to a server cluster monitoring and management system, and performing data backup and analysis.
Preferably, the method further comprises the following steps:
a single server display module: a baseboard management controller of the server provides an independent server state control interface outwards through a network to control a single server.
Preferably, the method further comprises the following steps:
a server management module: the collected data information, fault information and log information are managed and controlled uniformly, and the functions of server information real-time monitoring, fault alarm quick positioning and log information analysis and processing are provided for the outside.
Preferably, the fault warning module includes: lightening a fault warning lamp of the server, reporting fault warning information to a server cluster monitoring and management system, and positioning the fault through the server;
when the substrate management controller generates fault alarm cancellation information, the fault alarm cancellation information is reported to the server cluster monitoring and management system, whether other fault alarms exist in the server or not is judged, and if no other fault alarms exist, the fault alarm lamp of the server is turned off.
Preferably, the temperature sensor, the voltage sensor, the current sensor and the baseboard management controller transmit data through an I2C bus;
the E2PROM and the substrate management controller are communicated through an RS232 serial port;
the baseboard management controller and the external device communicate with each other through a network.
Preferably, the baseboard management controllers of all the servers collect all the information in the unit through the I2C bus and the RS232 serial port, transmit the information to the management system through the network, and monitor the information of all the servers through the management system.
Preferably, the management system accesses the baseboard management controllers of all the servers through the network, and performs on-off and restart on the servers through the baseboard management controllers of the servers, and updates the baseboard management control managers.
Preferably, the temperature sensor acquires temperature information of the server, the voltage sensor acquires voltage information of the server, and the current sensor acquires current information of the server;
the method comprises the steps that a baseboard management controller obtains data of each sensor in a server;
the substrate management controller is connected with the E2PROM to obtain server information stored in the E2PROM, wherein the server information comprises a manufacturer, a production date and a model.
Preferably, the baseboard management controller is connected to the fan of the server through PWM to control the rotating speed and monitor the state of the fan; and the baseboard management controller acquires the server state information through the GPIO.
Preferably, the baseboard management controller of each server has a network address, and the information of each server is uploaded to the server cluster monitoring and management system through the network according to the network address, so that the server cluster monitoring and management system collects and monitors real-time data of all the servers.
Compared with the prior art, the invention has the following beneficial effects:
1. the invention realizes the centralized management of the server cluster through the server cluster monitoring and management, monitors the hardware states of all the servers in real time, realizes the graphical display of the health states of the server cluster according to real-time data, and can greatly improve the operation and maintenance efficiency of the servers;
2. the invention rapidly positions the server fault: by the aid of the fault warning lamp of the server and the instant and active uploading of fault warning information, the fault of the server can be accurately and quickly positioned, and the maintenance efficiency of server hardware is improved;
3. the invention intelligently analyzes the server data: through long-time data collection and analysis, the running state of the server can be evaluated and intelligently researched and judged, and the running efficiency of the server is improved; the real-time data monitoring can monitor the voltage, the current and the temperature of all modules in real time, and monitor information such as manufacturer, production date, model and the like of the server;
4. the intelligent control is carried out, the rotating speed of the fan can be intelligently adjusted according to the temperature, and the remote server startup and shutdown and restart functions are realized; and the system is used for uniformly monitoring, so that the state of the server cluster can be monitored, a corresponding state diagram is generated, and the health condition of the server cluster is displayed.
Drawings
Other features, objects and advantages of the invention will become more apparent upon reading of the detailed description of non-limiting embodiments with reference to the following drawings:
FIG. 1 is a schematic diagram of a server cluster monitoring and management system;
FIG. 2 is a flow chart of information collection for a server cluster monitoring and management system;
FIG. 3 is a flow chart of server cluster monitoring and management system fault alarm;
FIG. 4 is a flow chart of a server cluster monitoring and management system canceling a fault alarm
FIG. 5 is a flow chart of server cluster monitoring and management system log collection;
fig. 6 is a schematic diagram of a server cluster structure.
Detailed Description
The present invention will be described in detail with reference to specific examples. The following examples will assist those skilled in the art in further understanding the invention, but are not intended to limit the invention in any way. It should be noted that it would be obvious to those skilled in the art that various changes and modifications can be made without departing from the spirit of the invention. All falling within the scope of the present invention.
Example 1:
as shown in fig. 1, the server cluster monitoring and management system provided by the present invention includes:
a data acquisition module: a temperature sensor, a voltage sensor, a current sensor, an E2PROM and a substrate management controller are arranged at corresponding positions in each server, as shown in figure 2;
data are transmitted among the temperature sensor, the voltage sensor, the current sensor and the substrate controller through an I2C bus; the E2PROM and the substrate controller are communicated through an RS232 serial port; the baseboard management controller communicates with the outside through a network.
Monitoring: all information in the baseboard management controllers of all servers are collected through an I2C bus and an RS232 serial port and transmitted to the management system through the network, and the management system can monitor the information of all servers.
Management: the management system can access the substrate controllers of all the servers through a network, and can perform functions of starting up, shutting down and restarting the servers through the substrate controllers of the servers, updating the firmware of the substrate control manager and the like.
The temperature sensor acquires temperature information of the server, the voltage sensor acquires voltage information of the server, and the current sensor acquires current information of the server.
The baseboard management controller can acquire data of each sensor in the server;
the substrate management controller is connected with the E2PROM to obtain server information stored in the substrate management controller, such as manufacturers, production dates, models and the like; the substrate management controller can be connected to the fan of the server through PWM to realize the rotation speed control and the state monitoring of the fan; and the substrate controller acquires the server state information through the GPIO.
The baseboard management controller of each server has a network address, and the information of each server is uploaded to the server cluster management system through the network in a unified manner, so that the server cluster monitoring and management system collects and monitors the real-time data of all the servers.
A fault warning module: the baseboard management controllers of the servers collect information such as temperature, voltage, current and fan rotating speed to judge, judgment programs are built in the baseboard management controllers, when the information such as the temperature, the voltage, the current and the fan state exceeds set threshold values, the baseboard management controllers can generate fault alarms through the judgment programs, the fault alarm lamps of the servers are lightened, the fault alarm information is reported to the server cluster management system timely and actively, the rapid hardware fault location of the server system is achieved, and support is provided for operation and maintenance of the servers, and the method is shown in figure 3.
When the baseboard management controller generates the failure alarm cancellation information, the failure alarm cancellation information is also reported to the server cluster monitoring and management system instantly and actively, and whether the server has other failure alarms or not is judged, and the failure alarm lamp of the server is turned off if the server has no failure alarms, as shown in fig. 4.
A log management module: the baseboard management controller of the server can record the operation (power on and power off, fan speed regulation, module replacement and the like) of the server and the generated fault information of a user to form a log, and can periodically upload the log to the server cluster management system to provide data for follow-up operations such as data analysis and fault tracing.
A single server display module: a substrate management controller of the server provides an independent server state control interface outwards through a network, and accurate and rapid control of a single server is achieved.
A server management module: the system provides a uniform server management module, performs uniform management and control on the acquired information, fault information and log information of each server, and provides functions of server information real-time monitoring, fault alarm rapid positioning, log information analysis and processing and the like for the outside. And providing strong support for the operation and maintenance of the server cluster, as shown in FIG. 5.
Example 2:
the server cluster monitoring and management system provided by the invention comprises:
as shown in fig. 6, a plurality of cabinets are arranged in the server cluster, a plurality of servers are arranged in each cabinet, each server is provided with a baseboard management controller, and the BMC is connected to the switch through a network and further connected to the server operation and maintenance terminal, so that the BMC connection of the server operation and maintenance terminal to each server is realized.
The BMC of the server realizes data acquisition, fault alarm, log collection and server state display in the server; and then, the server cluster monitoring and management system in the server operation and maintenance terminal collects data in each server through a network, stores the data in a database, performs data visualization processing, and displays the hardware health state of the server cluster in a centralized manner.
The data acquisition module acquires temperature information, voltage information and current information of the server through the sensor; server information such as manufacturer, production date, model and the like stored in the server E2PROM is obtained through accessing the server E2 PROM; the method comprises the steps that a fan connected to a server through PWM obtains fan state and rotating speed information; and then the data is uploaded to a server operation and maintenance terminal through a network.
The fault alarm module judges the acquired data, and when the information such as temperature, voltage, current, fan state and the like exceeds a set threshold value, a fault alarm can be generated through a judgment program, a fault alarm lamp of the server is lightened, and the fault alarm information is immediately and actively reported to the server cluster management system. And judging according to the acquired data, when the fault alarm cancellation information is generated, immediately and actively reporting the fault alarm cancellation information to a server cluster monitoring and management system, judging whether the server has other fault alarms or not, and turning off the fault alarm lamp of the server if the server does not have other fault alarms.
The log management module forms logs by recording operation information (power on/off, fan speed regulation, module replacement and the like) and fault information of the server, and periodically uploads the logs to the server cluster management system.
A server management module: the server cluster monitoring and management system is deployed in the server operation and maintenance terminal, unified management and control is performed on the collected information, fault information and log information of the server, and functions of server information real-time monitoring, fault alarm rapid positioning, log information analysis and processing and the like are provided for the outside. And the visual display of the health state of the server cluster and the quick positioning of the fault information are realized.
Those skilled in the art will appreciate that, in addition to implementing the systems, apparatus, and various modules thereof provided by the present invention in purely computer readable program code, the same procedures can be implemented entirely by logically programming method steps such that the systems, apparatus, and various modules thereof are provided in the form of logic gates, switches, application specific integrated circuits, programmable logic controllers, embedded microcontrollers and the like. Therefore, the system, the device and the modules thereof provided by the present invention can be considered as a hardware component, and the modules included in the system, the device and the modules thereof for implementing various programs can also be considered as structures in the hardware component; modules for performing various functions may also be considered to be both software programs for performing the methods and structures within hardware components.
The foregoing description of specific embodiments of the present invention has been presented. It is to be understood that the present invention is not limited to the specific embodiments described above, and that various changes or modifications may be made by one skilled in the art within the scope of the appended claims without departing from the spirit of the invention. The embodiments and features of the embodiments of the present application may be combined with each other arbitrarily without conflict.
Claims (10)
1. A server cluster monitoring and management system, comprising:
a data acquisition module: a temperature sensor, a voltage sensor, a current sensor, an E2PROM and a substrate management controller are arranged in a server, and corresponding data information is collected;
a fault warning module: judging the collected data information, and when the data information exceeds a preset threshold value, carrying out fault warning by the substrate management controller through a preset judging program;
a log management module: and recording the operation of the baseboard management controller and the generated fault information, forming a log, periodically uploading the log to a server cluster monitoring and management system, and performing data backup and analysis.
2. The server cluster monitoring and management system of claim 1, further comprising:
a single server display module: a baseboard management controller of the server provides an independent server state control interface outwards through a network to control a single server.
3. The server cluster monitoring and management system of claim 1, further comprising:
a server management module: the collected data information, fault information and log information are managed and controlled uniformly, and the functions of server information real-time monitoring, fault alarm quick positioning and log information analysis and processing are provided for the outside.
4. The server cluster monitoring and management system of claim 1, wherein the fault alert module comprises: lightening a fault warning lamp of the server, reporting fault warning information to a server cluster monitoring and management system, and positioning the fault through the server;
when the substrate management controller generates fault alarm cancellation information, the fault alarm cancellation information is reported to the server cluster monitoring and management system, whether other fault alarms exist in the server or not is judged, and if no other fault alarms exist, the fault alarm lamp of the server is turned off.
5. The server cluster monitoring and management system of claim 1, wherein the temperature sensors, voltage sensors, current sensors and baseboard management controllers communicate data therebetween via an I2C bus;
the E2PROM and the substrate management controller are communicated through an RS232 serial port;
the baseboard management controller and the external device communicate with each other through a network.
6. The server cluster monitoring and management system of claim 1, wherein the baseboard management controllers of all servers collect all information in the unit through I2C bus and RS232 serial port, transmit to the management system through network, and monitor all server information through the management system.
7. The server cluster monitoring and management system of claim 1, wherein the management system accesses the baseboard management controllers of all servers through a network, and performs the startup, shutdown and restart of the servers through the baseboard management controllers of the servers, and performs the update of the baseboard management control managers.
8. The server cluster monitoring and management system of claim 1, wherein a temperature sensor obtains temperature information of the server, a voltage sensor obtains voltage information of the server, and a current sensor obtains current information of the server;
the method comprises the steps that a baseboard management controller obtains data of each sensor in a server;
the substrate management controller is connected with the E2PROM to obtain server information stored in the E2PROM, wherein the server information comprises a manufacturer, a production date and a model.
9. The server cluster monitoring and management system of claim 1, wherein the baseboard management controller is connected to the fans of the servers through PWM to perform speed control and status monitoring of the fans; and the baseboard management controller acquires the server state information through the GPIO.
10. The server cluster monitoring and management system of claim 1, wherein the baseboard management controller of each server has a network address, and information of each server is uploaded to the server cluster monitoring and management system through a network according to the network address, so as to realize real-time data collection and monitoring of all servers by the server cluster monitoring and management system.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202010725476.7A CN111984498B (en) | 2020-07-24 | 2020-07-24 | Server cluster monitoring and management system |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202010725476.7A CN111984498B (en) | 2020-07-24 | 2020-07-24 | Server cluster monitoring and management system |
Publications (2)
Publication Number | Publication Date |
---|---|
CN111984498A true CN111984498A (en) | 2020-11-24 |
CN111984498B CN111984498B (en) | 2024-08-13 |
Family
ID=73438860
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202010725476.7A Active CN111984498B (en) | 2020-07-24 | 2020-07-24 | Server cluster monitoring and management system |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN111984498B (en) |
Cited By (13)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN112988505A (en) * | 2021-02-08 | 2021-06-18 | 深圳阿帕云计算有限公司 | Cloud computing-based general real-time big data monitoring and early warning system |
CN113014667A (en) * | 2021-03-18 | 2021-06-22 | 山东英信计算机技术有限公司 | Configuration method, device and equipment of rack server and storage medium |
CN113032218A (en) * | 2021-03-26 | 2021-06-25 | 山东英信计算机技术有限公司 | Server fault detection method, system and computer readable storage medium |
CN113051137A (en) * | 2021-04-22 | 2021-06-29 | 北京计算机技术及应用研究所 | Design method of extensible server remote health management system |
CN113448811A (en) * | 2021-05-31 | 2021-09-28 | 山东英信计算机技术有限公司 | Method, device, equipment and readable medium for lighting fault lamp of server system |
CN113505165A (en) * | 2021-07-05 | 2021-10-15 | 曙光信息产业(北京)有限公司 | Sensor data processing method and device, computer equipment and storage medium |
CN113535471A (en) * | 2021-06-28 | 2021-10-22 | 浙江大华技术股份有限公司 | Cluster server |
CN113590413A (en) * | 2021-06-29 | 2021-11-02 | 浪潮商用机器有限公司 | UNIX server, and UNIX server fault early warning method and device |
CN113805950A (en) * | 2021-08-30 | 2021-12-17 | 超越科技股份有限公司 | Method for managing server by cluster management system |
CN113852519A (en) * | 2021-11-26 | 2021-12-28 | 北京国电通网络技术有限公司 | Server working state monitoring equipment, method, device and computer readable medium |
CN114398229A (en) * | 2021-12-30 | 2022-04-26 | 中国舰船研究院 | Server cabinet management system and information query method |
CN117349128A (en) * | 2023-12-05 | 2024-01-05 | 杭州沃趣科技股份有限公司 | Fault monitoring method, device and equipment of server cluster and storage medium |
WO2024119843A1 (en) * | 2022-12-05 | 2024-06-13 | 苏州元脑智能科技有限公司 | Data acquisition method and apparatus and computer device |
Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20070168049A1 (en) * | 2006-01-13 | 2007-07-19 | Dell Products L.P. | System and method for the automated generation of events within a server environment |
CN103236949A (en) * | 2013-04-27 | 2013-08-07 | 北京搜狐新媒体信息技术有限公司 | Monitoring method, device and system for server cluster |
CN106844162A (en) * | 2017-02-25 | 2017-06-13 | 郑州云海信息技术有限公司 | Storage server cabinet management system and method based on BMC |
CN108536066A (en) * | 2018-04-24 | 2018-09-14 | 华东理工大学 | A kind of intelligent terminal control system towards laboratory safety |
CN109116812A (en) * | 2017-06-22 | 2019-01-01 | 上海智建电子工程有限公司 | Intelligent power distribution cabinet, energy conserving system and method based on SparkStreaming |
-
2020
- 2020-07-24 CN CN202010725476.7A patent/CN111984498B/en active Active
Patent Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20070168049A1 (en) * | 2006-01-13 | 2007-07-19 | Dell Products L.P. | System and method for the automated generation of events within a server environment |
CN103236949A (en) * | 2013-04-27 | 2013-08-07 | 北京搜狐新媒体信息技术有限公司 | Monitoring method, device and system for server cluster |
CN106844162A (en) * | 2017-02-25 | 2017-06-13 | 郑州云海信息技术有限公司 | Storage server cabinet management system and method based on BMC |
CN109116812A (en) * | 2017-06-22 | 2019-01-01 | 上海智建电子工程有限公司 | Intelligent power distribution cabinet, energy conserving system and method based on SparkStreaming |
CN108536066A (en) * | 2018-04-24 | 2018-09-14 | 华东理工大学 | A kind of intelligent terminal control system towards laboratory safety |
Cited By (18)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN112988505A (en) * | 2021-02-08 | 2021-06-18 | 深圳阿帕云计算有限公司 | Cloud computing-based general real-time big data monitoring and early warning system |
CN113014667A (en) * | 2021-03-18 | 2021-06-22 | 山东英信计算机技术有限公司 | Configuration method, device and equipment of rack server and storage medium |
CN113032218B (en) * | 2021-03-26 | 2022-07-29 | 山东英信计算机技术有限公司 | Server fault detection method, system and computer readable storage medium |
CN113032218A (en) * | 2021-03-26 | 2021-06-25 | 山东英信计算机技术有限公司 | Server fault detection method, system and computer readable storage medium |
CN113051137A (en) * | 2021-04-22 | 2021-06-29 | 北京计算机技术及应用研究所 | Design method of extensible server remote health management system |
CN113051137B (en) * | 2021-04-22 | 2024-03-26 | 北京计算机技术及应用研究所 | Design method of extensible server remote health management system |
CN113448811B (en) * | 2021-05-31 | 2023-03-14 | 山东英信计算机技术有限公司 | Method, device, equipment and readable medium for lighting fault lamp of server system |
CN113448811A (en) * | 2021-05-31 | 2021-09-28 | 山东英信计算机技术有限公司 | Method, device, equipment and readable medium for lighting fault lamp of server system |
CN113535471A (en) * | 2021-06-28 | 2021-10-22 | 浙江大华技术股份有限公司 | Cluster server |
CN113590413A (en) * | 2021-06-29 | 2021-11-02 | 浪潮商用机器有限公司 | UNIX server, and UNIX server fault early warning method and device |
CN113590413B (en) * | 2021-06-29 | 2024-05-10 | 浪潮商用机器有限公司 | UNIX server, and UNIX server fault early warning method and device |
CN113505165A (en) * | 2021-07-05 | 2021-10-15 | 曙光信息产业(北京)有限公司 | Sensor data processing method and device, computer equipment and storage medium |
CN113805950A (en) * | 2021-08-30 | 2021-12-17 | 超越科技股份有限公司 | Method for managing server by cluster management system |
CN113852519A (en) * | 2021-11-26 | 2021-12-28 | 北京国电通网络技术有限公司 | Server working state monitoring equipment, method, device and computer readable medium |
CN114398229A (en) * | 2021-12-30 | 2022-04-26 | 中国舰船研究院 | Server cabinet management system and information query method |
WO2024119843A1 (en) * | 2022-12-05 | 2024-06-13 | 苏州元脑智能科技有限公司 | Data acquisition method and apparatus and computer device |
CN117349128A (en) * | 2023-12-05 | 2024-01-05 | 杭州沃趣科技股份有限公司 | Fault monitoring method, device and equipment of server cluster and storage medium |
CN117349128B (en) * | 2023-12-05 | 2024-03-22 | 杭州沃趣科技股份有限公司 | Fault monitoring method, device and equipment of server cluster and storage medium |
Also Published As
Publication number | Publication date |
---|---|
CN111984498B (en) | 2024-08-13 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN111984498B (en) | Server cluster monitoring and management system | |
US10539932B2 (en) | Machine diagnostics based on overall system energy state | |
CN104052631A (en) | Equipment information collecting method, device and system | |
CN102571441B (en) | Whole machine cabinet intelligent management, system and device | |
CN102495785A (en) | Centralized management method and device for servers of whole equipment cabinet | |
CN110851322A (en) | Hardware equipment abnormity monitoring method, server and computer readable storage medium | |
CN111158328A (en) | Equipment operation and maintenance system based on industrial Internet of things | |
CN103325225A (en) | Embedded wireless computer room monitoring system and monitoring method therefor | |
CN103200050A (en) | Server hardware state monitoring method and server hardware state monitoring system | |
CN112685195A (en) | Unattended machine room management method, server and system based on micro-service technology | |
CN106100884A (en) | The alarm method of supervisory control of substation equipment operation exception | |
CN104637265A (en) | Dispatch-automated multilevel integration intelligent watching alarming system | |
CN104898525A (en) | Data acquisition device, data acquisition system and data acquisition method | |
CN112234707B (en) | High-energy synchrotron radiation light source magnet power failure recognition system | |
CN110320503B (en) | X-band weather radar standard output controller system | |
CN114510378A (en) | Parameter backup method and device for air conditioning unit and electronic equipment | |
CN103546332A (en) | Server monitoring method and system | |
CN202275310U (en) | Intelligent monitor system of cold storage warehouse assembly | |
JP3643930B2 (en) | Management operation system for networked infrastructure equipment and management operation method using the system | |
CN114510391A (en) | Monitoring management system integrating infrastructure | |
CN204389937U (en) | Transit equipment overall process monitoring operation device | |
CN113176982A (en) | Device and method for realizing VPX architecture-based chassis management system | |
CN111509855A (en) | Multifunctional digital Internet switching power supply and control method thereof | |
CN112698601A (en) | Wisdom monitored control system that raises pigs | |
CN111835902A (en) | Device testing method and device, electronic device and storage medium |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant |