CN116149954A - Intelligent operation and maintenance system and method for server - Google Patents

Intelligent operation and maintenance system and method for server Download PDF

Info

Publication number
CN116149954A
CN116149954A CN202310217735.9A CN202310217735A CN116149954A CN 116149954 A CN116149954 A CN 116149954A CN 202310217735 A CN202310217735 A CN 202310217735A CN 116149954 A CN116149954 A CN 116149954A
Authority
CN
China
Prior art keywords
server
unit
data
module
terminal
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Withdrawn
Application number
CN202310217735.9A
Other languages
Chinese (zh)
Inventor
谢才权
冯亮
曾新平
曾志
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Shanghai Zhongyuan Computer Technology Co ltd
Original Assignee
Shanghai Zhongyuan Computer Technology Co ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Shanghai Zhongyuan Computer Technology Co ltd filed Critical Shanghai Zhongyuan Computer Technology Co ltd
Priority to CN202310217735.9A priority Critical patent/CN116149954A/en
Publication of CN116149954A publication Critical patent/CN116149954A/en
Withdrawn legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/30Monitoring
    • G06F11/3003Monitoring arrangements specially adapted to the computing system or computing system component being monitored
    • G06F11/3024Monitoring arrangements specially adapted to the computing system or computing system component being monitored where the computing system component is a central processing unit [CPU]
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/30Monitoring
    • G06F11/3089Monitoring arrangements determined by the means or processing involved in sensing the monitored data, e.g. interfaces, connectors, sensors, probes, agents
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/30Monitoring
    • G06F11/32Monitoring with visual or acoustical indication of the functioning of the machine
    • G06F11/324Display of status information
    • G06F11/327Alarm or error message display
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F9/00Arrangements for program control, e.g. control units
    • G06F9/06Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
    • G06F9/46Multiprogramming arrangements
    • G06F9/50Allocation of resources, e.g. of the central processing unit [CPU]
    • G06F9/5005Allocation of resources, e.g. of the central processing unit [CPU] to service a request
    • G06F9/5011Allocation of resources, e.g. of the central processing unit [CPU] to service a request the resources being hardware resources other than CPUs, Servers and Terminals
    • G06F9/5022Mechanisms to release resources
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F9/00Arrangements for program control, e.g. control units
    • G06F9/06Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
    • G06F9/46Multiprogramming arrangements
    • G06F9/50Allocation of resources, e.g. of the central processing unit [CPU]
    • G06F9/5005Allocation of resources, e.g. of the central processing unit [CPU] to service a request
    • G06F9/5027Allocation of resources, e.g. of the central processing unit [CPU] to service a request the resource being a machine, e.g. CPUs, Servers, Terminals

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Software Systems (AREA)
  • Quality & Reliability (AREA)
  • Computing Systems (AREA)
  • Mathematical Physics (AREA)
  • Hardware Redundancy (AREA)

Abstract

The invention provides an intelligent operation and maintenance system and a method thereof, and relates to the technical field of operation and maintenance implementation and management. The intelligent operation and maintenance system of the server comprises a background server operation monitoring module, wherein the background server operation monitoring module is used for monitoring the operation state of the server in real time; the server terminal management module is connected with the background server operation monitoring module through a data link signal; the server distributed operation alarm module is connected with the background server operation monitoring module through a data link signal. By setting the independent server distributed operation alarm module, the operation states of a plurality of server terminal machine rooms in the whole system can be monitored, meanwhile, an administrator can freely set the alarm strategy of a single server, a polynomial combined solution is intelligently provided for the system administrator, and the operation and maintenance efficiency of the whole server system is improved.

Description

Intelligent operation and maintenance system and method for server
Technical Field
The invention relates to the technical field of operation and maintenance implementation and management, in particular to an intelligent operation and maintenance system of a server and a method thereof.
Background
For most enterprises, the internal data of the enterprise is stored in the relevant server host, the server is an instance of a computer program or device, and is mainly used for managing network resources, processing requests and transmitting the data to another computer through the internet or a local network, a user can set the server to control access to the network, some servers are also dedicated to specific tasks, commonly called dedicated tasks, and many current servers are shared servers, which puts high demands on daily maintenance and management work of the server operation and maintenance management system, and how to ensure the safety, continuous and stable operation of the enterprise data in the informatization system is a main problem of the current server operation and maintenance management system.
The daily inspection of the server system is very important, including whether the service port is normally operated, whether the utilization rate of CPU memory and the like is normal, any faults may be represented before occurrence, but the current enterprise server is generally connected with each other through a bus in the operation process, when one of the servers suddenly breaks down, the whole system can be firstly switched to the standby equipment to restore service, then the subsequent fault checking operation is carried out, a great deal of time is required for carrying out the operation, a certain requirement is provided for the response speed of the standby server, and meanwhile, the problems of high system maintenance cost, low maintenance efficiency and the like are caused by adopting a bus connection mode.
Disclosure of Invention
(one) solving the technical problems
Aiming at the defects of the prior art, the invention provides an intelligent operation and maintenance system and method for a server, which solve the problem that the whole server system is affected when sudden faults are caused by the fact that the current enterprise server is generally connected through a bus and cannot monitor and maintain multiple groups of servers in a partitioning way.
(II) technical scheme
In order to achieve the above purpose, the invention is realized by the following technical scheme: a server intelligent operation and maintenance system, the server intelligent operation and maintenance system comprising:
the background server operation monitoring module is used for monitoring the operation state of the server in real time;
the server terminal management module is connected with the background server operation monitoring module through a data link signal;
the server distributed operation alarm module is connected with the background server operation monitoring module through a data link signal;
the server cache data intelligent cleaning module is connected with the server terminal management module through an Ethernet;
the server preprocessing and upgrading module is connected with the server terminal management module through a data link signal;
and the server standby resource management module is respectively connected with the background server operation monitoring module and the server terminal management module through data link signals.
Preferably, the background server operation monitoring module comprises an information data display terminal and a server room distribution monitoring unit, the server room distribution monitoring unit comprises an access right detection unit, a database security detection unit, a room environment detection unit and a port access recording unit, the information data display terminal is used for performing data visual display, the access right detection unit is used for performing record detection of external access right, the database security detection unit is used for recording a retrieval list of a server database and marking, the room environment detection unit is used for detecting environment information in a room, and the port access recording unit is used for monitoring the starting state of each server port.
Preferably, the server terminal management module includes a terminal data receiving unit, a line configuration unit, a main control configuration unit, a data configuration unit, a firewall management unit and an instruction execution unit, where the line configuration unit, the main control configuration unit, the data configuration unit and the firewall management unit are all connected with the instruction execution unit through network signals, the terminal data receiving unit is used for receiving ethernet data signals, the line configuration unit is used for configuring a connection line of the system unit, the main control configuration unit is used for configuring a server maintenance plan data report, the data configuration unit is used for compressing data to be transmitted, the firewall management unit is used for configuring and managing a server firewall in the system, and the instruction execution unit is used for generating execution command signals and transmitting the execution command signals to the server cache data intelligent cleaning module.
Preferably, the server distributed operation alarm module comprises a hierarchical line access unit, an administrator control terminal and a server burst fault simulation experiment unit, wherein the hierarchical line access unit is connected with the administrator control terminal through a data link signal, the output end of the administrator control terminal is electrically connected with the input end of the server burst fault simulation experiment unit, the output end of the server burst fault simulation experiment unit is sequentially connected with a server fault simulation generation unit, an experiment log generation unit and a system maintenance strategy production unit through a data link signal, the hierarchical line access unit is used for being connected with a plurality of groups of server terminals in a line mode, the server burst fault simulation experiment unit is used for carrying out calculation experiment analysis on emergency processing capacity of a server fault, the server fault simulation generation unit is used for carrying out server fault simulation experiment, the experiment log generation unit is used for generating report information of experimental data and sending the report information to a background server operation monitoring module, and the system maintenance strategy production unit is used for carrying out backup processing on system experiment logs.
Preferably, the administrator control terminal is sequentially connected with a server firewall detection unit, a server running state alarm unit and a hardware device monitoring terminal, the server firewall detection unit is used for detecting the running state of a system firewall, the server running state alarm unit is used for carrying out rapid alarm processing when a server fails, and the hardware device monitoring terminal is used for setting the running state of the hardware of the server system.
Preferably, the server cache data intelligent cleaning module comprises a server port opening and closing unit, a server storage space calculation and recombination unit, a cold data recovery and cleaning unit and an information confirmation unit, wherein the server port opening and closing unit is connected with the server storage space calculation and recombination unit and the cold data recovery and cleaning unit through data link signals, the output ends of the server storage space calculation and recombination unit and the cold data recovery and cleaning unit are electrically connected with the input end of the information confirmation unit, the server port opening and closing unit is used for controlling the opening and closing of a server connection port, the server storage space calculation and recombination unit is used for calculating the data storage allowance of a server, the cold data recovery and cleaning unit is used for cleaning silent data inside the server, and the information confirmation unit is used for a manager to secondarily confirm system instructions.
Preferably, the server preprocessing upgrade module comprises an upgrade package pre-downloading unit, an upgrade period setting unit, a historical data coverage unit and a system rollback processing unit, wherein the upgrade package pre-downloading unit is connected with the upgrade period setting unit through an ethernet network, the historical data coverage unit is connected with the historical data coverage unit through a data link signal, the upgrade package pre-downloading unit is used for pre-downloading system firmware, the upgrade period setting unit is used for setting a period of system updating, and the historical data coverage unit is used for performing coverage replacement of historical data.
Preferably, the server standby resource management module comprises a secondary line access terminal and a data transceiver unit, the output end of the secondary line access terminal is sequentially connected with a server resource access management and control unit, a data backup resource repository and a server intelligent scheduling unit, the output ends of the server resource access management and control unit, the data backup resource repository and the server intelligent scheduling unit are electrically connected with the input end of the data transceiver unit, the secondary line access terminal is used for accessing a server standby line, the server resource access management and control unit is used for managing and controlling external access data, the data backup resource repository is used for storing backup data of a server, the server intelligent scheduling unit is used for scheduling the standby server, and the data transceiver unit is used for transmitting data information.
Preferably, the server terminal management module is used for performing connection management of multiple groups of servers, the server distributed operation alarm module is used for performing system alarm processing, the server cache data intelligent cleaning module is used for screening cache data in the servers, the server preprocessing upgrading module is used for updating and upgrading the systems, and the server standby resource management module is used for performing management and calling of standby server resources.
An intelligent operation and maintenance method for a server comprises the following method steps:
s1, an operation and maintenance manager firstly sets a system line, data and a firewall preset value through a server terminal management module, and transmits a control command to a server distributed operation alarm module through an instruction execution unit;
s2, starting a background server operation monitoring module to operate, monitoring the operation state of the background server in real time, connecting with a plurality of groups of servers through a grading line access unit in a server distributed operation alarm module, and carrying out a server fault emergency treatment experiment every 6-10 hours by a server sudden fault simulation experiment unit to verify the emergency treatment capability of the system;
s3, an operation and maintenance manager selectively closes part of ports according to the access time period by starting a server port opening and closing unit in the server cache data intelligent cleaning module, simultaneously calculates and processes the storage capacity of the server through a server storage space calculation and recombination unit, and cleans silent data through a cold data recovery cleaning unit;
s4, after the calculation of the server capacity and the cleaning of cold data are completed, an operation and maintenance manager sets a system updating period through an updating period setting unit in the server preprocessing updating module, and pre-downloads a system updating packet through an updating packet pre-downloading unit;
s5, the auxiliary line access terminal inside the server standby resource management module is connected with a plurality of groups of servers, the server resource access control unit is used for controlling external access instructions, the server intelligent scheduling unit is used for intelligent calling of the servers, and the data receiving and transmitting unit is used for receiving connection information of the background server operation monitoring module and the server terminal management module.
(III) beneficial effects
The invention provides an intelligent operation and maintenance system and method for a server. The beneficial effects are as follows:
1. the invention can monitor the running state of a plurality of server terminal machine rooms in the whole system by arranging the independent server distributed running alarm modules, can enable an administrator to freely set the alarm strategy of a single server, intelligently provides a polynomial combined solution for the system administrator, and improves the running and maintenance efficiency of the whole server system.
2. According to the invention, the server burst fault simulation experiment unit is arranged, so that a simulation experiment can be carried out on faults of the whole server system in a time-sharing section, an experiment log is generated and sent to the system maintenance strategy generation unit to generate a corresponding solution strategy, and therefore operation and maintenance management and control are carried out according to different planning data, and the operation stability and emergency processing capacity of the whole server system are improved.
3. According to the invention, the server cache data intelligent cleaning module is arranged, so that temporarily unused service ports in the server can be deactivated, the occupation of server system resources is reduced, the potential safety hazard of the server ports is reduced, meanwhile, the redundant storage capacity of the server can be effectively expanded, the occupation of silent cold data to the server space is reduced, and the overall performance of the server is improved.
Drawings
FIG. 1 is a schematic diagram of the overall architecture of the system of the present invention;
FIG. 2 is a schematic diagram of a background server operation monitoring module architecture according to the present invention;
FIG. 3 is a schematic diagram of a server terminal management module architecture according to the present invention;
FIG. 4 is a schematic diagram of a distributed operation alarm module architecture of a server according to the present invention;
FIG. 5 is a schematic diagram of a server burst fault simulation experiment unit architecture according to the present invention;
FIG. 6 is a schematic diagram of an intelligent cleaning module architecture for server cache data according to the present invention;
FIG. 7 is a schematic diagram of a server preprocessing upgrade module architecture according to the present invention;
FIG. 8 is a schematic diagram of a standby resource management module architecture of a server according to the present invention.
The method comprises the steps that 1, a background server runs a monitoring module; 101. an information data display terminal; 102. the server room distributes the monitoring units; 103. an access right detection unit; 104. a database security detection unit; 105. a machine room environment detection unit; 106. a port access recording unit; 2. a server terminal management module; 201. a terminal data receiving unit; 202. a line configuration unit; 203. a main control configuration unit; 204. a data configuration unit; 205. a firewall management unit; 206. an instruction execution unit; 3. the server distributes running alarm module; 301. a hierarchical line access unit; 302. an administrator controls the terminal; 303. a server firewall detection unit; 304. a server running state alarm unit; 305. a hardware equipment monitoring terminal; 306. the server burst fault simulation experiment unit; 307. a server fault simulation generation unit; 308. an experiment log generation unit; 309. a system maintenance strategy production unit; 4. the server cache data intelligent cleaning module; 401. a server port opening and closing unit; 402. a server storage space calculation reorganization unit; 403. a cold data recovery and cleaning unit; 404. an information confirmation unit; 5. the server preprocesses the upgrade module; 501. an upgrade package pre-downloading unit; 502. an update period setting unit; 503. a history data overlay unit; 504. a system rollback processing unit; 6. a server standby resource management module; 601. a secondary line access terminal; 602. a server resource access control unit; 603. a data backup resource repository; 604. the server intelligent scheduling unit; 605. and a data receiving and transmitting unit.
Detailed Description
The following description of the embodiments of the present invention will be made clearly and completely with reference to the accompanying drawings, in which it is apparent that the embodiments described are only some embodiments of the present invention, but not all embodiments. All other embodiments, which can be made by those skilled in the art based on the embodiments of the invention without making any inventive effort, are intended to be within the scope of the invention.
Examples:
as shown in fig. 1, a server intelligent operation and maintenance system includes: the background server operation monitoring module 1 is used for monitoring the operation state of the server in real time; the server terminal management module 2 is connected with the background server operation monitoring module 1 through a data link signal; the server distributed operation alarm module 3 is connected with the background server operation monitoring module 1 through a data link signal; the server cache data intelligent cleaning module 4 is connected with the server terminal management module 2 through an Ethernet; the server preprocessing and upgrading module 5 is connected with the server terminal management module 2 through a data link signal by the server preprocessing and upgrading module 5; the server standby resource management module 6 is connected with the background server operation monitoring module 1 and the server terminal management module 2 through data link signals respectively.
Specifically, the server terminal management module 2 is used for performing connection management of multiple groups of servers, the server distributed operation alarm module 3 is used for performing system alarm processing, the server cache data intelligent cleaning module 4 is used for screening cache data in the server, the server preprocessing upgrading module 5 is used for updating and upgrading the system, and the server standby resource management module 6 is used for performing management and calling of standby server resources.
Further, as shown in fig. 1, the background server operation monitoring module 1 includes an information data display terminal 101 and a server room distribution monitoring unit 102, and the server room distribution monitoring unit 102 includes an access right detecting unit 103, a database security detecting unit 104, a room environment detecting unit 105, and a port access recording unit 106.
Specifically, the information data display terminal 101 is configured to perform data visual display, the access right detection unit 103 is configured to perform record detection of external access rights, the database security detection unit 104 is configured to record a retrieval list of a server database, perform marking, the machine room environment detection unit 105 is configured to detect environment information in the machine room, and the port access recording unit 106 is configured to monitor an enabling state of each server port.
Further, as shown in fig. 3, the server terminal management module 2 includes a terminal data receiving unit 201, a line configuration unit 202, a master control configuration unit 203, a data configuration unit 204, a firewall management unit 205, and an instruction execution unit 206, and the line configuration unit 202, the master control configuration unit 203, the data configuration unit 204, and the firewall management unit 205 are all connected to the instruction execution unit 206 through network signals.
Specifically, the terminal data receiving unit 201 is configured to receive an ethernet data signal, the line configuration unit 202 is configured to configure a connection line of a system unit, the master control configuration unit 203 is configured to configure a server maintenance plan data report, the data configuration unit 204 is configured to compress data to be transmitted, the firewall management unit 205 is configured to manage a firewall of a server in the system, and the instruction execution unit 206 is configured to generate an execution command signal and transmit the execution command signal to the server cache data intelligent cleaning module 4.
Further, as shown in fig. 4 to 5, the server distributed operation alarm module 3 includes a hierarchical line access unit 301, an administrator control terminal 302 and a server burst fault simulation experiment unit 306, where the hierarchical line access unit 301 is connected to the administrator control terminal 302 through a data link signal, an output end of the administrator control terminal 302 is electrically connected to an input end of the server burst fault simulation experiment unit 306, and an output end of the server burst fault simulation experiment unit 306 is sequentially connected to the server fault simulation generation unit 307, the experiment log generation unit 308 and the system maintenance policy production unit 309 through a data link signal, and by setting an independent server distributed operation alarm module 3, an administrator can freely set an alarm policy of a single server, and intelligently provide a multiple combined solution for the system administrator, so that the operation and maintenance efficiency of the whole server system is improved.
Specifically, the hierarchical line access unit 301 is configured to be in line connection with multiple groups of server terminals, the server burst fault simulation experiment unit 306 is configured to perform calculation experiment analysis on the emergency processing capability of the server fault, the server fault simulation generation unit 307 is configured to perform operation and maintenance management on the whole server system according to different planning data, and the system maintenance policy production unit 309 is configured to perform backup processing on the system experiment log.
Further, the administrator control terminal 302 is sequentially connected to a server firewall detection unit 303, a server running state alarm unit 304, and a hardware device monitoring terminal 305.
Specifically, the server firewall detection unit 303 is configured to detect an operation state of a system firewall, the server operation state alarm unit 304 is configured to perform a rapid alarm processing when a server fails, and the hardware device monitoring terminal 305 is configured to set an operation state of hardware of the server system.
Further, as shown in fig. 6, the server cache data intelligent cleaning module 4 includes a server port opening and closing unit 401, a server storage space calculating and reorganizing unit 402, a cold data recovery cleaning unit 403 and an information confirmation unit 404, where the server port opening and closing unit 401 is connected with the server storage space calculating and reorganizing unit 402 and the cold data recovery cleaning unit 403 through data link signals, the output ends of the server storage space calculating and reorganizing unit 402 and the cold data recovery cleaning unit 403 are electrically connected with the input end of the information confirmation unit 404, and by setting the server cache data intelligent cleaning module 4, temporarily unused service ports in a server can be deactivated, occupation of server system resources is reduced, potential safety hazards of the server ports are reduced, meanwhile by setting the cold data recovery cleaning unit 403, redundant storage capacity of the server can be effectively expanded, occupation of silent cold data on the server space is reduced, and overall performance of the server is improved.
Specifically, the server port opening and closing unit 401 is configured to control opening and closing of a server connection port, the server storage space calculating and reorganizing unit 402 is configured to calculate a data storage margin of a server, the cold data recovery cleaning unit 403 is configured to clean silent data inside the server, and the information confirmation unit 404 is configured to secondarily confirm a system instruction by an administrator.
Further, as shown in fig. 7, the server preprocessing upgrade module 5 includes an upgrade package pre-download unit 501, an update period setting unit 502, a history data overlay unit 503, and a system rollback processing unit 504, the upgrade package pre-download unit 501 is connected to the update period setting unit 502 through an ethernet, and the history data overlay unit 503 is connected to the history data overlay unit 503 through a data link signal.
Specifically, the upgrade package pre-download unit 501 is configured to perform pre-download of system firmware, the update period setting unit 502 is configured to set a period of system update, and the history data overlay unit 503 is configured to perform overlay replacement of history data.
Further, as shown in fig. 8, the server standby resource management module 6 includes a secondary line access terminal 601 and a data transceiver unit 605, and the output end of the secondary line access terminal 601 is sequentially connected to the server resource access management and control unit 602, the data backup resource repository 603 and the server intelligent scheduling unit 604, and the output ends of the server resource access management and control unit 602, the data backup resource repository 603 and the server intelligent scheduling unit 604 are electrically connected to the input end of the data transceiver unit 605.
Specifically, the secondary line access terminal 601 is used for accessing a standby line of a server, the server resource access management and control unit 602 is used for managing and controlling external access data, the data backup resource repository 603 is used for storing backup data of the server, the server intelligent scheduling unit 604 is used for performing scheduling processing of the standby server, and the data transceiver unit 605 is used for transmitting data information.
An intelligent operation and maintenance method for a server comprises the following method steps:
s1, an operation and maintenance manager firstly sets a system line, data and a firewall preset value through a server terminal management module 2, and transmits a control command to a server distributed operation alarm module 3 through an instruction execution unit 206;
s2, starting a background server operation monitoring module 1 to operate, monitoring the operation state of a background server in real time, connecting with a plurality of groups of servers through a grading line access unit 301 in a server distributed operation alarm module 3, and simultaneously carrying out a server fault emergency treatment experiment on a server sudden fault simulation experiment unit 306 every 6-10 hours, and verifying the emergency treatment capability of a system;
s3, an operation and maintenance manager selectively closes part of ports according to access time intervals by starting a server port opening and closing unit 401 in the server cache data intelligent cleaning module 4, simultaneously calculates and processes the storage capacity of the server through a server storage space calculation and recombination unit 402, and cleans silent data through a cold data recovery cleaning unit 403;
s4, after the calculation of the server capacity and the cleaning of cold data are completed, an operation and maintenance manager sets a system update period through an update period setting unit 502 in the server preprocessing and upgrading module 5, and pre-downloads a system upgrade package through an upgrade package pre-downloading unit 501;
s5, the auxiliary line access terminal 601 inside the server standby resource management module 6 is connected with a plurality of groups of servers, the server resource access control unit 602 is used for controlling external access instructions, the server intelligent scheduling unit 604 is used for intelligent calling of the servers, and the data transceiver unit 605 is used for receiving connection information of the background server operation monitoring module 1 and the server terminal management module 2.
Although embodiments of the present invention have been shown and described, it will be understood by those skilled in the art that various changes, modifications, substitutions and alterations can be made therein without departing from the principles and spirit of the invention, the scope of which is defined in the appended claims and their equivalents.

Claims (10)

1. The intelligent operation and maintenance system of the server is characterized in that: the intelligent operation and maintenance system of the server comprises:
the system comprises a background server operation monitoring module (1), wherein the background server operation monitoring module (1) is used for monitoring the operation state of a server in real time;
the server terminal management module (2), the said server terminal management module (2) is connected with the backstage server and operates the monitoring module (1) through the signal of data link;
the server distributed operation alarm module (3), the server distributed operation alarm module (3) is connected with the background server operation monitoring module (1) through a data link signal;
the server cache data intelligent cleaning module (4) is connected with the server terminal management module (2) through an Ethernet;
the server preprocessing and upgrading module (5), wherein the server preprocessing and upgrading module (5) is linked with the server terminal management module (2) through a data link signal;
and the server standby resource management module (6), wherein the server standby resource management module (6) is respectively connected with the background server operation monitoring module (1) and the server terminal management module (2) through data link signals.
2. The server intelligent operation and maintenance system according to claim 1, wherein: the background server operation monitoring module (1) comprises an information data display terminal (101) and a server room distribution monitoring unit (102), the server room distribution monitoring unit (102) comprises an access right detection unit (103), a database security detection unit (104), a room environment detection unit (105) and a port access recording unit (106), the information data display terminal (101) is used for carrying out data visual display, the access right detection unit (103) is used for carrying out external access right recording detection, the database security detection unit (104) is used for recording a retrieval list of a server database and carrying out marking, the room environment detection unit (105) is used for detecting environment information inside a computer room, and the port access recording unit (106) is used for monitoring the starting state of each server port.
3. The server intelligent operation and maintenance system according to claim 1, wherein: the server terminal management module (2) comprises a terminal data receiving unit (201), a line configuration unit (202), a main control configuration unit (203), a data configuration unit (204), a firewall management unit (205) and an instruction execution unit (206), wherein the line configuration unit (202), the main control configuration unit (203), the data configuration unit (204) and the firewall management unit (205) are all connected with the instruction execution unit (206) through network signals, the terminal data receiving unit (201) is used for receiving Ethernet data signals, the line configuration unit (202) is used for configuring a connection line of a system unit, the main control configuration unit (203) is used for configuring a server maintenance plan data report, the data configuration unit (204) is used for performing compression processing on data to be transmitted, the firewall management unit (205) is used for performing configuration management on a server firewall in the system, and the instruction execution unit (206) is used for generating execution command signals and transmitting the execution command signals to the server cache data intelligent cleaning module (4).
4. The server intelligent operation and maintenance system according to claim 1, wherein: the distributed operation alarm module (3) of the server comprises a hierarchical line access unit (301), an administrator control terminal (302) and a server burst fault simulation experiment unit (306), wherein the hierarchical line access unit (301) is connected with the administrator control terminal (302) through a data link signal, the output end of the administrator control terminal (302) is electrically connected with the input end of the server burst fault simulation experiment unit (306), the output end of the server burst fault simulation experiment unit (306) is sequentially connected with a server fault simulation generation unit (307), an experiment log generation unit (308) and a system maintenance strategy production unit (309) through data link signals, the hierarchical line access unit (301) is used for being connected with a plurality of groups of server terminals in a line mode, the server burst fault simulation experiment unit (306) is used for carrying out calculation experiment analysis of server fault emergency processing capacity, the server fault simulation experiment generation unit (307) is used for carrying out server fault simulation experiment, the backup experiment log generation unit (308) is used for generating report information of experiment data and sending the report information to a background server operation monitoring module (1), and the system maintenance strategy production unit (309) is used for carrying out system maintenance strategy processing.
5. The intelligent operation and maintenance system of claim 4, wherein: the manager control terminal (302) is sequentially connected with the server firewall detection unit (303), the server running state alarm unit (304) and the hardware equipment monitoring terminal (305), the server firewall detection unit (303) is used for detecting the running state of a system firewall, the server running state alarm unit (304) is used for carrying out rapid alarm processing when a server breaks down, and the hardware equipment monitoring terminal (305) is used for setting the running state of hardware of a server system.
6. The server intelligent operation and maintenance system according to claim 1, wherein: the intelligent server cache data cleaning module (4) comprises a server port opening and closing unit (401), a server storage space calculation and recombination unit (402), a cold data recovery and cleaning unit (403) and an information confirmation unit (404), wherein the server port opening and closing unit (401) is connected with the server storage space calculation and recombination unit (402) and the cold data recovery and cleaning unit (403) through data link signals, the output ends of the server storage space calculation and recombination unit (402) and the cold data recovery and cleaning unit (403) are electrically connected with the input end of the information confirmation unit (404), the server port opening and closing unit (401) is used for controlling the opening and closing of a server connection port, the server storage space calculation and recombination unit (402) is used for calculating the data storage allowance of a server, the cold data recovery and cleaning unit (403) is used for cleaning silent data inside the server, and the information confirmation unit (404) is used for a manager to secondarily confirm system instructions.
7. The server intelligent operation and maintenance system according to claim 6, wherein: the server preprocessing upgrade module (5) comprises an upgrade package pre-downloading unit (501), an upgrade period setting unit (502), a historical data coverage unit (503) and a system rollback processing unit (504), wherein the upgrade package pre-downloading unit (501) is connected with the upgrade period setting unit (502) through an Ethernet, the historical data coverage unit (503) is connected with the historical data coverage unit (503) through a data link signal, the upgrade package pre-downloading unit (501) is used for pre-downloading system firmware, the upgrade period setting unit (502) is used for setting a period of system updating, and the historical data coverage unit (503) is used for performing coverage replacement of historical data.
8. The server intelligent operation and maintenance system according to claim 1, wherein: the server standby resource management module (6) comprises a secondary line access terminal (601) and a data receiving and transmitting unit (605), wherein the output end of the secondary line access terminal (601) is sequentially connected with a server resource access management and control unit (602), a data standby resource repository (603) and a server intelligent scheduling unit (604), the output ends of the server resource access management and control unit (602), the data standby resource repository (603) and the server intelligent scheduling unit (604) are electrically connected with the input end of the data receiving and transmitting unit (605), the secondary line access terminal (601) is used for accessing a server standby line, the server resource access management and control unit (602) is used for managing and controlling external access data, the data standby resource repository (603) is used for storing backup data of a server, the server intelligent scheduling unit (604) is used for scheduling standby servers, and the data receiving and transmitting unit (605) is used for transmitting data information.
9. The server intelligent operation and maintenance system according to claim 1, wherein: the server terminal management module (2) is used for carrying out connection management and control of a plurality of groups of servers, the server distributed operation alarm module (3) is used for system alarm processing, the server cache data intelligent cleaning module (4) is used for screening cache data in the server, the server preprocessing upgrading module (5) is used for updating and upgrading the system, and the server standby resource management module (6) is used for carrying out management and calling of standby server resources.
10. An intelligent operation and maintenance method for a server is characterized in that: a server intelligent operation and maintenance system applied to any one of claims 1-9, comprising the following method steps:
s1, an operation and maintenance manager firstly sets a system line, data and a firewall preset value through a server terminal management module (2), and transmits a control command to a server distributed operation alarm module (3) through an instruction execution unit (206);
s2, starting a background server operation monitoring module (1) to operate, monitoring the operation state of a background server in real time, connecting with a plurality of groups of servers through a grading line access unit (301) in a server distributed operation alarm module (3), and simultaneously carrying out a server fault emergency treatment experiment on a server burst fault simulation experiment unit (306) every 6-10 hours, and verifying the emergency treatment capability of the system;
s3, an operation and maintenance manager selectively closes part of ports according to access time intervals by starting a server port opening and closing unit (401) in the server cache data intelligent cleaning module (4), simultaneously calculates and processes the storage capacity of the server through a server storage space calculation and recombination unit (402), and cleans silent data through a cold data recovery cleaning unit (403);
s4, after the calculation of the server capacity and the cleaning of cold data are completed, an operation and maintenance manager sets a system update period through an update period setting unit (502) in a server preprocessing and upgrading module (5), and pre-downloads a system upgrade package through an upgrade package pre-downloading unit (501);
s5, a plurality of groups of servers are connected through a secondary line access terminal (601) in the server standby resource management module (6), external access instructions are managed and controlled through a server resource access management and control unit (602), meanwhile, intelligent calling of the servers is conducted through a server intelligent scheduling unit (604), and connection information of the background server operation monitoring module (1) and the server terminal management module (2) is received through a data receiving and transmitting unit (605).
CN202310217735.9A 2023-03-07 2023-03-07 Intelligent operation and maintenance system and method for server Withdrawn CN116149954A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202310217735.9A CN116149954A (en) 2023-03-07 2023-03-07 Intelligent operation and maintenance system and method for server

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202310217735.9A CN116149954A (en) 2023-03-07 2023-03-07 Intelligent operation and maintenance system and method for server

Publications (1)

Publication Number Publication Date
CN116149954A true CN116149954A (en) 2023-05-23

Family

ID=86360059

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202310217735.9A Withdrawn CN116149954A (en) 2023-03-07 2023-03-07 Intelligent operation and maintenance system and method for server

Country Status (1)

Country Link
CN (1) CN116149954A (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN116582369A (en) * 2023-07-13 2023-08-11 北京点聚信息技术有限公司 Willingness authentication method for online subscription

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN116582369A (en) * 2023-07-13 2023-08-11 北京点聚信息技术有限公司 Willingness authentication method for online subscription
CN116582369B (en) * 2023-07-13 2023-09-12 北京点聚信息技术有限公司 Willingness authentication method for online subscription

Similar Documents

Publication Publication Date Title
US11397647B2 (en) Hot backup system, hot backup method, and computer device
CN1770707B (en) Apparatus and method for quorum-based power-down of unresponsive servers in a computer cluster
CN102355368B (en) Fault processing method of network equipment and system
CN105430327A (en) NVR cluster backup method and device
CN105450472A (en) Method and device for automatically acquiring states of physical components of servers
CN108199901B (en) Hardware repair reporting method, system, device, hardware management server and storage medium
Wang et al. Reliability modeling and analysis for linear consecutive-k-out-of-n: F retrial systems with two maintenance activities
CN111431754A (en) Fault analysis method and system for power distribution and utilization communication network
CN116149954A (en) Intelligent operation and maintenance system and method for server
WO2024113780A1 (en) Fc link management method, apparatus and device, and readable storage medium
CN110119325A (en) Server failure processing method, device, equipment and computer readable storage medium
CN117391675B (en) Data center infrastructure operation and maintenance management method
CN111897697A (en) Server hardware fault repairing method and device
CN110291505A (en) Reduce the recovery time of application
CN113448947B (en) Method and device for distributed deployment operation and maintenance of mongo database
CN105119765B (en) A kind of Intelligent treatment fault system framework
CN117453036A (en) Method, system and device for adjusting power consumption of equipment in server
WO2023125702A1 (en) Cloud management method and system for battery swapping station, server, and storage medium
CN112541011A (en) Power grid terminal fault processing method and device based on RPA and AI
CN116074180A (en) Fault location method, fault repair method, device and storage medium
CN105550065A (en) Database server communication management method and device
CN115292004A (en) Fault emergency method and device, electronic equipment and storage medium
CN113946633A (en) Operation and maintenance system and method based on agile data warehouse
CN112202594A (en) Nuclear power station server fault emergency processing system and method
CN105550094A (en) Automatic state monitoring method of high-availability system

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
WW01 Invention patent application withdrawn after publication
WW01 Invention patent application withdrawn after publication

Application publication date: 20230523