CN114816943A - Enterprise intelligent cloud operation and maintenance system - Google Patents

Enterprise intelligent cloud operation and maintenance system Download PDF

Info

Publication number
CN114816943A
CN114816943A CN202210764810.9A CN202210764810A CN114816943A CN 114816943 A CN114816943 A CN 114816943A CN 202210764810 A CN202210764810 A CN 202210764810A CN 114816943 A CN114816943 A CN 114816943A
Authority
CN
China
Prior art keywords
module
alarm
management
script
information
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202210764810.9A
Other languages
Chinese (zh)
Inventor
周培勇
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing Huasheng Ruiying Technology Co ltd
Ruiyingyun Technology Tianjin Co ltd
Original Assignee
Beijing Huasheng Ruiying Technology Co ltd
Ruiyingyun Technology Tianjin Co ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing Huasheng Ruiying Technology Co ltd, Ruiyingyun Technology Tianjin Co ltd filed Critical Beijing Huasheng Ruiying Technology Co ltd
Priority to CN202210764810.9A priority Critical patent/CN114816943A/en
Publication of CN114816943A publication Critical patent/CN114816943A/en
Pending legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/30Monitoring
    • G06F11/3003Monitoring arrangements specially adapted to the computing system or computing system component being monitored
    • G06F11/302Monitoring arrangements specially adapted to the computing system or computing system component being monitored where the computing system component is a software system
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/30Monitoring
    • G06F11/3055Monitoring arrangements for monitoring the status of the computing system or of the computing system component, e.g. monitoring if the computing system is on, off, available, not available
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/30Monitoring
    • G06F11/32Monitoring with visual or acoustical indication of the functioning of the machine
    • G06F11/324Display of status information
    • G06F11/327Alarm or error message display
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F9/00Arrangements for program control, e.g. control units
    • G06F9/06Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
    • G06F9/46Multiprogramming arrangements
    • G06F9/54Interprogram communication
    • G06F9/542Event management; Broadcasting; Multicasting; Notifications

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Computing Systems (AREA)
  • Quality & Reliability (AREA)
  • Software Systems (AREA)
  • Multimedia (AREA)
  • Mathematical Physics (AREA)
  • Debugging And Monitoring (AREA)

Abstract

The invention provides an enterprise intelligent cloud operation and maintenance system which comprises a dashboard, an intelligent alarm module, an operation management module, a system management module, a user management module, a workbench module and a configuration management module, wherein the intelligent alarm module comprises an alarm analysis gathering unit, an alarm management unit, an integrated management unit, an alarm configuration management unit, an audit management unit and a report management unit; the invention can quickly access various alarm information, realize alarm and noise reduction through automatic de-weighting, rule compression and algorithm noise reduction, help the operation and maintenance team reduce alarms and avoid alarm storms; the expansibility is good, the operating efficiency is high, the operation is simple and convenient, and the visual display can be carried out by summarizing operation and maintenance and warning information.

Description

Enterprise intelligent cloud operation and maintenance system
Technical Field
The invention relates to the technical field of operation and maintenance systems, in particular to an enterprise intelligent cloud operation and maintenance system.
Background
The operation and maintenance run through the life cycle of the product, and an automatic and intelligent system is needed to help an operation and maintenance engineer to complete service delivery and service quality guarantee for users at the lowest cost and the highest speed. The operation and maintenance system is mainly developed after an operation and maintenance system research and development engineer understands service requirements, in various operation, maintenance and development processes of a computer system, development, communication, cooperation and integration among technical operation and quality assurance departments need to be promoted, and operation states also need to be monitored and maintained.
Disclosure of Invention
The object of the present invention is to solve at least one of the technical drawbacks mentioned.
Therefore, an object of the present invention is to provide an enterprise intelligent cloud operation and maintenance system, so as to solve the problems mentioned in the background art and overcome the disadvantages in the prior art.
In order to achieve the above object, an embodiment of an aspect of the present invention provides an enterprise intelligent cloud operation and maintenance system, including an intelligent alarm module and a dashboard; the instrument panel is used for displaying machine information overview governed by the system; the intelligent alarm module is used for receiving alarm information and managing the alarm information, and comprises:
step D1, creating a notification group; the created notification group includes a management group and an operation and maintenance group.
Step D2, personnel information is created for personnel to receive alarms and data notifications for the monitoring tool.
And D3, distributing the personnel to the created notification group to complete the creation of the notification group.
And D4, collecting alarm information of the monitoring tool.
Step D5, assign the same monitoring tool alert information to a notification group.
And D6, judging whether the notification group meets the notification conditions, if not, discarding the alarm information, and if so, executing the step D7.
D7, judging whether the event ID information of the alarm is in the record of the alarm, if yes, filing the alarm information in the list of the same event, if not, creating an alarm information; and informing according to the informing mode and recording in the system.
Preferably, the intelligent warning module further comprises:
and D31, creating a scheduling list, selecting the notification group and creating scheduling information.
And D32, distributing a notification tool for the notification group, and receiving the alarm information notification.
In any of the above schemes, preferably, the system further comprises a job management module, and the job management module comprises a script execution module and a job execution module.
In any of the above solutions, preferably, the script execution module is configured to execute the script on the machine configured to manage the database by manually writing or locally uploading or introducing a common script, and includes:
step S1, input script name.
And step S2, selecting an operating system, wherein the operating system comprises a Windows system and a Linux system.
And step S3, selecting script sources, wherein the script sources comprise manual input scripts, local scripts and public scripts.
Step S4, executing the selected script.
In any of the above aspects, preferably, the job execution module for managing the job execution task includes:
step A1, create job, add basic information.
Step A2, adding at least one step and a script to the created job to complete the created job.
Step A3, a task is generated.
Step A4, start timing rules.
And A5, selecting a timing expression generation mode and generating an expression, wherein the expression generation mode comprises a custom adjustment mode and a manual input mode.
A6, selecting operation to generate timing task; and closing or opening the timing task, and checking the execution condition in the execution history unit when the timing task is opened.
In any of the above schemes, it is preferable that the system further includes a system management module, and the system management module is configured to monitor operation states of all modules of the system; the system management module comprises a login record management unit and a login log management unit, wherein the login record management unit is used for counting and displaying login records of a user, and the login log management unit is used for displaying API (application program interface) request records of the user.
In any of the above schemes, it is preferable that the system further includes a user management module, and the user management module is configured to manage user account information and permissions; the user management module comprises user list management, authority list management, menu component management and role management.
In any of the above solutions, it is preferable that the system further includes a workbench module, and the workbench module is configured to manage basic information of the system; the workbench module comprises a backlog unit, a shortcut navigation unit, a message notification unit and an information document and manual management unit.
In any of the above schemes, it is preferable that the system further includes a configuration management module, where the configuration management module is configured to manage and provide configuration data of the operation and maintenance scene; the configuration management module comprises a host management module, a host adding module, an operation auditing module and a field configuration module.
In any of the above schemes, preferably, the intelligent alarm module includes an alarm analysis and summary unit, an alarm management unit, an integrated management unit, an alarm configuration management unit, an audit management unit, and a report management unit.
Compared with the prior art, the invention has the advantages and beneficial effects that:
1. the enterprise intelligent cloud operation and maintenance system can be quickly accessed with various alarm information, help operation and maintenance teams reduce alarms, and avoid alarm storms; meanwhile, through functions of assignment, scheduling, notification and the like, the alarm process management is quickly realized, the operation and maintenance team is helped to respond to the alarm more quickly, the alarm is recovered, and the alarm management capability is improved.
2. The enterprise intelligent cloud operation and maintenance system is high in operation efficiency, good in expandability, comprehensive in monitoring and simple and convenient to operate.
Additional aspects and advantages of the invention will be set forth in part in the description which follows and, in part, will be obvious from the description, or may be learned by practice of the invention.
Drawings
The above and/or additional aspects and advantages of the present invention will become apparent and readily appreciated from the following description of the embodiments, taken in conjunction with the accompanying drawings of which:
fig. 1 is a structural block diagram of an enterprise intelligent cloud operation and maintenance system according to an embodiment of the present invention.
Fig. 2 is a flowchart illustrating steps of an intelligent alarm module in an enterprise intelligent cloud operation and maintenance system according to an embodiment of the present invention.
Fig. 3 is a flowchart illustrating steps of a script execution module in an enterprise intelligent cloud operation and maintenance system according to an embodiment of the present invention.
Fig. 4 is a flowchart illustrating steps of an operation execution module in an enterprise intelligent cloud operation and maintenance system according to an embodiment of the present invention.
Fig. 5 is a display diagram of an instrument panel in the enterprise intelligent cloud operation and maintenance system according to the embodiment of the invention.
Fig. 6 is a diagram showing a script execution module of an enterprise intelligent cloud operation and maintenance system according to an embodiment of the present invention.
Fig. 7 is a diagram illustrating an execution result of a script execution module of an enterprise intelligent cloud operation and maintenance system according to an embodiment of the present invention.
Fig. 8 is a diagram illustrating an operation template of an operation execution module of the enterprise intelligent cloud operation and maintenance system according to an embodiment of the present invention.
Fig. 9 is a diagram illustrating an intelligent alarm module of an enterprise intelligent cloud operation and maintenance system according to an embodiment of the invention.
Fig. 10 is an alarm log graph derived by an intelligent alarm module of an enterprise intelligent cloud operation and maintenance system according to an embodiment of the present invention.
Detailed Description
Reference will now be made in detail to embodiments of the present invention, examples of which are illustrated in the accompanying drawings, wherein like or similar reference numerals refer to the same or similar elements or elements having the same or similar function throughout. The embodiments described below with reference to the drawings are illustrative and intended to be illustrative of the invention and are not to be construed as limiting the invention.
As shown in fig. 1, fig. 2, and fig. 5, an enterprise intelligent cloud operation and maintenance system according to an embodiment of the present invention includes an intelligent alarm module 2 and a dashboard 3 module. The instrument panel 3 module is used for displaying machine information overview managed by the system; the intelligent alarm module 2 is used for receiving alarm information and managing the alarm information, and comprises:
and D1, creating a notification group, wherein the created notification group comprises a management group and an operation and maintenance group.
Step D2, personnel information is created for personnel to receive alarms and data notifications for the monitoring tool.
And D3, distributing the personnel to the created notification group to complete the creation of the notification group.
And D4, collecting alarm information of the monitoring tool.
Step D5, assign the same monitoring tool alert information to a notification group.
And D6, judging whether the notification group meets the notification conditions, if not, discarding the alarm information, and if so, executing the step D7. The notification condition may be set to be within the time of the shift schedule, etc.
And D7, judging whether the event ID information of the alarm is in the record of the alarm, if so, filing the alarm information in a list of the same event, and if not, creating an alarm information. And informing according to the informing mode and recording in the system. The notification mode comprises pushing the alarm information to an alarm module for notification, and performing notification in modes of mails, telephones, WeChats and the like.
The instrument panel 3 module is used for system information, and displays machine information, host distribution condition, manufacturer distribution, operation information and system overview information of the whole enterprise intelligent cloud operation and maintenance system in various icon forms. The method is visual and convenient, and a user can conveniently know the whole situation.
Further, the intelligent alarm module 2 further includes:
and D31, creating a scheduling list, selecting the notification group and creating scheduling information.
And D32, distributing a notification tool for the notification group, and receiving the alarm information notification. The notification tool comprises an enterprise WeChat, a mailbox and an announcement voice, and the notification function is realized respectively.
The intelligent alarm module 2 is composed of a plurality of sub-applications, can achieve the user-defined requirement in the sub-applications, and can integrate various automatic operation and maintenance tools into the module. The monitoring tools are specifically Zabbix tools and HTTPS detection tools, and can be added as required. The intelligent alarm module 2 can be quickly accessed with various alarm information, alarm noise reduction is realized through automatic weight removal, regular compression and algorithm noise reduction, the operation and maintenance team is helped to reduce alarms, and alarm storms are avoided; the automatic deduplication and the regular compression are specifically implemented by sorting data in a background database actually or sporadically and irregularly only when the data are read, and filing the same events into one and returning the same events according to the alarm occurrence time, the solution condition and whether the same events belong to the same event; the algorithm noise reduction happens when the notification is carried out, the system does not send the notification to the same event every time, but only sends the notification once, so that more resource space is saved, and the processing efficiency of the system is improved. Meanwhile, through functions of assignment, scheduling, notification and the like, the alarm process management is quickly realized, the operation and maintenance team is helped to respond to the alarm more quickly, the alarm is recovered, and the alarm management capability is improved. Can accomplish in time to report an emergency and ask for help the user in time to avoid the risk, promote the security of system, maintainability is strong.
Specifically, as shown in fig. 9, the intelligent alarm module 2 further includes an alarm analysis and summary unit 10, an alarm management unit 11, an integrated management unit 12, an alarm configuration management unit 13, an audit management unit 14, and a report management unit 15.
The intelligent alarm module 2 comprises an admin manager and user group members, wherein the user group members are divided into a group administrator and operation and maintenance personnel, the admin manager shares all permissions of the tool, in the user group members, the group administrator has the permissions of alarm checking, distribution and the like of the group, and common personnel only have the permission of alarm data checking of the group. admin may add or delete members to different groups. When the integrated monitoring tool adds the tool, a default user group source is generated, and a management group and an operation and maintenance group can be added under the source.
The alarm analysis and summary unit 10 summarizes the alarm information collected by the accessed monitoring tools, and displays the data of each alarm source including events, alarm conditions, alarm classifications, levels, states, team efficiency and the like by summarizing the alarm information. The alarm management unit 11 includes my alarm, assigned alarm, unassigned alarm and intelligent noise reduction, and the alarm management unit 11 manages alarm information of each stage, and all alarms are data of a certain alarm source. The integrated management unit 12 mainly accesses a third-party monitoring tool for the intelligent alarm module 2, and integrates a monitoring tool, a ChatOps automation tool and an ITSM integration tool respectively. The alarm configuration management unit 13 is used to inform designated personnel in what manner, the scheduling of the personnel, the assigned notification groups for the personnel to receive the alarm notifications and to which notification groups the alarms formed for the accessed tools are assigned, which may configure the assignment policy, specify how the alarms are assigned, the notification policy, i.e. in what manner the notifications are sent, wherein the notification manner requires the scheduling management to be configured in the integrated management unit 12, the number of shifts for the personnel, the notification groups, i.e. the member groups of each alarm source. The auditing management unit 14 is specifically configured to display delayed application information under the module when the monitoring or operation and maintenance staff applies for delay, where the delayed application information includes approved information, information that has not passed auditing, historical auditing information, and non-auditing information; my alarm may be deferred, and deferred operations may be sent to the group of administrators, who may review the members' deferred applications on the page. The report management unit 15 is specifically an alarm module that generates an alarm report, i.e., an alarm log, at each day, monday, in month, and at the beginning of month, as shown in fig. 10, and summarizes and displays the alarm information monthly, weekly, in month, and monthly.
Specifically, the system further comprises a job management module 28, and the job management module 28 comprises a script execution module 6 and a job execution module 7.
Specifically, as shown in fig. 3, the script execution module 6 is configured to execute a script on a machine configured to manage a database by manually writing or locally uploading or introducing a common script, and includes:
step S1, input script name.
And step S2, selecting an operating system, wherein the operating system comprises a Windows system and a Linux system.
And step S3, selecting script sources, wherein the script sources comprise manual input scripts, local scripts and public scripts.
Step S4, executing the selected script.
The operation management module 28 supports a series of basic operation and maintenance scenarios such as script execution and setting of a timing task, and also supports an automatic operation process by combining fragmented scripts or tasks through a process scheduling capability, where each script or operation is an atomic node, so as to implement automatic scheduling for Linux and Windows.
The script execution module 6 is used for executing scripts on one or more machines of the same operating system, simultaneously supports public scripts supporting an execution script library, can execute the scripts on a certain machine of the CMDB in a manual writing, local uploading or public script introducing mode, can select a linux server and check available physical memory of the machine, and specifically comprises the following steps: firstly, as shown in fig. 6, a linux machine is added to implement script execution of a single machine, secondly, script content is input, memory information of the machine is checked, and clicking is immediately executed, as shown in fig. 7, an execution result of the script can be seen in the execution history unit 27. The common script is oriented to all services of the whole job management module 28, meets the script usage scenario providing common requirements, and is mainly used for displaying all scripts recorded by individuals and creating storage scripts.
Further, as shown in fig. 4, the job execution module 7 is configured to manage job execution tasks, and includes:
step A1, create a job, add basic information.
Step A2, adding at least one step and a script to the created job to complete the created job.
Step A3, a task is generated.
Step A4, start timing rules.
And A5, selecting a timing expression generation mode and generating an expression, wherein the expression generation mode comprises a custom adjustment mode and a manual input mode.
Step A6, select job, generate timed task 26. The timed task 26 is turned off or on, and when the timed task 26 is turned on, the execution is checked in the execution history unit 27.
The job execution module 7 comprises a new job unit 24, a my job unit 25, a timed task unit 26 and an execution history unit 27, wherein the new job unit 24 linearly links the execution logics of a plurality of scripts together to form a job, each script execution becomes an atomic node and supports a common script of an execution script library, the my job unit 25 is used for displaying all jobs, the timed task unit 26 is used for automatically adjusting the jobs, the timed task unit 26 is used for displaying and executing the tasks, and the execution history unit 27 is used for recording the job execution condition under the job management module 28. The operation execution module 7 combines multiple script execution or file distribution steps involved in the operation and maintenance operation scenario into one operation template through the process arrangement capability, as shown in fig. 8, and this operation template includes the scenario-related common logic as much as possible. Besides basic information, a core part of a job is composed of a plurality of steps, and each step contains at least one script. Script execution may be performed on one or more machines through the orchestration capabilities of the job execution module 7.
Taking a script in rapid execution as an example, checking available physical memories and CPU loads of two machines, checking that a physical memory script is recorded in a public script library, and introducing the script in a mode of using the public script when creating operation; firstly, filling in basic information of the job, wherein the basic information of the job comprises a job name, a job type and a description of the job, selecting two servers needing to be operated, manually inputting a script for checking CPU load, and finally saving the script. And finally, selecting a script for viewing the available internal memory from the public script library.
The job execution module 7 summarizes and displays the execution scheme list of all jobs, and can execute a certain job immediately or at regular time. When clicking to execute immediately, the relevant script in the operation can be executed immediately, and the timed execution prompts the creation of a timed task. The job execution module 7 also comprises search, fuzzy search supporting job names and time screening; the method can edit the job, support secondary editing of the job, and execute the job immediately or at regular time for the job operation. When the timing task is executed on the job, the job execution module 7 sets the job execution scheme to be the timing task executed once or periodically, supports the second-level timing task and the custom expression, and can create the timing task if the job needs to be executed periodically, and the timing rule can be manually input or can be customized. The format of the expression needs to be satisfied, and the specific format requirements are as follows: seconds are set, the allowable value of the seconds is "0-59", and the allowable special characters are "character," - "character," character, and "#" character. Set minutes, the allowable values for the minutes are: "0-59", allowing special characters to be the "," character "," - "character", and "#" character. The hours are set, the allowable values for the hours are "0-23", and the allowable special characters are the "character," - "character," character "-" and "#" characters. The date is set, the allowed value of the date is ' 1-31 ', and the allowed special characters are ' characters ', ' question marks ' characters, ' characters ', characters ' L ', characters ' W ' and ' characters ' C '. The month is set, and the allowed characters of the month are "1-12" or "JAN-DEC", and the allowed special characters are "character", "character" - "," - "character, and" # "character. The week is set with the allowed characters of the week being "1-7" or "SUN-SAT", and the allowed special characters being the characters of "," character "," question mark "character", "character"/"," character "L", "character", "C", and "#". The year is set, the allowed characters of the year are "1970-.
The job execution module 7 may also perform fuzzy search for task names, time filtering, and may also pause or start tasks. The expression is set in terms of seconds-minutes-hours-dates-months-weeks-years, representing all, weeks and dates can only be selected as one out of two.
The job execution module 7 also includes a history query execution module, and the job execution module 7 records all script execution, file distribution, job and timed task operation information initiated by a job system page or API calling mode, and provides task redo and log backtracking capabilities. The job execution module 7 includes the following records and operations: the ID content record and the ID of each historical execution job are unique, and if the ID is definitely known, the corresponding job can be directly located through ID search. The job name in the setting is renameable, and the same job can be executed multiple times. A people record is initiated, recording which user is performing the current job. And checking the current state to check whether the job is executed successfully. The start end time, start time and end time, and the specific time at which the job is executed are set. Recording a starting mode, wherein the operation is executed periodically or immediately, recording the total consumed time of the operation, and checking an execution log of the operation.
Optionally, the system further includes a system management module 1, where the system management module 1 is configured to monitor operation states of all modules of the system. The system management module 1 comprises a login record management unit 8 and a login log management unit 9, wherein the login record management unit 8 is used for counting and displaying login records of a user, and the login log management unit 9 is used for displaying API request records of the user. The system management module 1 is mainly used by an administrator to monitor the entire system. The system management module 1 comprises a module detection and login record management unit 8 and a system log; the module detection specifically comprises the steps of monitoring all modules in the system, and starting and stopping a certain module at any time; the login record management unit 8 is used for recording login records of system users; the system log mainly records all request logs of the system, such as: GET logs, POST logs, DELETE logs, PUT logs, etc., including operational information, are recorded.
Specifically, the system further comprises a user management module, and the user management module is used for managing user account information and authority. The user management module comprises user list management, authority list management, menu component management and role management.
The user management module is mainly used by an administrator, the administrator can give the use right of the secondary module to other users, the user list management is used for showing all users in the system, and functions of newly building users and resetting passwords are provided. The administrator creates a user, sets an initial password, and the user can modify the password after logging in. The administrator may reset the password for the user. The user management mainly comprises functions of searching users, creating users, deleting users, closing users, resetting passwords and the like.
The authority list management is used for configuring and managing the authority of each interface in detail, adding API authority and starting, defaults are not required to be modified, and the authority list management can be used in role management. Role management in conjunction with the actual scene creates roles, giving different roles accessible menus and interfaces. The role management also comprises the searching, editing, closing and deleting of the role; the user-defined role names, the user-defined empowerment components, the menu and the authority management are managed, empowerment design can be searched in the list, and visual operation of a user is facilitated.
Menu component management embodies all components in the development process as functional modules seen by the navigation bar, such as: user management, system management and some function buttons, such as: editing and deleting buttons, defaulting without modification, endowing corresponding menu functions to different roles in role management by an administrator, adding the components or menus to a menu component list when developing one component or menu, and enabling the role to access the corresponding menu only.
Specifically, the system further comprises a workbench module 4, and the workbench module 4 is used for managing system work basic information. The workbench module 4 includes a backlog unit 16 management, a shortcut navigation unit 17, a message notification unit 18, an information document and manual management unit 19.
Optionally, the system further includes a configuration management module 5, where the configuration management module 5 is configured to manage and provide configuration data of the operation and maintenance scenario. The configuration management module 5 includes a host management module 20, a host addition module 21, an operation audit module 22, and a field configuration module 23.
The host management module 20 is configured to display system resource information of all machines, derive a machine list, and display custom fields, except seven system-level custom fields, namely application, appower, dc, owner, rack, and role, all of which are system-level default fields and user-defined fields. If an identifier is needed to record a certain machine, the hyperlink of the hostname field is clicked in the detailed information of the machine, so that the detailed information of the machine can be more visually checked, and the needed identifier is added in the tag field.
The host adding module 21 is configured to add hosts into the system in an SSH manner, perform connectivity test on hosts connected in the SSH manner, and determine whether the hosts are online currently, and the host management module 20 further includes functions of fuzzy search, data refreshing, field screening, derivation, and the like, and does not need a complete search value through the fuzzy search function, and can use fuzzy search if we want to search hosts whose host names include OMP; all hosts that include the OMP's key are retrieved. The current page data can be manually refreshed through a data refreshing function, and the required fields can be screened through a field screening function; the export function may export the current data out of Excel format.
The operation auditing module 22 is used for recording the operation records of all users of the system in the host management module 20. Specifically, the corresponding change operation can be located through ID search, the machine IP of a certain user performing the current operation and the tagging operation is recorded, the specific content of the operation is recorded, and the specific time for performing the operation is tagged. The ID of each history operation is unique, and if the ID is clearly known, the corresponding change operation can be directly located through ID search.
The field configuration module 23 is used for adding fields for hosts in host management for personalized management, and the added fields include system level custom fields and user custom fields. When the field provided by the host management can not meet the requirement of the user and needs other fields to record all host resources, a needed field can be newly added in the module. The name of a field is not repeatable and cannot be added to an already existing field. Meanwhile, the English name of the field must also meet the basic requirement of the database field, namely, only the combination of underline and English letters can be used.
To illustrate by way of a practical example, a room is divided into A, B, C, D four areas, and when a field is needed to mark that the server is located in a room, the field can be created in the following manner.
Calling the host management module 20 to find the corresponding field, calling the field configuration module 23 when data needs to be added to the field, clicking the hyperlink of the field to enter an editing mode, and at the moment, performing batch and single editing to add machine area information to a certain machine.
In the host management module 20, if a field is needed to help us to filter some values, the field configuration module 23 is called to find the field click edit. The sorting function may also be used when the values of the fields are of the numeric type, although any value will be sorted, sorting data of the numeric type: when this custom field is not needed, the unneeded field may be edited at field configuration module 23 and then the display on the CMDB canceled.
The field configuration module 23 further includes: the add field function: adding a custom field according to the actual scene requirement, and having a searching function: a field is quickly searched.
The enterprise intelligent cloud operation and maintenance system is high in operation efficiency, high in safety, timely in alarming, good in expandability, comprehensive in monitoring and simple and convenient to set.
In the description herein, references to the description of the term "one embodiment," "some embodiments," "an example," "a specific example," or "some examples," etc., mean that a particular feature, structure, material, or characteristic described in connection with the embodiment or example is included in at least one embodiment or example of the invention. In this specification, the schematic representations of the terms used above do not necessarily refer to the same embodiment or example. Furthermore, the particular features, structures, materials, or characteristics described may be combined in any suitable manner in any one or more embodiments or examples.
It will be understood by those skilled in the art that the present invention includes any combination of the summary and detailed description of the invention described above and those illustrated in the accompanying drawings, which is not intended to be limited to the details and which, for the sake of brevity of this description, does not describe every aspect which may be formed by such combination. Any modification, equivalent replacement, or improvement made within the spirit and principle of the present invention should be included in the protection scope of the present invention.
Although embodiments of the present invention have been shown and described above, it is understood that the above embodiments are exemplary and should not be construed as limiting the present invention, and that variations, modifications, substitutions and alterations can be made in the above embodiments by those of ordinary skill in the art without departing from the principle and spirit of the present invention. The scope of the invention is defined by the appended claims and equivalents thereof.

Claims (10)

1. An enterprise intelligent cloud operation and maintenance system is characterized by comprising an intelligent alarm module and an instrument panel; the instrument panel is used for displaying machine information overview governed by the system; the intelligent alarm module is used for receiving alarm information and managing the alarm information, and comprises:
step D1, creating a notification group, wherein the created notification group comprises a management group and an operation and maintenance group;
step D2, creating personnel information, wherein the personnel information is used for receiving the alarm and data notification of the monitoring tool;
d3, distributing personnel to the created notification group to complete the creation of the notification group;
d4, collecting alarm information of the monitoring tool;
step D5, assigning the same monitoring tool alarm information to a notification group;
d6, judging whether the notification group meets the notification condition, if not, discarding the alarm information, if so, executing the step D7;
d7, judging whether the event ID information of the alarm is in the record of the alarm, if yes, filing the alarm information in the list of the same event, if not, creating an alarm information; and informing according to the informing mode and recording in the system.
2. The enterprise intelligent cloud operation and maintenance system of claim 1, wherein said intelligent alarm module further comprises:
step D31, creating a scheduling list, namely selecting the notification group and creating scheduling information;
and D32, distributing a notification tool for the notification group, and receiving the alarm information notification.
3. The enterprise intelligent cloud operation and maintenance system of claim 1, further comprising a job management module, wherein the job management module comprises a script execution module and a job execution module.
4. The enterprise intelligent cloud operation and maintenance system of claim 3, wherein the script execution module is configured to execute a script on a machine of the configuration management database by manually writing or locally uploading or introducing a public script, and the script execution module comprises:
step S1, inputting script name;
step S2, selecting an operating system, wherein the operating system comprises a Windows system and a Linux system;
step S3, selecting script sources, wherein the script sources comprise a manual input script, a local script and a public script;
step S4, executing the selected script.
5. The enterprise smart cloud operation and maintenance system of claim 3, wherein said job execution module for managing job execution tasks comprises:
step A1, creating a job and adding basic information;
step A2, adding at least one step and a script in the created job to complete the created job;
step A3, generating a task;
step A4, starting a timing rule;
a5, selecting a timing expression generation mode and generating an expression, wherein the expression generation mode comprises a custom adjustment mode and a manual input mode;
a6, selecting operation to generate timing task; and closing or opening the timing task, and checking the execution condition in an execution history unit when the timing task is opened.
6. The enterprise intelligent cloud operation and maintenance system of claim 1, further comprising a system management module, wherein the system management module is configured to monitor operation states of all modules of the system; the system management module comprises a login record management unit and a login log management unit, wherein the login record management unit is used for counting and displaying login records of a user, and the login log management unit is used for displaying API (application program interface) request records of the user.
7. The enterprise intelligent cloud operation and maintenance system of claim 1, further comprising a user management module, wherein the user management module is configured to manage user account information and permissions; the user management module comprises user list management, authority list management, menu component management and role management.
8. The enterprise intelligent cloud operation and maintenance system of claim 1, further comprising a workbench module, wherein the workbench module is configured to manage system work basic information; the workbench module comprises a backlog unit, a shortcut navigation unit, a message notification unit and an information document and manual management unit.
9. The enterprise smart cloud operation and maintenance system of claim 1, further comprising a configuration management module, wherein the configuration management module is configured to manage and provide configuration data of operation and maintenance scenarios; the configuration management module comprises a host management module, a host adding module, an operation auditing module and a field configuration module.
10. The enterprise intelligent cloud operation and maintenance system of claim 1, wherein the intelligent alarm module comprises an alarm analysis summarizing unit, an alarm management unit, an integrated management unit, an alarm configuration management unit, an audit management unit and a report management unit.
CN202210764810.9A 2022-07-01 2022-07-01 Enterprise intelligent cloud operation and maintenance system Pending CN114816943A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202210764810.9A CN114816943A (en) 2022-07-01 2022-07-01 Enterprise intelligent cloud operation and maintenance system

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202210764810.9A CN114816943A (en) 2022-07-01 2022-07-01 Enterprise intelligent cloud operation and maintenance system

Publications (1)

Publication Number Publication Date
CN114816943A true CN114816943A (en) 2022-07-29

Family

ID=82523453

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202210764810.9A Pending CN114816943A (en) 2022-07-01 2022-07-01 Enterprise intelligent cloud operation and maintenance system

Country Status (1)

Country Link
CN (1) CN114816943A (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN116708157A (en) * 2023-08-07 2023-09-05 北京鹰速光电科技有限公司 Computer security operation and maintenance service system

Citations (11)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106487574A (en) * 2016-04-01 2017-03-08 国家计算机网络与信息安全管理中心 Automatic operating safeguards monitoring system
CN107358400A (en) * 2017-06-06 2017-11-17 安徽赛福贝特信息技术有限公司 A kind of data management service system based on cloud platform
CN108829558A (en) * 2018-05-22 2018-11-16 郑州云海信息技术有限公司 A kind of intelligent operation management method and system of data center's alarm
CN109214704A (en) * 2018-09-26 2019-01-15 广东电网有限责任公司 A kind of distributed intelligence operation platform, method, apparatus and readable storage medium storing program for executing
CN110138611A (en) * 2019-05-15 2019-08-16 重庆天蓬网络有限公司 Automate O&M method and system
CN110569140A (en) * 2019-08-29 2019-12-13 网宿科技股份有限公司 operation and maintenance method and device
CN110955457A (en) * 2019-10-11 2020-04-03 广西电网有限责任公司信息中心 Batch automatic operation and maintenance management method for operating systems
CN111026509A (en) * 2019-12-12 2020-04-17 广州西麦科技股份有限公司 Method, system and storage medium for operation and maintenance remote host to execute operation
CN111666189A (en) * 2020-06-12 2020-09-15 中信银行股份有限公司 Method and system for declaratively visually configuring Prometheus monitoring alarm
CN113010277A (en) * 2020-11-20 2021-06-22 云智慧(北京)科技有限公司 Multi-condition triggering automatic operation system and method based on automatic operation and maintenance
CN114189423A (en) * 2021-12-08 2022-03-15 兴业银行股份有限公司 Intelligent inquiry alarm system, method and medium with comprehensive compatibility and expansion

Patent Citations (11)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106487574A (en) * 2016-04-01 2017-03-08 国家计算机网络与信息安全管理中心 Automatic operating safeguards monitoring system
CN107358400A (en) * 2017-06-06 2017-11-17 安徽赛福贝特信息技术有限公司 A kind of data management service system based on cloud platform
CN108829558A (en) * 2018-05-22 2018-11-16 郑州云海信息技术有限公司 A kind of intelligent operation management method and system of data center's alarm
CN109214704A (en) * 2018-09-26 2019-01-15 广东电网有限责任公司 A kind of distributed intelligence operation platform, method, apparatus and readable storage medium storing program for executing
CN110138611A (en) * 2019-05-15 2019-08-16 重庆天蓬网络有限公司 Automate O&M method and system
CN110569140A (en) * 2019-08-29 2019-12-13 网宿科技股份有限公司 operation and maintenance method and device
CN110955457A (en) * 2019-10-11 2020-04-03 广西电网有限责任公司信息中心 Batch automatic operation and maintenance management method for operating systems
CN111026509A (en) * 2019-12-12 2020-04-17 广州西麦科技股份有限公司 Method, system and storage medium for operation and maintenance remote host to execute operation
CN111666189A (en) * 2020-06-12 2020-09-15 中信银行股份有限公司 Method and system for declaratively visually configuring Prometheus monitoring alarm
CN113010277A (en) * 2020-11-20 2021-06-22 云智慧(北京)科技有限公司 Multi-condition triggering automatic operation system and method based on automatic operation and maintenance
CN114189423A (en) * 2021-12-08 2022-03-15 兴业银行股份有限公司 Intelligent inquiry alarm system, method and medium with comprehensive compatibility and expansion

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
朱兵颖: "新安县提黄灌区信息化建设初探", 《陕西水利》 *
王等: "信息系统报警平台的研发", 《计算机光盘软件与应用》 *

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN116708157A (en) * 2023-08-07 2023-09-05 北京鹰速光电科技有限公司 Computer security operation and maintenance service system

Similar Documents

Publication Publication Date Title
US11178029B2 (en) Systems and methods of specifying service level criteria
US10812335B2 (en) Data insights for performance analytics
US8601323B2 (en) Advanced management of runtime errors
CN110278097B (en) Server operation and maintenance system and method based on Android system
US20060004830A1 (en) Agent-less systems, methods and computer program products for managing a plurality of remotely located data storage systems
US10970114B2 (en) Systems and methods for task scheduling
US20040088141A1 (en) Automatically identifying replacement times for limited lifetime components
US20030233366A1 (en) Database monitoring system with formatted report information delivery
US10169723B2 (en) Distributed policy distribution for compliance functionality
US9104706B2 (en) Meta-directory control and evaluation of events
US20050187940A1 (en) Systems, methods and computer program products for managing a plurality of remotely located data storage systems
US20160247246A1 (en) Plan visualization
CN111190794A (en) Operation and maintenance monitoring and management system
CN105183625A (en) Log data processing method and apparatus
CA2509859A1 (en) System and method for auditing a network
CN110888786A (en) Operation and maintenance monitoring system
CN115841310A (en) Construction method of plan flow model, event processing method and device
CN111260251A (en) Operation and maintenance service management platform and operation method thereof
CN114816943A (en) Enterprise intelligent cloud operation and maintenance system
US11615358B2 (en) Data insights for performance analytics
US8832715B2 (en) Limiting execution of event-responses with use of policies
US10171442B2 (en) Predicting a need for and creating temporary access to a computer component in infrastructure information technology
CN114862277A (en) Enterprise hybrid cloud management system
US20070260983A1 (en) Method for providing a summary of user activities
US20220382580A1 (en) System and method for digital automation governance

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication

Application publication date: 20220729

RJ01 Rejection of invention patent application after publication