Disclosure of Invention
In view of the foregoing, it is desirable to provide a method, an apparatus, a computer device and a storage medium for managing a service, which can achieve collection efficiency.
A method of traffic management, the method comprising:
acquiring initial operation information of a management equipment system in a multi-thread multi-concurrency mode based on the operation state and the data scale of the management equipment system, and preprocessing the initial operation information to obtain available operation information of the management equipment system; the initial operation information comprises configuration information, performance information, logs and service information of each device in the management device system, and the preprocessing comprises at least one of data structure standardization processing, merging and pressing processing and filtering processing;
analyzing the available operation information received through a preset interface according to a preset analysis strategy to obtain analysis results of multiple dimensions of the management equipment system; wherein the analysis results for multiple dimensions comprise: at least one of a resource statistical report, a created performance inspection report, an alarm statistical report, a network topology view, equipment performance bottleneck information and fault information;
and outputting the analysis result to a client.
In one embodiment, the analyzing the available operation information received through the preset interface according to a preset analysis policy to obtain analysis results of multiple dimensions of the management device system includes:
receiving the available operation information through the preset interface;
creating a resource statistical report according to configuration information and performance information in the available operation information; the resource statistical form comprises hardware resources and relevant information of the hardware resources;
obtaining items and proportions of the configuration items according to the date and the matching conditions of different types of resource attributes;
and displaying the resource configuration information of each device in the management device system according to the items and the proportion of the configuration items.
In one embodiment, the analyzing the available operation information received through the preset interface according to a preset analysis policy to obtain analysis results of multiple dimensions of the management device system further includes:
acquiring the created performance inspection report according to the resource utilization rate of each monitored object obtained by the date and different conditions for the log and the service information in the available operation information;
obtaining a created performance inspection report form of each device in the management device system according to dates and different conditions, and obtaining the running state of each device in the management device system;
and if the running state is the fault state, creating an alarm statistical report based on the fault state.
In one embodiment, the method further comprises:
and sending the equipment information corresponding to the fault state to the client.
In one embodiment, the analyzing the available operation information received through the preset interface according to a preset analysis policy to obtain analysis results of multiple dimensions of the management device system further includes:
acquiring key node parameters of each device in the management device system and the connection relation of the key nodes;
generating a network topology view according to the key node parameters and the connection relation of the key nodes; wherein, the primitives in the network topology view comprise: the system comprises a server, network equipment, safety equipment, storage equipment, middleware and a database, wherein service software is deployed on equipment corresponding to primitives in the network topology view.
In one embodiment, the analyzing the available operation information received through the preset interface according to a preset analysis policy to obtain analysis results of multiple dimensions of the management device system includes:
generating equipment performance bottleneck information according to configuration information and performance information in the available operation information;
and generating risk early warning information aiming at the possible faults of the management equipment system based on the equipment performance bottleneck information.
In one embodiment, the method further comprises:
and sorting according to the risk early warning information from high to low according to a preset risk grade, and sending the sorted risk early warning information to the client.
A traffic management apparatus, the apparatus comprising:
the acquisition layer module is used for acquiring initial operation information of the management equipment system in a multi-thread multi-concurrency mode based on the operation state and the data scale of the management equipment system and preprocessing the operation information to obtain available operation information of the management equipment system; the initial operation information comprises configuration information, performance information, logs and service information of each device in the management device system, and the preprocessing comprises at least one of data structure standardization processing, merging and pressing processing and filtering processing;
the application layer module is used for analyzing the available operation information received through a preset interface according to a preset analysis strategy to obtain analysis results of multiple dimensions of the management equipment system and outputting the analysis results to a client; wherein the analysis results for multiple dimensions comprise: the resource statistics report form, the created performance inspection report form, the alarm statistics report form, the network topology view, the equipment performance bottleneck information and the fault information.
In one embodiment, the communication mode of the acquisition layer module and the application layer module is hot plug; the script of the acquisition layer module is in the form of at least one of shell, bat, python, go and powershell; the deployment mode of the acquisition layer module is any one of single machine deployment, cluster deployment or cross-network segment deployment; the acquisition layer module is a module supporting a micro-service architecture.
In one embodiment, the acquisition layer module is configured to interface with an http/web service interface, an ftp, an ssh, a database, or a message middleware, acquire initial operation information of a management device system in a multi-thread multi-concurrency manner based on an operation state and a data scale of the management device system, and preprocess the operation information to obtain available operation information of the management device system.
A computer device comprising a memory and a processor, the memory storing a computer program, the processor implementing the following steps when executing the computer program:
acquiring initial operation information of a management equipment system in a multi-thread multi-concurrency mode based on the operation state and the data scale of the management equipment system, and preprocessing the initial operation information to obtain available operation information of the management equipment system; the initial operation information comprises configuration information, performance information, logs and service information of each device in the management device system, and the preprocessing comprises at least one of data structure standardization processing, merging and pressing processing and filtering processing;
analyzing the available operation information received through a preset interface according to a preset analysis strategy to obtain analysis results of multiple dimensions of the management equipment system; wherein the analysis results for multiple dimensions comprise: at least one of a resource statistical report, a created performance inspection report, an alarm statistical report, a network topology view, equipment performance bottleneck information and fault information;
and outputting the analysis result to a client.
A computer-readable storage medium, on which a computer program is stored which, when executed by a processor, carries out the steps of:
acquiring initial operation information of a management equipment system in a multi-thread multi-concurrency mode based on the operation state and the data scale of the management equipment system, and preprocessing the initial operation information to obtain available operation information of the management equipment system; the initial operation information comprises configuration information, performance information, logs and service information of each device in the management device system, and the preprocessing comprises at least one of data structure standardization processing, merging and pressing processing and filtering processing;
analyzing the available operation information received through a preset interface according to a preset analysis strategy to obtain analysis results of multiple dimensions of the management equipment system; wherein the analysis results for multiple dimensions comprise: at least one of a resource statistical report, a created performance inspection report, an alarm statistical report, a network topology view, equipment performance bottleneck information and fault information;
and outputting the analysis result to a client.
According to the service management method, the service management device, the computer equipment and the storage medium, the management equipment system obtains the initial operation information of the management equipment system in a multi-thread multi-concurrency mode based on the operation state and the data scale of the management equipment system, preprocesses the initial operation information to obtain the available operation information of the management equipment system, analyzes the available operation information received through the preset interface according to a preset analysis strategy to obtain analysis results of multiple dimensions of the management equipment system, and then outputs the analysis results to the client. According to the method, the number of threads and the number of concurrencies are determined in a self-adaptive mode based on the running state and the data scale of the method, so that the obtained number of threads and the obtained number of concurrencies can be matched with the running state and the data scale of the management equipment system, the speed and the efficiency of obtaining initial running information by data can be greatly improved, and the efficiency and the resource utilization rate are further balanced. Meanwhile, initial operation information is preprocessed, so that storage and analysis of invalid data can be avoided, space is saved, data processing amount in the analysis process is reduced, and acquisition efficiency of analysis results is improved. In addition, the analysis result is sent to the corresponding client, and the client can perform corresponding processing according to the analysis result, so that the closed loop of service processing is realized, the service flow is perfected, the service management efficiency is improved, and the effectiveness and the integrity are improved.
Detailed Description
In order to make the objects, technical solutions and advantages of the present application more apparent, the present application is described in further detail below with reference to the accompanying drawings and embodiments. It should be understood that the specific embodiments described herein are merely illustrative of the present application and are not intended to limit the present application.
The service management method provided by the application can be applied to the application environment shown in fig. 1. Wherein, the acquisition server 102 deployed by the acquisition layer module communicates with the server 104 of the application layer module through the network. The collection server 102 and the server 104 deployed by the application layer module may be implemented by separate servers or a server cluster composed of a plurality of servers.
Those skilled in the art will appreciate that the architecture shown in fig. 1 is merely a block diagram of some of the structures associated with the disclosed aspects and is not intended to limit the computing devices to which the disclosed aspects apply, as particular computing devices may include more or less components than those shown, or may combine certain components, or have a different arrangement of components.
The following describes the technical solutions of the present application and how to solve the above technical problems with specific examples. The following several specific embodiments may be combined with each other, and details of the same or similar concepts or processes may not be repeated in some embodiments. Embodiments of the present application will be described below with reference to the accompanying drawings.
It should be noted that the execution subject of the method embodiments described below may be a service management apparatus, and the apparatus may be implemented as part of or all of the computer device in a software, hardware, or a combination of software and hardware. The following method embodiments take the execution subject as an example of a management device system.
Fig. 2 is a flowchart illustrating a service management method according to an embodiment. The embodiment relates to a specific process for efficiently acquiring data and performing service management by a management equipment system according to the running state and the data scale of the management equipment system. As shown in fig. 2, includes:
s10, acquiring initial operation information of the management equipment system in a multi-thread multi-concurrency mode based on the operation state and the data scale of the management equipment system, and preprocessing the initial operation information to obtain available operation information of the management equipment system; the initial operation information includes configuration information, performance information, logs and service information of each device in the management device system, and the preprocessing includes at least one of data structure standardization processing, merging and suppressing processing and filtering processing.
Specifically, the management device system may obtain its own operating state and data scale, determine the number of threads and the number of concurrencies that match the own operating state and data scale, and obtain the initial operating information of the management device system in a multi-thread multi-concurrency manner based on the number of threads and the number of concurrencies. And then, managing the equipment system and preprocessing the acquired initial operation information to obtain the available operation information of the management equipment system. The initial operation information includes configuration information, performance information, logs and service information of each device in the management device system. Wherein the configuration information includes the preprocessing, and may include at least one of a data structure normalization processing, a merging throttle processing, and a filtering processing on the initial operation information. The data structure is unified by adopting the data structure standardization processing, so that the data structure is convenient to read, store, identify and call; the data can be classified according to categories and compressed for storage by adopting merging and pressing treatment, so that the storage space is saved; by adopting the filtering processing, useless information can be filtered out, only useful information is reserved, and the data volume is further reduced.
It should be noted that, the number of threads and the number of concurrencies that match the running state and the data size of the user are determined, which may be determined by using a correspondence between the running state and the data size that are set in advance and the number of threads and the number of concurrencies. Generally, the number of threads and the number of concurrencies can be smaller under the condition of good running state, and the number of threads and the number of concurrencies can be larger under the condition of bad running state; the number of threads and the number of concurrencies may be small when the data size is small, and the number of threads and the number of concurrencies may be large when the data size is large.
S20, analyzing the available operation information received through a preset interface according to a preset analysis strategy to obtain analysis results of multiple dimensions of the management equipment system; wherein the analysis results for multiple dimensions comprise: the resource statistics report form, the created performance inspection report form, the alarm statistics report form, the network topology view, the equipment performance bottleneck information and the fault information.
Specifically, the management device system may receive the available operation information through a preset interface according to a preset analysis policy, and analyze the available operation information, so as to obtain analysis results of multiple dimensions of the management device system. It should be noted that the available operation information may include one or more of configuration information, performance information, logs and service information, and the available operation information is analyzed based on a preset analysis policy, so that analysis results of multiple dimensions can be obtained. The analysis results of the multiple dimensions may include one or more of a resource statistics report, a created performance inspection report, an alarm statistics report, a network topology view, device performance bottleneck information, and information failure. It should be noted that the analysis policy may be a correspondence between different configuration information, performance information, logs, and service information and different analysis results of different dimensions. The corresponding relationship may be obtained empirically, or obtained by learning available operation information of the learning history and corresponding processing results, which is not limited in this embodiment.
And S30, outputting the analysis result to the client.
Specifically, after the management device system obtains the analysis result, the management device system may send the analysis result to the corresponding client according to the type or the authority of the client, so that the client performs corresponding processing according to the analysis result, thereby implementing a closed loop of service processing.
In this embodiment, the management device system obtains initial operation information of the management device system in a multi-thread and multi-concurrency manner based on its own operation state and data scale, preprocesses the initial operation information to obtain available operation information of the management device system, analyzes the available operation information received through the preset interface according to a preset analysis policy to obtain analysis results of multiple dimensions of the management device system, and then outputs the analysis results to the client. According to the method, the number of threads and the number of concurrencies are determined in a self-adaptive mode based on the running state and the data scale of the method, so that the obtained number of threads and the obtained number of concurrencies can be matched with the running state and the data scale of the management equipment system, the speed and the efficiency of obtaining initial running information by data can be greatly improved, and the efficiency and the resource utilization rate are further balanced. Meanwhile, initial operation information is preprocessed, so that storage and analysis of invalid data can be avoided, space is saved, data processing amount in the analysis process is reduced, and acquisition efficiency of analysis results is improved. In addition, the analysis result is sent to the corresponding client, and the client can perform corresponding processing according to the analysis result, so that the closed loop of service processing is realized, the service flow is perfected, the service management efficiency is improved, and the effectiveness and the integrity are improved.
Optionally, on the basis of the foregoing embodiment, a possible implementation manner of step S20 may be as shown in fig. 3, and includes:
s211, receiving the available operation information through the preset interface.
S212, creating a resource statistical report according to configuration information and performance information in the available operation information; the resource statistical form comprises hardware resources and relevant information of the hardware resources.
And S213, obtaining the items and the proportions of the configuration items according to the date and the matching conditions of the resource attributes of different types.
S214, displaying the resource configuration information of each device in the management device system according to the items and the proportion of the configuration items.
Specifically, the management device system can receive the available operation information through a preset interface, and it should be noted that the preset interface is a predefined interface, so that the modification of the policy in the initial operation information acquisition process does not affect the subsequent processing process, and the iteration is facilitated. The management equipment system can create a resource statistical report according to the configuration information and the performance information; the resource statistical report includes hardware resources and related information of the hardware resources, and optionally, the resource statistical report may count a CPU usage rate, a memory usage rate, a disk space usage rate, a network card bandwidth usage rate, and the like of the device corresponding to each IP according to the application system dimension; the relevant statistics of the hardware resources in the resource statistics report may include the types and corresponding quantities of the number of CPUs, the total memory, the number of hard disks, the motherboard, the slots, the power supplies, and the like according to the device type and the statistics. Then, the management device system obtains the items and the proportions of the configuration items according to the matching conditions of different dates and different types of resource attributes in the created resource statistics report, for example, the items and the proportions may be obtained according to the time of resource entry or automatic data acquisition, the device type (security device, network device, server, accessory, etc.), and statistics related capacity proportions, for example, statistics of the overall memory usage of each device in the statistics period.
On the basis of an embodiment, another possible implementation manner of step S20 may be as shown in fig. 4, and includes:
s221, acquiring the created performance inspection report according to the log and the service information in the available operation information and the resource utilization rate of each device obtained according to dates and different conditions.
S222, obtaining a created performance inspection report of each device in the management device system according to dates and different conditions, and obtaining the running state of each device in the management device system.
And S223, if the running state is the fault state, creating an alarm statistical report based on the fault state.
Specifically, the management device system obtains and creates a performance inspection report form by using the log and the service information in the pair of available operation information according to the resource utilization rate of each monitored object obtained by the date and different conditions. And meanwhile, a created performance inspection report of each device in the management device system is obtained according to the date and different conditions, and the running state of each device in the management device system is obtained. It should be noted that the operating state may be a curve or a set of data based on time. Alternatively, the operating conditions may include excellent, good, medium, poor, etc.; busy, normal and idle, etc. may also be included; fault conditions, normal conditions or wind and risk conditions may also be included. And then when the running state of the management equipment system is the fault state, creating an alarm statistical report based on the fault state to record the fault state or provide a data basis for future fault analysis.
In this embodiment, the management device system obtains a created performance inspection report for the log and the service information in the available operation information according to the resource utilization rate of each device obtained by the date and different conditions, obtains the created performance inspection report for each device in the management device system according to the date and different conditions, obtains the operation state of each device in the management device system, and creates an alarm statistical report based on the fault state when the operation state is the fault state, thereby implementing unified recording and management of the alarm information.
Optionally, on the basis of the foregoing embodiments, the method may further include sending device information corresponding to the fault state to the client. The management equipment system sends the equipment information corresponding to the fault state to the client, so that the client can conveniently pay attention to, process and track the fault equipment, the closed loop of a service management process is further improved, and the effectiveness of service management is improved.
On the basis of an embodiment, another possible implementation manner of step S20 may be as shown in fig. 5, and includes:
s231, obtaining key node parameters of each device in the management device system and the connection relation of the key nodes.
S232, generating a network topology view according to the key node parameters and the connection relation of the key nodes; wherein, the primitives in the network topology view comprise: the system comprises a server, network equipment, safety equipment, storage equipment, middleware and a database, wherein service software is deployed on equipment corresponding to primitives in the network topology view.
Specifically, the management device system can obtain the key node and key node parameters of each device, and the connection relationship between the key nodes. And the management equipment system can generate a network topology view according to the key node parameters and the connection relation of the key nodes. It should be noted that, the primitives in the network topology view include: the system comprises a server, network equipment, security equipment, storage equipment, middleware, a database and business software deployed on equipment corresponding to the primitives.
In this embodiment, the management device system obtains the key node parameters of each device in the management device system and the connection relationship between the key nodes, and generates a network topology view according to the key node parameters and the connection relationship between the key nodes, so that each device in the management device system can be visually displayed, and the device can be conveniently checked and troubleshoot faults of devices corresponding to different primitives.
On the basis of an embodiment, another possible implementation manner of step S20 may be as shown in fig. 6, and includes:
s241, generating equipment performance bottleneck information according to configuration information and performance information in the available operation information;
and S242, generating risk early warning information aiming at the possible faults of the management equipment system based on the equipment performance bottleneck information.
Specifically, the management device system generates device performance bottleneck information according to configuration information and performance information in the available operation information, and generates risk early warning information for a fault that may occur in the management device system based on the device performance bottleneck information. It should be noted that the device performance bottleneck information can represent critical information of whether each device in the management device system is normal or not, so that the management device system can generate risk early warning information for a possible fault of the management device system based on the device performance bottleneck information, and perform risk early warning on a device which has not failed, thereby predicting a risk and reducing a failure rate.
Optionally, the management device system sorts the risk early warning information according to preset risk levels from high to low, and sends the sorted risk early warning information to the client, so that the client can preferentially process the fault corresponding to the risk early warning information with a high risk level, and normal operation of the management device system is further ensured.
It should be understood that although the various steps in the flow charts of fig. 2-6 are shown in order as indicated by the arrows, the steps are not necessarily performed in order as indicated by the arrows. The steps are not performed in the exact order shown and described, and may be performed in other orders, unless explicitly stated otherwise. Moreover, at least some of the steps in fig. 2-6 may include multiple steps or multiple stages, which are not necessarily performed at the same time, but may be performed at different times, which are not necessarily performed in sequence, but may be performed in turn or alternately with other steps or at least some of the other steps.
In one embodiment, as shown in fig. 7, there is provided a traffic management apparatus, including:
the acquisition layer module 100 is configured to acquire initial operation information of a management device system in a multi-thread and multi-concurrency manner based on an operation state and a data scale of the management device system, and preprocess the operation information to obtain available operation information of the management device system; the initial operation information comprises configuration information, performance information, logs and service information of each device in the management device system, and the preprocessing comprises at least one of data structure standardization processing, merging and pressing processing and filtering processing;
the application layer module 200 is configured to analyze the available operation information received through a preset interface according to a preset analysis policy to obtain analysis results of multiple dimensions of the management device system, and output the analysis results to a client; wherein the analysis results for multiple dimensions comprise: the resource statistics report form, the created performance inspection report form, the alarm statistics report form, the network topology view, the equipment performance bottleneck information and the fault information.
In an embodiment, the application layer module 200 is specifically configured to receive the available operation information through the preset interface; creating a resource statistical report according to configuration information and performance information in the available operation information; the resource statistical form comprises hardware resources and relevant information of the hardware resources; obtaining items and proportions of the configuration items according to the date and the matching conditions of different types of resource attributes; and displaying the resource configuration information of each device in the management device system according to the items and the proportion of the configuration items.
In an embodiment, the application layer module 200 is specifically configured to obtain the created performance inspection report for the log and the service information in the available running information according to the date and the resource utilization rate of each monitored object obtained under different conditions; obtaining a created performance inspection report form of each device in the management device system according to dates and different conditions, and obtaining the running state of each device in the management device system; and if the running state is the fault state, creating an alarm statistical report based on the fault state.
In an embodiment, the application layer module 200 is specifically configured to send the device information corresponding to the fault state to the client.
In an embodiment, the application layer module 200 is specifically configured to obtain a key node parameter of each device in the management device system and a connection relationship between key nodes; generating a network topology view according to the key node parameters and the connection relation of the key nodes; wherein, the primitives in the network topology view comprise: the system comprises a server, network equipment, safety equipment, storage equipment, middleware and a database, wherein service software is deployed on equipment corresponding to primitives in the network topology view.
In an embodiment, the application layer module 200 is specifically configured to generate device performance bottleneck information according to configuration information and performance information in the available operation information; and generating risk early warning information aiming at the possible faults of the management equipment system based on the equipment performance bottleneck information.
In an embodiment, the application layer module 200 is further configured to sort the risk early warning information according to a preset risk level from high to low, and send the sorted risk early warning information to the client.
In one embodiment, the communication mode between the acquisition layer module 100 and the application layer module 200 is hot-plug, so that modification and iterative update can be flexibly performed according to the actual service situation, that is, the acquisition layer module is upgraded, and the whole management device system is not affected.
In one embodiment, the script of acquisition layer module 100 is implemented in the form of at least one of shell, bat, python, go, and powershell.
In one embodiment, the deployment mode of the acquisition layer module 100 is any one of standalone deployment, cluster deployment or cross-network segment deployment to adapt to the specific form of the managed system.
In one embodiment, the acquisition layer module 100 is a module that supports a microservice architecture.
In one embodiment, the acquisition layer module 100 is specifically configured to interface with an http/web service interface, an ftp, an ssh, a database, or a message middleware, acquire initial operation information of a management device system in a multi-thread multi-concurrency manner based on an operation state and a data scale of the management device system, and preprocess the operation information to obtain available operation information of the management device system.
In an embodiment, the service management apparatus may further include a historical trend analysis module, a resource margin analysis module, an alarm correlation analysis module, a configuration change analysis module, a topology change analysis module, an implementation performance analysis module, a data configuration module, a performance data module, a state data module, a log data module, and a service data module, as shown in fig. 8.
In one embodiment, the service management apparatus may further include a configuration change analysis module, a real-time performance analysis module, a historical trend analysis module, a resource margin analysis module, a topology change analysis module, an alarm correlation analysis module, and an analysis total control module, where the configuration change analysis module, the real-time performance analysis module, the historical trend analysis module, the resource margin analysis module, the topology change analysis module, and the alarm correlation analysis module are all based on an acquisition function of the IT integrated monitoring system, and receive configuration, performance, log, and service information of acquired IT basic resources; the configuration change analysis module receives configuration information of IT basic resources, inputs the configuration information into the analysis master control module for storage, realizes unified management on configuration information of IT software and hardware equipment, and quickly feeds back the configuration condition of an IT basic architecture to a client, and has the main functions of: resource type maintenance, resource attribute maintenance, resource operation life track management and resource information inquiry and statistics; the real-time performance analysis module receives the real-time performance of the IT basic resource in real time, expresses the change of the real-time performance in a chart in a line graph form and uploads the change to the analysis master control module in real time, the real-time performance analysis module comprises a hardware management module and a performance management module, the hardware management module is used for uniformly managing the information of the IT hardware equipment and feeding the working state of the hardware back to a client to display the hardware performance bottleneck and fault analysis, and the main functions of the real-time performance analysis module comprise: hardware performance query, server start-stop management and log query analysis; the performance management module manages the performance states of the IT software and hardware equipment in a unified way, and quickly feeds performance bottlenecks and fault analysis back to the client, and the main functions of the performance management module comprise: the method comprises the steps of resource performance query and performance topN query, wherein the performance topN query supports ranking display of a certain performance index of a resource according to the type of equipment and the dimension of a service system. The alarm correlation analysis module is a centralized platform for uniformly triggering, notifying, displaying, processing and inquiring alarm information, provides real-time alarm for users, and has the main functions of: alarm level maintenance, alarm type maintenance, alarm behavior maintenance, alarm rule maintenance, alarm information processing, query, statistics and display; the alarm correlation analysis module uploads alarm data to the analysis master control module for storage; the analysis master control module comprises a file storage module, a distributed computation framework and a search engine, wherein the file storage module stores real-time data of the configuration change analysis module and the real-time performance analysis module and stamps a timestamp, the search engine is accessed into the file storage module to provide a data search function, the distributed computation framework provides Spark/R high-speed computation, and the analysis master control module is provided with a parallelized high-performance statistical algorithm library, supports a parallelized statistical algorithm and a basic algorithm library, supports a parallelized basic algorithm, and supports processing of a large data set.
The historical trend analysis module, the resource margin analysis module and the topology change analysis module are all arranged in a distributed computing framework,
the historical trend analysis module and the resource margin analysis module analyze data in the file storage module, create a resource statistical report, obtain the summary of the number and proportion of configuration items according to the date and different resource attribute matching conditions, and display the resource configuration conditions of various hardware, software and applications in the IT system;
creating a performance inspection report, obtaining the resource utilization condition of each monitored object according to dates and different conditions, and inspecting the operation condition of the IT resources in real time; meanwhile, an alarm statistical report is created, the functions of inquiring, counting and analyzing the current alarm and the historical alarm are realized, and according to the alarm detail and the statistical report of different time granularities of day, week and month,
the topology change analysis module reflects the function and logic association relation information among the service system hardware equipment, the basic software and the service software function module in a graphical mode, and the primitives comprise: the system comprises a server, network equipment, safety equipment, storage equipment, middleware, a database and service software, wherein the network structure of a non-monitoring area is represented by adopting network cloud, and the running information and the alarm information of equipment nodes are displayed by reflecting the relation between software and hardware function connection and logic association by using the connection lines among primitives.
The method comprises the steps of configuring a change analysis module, a real-time performance analysis module and an alarm correlation analysis module to realize unified monitoring and management of the equipment state, the performance and the service state of a service system, creating a resource statistical report and a performance inspection report through a historical trend analysis module, a resource margin analysis module and a topology change analysis module, reflecting the function and logic correlation relation information among hardware equipment, basic software and service software of the service system in a graphical mode, realizing the functions of operation information data mining, efficiency evaluation, reliability analysis, risk management, auxiliary decision making and the like on the basis, guaranteeing safe and stable operation of the service, and effectively improving the operation and maintenance level and efficiency of an operation and maintenance organization.
The acquisition server can deploy a module of an acquisition layer, is responsible for data acquisition work, and performs configuration, such as configuration information of a central processing unit, configuration information of a storage space and the like, and has the following performances: such as central processing unit utilization rate, disk space utilization rate and the like, logs: such as logs of service applications, and service information, such as service flow of the OA system, for collection and standardized processing, and may further include operations of merging and suppressing log-like information, filtering, preprocessing, and the like. The acquisition layer can adopt protocols such as SNMP, TCP, Syslog, SSH, WMI and the like to ensure the safety of data in the operation and construction processes of the system. The application server is used for receiving the data sent by the acquisition server, and analyzing the data to obtain an analysis result. The database server can be responsible for uniformly deploying the unified information base and the data service bus, and is responsible for data storage and communication with the middleware. The database server can also improve the data transmission efficiency through the service bus. The unified information base is responsible for storing monitoring information, management information, analysis result information and basic parameters required by the system of the system and providing data support for the business health supervision system, and the monitoring information comprises configuration information, performance information, state information and alarm information. For example, when a disk fails, if there may be a large probability of failure in the service line of the system to which the corresponding server belongs, an alarm is given. The management information comprises user information and equipment management information, and the analysis result mainly comprises analysis report information of days, weeks, months, seasons and years which are generated regularly; the data service bus means that the monitoring system adopts the data service bus to realize data communication among the modules and provide data to an external system. The data service bus follows an SOA architecture system, supports various operating system platforms and various technical protocols, has the extensible characteristic, provides functions of a page service WebService and a database adapter, and has the function of adapter extension. The application server deploys a module of a processing layer, which is responsible for data analysis and processing work, and the processing layer completes the data analysis work and mainly comprises: configuration change analysis, real-time performance analysis, historical trend analysis, resource margin analysis, topology change analysis and alarm correlation analysis; the front-end server (Web server) is responsible for issuing and managing application and man-machine interaction work with users, the Web server comprises a display layer, the display layer provides a man-machine interaction interface of a Web browsing mode for the users, the users visually look up and maintain related asset management information, performance management information, alarm management information, service management information, network management information and comprehensive analysis type reports through various graphical components provided by a system, the setting work of equipment monitoring strategies and user authorities is realized through the system maintenance interface, the asset management information realizes unified management of IT soft and hardware equipment configuration information and is used for the users to know the configuration condition of an IT basic architecture; the performance management information realizes the unified management of the performance states of the IT software and hardware equipment and is used for discovering performance bottlenecks and analyzing faults by users; the alarm management information is a centralized platform for uniformly triggering, notifying, displaying, processing and inquiring alarm information and is used for a user to find and process an alarm; the business management information reflects the logic association relation between the IT basic framework of the business system and the business in a graphical mode, and shows departments influenced by business faults from the perspective of the business; the network management information reflects physical and logical association relations among network nodes in a graphical mode, and provides a network fault positioning analysis environment for a user from the view point of a network; the comprehensive analysis type report supports a user to define a daily operation and maintenance report, report statistics is carried out according to a day, a week, a month, a season, a year or a defined time interval, the generation time, the content, the format and the printing time of the defined report are defined, image-text mixed arrangement is supported, the report supports a curve, a bar graph and a pie graph display form, and report file export is supported: HTML, Excel, PDF and printing function, the system maintenance interface comprises: monitoring strategy setting, user authority setting, operation log query and function menu maintenance. Based on the above, the system monitoring of the whole process can be realized, and the safe operation of the managed system and the effective solution of the fault problem are ensured.
For the specific definition of the service management device, reference may be made to the above definition of the service management method, which is not described herein again. The modules in the service management device can be implemented wholly or partially by software, hardware and a combination thereof. The modules can be embedded in a hardware form or independent from a processor in the computer device, and can also be stored in a memory in the computer device in a software form, so that the processor can call and execute operations corresponding to the modules.
In one embodiment, a computer device is provided, which may be a server, and its internal structure diagram may be as shown in fig. 9. The computer device includes a processor, a memory, and a network interface connected by a system bus. Wherein the processor of the computer device is configured to provide computing and control capabilities. The memory of the computer device comprises a nonvolatile storage medium and an internal memory. The non-volatile storage medium stores an operating system, a computer program, and a database. The internal memory provides an environment for the operation of an operating system and computer programs in the non-volatile storage medium. The database of the computer device is used to store available operational information. The network interface of the computer device is used for communicating with an external terminal through a network connection. The computer program is executed by a processor to implement a business management method.
Those skilled in the art will appreciate that the architecture shown in fig. 9 is merely a block diagram of some of the structures associated with the disclosed aspects and is not intended to limit the computing devices to which the disclosed aspects apply, as particular computing devices may include more or less components than those shown, or may combine certain components, or have a different arrangement of components.
In one embodiment, a computer device is provided, comprising a memory and a processor, the memory having a computer program stored therein, the processor implementing the following steps when executing the computer program:
acquiring initial operation information of a management equipment system in a multi-thread multi-concurrency mode based on the operation state and the data scale of the management equipment system, and preprocessing the initial operation information to obtain available operation information of the management equipment system; the initial operation information comprises configuration information, performance information, logs and service information of each device in the management device system, and the preprocessing comprises at least one of data structure standardization processing, merging and pressing processing and filtering processing;
analyzing the available operation information received through a preset interface according to a preset analysis strategy to obtain analysis results of multiple dimensions of the management equipment system; wherein the analysis results for multiple dimensions comprise: at least one of a resource statistical report, a created performance inspection report, an alarm statistical report, a network topology view, equipment performance bottleneck information and fault information;
and outputting the analysis result to a client.
In one embodiment, the processor, when executing the computer program, further performs the steps of:
receiving the available operation information through the preset interface;
creating a resource statistical report according to configuration information and performance information in the available operation information; the resource statistical form comprises hardware resources and relevant information of the hardware resources;
obtaining items and proportions of the configuration items according to the date and the matching conditions of different types of resource attributes;
and displaying the resource configuration information of each device in the management device system according to the items and the proportion of the configuration items.
In one embodiment, the processor, when executing the computer program, further performs the steps of:
acquiring the created performance inspection report according to the resource utilization rate of each monitored object obtained by the date and different conditions for the log and the service information in the available operation information;
obtaining a created performance inspection report form of each device in the management device system according to dates and different conditions, and obtaining the running state of each device in the management device system;
and if the running state is the fault state, creating an alarm statistical report based on the fault state.
In one embodiment, the processor, when executing the computer program, further performs the steps of:
and sending the equipment information corresponding to the fault state to the client.
In one embodiment, the processor, when executing the computer program, further performs the steps of:
acquiring key node parameters of each device in the management device system and the connection relation of the key nodes;
generating a network topology view according to the key node parameters and the connection relation of the key nodes; wherein, the primitives in the network topology view comprise: the system comprises a server, network equipment, safety equipment, storage equipment, middleware and a database, wherein service software is deployed on equipment corresponding to primitives in the network topology view.
In one embodiment, the processor, when executing the computer program, further performs the steps of:
generating equipment performance bottleneck information according to configuration information and performance information in the available operation information;
and generating risk early warning information aiming at the possible faults of the management equipment system based on the equipment performance bottleneck information.
In one embodiment, the processor, when executing the computer program, further performs the steps of:
and sorting according to the risk early warning information from high to low according to a preset risk grade, and sending the sorted risk early warning information to the client.
It should be clear that, in the embodiments of the present application, the process of executing the computer program by the processor is consistent with the process of executing the steps in the above method, and specific reference may be made to the description above.
In one embodiment, a computer-readable storage medium is provided, having a computer program stored thereon, which when executed by a processor, performs the steps of:
acquiring initial operation information of a management equipment system in a multi-thread multi-concurrency mode based on the operation state and the data scale of the management equipment system, and preprocessing the initial operation information to obtain available operation information of the management equipment system; the initial operation information comprises configuration information, performance information, logs and service information of each device in the management device system, and the preprocessing comprises at least one of data structure standardization processing, merging and pressing processing and filtering processing;
analyzing the available operation information received through a preset interface according to a preset analysis strategy to obtain analysis results of multiple dimensions of the management equipment system; wherein the analysis results for multiple dimensions comprise: at least one of a resource statistical report, a created performance inspection report, an alarm statistical report, a network topology view, equipment performance bottleneck information and fault information;
and outputting the analysis result to a client.
In one embodiment, the computer program when executed by the processor further performs the steps of:
receiving the available operation information through the preset interface;
creating a resource statistical report according to configuration information and performance information in the available operation information; the resource statistical form comprises hardware resources and relevant information of the hardware resources;
obtaining items and proportions of the configuration items according to the date and the matching conditions of different types of resource attributes;
and displaying the resource configuration information of each device in the management device system according to the items and the proportion of the configuration items.
In one embodiment, the computer program when executed by the processor further performs the steps of:
acquiring the created performance inspection report according to the resource utilization rate of each monitored object obtained by the date and different conditions for the log and the service information in the available operation information;
obtaining a created performance inspection report form of each device in the management device system according to dates and different conditions, and obtaining the running state of each device in the management device system;
and if the running state is the fault state, creating an alarm statistical report based on the fault state.
In one embodiment, the computer program when executed by the processor further performs the steps of:
and sending the equipment information corresponding to the fault state to the client.
In one embodiment, the computer program when executed by the processor further performs the steps of:
acquiring key node parameters of each device in the management device system and the connection relation of the key nodes;
generating a network topology view according to the key node parameters and the connection relation of the key nodes; wherein, the primitives in the network topology view comprise: the system comprises a server, network equipment, safety equipment, storage equipment, middleware and a database, wherein service software is deployed on equipment corresponding to primitives in the network topology view.
In one embodiment, the computer program when executed by the processor further performs the steps of:
generating equipment performance bottleneck information according to configuration information and performance information in the available operation information;
and generating risk early warning information aiming at the possible faults of the management equipment system based on the equipment performance bottleneck information.
In one embodiment, the computer program when executed by the processor further performs the steps of:
and sorting according to the risk early warning information from high to low according to a preset risk grade, and sending the sorted risk early warning information to the client.
It should be clear that, in the embodiments of the present application, the process executed by the processor by the computer program is consistent with the execution process of each step in the above method, and specific reference may be made to the description above.
It will be understood by those skilled in the art that all or part of the processes of the methods of the embodiments described above can be implemented by hardware instructions of a computer program, which can be stored in a non-volatile computer-readable storage medium, and when executed, can include the processes of the embodiments of the methods described above. Any reference to memory, storage, database or other medium used in the embodiments provided herein can include at least one of non-volatile and volatile memory. Non-volatile Memory may include Read-Only Memory (ROM), magnetic tape, floppy disk, flash Memory, optical storage, or the like. Volatile Memory can include Random Access Memory (RAM) or external cache Memory. By way of illustration and not limitation, RAM can take many forms, such as Static Random Access Memory (SRAM) or Dynamic Random Access Memory (DRAM), among others.
The technical features of the above embodiments can be arbitrarily combined, and for the sake of brevity, all possible combinations of the technical features in the above embodiments are not described, but should be considered as the scope of the present specification as long as there is no contradiction between the combinations of the technical features.
The above-mentioned embodiments only express several embodiments of the present application, and the description thereof is more specific and detailed, but not construed as limiting the scope of the invention. It should be noted that, for a person skilled in the art, several variations and modifications can be made without departing from the concept of the present application, which falls within the scope of protection of the present application. Therefore, the protection scope of the present patent shall be subject to the appended claims.