CN111884853A - Cloud environment automatic resource management method and system - Google Patents

Cloud environment automatic resource management method and system Download PDF

Info

Publication number
CN111884853A
CN111884853A CN202010741682.7A CN202010741682A CN111884853A CN 111884853 A CN111884853 A CN 111884853A CN 202010741682 A CN202010741682 A CN 202010741682A CN 111884853 A CN111884853 A CN 111884853A
Authority
CN
China
Prior art keywords
resource
model
cloud environment
performance
alarm
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202010741682.7A
Other languages
Chinese (zh)
Inventor
郭巧枝
颜亮
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Inspur Cloud Information Technology Co Ltd
Original Assignee
Inspur Cloud Information Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Inspur Cloud Information Technology Co Ltd filed Critical Inspur Cloud Information Technology Co Ltd
Priority to CN202010741682.7A priority Critical patent/CN111884853A/en
Publication of CN111884853A publication Critical patent/CN111884853A/en
Pending legal-status Critical Current

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L41/00Arrangements for maintenance, administration or management of data switching networks, e.g. of packet switching networks
    • H04L41/04Network management architectures or arrangements
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q10/00Administration; Management
    • G06Q10/10Office automation; Time management
    • G06Q10/103Workflow collaboration or project management
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L41/00Arrangements for maintenance, administration or management of data switching networks, e.g. of packet switching networks
    • H04L41/06Management of faults, events, alarms or notifications
    • H04L41/0631Management of faults, events, alarms or notifications using root cause analysis; using analysis of correlation between notifications, alarms or events based on decision criteria, e.g. hierarchy, tree or time analysis
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L41/00Arrangements for maintenance, administration or management of data switching networks, e.g. of packet switching networks
    • H04L41/06Management of faults, events, alarms or notifications
    • H04L41/0677Localisation of faults
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L41/00Arrangements for maintenance, administration or management of data switching networks, e.g. of packet switching networks
    • H04L41/08Configuration management of networks or network elements
    • H04L41/0876Aspects of the degree of configuration automation
    • H04L41/0886Fully automatic configuration
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L41/00Arrangements for maintenance, administration or management of data switching networks, e.g. of packet switching networks
    • H04L41/12Discovery or management of network topologies
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L41/00Arrangements for maintenance, administration or management of data switching networks, e.g. of packet switching networks
    • H04L41/14Network analysis or design
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L41/00Arrangements for maintenance, administration or management of data switching networks, e.g. of packet switching networks
    • H04L41/14Network analysis or design
    • H04L41/145Network analysis or design involving simulating, designing, planning or modelling of a network

Abstract

The invention discloses a cloud environment automatic resource management method and a cloud environment automatic resource management system, which belong to the technical field of cloud environment, and realize the automatic acquisition of resources, resource relationships and resource use condition data and the visual page display in a cloud environment in an engine driving mode; the engines include a resource engine, a performance engine, and an alert engine. The cloud environment automatic resource management system comprises an engine driving part, an information management part, a visual display part and an intelligent analysis part, wherein the engine part comprises a resource engine module, a performance engine module and an alarm engine module; correspondingly, the information management part comprises a resource attribute information management module, a resource performance information management module and a resource alarm information management module; the visual display part comprises a topology alarm module and a report module. The invention can enable operation and maintenance personnel in the cloud center to more intuitively master the resource use condition and the direction which can be optimized, thereby greatly improving the operation and maintenance efficiency.

Description

Cloud environment automatic resource management method and system
Technical Field
The invention relates to the technical field of cloud environment, in particular to a cloud environment automatic resource management method and system.
Background
In cloud center management, the number of cloud center resources is large, so that operation problems such as repeated obvious positioning, analysis and the like are caused, most of traditional management systems adopt non-automatic or semi-automatic resource basic data management, and basic data, performance, alarm and report reports are not divided into clear collaboration boundaries, so that the completeness and accuracy of the basic data cannot be guaranteed, the use conditions such as the performance, the alarm and the report cannot be checked directly indirectly, the use optimization direction of the cloud center resources cannot be obtained directly, and management work of cloud center operation and maintenance personnel is very unfavorable.
Disclosure of Invention
The technical task of the invention is to provide a cloud environment automatic resource management method and system aiming at the defects, so that operation and maintenance personnel in a cloud center can more intuitively master the resource use condition and the direction capable of being optimized, and the operation and maintenance efficiency is greatly improved.
The technical scheme adopted by the invention for solving the technical problems is as follows:
an automatic resource management method for a cloud environment realizes automatic acquisition of resources, resource relationships and resource use condition data and visual page display in the cloud environment in an engine-driven mode;
the engines include a resource engine, a performance engine, and an alert engine.
According to the method, the basic resource data, the performance data and the alarm data are automatically acquired through the engine, the resource, performance and alarm models are configured according to the specifications of the engine, model instance data can be automatically acquired through the technologies of qga, ipmi, lldp, snmp and the like in the cloud environment, and the resource and resource use conditions are displayed through topology, report forms and other visual forms, so that the management requirements of operation and maintenance personnel are met.
Preferably, the resource engine, the performance engine and the alarm engine are configured to generate respective service models, model instance data are obtained through techniques such as qga, ipmi, lldp and snmp in a cloud environment, alarm data are generated according to configured rules, then display is carried out according to actual requirements, and finally comprehensive analysis of management experience of operation and maintenance personnel is collected through intelligent analysis to form an automatic optimization operation and maintenance suggestion.
Comprehensive analysis is carried out on resources through intelligent analysis, a plurality of operation and maintenance experiences in the cloud environment are summarized and analyzed, and a resource use suggestion is automatically generated, so that the operation and maintenance personnel can efficiently carry out daily resource management to provide help, the problem that the operation such as repeated and obvious positioning and analysis is caused by more resources in a cloud center is solved, and the maximization of the experience value of the operation and maintenance personnel is facilitated.
Specifically, the resource engine defines a resource model and a resource model relationship which need to be managed in a cloud environment, and the resource model relationship comprise model classification, model definition, model attributes, model relationship classification, model relationship definition, model relationship attributes, attribute classification, and automatic configuration of resource model and resource model relationship instance data;
the performance engine defines a performance model of resources needing to be managed in a cloud environment, and the performance model comprises model definition, model attributes and automatic configuration of performance model instance data;
the alarm engine defines an alarm model of resources needing to be managed in a cloud environment, and comprises model definition, model attributes and automatic configuration of performance model instance data.
Furthermore, resource instances and resource relation instances needing to be managed in the cloud environment are managed and defined through resource attribute information, wherein the resource instances and the resource relation instances comprise instance adding, modifying, deleting and basic query operations;
the method comprises the steps that performance data of resource instances needing to be managed in the cloud environment are managed and defined through resource performance information, the performance data comprise a list, a pie chart, a bar chart, a line chart and other multi-form indexes, and the data are automatically obtained according to a performance engine;
the method comprises the steps of defining alarm data of resource instances needing to be managed in the cloud environment through resource alarm information management, wherein the alarm data comprises multiple ways such as a management system, mails, WeChats, telephones and the like, filtering performance index data in real time or periodically according to configured alarm rules (threshold values, levels and the like), and extracting key information of resources and performance to form alarm content for viewing.
Furthermore, resource use profiles of different scenes in the cloud environment are defined through topology alarms, the resource use profiles comprise resources, resource relationships and alarm influences under various application scenes such as networking topology of a whole network view angle, tenant topology of a tenant view angle, virtualization topology of a platform layer and the like, and topology resource nodes support drilling-up and drilling-down to view more information under different application scenes.
Further, report data of resources needing to be managed in different angles in the cloud environment are defined through report reports, wherein the report data comprise common fixed reports and customized reports with individual requirements.
Further, an optimization measure suggestion of resource management of operation and maintenance personnel in a cloud environment is defined through resource use comprehensive analysis, and the optimization measure suggestion includes data comprehensive analysis results of performance monitoring and tracking, fault problem positioning analysis processing, resource integration optimization and adjustment and the like in resource operation.
The invention also claims a cloud environment automatic resource management system, which comprises an engine driving part, an information management part, a visual display part and an intelligent analysis part, wherein the engine part comprises a resource engine module, a performance engine module and an alarm engine module; correspondingly, the information management part comprises a resource attribute information management module, a resource performance information management module and a resource alarm information management module;
the visual display part comprises a topology alarm module and a report module;
the intelligent analysis part comprises a resource use comprehensive analysis module;
the resource engine module, the performance engine module and the alarm engine module are used for configuring and generating respective business models; the system acquires model instance data and generates alarm data according to configured rules; then, displaying through a topology warning module and a report module according to actual requirements; and finally, collecting management experience of operation and maintenance personnel through a resource use comprehensive analysis module and carrying out comprehensive analysis to form an automatic operation and maintenance optimization suggestion measure.
The system respectively carries out automatic acquisition of resource basic data, performance data and alarm data through an engine, generates a resource, performance and alarm model according to the standard configuration of the engine, can automatically acquire model example data through the technologies of qga, ipmi, lldp, snmp and the like in a cloud environment, and shows the resource and resource use conditions through topology, report forms and other visual forms, thereby realizing the management requirements of operation and maintenance personnel;
meanwhile, comprehensive analysis is carried out on resources through intelligent analysis, a plurality of operation and maintenance experiences in the cloud environment are summarized and analyzed, and a resource use suggestion is automatically generated, so that the operation and maintenance personnel can efficiently carry out daily resource management, the problem that the operation such as repeated and obvious positioning and analysis is caused by more resources in a cloud center is solved, and the maximization of the experience value of the operation and maintenance personnel is facilitated.
Preferably, the resource engine module is used for defining a resource model and a resource model relationship which need to be managed in a cloud environment, and comprises model classification, model definition, model attributes, model relationship classification, model relationship definition, model relationship attributes, attribute classification, and automatic configuration of resource model and resource model relationship instance data;
the performance engine module is used for defining a performance model needing to manage resources in a cloud environment, and comprises model definition, model attributes and automatic configuration of performance model instance data;
the alarm engine module is used for defining an alarm model needing to manage resources in a cloud environment, and comprises model definition, model attributes and automatic configuration of performance model instance data;
the resource attribute information management module is used for defining resource instances and resource relation instances needing to be managed in the cloud environment, and comprises instance adding, modifying, deleting and basic query operations;
the resource performance information management module is used for defining performance data of resource instances needing to be managed in the cloud environment, the performance data comprises a list, a pie chart, a bar chart, a line chart and other multi-form indexes which are checked, and the data are automatically acquired according to a performance engine;
the resource alarm information management module is used for defining alarm data of resource instances needing to be managed in a cloud environment, and comprises a management system, a mail, a WeChat, a telephone and other ways, filtering performance index data in real time or periodically according to configured alarm rules (threshold values, levels and the like), and extracting key information of resources and performance to form alarm content for viewing;
the topology alarm module is used for defining resource use profiles of different scenes in a cloud environment, and comprises resources, resource relationships and alarm influences under various application scenes such as networking topology of a whole network view angle, tenant topology of a tenant view angle, virtualization topology of a platform layer and the like, and topology resource nodes support drilling-up and drilling-down to view more information under different application scenes;
the report module is used for defining report data of resources needing to be managed in different angles under the cloud environment, and the report data comprises a common fixed report and a customized report with individual requirements;
the resource use comprehensive analysis module is used for defining optimization measure suggestions of resource management of operation and maintenance personnel in a cloud environment, and the optimization measure suggestions comprise data comprehensive analysis of performance monitoring and tracking, fault problem positioning analysis and processing and resource integration optimization and adjustment in resource operation.
The system automatically acquires updated resource basic information by engine configuration and adopting the technologies of qga, ipmi, lldp, snmp and the like in a cloud environment;
by engine configuration, according to resource basic information, adopting technologies such as qga, ipmi, lldp, snmp and the like to automatically generate performance index information;
and alarm notifications in various ways such as a management system, a mail, a WeChat, a telephone and the like are automatically generated through engine configuration according to the resource basic information and the performance index information.
The resource usage comprehensive analysis of the system outputs resource optimization management suggested measures, and the resource optimization management suggested measures are intelligently analyzed, summarized, refined and output on the premise of basic information, performance, alarm and management experience of operation and maintenance personnel.
The invention also claims a computer readable medium having stored thereon computer instructions which, when executed by a processor, cause the processor to perform the above-described method.
Compared with the prior art, the automatic resource management method and system for the cloud environment have the following beneficial effects:
the method or the system acquires the cloud environment resources and the resource relation in an automatic updating mode, so that operation and maintenance personnel can more intuitively master the complete and accurate resource condition under the cloud center;
the resource change is triggered to generate performance indexes and alarm notifications through engine driving, so that operation and maintenance personnel can master the operation general situation of the resources more timely.
The method or the system combines the alarm and the topology, and the operation and maintenance personnel can more intuitively know the resources directly or indirectly influenced by the alarm under the full-network, tenant and virtual multi-scene by showing the alarm quantity in the topological graph in real time;
the resource management experience of the daily operation and maintenance personnel is comprehensively analyzed, the optimized management advices can be intelligently output, and the daily work efficiency of the operation and maintenance personnel can be improved.
Drawings
Fig. 1 is an architecture diagram of a cloud environment automated resource management method according to an embodiment of the present invention.
Detailed Description
The invention is further described with reference to the following figures and specific examples.
The operation and maintenance personnel of the cloud center want to visually check which hardware resources and virtual resources exist in the cloud center and the association relationship among the resources, and meanwhile, the operation and maintenance personnel want to visually check the performance overview, the capacity expansion requirement, the migration requirement and the like of the resources in operation. In addition, the operation and maintenance personnel also want to visually check whether the resources have alarms generated or not, and the alarms can affect the directly or indirectly related resources. After a series of experiences are accumulated, operation and maintenance personnel hope to visually check the use optimization suggestion of the cloud center resource.
The embodiment of the invention provides an automatic resource management method for a cloud environment, which realizes the automatic acquisition of resources, resource relation and resource use condition data and visual page display in the cloud environment in an engine driving mode.
The engines include a resource engine, a performance engine, and an alert engine. The resource engine, the performance engine and the alarm engine are configured to generate respective business models, model instance data are obtained through technologies such as qga, ipmi, lldp and snmp in a cloud environment, alarm data are generated according to configured rules, then display is carried out according to actual requirements, and finally comprehensive analysis of management experience of operation and maintenance personnel is collected through intelligent analysis to form an automatic optimization operation and maintenance suggestion.
According to the method, the basic resource data, the performance data and the alarm data are automatically acquired through the engine, the resource, performance and alarm models are configured according to the specifications of the engine, model instance data can be automatically acquired through the technologies of qga, ipmi, lldp, snmp and the like in the cloud environment, and the resource and resource use conditions are displayed through topology, report forms and other visual forms, so that the management requirements of operation and maintenance personnel are met.
Comprehensive analysis is carried out on resources through intelligent analysis, a plurality of operation and maintenance experiences in the cloud environment are summarized and analyzed, and a resource use suggestion is automatically generated, so that the operation and maintenance personnel can efficiently carry out daily resource management to provide help, the problem that the operation such as repeated and obvious positioning and analysis is caused by more resources in a cloud center is solved, and the maximization of the experience value of the operation and maintenance personnel is facilitated.
The resource engine is mainly used for defining resource models and resource model relations which need to be managed in the cloud environment, and comprises model classification, model definition, model attributes, model relation classification, model relation definition, model relation attributes, attribute classification and automatic configuration of resource model and resource model relation instance data. The resource model and resource model relation instance data automatic configuration is realized to ensure that instance data is automatically acquired, and the uniformity, integrity and accuracy of basic data in a cloud environment are ensured by adopting technologies such as qga, ipmi, lldp and snmp.
The performance engine is mainly used for defining a performance model needing to manage resources in a cloud environment, and comprises model definition, model attributes and automatic configuration of performance model instance data; the performance model and the resource model are in a 1:1 relationship, and the example data are automatically acquired by adopting the technologies of qga, ipmi, lldp, snmp and the like.
The alarm engine is mainly used for defining an alarm model needing to manage resources in a cloud environment, and comprises model definition, model attributes and automatic configuration of performance model instance data; wherein the alarm data is derived from the resource and performance raw data.
Resource instances and resource relation instances needing to be managed in the cloud environment are managed and defined through resource attribute information, and the management and definition comprise instance adding, modifying, deleting and basic query operations; no other way of manipulating the data is allowed in which the engine configuration supports automatic updating.
The method comprises the steps that performance data of resource instances needing to be managed in the cloud environment are managed and defined through resource performance information, the performance data comprise a list, a pie chart, a bar chart, a line chart and other multi-form indexes, and the data are automatically obtained according to a performance engine;
the method comprises the steps of defining alarm data of resource instances needing to be managed in the cloud environment through resource alarm information management, wherein the alarm data comprises multiple ways such as a management system, mails, WeChats, telephones and the like, filtering performance index data in real time or periodically according to configured alarm rules (threshold values, levels and the like), and extracting key information of resources and performance to form alarm content for viewing.
The resource use profiles of different scenes in the cloud environment are defined through topology alarms, the resource use profiles comprise resources, resource relations and alarm influences under various application scenes such as networking topology of a whole network view angle, tenant topology of a tenant view angle, virtualization topology of a platform layer and the like, and topology resource nodes support drilling-up and drilling-down to view more information under different application scenes. The main data is managed by the resource attribute information and the resource alarm information respectively.
Report data of resources needing to be managed in different angles in the cloud environment are defined through report reports, and the report data comprise common fixed reports and customized reports with individual requirements. The main data is managed by the resource attribute information and the resource performance information respectively.
And defining optimization measure suggestions of resource management of operation and maintenance personnel in a cloud environment through resource use comprehensive analysis, wherein the optimization measure suggestions comprise data comprehensive analysis results of performance monitoring and tracking, fault problem positioning analysis and processing, resource integration optimization and adjustment and the like in resource operation. The main data is managed by the resource attribute information, the resource performance information and the resource alarm information respectively.
The method solves the problem of automatic operation and maintenance management of resources in aspects of basic information, performance, alarm and the like in a full life cycle mode, enables operation and maintenance personnel to more intuitively master the use condition of the resources of the cloud center through visualization methods such as topology, report reports and the like, obtains further optimized management measures through analysis, greatly improves operation and maintenance efficiency, can integrally know the resource general profile of the cloud center, checks the integral use condition of the resources in real time, and timely pays attention to abnormal conditions to make accurate emergency response.
The embodiment of the invention also provides a cloud environment automatic resource management system which comprises an engine driving part, an information management part, a visual display part and an intelligent analysis part, wherein the engine part comprises a resource engine module, a performance engine module and an alarm engine module; correspondingly, the information management part comprises a resource attribute information management module, a resource performance information management module and a resource alarm information management module;
the visual display part comprises a topology alarm module and a report module;
the intelligent analysis part comprises a resource usage comprehensive analysis module.
The resource engine module, the performance engine module and the alarm engine module are used for configuring and generating respective business models; the system acquires model instance data and generates alarm data according to configured rules; then, displaying through a topology warning module and a report module according to actual requirements; and finally, collecting management experience of operation and maintenance personnel through a resource use comprehensive analysis module and carrying out comprehensive analysis to form an automatic operation and maintenance optimization suggestion measure.
The system automatically acquires updated resource basic information by engine configuration and adopting the technologies of qga, ipmi, lldp, snmp and the like in the cloud environment;
by engine configuration, according to resource basic information, adopting technologies such as qga, ipmi, lldp, snmp and the like to automatically generate performance index information;
through engine configuration, alarm notifications in various ways such as a management system, mails, WeChat, telephones and the like are automatically generated according to the resource basic information and the performance index information;
the resource usage comprehensive analysis of the system outputs the resource optimization management proposal measures, and the resource optimization management proposal measures are intelligently analyzed, summarized, refined and output on the premise of basic information, performance, alarm and management experience of operation and maintenance personnel.
The system respectively carries out automatic acquisition of resource basic data, performance data and alarm data through the engine, generates a resource, performance and alarm model according to the standard configuration of the engine, can automatically acquire model example data through the technologies of qga, ipmi, lldp, snmp and the like in a cloud environment, and shows the resource and resource use conditions through topology, report forms and other visual forms, thereby realizing the management requirements of operation and maintenance personnel;
meanwhile, comprehensive analysis is carried out on resources through intelligent analysis, a plurality of operation and maintenance experiences in the cloud environment are summarized and analyzed, and a resource use suggestion is automatically generated, so that the operation and maintenance personnel can efficiently carry out daily resource management, the problem that the operation such as repeated and obvious positioning and analysis is caused by more resources in a cloud center is solved, and the maximization of the experience value of the operation and maintenance personnel is facilitated.
The resource engine module is mainly used for defining resource models and resource model relations which need to be managed in the cloud environment, and comprises model classification, model definition, model attributes, model relation classification, model relation definition, model relation attributes, attribute classification and resource model relation instance data automatic configuration; the resource model and resource model relation instance data automatic configuration is realized to ensure that instance data is automatically acquired, and the uniformity, integrity and accuracy of basic data in a cloud environment are ensured by adopting technologies such as qga, ipmi, lldp and snmp.
The performance engine module is mainly used for defining a performance model needing to manage resources in a cloud environment, and comprises model definition, model attributes and automatic configuration of performance model instance data; the performance model and the resource model are in a 1:1 relationship, and the example data are automatically acquired by adopting the technologies of qga, ipmi, lldp, snmp and the like.
The alarm engine module is mainly used for defining an alarm model needing to manage resources in a cloud environment, and comprises model definition, model attributes and automatic configuration of performance model instance data; wherein the alarm data is derived from the resource and performance raw data.
The resource attribute information management module is mainly used for defining resource instances and resource relation instances needing to be managed in the cloud environment, and comprises instance adding, modifying, deleting and basic query operations; no other way of manipulating the data is allowed in which the engine configuration supports automatic updating.
The resource performance information management module is mainly used for defining performance data of resource instances needing to be managed in the cloud environment, the performance data comprises a list, a pie chart, a bar chart, a line chart and other multi-form indexes which are checked, and the data are automatically acquired according to a performance engine;
the resource alarm information management module is mainly used for defining alarm data of resource instances needing to be managed in a cloud environment, and comprises a management system, mails, WeChat, telephones and other ways, filtering performance index data in real time or periodically according to configured alarm rules (threshold values, levels and the like), and extracting key information of resources and performance to form alarm content for viewing;
the topology alarm module is mainly used for defining resource use profiles of different scenes in a cloud environment, and comprises resources, resource relationships and alarm influences under various application scenes such as networking topology of a whole network view angle, tenant topology of a tenant view angle, virtualization topology of a platform layer and the like, and topology resource nodes support drilling-up and drilling-down to view more information under different application scenes; the main data is managed by the resource attribute information and the resource alarm information respectively.
The report module is mainly used for defining report data of resources needing to be managed in different angles under the cloud environment, and the report data comprises a common fixed report and a customized report with individual requirements; the main data is managed by the resource attribute information and the resource performance information respectively.
The resource use comprehensive analysis module is mainly used for defining optimization measure suggestions of resource management of operation and maintenance personnel in a cloud environment, and comprises data comprehensive analysis of performance monitoring and tracking, fault problem positioning analysis and processing and resource integration optimization and adjustment in resource operation; the main data is managed by the resource attribute information, the resource performance information and the resource alarm information respectively.
The system realizes automatic acquisition, visual page display and intelligent analysis of resources, resource relation and resource use condition data in a cloud environment in an engine-driven mode, and can realize automatic acquisition, display and analysis use of the resource data through configuration by using modules such as the resource engine, the performance engine, the alarm engine, resource attribute information management, resource performance information management, resource alarm information management, topology alarm, report form report, resource use comprehensive analysis and the like, so that operation and maintenance personnel in a cloud center can more visually master the resource use condition and can optimize the direction, the operation and maintenance efficiency is greatly improved, the resource general profile of the cloud center can be integrally known, the whole use condition of the resource can be checked in real time, and accurate emergency response can be made by paying attention to the abnormal condition in time
An embodiment of the present invention further provides a computer-readable medium, where a computer instruction is stored on the computer-readable medium, and when the computer instruction is executed by a processor, the processor is enabled to execute the cloud environment automated resource management method in the foregoing embodiment of the present invention. Specifically, a system or an apparatus equipped with a storage medium on which software program codes that realize the functions of any of the above-described embodiments are stored may be provided, and a computer (or a CPU or MPU) of the system or the apparatus is caused to read out and execute the program codes stored in the storage medium.
In this case, the program code itself read from the storage medium can realize the functions of any of the above-described embodiments, and thus the program code and the storage medium storing the program code constitute a part of the present invention.
Examples of the storage medium for supplying the program code include a floppy disk, a hard disk, a magneto-optical disk, an optical disk (e.g., CD-ROM, CD-R, CD-RW, DVD-ROM, DVD-RAM, DVD-RW, DVD + RW), a magnetic tape, a nonvolatile memory card, and a ROM. Alternatively, the program code may be downloaded from a server computer via a communications network.
Further, it should be clear that the functions of any one of the above-described embodiments may be implemented not only by executing the program code read out by the computer, but also by causing an operating system or the like operating on the computer to perform a part or all of the actual operations based on instructions of the program code.
Further, it is to be understood that the program code read out from the storage medium is written to a memory provided in an expansion board inserted into the computer or to a memory provided in an expansion unit connected to the computer, and then causes a CPU or the like mounted on the expansion board or the expansion unit to perform part or all of the actual operations based on instructions of the program code, thereby realizing the functions of any of the above-described embodiments.
While the invention has been shown and described in detail in the drawings and in the preferred embodiments, it is not intended to limit the invention to the embodiments disclosed, and it will be apparent to those skilled in the art that various combinations of the code auditing means in the various embodiments described above may be used to obtain further embodiments of the invention, which are also within the scope of the invention.

Claims (10)

1. A cloud environment automatic resource management method is characterized in that automatic acquisition of resources, resource relationships and resource use condition data and visual page display in a cloud environment are realized in an engine driving mode;
the engines include a resource engine, a performance engine, and an alert engine.
2. The method for managing the cloud environment automated resources according to claim 1, wherein the resource engine, the performance engine and the alarm engine are configured to generate respective business models, generate alarm data according to configured rules by obtaining model instance data, and then display according to actual requirements; and finally, collecting comprehensive analysis of management experience of operation and maintenance personnel through intelligent analysis to form an automatic optimization operation and maintenance suggestion.
3. The method for automatically managing the resources in the cloud environment according to claim 1 or 2, wherein the resource engine defines resource models and resource model relations to be managed in the cloud environment, and the resource models and the resource model relation examples are automatically configured, and the resource models and the resource model relation examples comprise model classification, model definition, model attributes, model relation classification, model relation definition, model relation attributes and attribute classification;
the performance engine defines a performance model of resources needing to be managed in a cloud environment, and the performance model comprises model definition, model attributes and automatic configuration of performance model instance data;
the alarm engine defines an alarm model of resources needing to be managed in a cloud environment, and comprises model definition, model attributes and automatic configuration of performance model instance data.
4. The method for automatically managing the resources in the cloud environment according to claim 3, wherein the resource instances and the resource relationship instances to be managed in the cloud environment are defined through resource attribute information management, and the management comprises basic operations of instance adding, modifying, deleting and inquiring;
managing and defining performance data of resource instances needing to be managed in a cloud environment through resource performance information, wherein the performance data comprises the check of multiple indexes in multiple forms;
alarm data of resource instances needing to be managed in the cloud environment are managed and defined through resource alarm information, performance index data are filtered in real time or periodically according to configured alarm rules, and key information of resources and performance is extracted to form alarm content for viewing.
5. The method according to claim 4, wherein the topology alarm defines resource usage profiles of different scenarios in the cloud environment, including networking topology from a view point of the whole network, tenant topology from a view point of the tenant, and resource, resource relationship, and alarm impact in a virtual topology application scenario at a platform level.
6. The method according to claim 4, wherein the report data of the resources to be managed in the cloud environment from different angles is defined by a report.
7. The method for automated resource management in cloud environment according to claim 4, wherein the optimization measure recommendation for resource management of operation and maintenance personnel in cloud environment is defined through comprehensive analysis of resource usage, and the optimization measure recommendation includes data comprehensive analysis of performance monitoring and tracking, fault problem positioning and analysis processing, and resource integration and optimization adjustment during resource operation.
8. The cloud environment automatic resource management system is characterized by comprising an engine driving part, an information management part, a visual display part and an intelligent analysis part, wherein the engine part comprises a resource engine module, a performance engine module and an alarm engine module; correspondingly, the information management part comprises a resource attribute information management module, a resource performance information management module and a resource alarm information management module;
the visual display part comprises a topology alarm module and a report module;
the intelligent analysis part comprises a resource use comprehensive analysis module;
the resource engine module, the performance engine module and the alarm engine module are used for configuring and generating respective business models; the system acquires model instance data and generates alarm data according to configured rules; then, displaying through a topology warning module and a report module according to actual requirements; and finally, collecting management experience of operation and maintenance personnel through a resource use comprehensive analysis module and carrying out comprehensive analysis to form an automatic operation and maintenance optimization suggestion measure.
9. The cloud environment automated resource management system of claim 8, wherein the resource engine module is configured to define resource models and resource model relationships to be managed in the cloud environment, including model classification, model definition, model attributes, model relationship classification, model relationship definition, model relationship attributes, attribute classification, and automated configuration of resource model and resource model relationship instance data;
the performance engine module is used for defining a performance model needing to manage resources in a cloud environment, and comprises model definition, model attributes and automatic configuration of performance model instance data;
the alarm engine module is used for defining an alarm model needing to manage resources in a cloud environment, and comprises model definition, model attributes and automatic configuration of performance model instance data;
the resource attribute information management module is used for defining resource instances and resource relation instances needing to be managed in the cloud environment, and comprises instance adding, modifying, deleting and basic query operations;
the resource performance information management module is used for defining performance data of resource instances needing to be managed in the cloud environment, and the performance data comprises the check of multiple indexes in multiple forms;
the resource alarm information management module is used for defining alarm data of a resource instance needing to be managed in a cloud environment, filtering performance index data in real time or periodically according to configured alarm rules, and extracting key information of resources and performance to form alarm content for viewing;
the topology alarm module is used for defining resource use profiles of different scenes in a cloud environment, and the resource use profiles comprise networking topology of a whole network view angle, tenant topology of a tenant view angle and resource, resource relation and alarm influence in a virtual topology application scene of a platform layer;
the report module is used for defining report data of resources needing to be managed in a cloud environment from different angles;
the resource use comprehensive analysis module is used for defining optimization measure suggestions of resource management of operation and maintenance personnel in a cloud environment, and the optimization measure suggestions comprise data comprehensive analysis of performance monitoring and tracking, fault problem positioning analysis and processing and resource integration optimization and adjustment in resource operation.
10. A computer readable medium having stored thereon computer instructions which, when executed by a processor, cause the processor to perform the method of any of claims 1 to 7.
CN202010741682.7A 2020-07-29 2020-07-29 Cloud environment automatic resource management method and system Pending CN111884853A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202010741682.7A CN111884853A (en) 2020-07-29 2020-07-29 Cloud environment automatic resource management method and system

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202010741682.7A CN111884853A (en) 2020-07-29 2020-07-29 Cloud environment automatic resource management method and system

Publications (1)

Publication Number Publication Date
CN111884853A true CN111884853A (en) 2020-11-03

Family

ID=73200932

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202010741682.7A Pending CN111884853A (en) 2020-07-29 2020-07-29 Cloud environment automatic resource management method and system

Country Status (1)

Country Link
CN (1) CN111884853A (en)

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112527766A (en) * 2020-12-04 2021-03-19 浪潮云信息技术股份公司 System and method for realizing configuration management database
CN112532456A (en) * 2020-12-04 2021-03-19 浪潮云信息技术股份公司 Alarm monitoring method in cloud environment
WO2023138013A1 (en) * 2022-01-19 2023-07-27 浪潮通信信息系统有限公司 Method and system for computing power gridding in multi-cloud scene

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103220364A (en) * 2013-04-27 2013-07-24 清华大学 Cloud-based system management training platform architecture
CN106100880A (en) * 2016-06-12 2016-11-09 南京邮电大学 A kind of cloud data resource is disposed and visual management method
CN106293701A (en) * 2016-07-26 2017-01-04 浪潮软件股份有限公司 The application engine method for designing of many tenants version management based on cloud server
CN106649239A (en) * 2016-12-23 2017-05-10 郑州云海信息技术有限公司 Method and device for generating report in cloud monitoring system based on visualization
CN107315776A (en) * 2017-05-27 2017-11-03 国网安徽省电力公司信息通信分公司 A kind of data management system based on cloud computing
CN108777637A (en) * 2018-05-30 2018-11-09 郑州云海信息技术有限公司 A kind of data center's total management system and method for supporting server isomery

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103220364A (en) * 2013-04-27 2013-07-24 清华大学 Cloud-based system management training platform architecture
CN106100880A (en) * 2016-06-12 2016-11-09 南京邮电大学 A kind of cloud data resource is disposed and visual management method
CN106293701A (en) * 2016-07-26 2017-01-04 浪潮软件股份有限公司 The application engine method for designing of many tenants version management based on cloud server
CN106649239A (en) * 2016-12-23 2017-05-10 郑州云海信息技术有限公司 Method and device for generating report in cloud monitoring system based on visualization
CN107315776A (en) * 2017-05-27 2017-11-03 国网安徽省电力公司信息通信分公司 A kind of data management system based on cloud computing
CN108777637A (en) * 2018-05-30 2018-11-09 郑州云海信息技术有限公司 A kind of data center's total management system and method for supporting server isomery

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
卢乐天: "云中心IT运维管理系统规划设计方法研究", 《电信技术》 *

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112527766A (en) * 2020-12-04 2021-03-19 浪潮云信息技术股份公司 System and method for realizing configuration management database
CN112532456A (en) * 2020-12-04 2021-03-19 浪潮云信息技术股份公司 Alarm monitoring method in cloud environment
WO2023138013A1 (en) * 2022-01-19 2023-07-27 浪潮通信信息系统有限公司 Method and system for computing power gridding in multi-cloud scene

Similar Documents

Publication Publication Date Title
CN111884853A (en) Cloud environment automatic resource management method and system
CN108628929B (en) Method and apparatus for intelligent archiving and analysis
CN104407964B (en) A kind of centralized monitoring system and method based on data center
CN110659173A (en) Operation and maintenance system and method
CN108170580A (en) A kind of rule-based log alarming method, apparatus and system
CN110351150A (en) Fault rootstock determines method and device, electronic equipment and readable storage medium storing program for executing
CN107707381B (en) Virtual network element intelligent slice management system and method
CN102902752A (en) Method and system for monitoring log
CN106487601B (en) Resource monitoring method, device and system
CN102096688A (en) Method and device for generating data report
CN105915381A (en) System for realizing monitoring system business logic online modification
CN101989931A (en) Operation alarm processing method and device
CN114791846B (en) Method for realizing observability aiming at cloud-originated chaos engineering experiment
WO2023071761A1 (en) Anomaly positioning method and device
CN106407075A (en) A management method and system for a big data platform
CN111858713A (en) Object-based government information asset management method and system
CN114302103A (en) Intelligent monitoring operation and maintenance system
CN103049365A (en) Monitoring and evaluating method for information and application resource operating states
CN113435122B (en) Real-time flow data processing method, device, computer equipment and storage medium
CN110138583B (en) Display method for intelligent alarm analysis
CN112783906A (en) Log data management method and system for industrial internet
CN114757448B (en) Manufacturing inter-link optimal value chain construction method based on data space model
US20100324949A1 (en) Transforming generic business measure definitions into executable monitoring specifications
CN115334559A (en) Network detection method, device, equipment and medium
CN113159721A (en) Internet of things platform based on full life cycle of equipment

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication

Application publication date: 20201103

RJ01 Rejection of invention patent application after publication