CN111343017A - Method, system, equipment and medium for cloud platform resource alarm - Google Patents

Method, system, equipment and medium for cloud platform resource alarm Download PDF

Info

Publication number
CN111343017A
CN111343017A CN202010109622.3A CN202010109622A CN111343017A CN 111343017 A CN111343017 A CN 111343017A CN 202010109622 A CN202010109622 A CN 202010109622A CN 111343017 A CN111343017 A CN 111343017A
Authority
CN
China
Prior art keywords
alarm information
resource
resources
total amount
layer
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN202010109622.3A
Other languages
Chinese (zh)
Other versions
CN111343017B (en
Inventor
王会
赵龙
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Suzhou Inspur Intelligent Technology Co Ltd
Original Assignee
Suzhou Inspur Intelligent Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Suzhou Inspur Intelligent Technology Co Ltd filed Critical Suzhou Inspur Intelligent Technology Co Ltd
Priority to CN202010109622.3A priority Critical patent/CN111343017B/en
Publication of CN111343017A publication Critical patent/CN111343017A/en
Application granted granted Critical
Publication of CN111343017B publication Critical patent/CN111343017B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L41/00Arrangements for maintenance, administration or management of data switching networks, e.g. of packet switching networks
    • H04L41/06Management of faults, events, alarms or notifications
    • H04L41/0681Configuration of triggering conditions
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L41/00Arrangements for maintenance, administration or management of data switching networks, e.g. of packet switching networks
    • H04L41/06Management of faults, events, alarms or notifications
    • H04L41/0604Management of faults, events, alarms or notifications using filtering, e.g. reduction of information by using priority, element types, position or time
    • H04L41/0618Management of faults, events, alarms or notifications using filtering, e.g. reduction of information by using priority, element types, position or time based on the physical or logical position
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L41/00Arrangements for maintenance, administration or management of data switching networks, e.g. of packet switching networks
    • H04L41/06Management of faults, events, alarms or notifications
    • H04L41/0604Management of faults, events, alarms or notifications using filtering, e.g. reduction of information by using priority, element types, position or time
    • H04L41/0622Management of faults, events, alarms or notifications using filtering, e.g. reduction of information by using priority, element types, position or time based on time

Abstract

The invention discloses a method, a system, equipment and a storage medium for cloud platform resource alarm, wherein the method comprises the following steps: carrying out hierarchy division on cloud platform resources, and setting a convergence threshold value for each layer; responding to the current level resource of the lower layer to generate alarm information, and gathering the alarm information into an information pool of the resource of the upper layer; judging whether the total amount of the gathered alarm information exceeds the gathering threshold of the previous layer of resources at intervals of preset time; responding to the fact that the total amount of the gathered alarm information exceeds the gathering threshold value of the resource of the previous layer, gathering the alarm information into the information pool of the resource of the next previous layer, and judging whether the total amount of the gathered alarm information exceeds the gathering threshold value of the resource of the next previous layer again; and repeating the previous step until the total amount of the gathered alarm information does not exceed the gathering threshold of the corresponding level resource, displaying and informing the alarm information of all the lower-layer resources generating the alarm information, and determining and solving the problem of the current corresponding level resource.

Description

Method, system, equipment and medium for cloud platform resource alarm
Technical Field
The present invention relates to the field of alarm, and more particularly, to a method, a system, a computer device, and a readable medium for cloud platform resource alarm.
Background
Nowadays, more and more companies develop their own cloud platform project to manage various cloud resources. The ICM is used as a well-known cloud computing management platform, has rich functions and high stability, and can manage various bottom layer virtualization and various resources. In the ICM, different virtualization systems (V MWARE, OPENSTACK) can be managed, and resources such as a host, a cloud host, and storage can be taken in to manage underlying virtualization. The user can apply the virtual machine to carry out own service processing. In addition, a set of unified alarm management system is provided in the ICM, so that cloud resources and the like in the ICM can be monitored, and a user can be helped to know the running condition of bottom layer virtualization so as to reasonably allocate resources. The user can also check the condition of own resources, and alarm information can be generated when the monitored resources are abnormal so as to remind the user to process.
However, because the ICMs are all connected to the cloud platform, and the cloud platform has its own deployment structure, when an upper resource is abnormal, all lower resources are often alerted, and a large amount of alert information is generated. In addition, too much alarm information often causes difficulty in investigation, and the reason of alarm generation cannot be known specifically. After the user receives the alarm information, sometimes the alarm is eliminated by some means such as increasing or decreasing configuration, and the difficulty of troubleshooting the alarm problem of the operation and maintenance personnel is also caused. On the other hand, because only one alarm exists in the upper layer resource which really causes the alarm, the owner of the upper layer resource can only receive one alarm notification, and does not know the influence of the alarms on the lower layer resource and the user, and once the alarms are not processed in time, the larger blockage is often generated on the service, and the service operation is influenced.
Disclosure of Invention
In view of this, embodiments of the present invention provide a method, a system, a computer device, and a computer readable storage medium for cloud platform resource alarm, in which cloud platform resources are hierarchically divided, and an aggregation threshold is set for each layer, when alarm information of a bottom layer resource reaches the aggregation threshold, the alarm information is aggregated to a top layer, once the alarm is aggregated, the alarm information is only displayed and prompted on the upper layer resource, and the alarm of the bottom layer resource is only prompted, so that the alarm information can be displayed in a targeted manner, problems can be conveniently and quickly checked, and the accuracy and ease of alarm are improved.
Based on the above object, an aspect of the embodiments of the present invention provides a method for cloud platform resource alarm, including the following steps: carrying out hierarchy division on cloud platform resources, and setting a convergence threshold value for each layer; responding to the current level resource to generate alarm information, and converging the alarm information into an information pool of the previous level resource; judging whether the total amount of the gathered alarm information exceeds the gathering threshold of the previous layer of resources at intervals of preset time; responding to the fact that the total amount of the gathered alarm information exceeds the gathering threshold value of the previous layer of resources, gathering the alarm information into an information pool of the next previous layer of resources, and judging whether the total amount of the gathered alarm information exceeds the gathering threshold value of the next previous layer of resources again; and repeating the previous step until the total amount of the gathered alarm information does not exceed the gathering threshold of the corresponding level resource, displaying and informing the alarm information of all the resources generating the alarm information, and determining and solving the problem of the corresponding level resource.
In some embodiments, further comprising: and in response to the fact that the total amount of the gathered alarm information does not exceed the gathering threshold of the previous layer of resources, displaying and notifying the alarm information on all the current level resources generating the alarm information.
In some embodiments, further comprising: and locking the current level resource in response to the fact that the total amount of the gathered alarm information exceeds the upper layer gathering threshold value.
In some embodiments, the locking the current tier resource comprises: and putting the current level resource into a converged resource pool, and closing the operation authority of the current level resource.
In another aspect of the embodiments of the present invention, a system for cloud platform resource alarm is further provided, including: the grading module is configured for carrying out hierarchical division on cloud platform resources and setting a convergence threshold value for each layer; the aggregation module is used for responding to the lower layer current level resource to generate alarm information and aggregating the alarm information into the information pool of the upper layer resource; the first judgment module judges whether the total amount of the gathered alarm information exceeds the gathering threshold of the upper layer of resources at intervals of preset time; the second judgment module is configured to respond to the fact that the total amount of the gathered alarm information exceeds the gathering threshold of the previous layer of resources, gather the alarm information into an information pool of the next previous layer of resources, and judge whether the total amount of the gathered alarm information exceeds the gathering threshold of the next previous layer of resources again; and the execution module is configured to repeat the previous step until the total amount of the aggregated alarm information does not exceed the aggregation threshold of the corresponding level resource, display and inform all lower-layer resources generating the alarm information of the alarm information, and determine and solve the current problem of the corresponding level resource.
In some embodiments, further comprising: and the execution module is configured to respond that the total amount of the gathered alarm information does not exceed the gathering threshold of the previous layer of resources, and display and inform all current level resources generating the alarm information of the alarm information.
In some embodiments, further comprising: and the locking module is configured to lock the current level resource in response to the fact that the total amount of the gathered alarm information exceeds the upper-level gathering threshold.
In some embodiments, the locking module is further configured to: and putting the current level resource into a converged resource pool, and closing the operation authority of the current level resource.
In another aspect of the embodiments of the present invention, there is also provided a computer device, including: at least one processor; and a memory storing computer instructions executable on the processor, the instructions when executed by the processor implementing the steps of the method as above.
In a further aspect of the embodiments of the present invention, a computer-readable storage medium is also provided, in which a computer program for implementing the above method steps is stored when the computer program is executed by a processor.
The invention has the following beneficial technical effects: by carrying out hierarchical division on cloud platform resources and setting a convergence threshold value for each layer, after the alarm information of the bottom layer resources reaches the convergence threshold value, the alarm information can be converged to the top layer, once the alarm information is converged, the alarm information is only displayed and prompted on the upper layer resources, and the alarm information of the bottom layer resources is only prompted, so that the display of the alarm information can be pertinently realized, the problems can be conveniently and quickly checked, and the accuracy and the usability of the alarm are improved.
Drawings
In order to more clearly illustrate the embodiments of the present invention or the technical solutions in the prior art, the drawings used in the description of the embodiments or the prior art will be briefly described below, and it is obvious that the drawings in the following description are only some embodiments of the present invention, and it is obvious for those skilled in the art that other embodiments can be obtained by using the drawings without creative efforts.
Fig. 1 is a schematic diagram of an embodiment of a cloud platform resource alarm method provided in the present invention;
fig. 2 is a schematic diagram of a hardware structure of a computer device for cloud platform resource warning according to an embodiment of the present invention.
Detailed Description
In order to make the objects, technical solutions and advantages of the present invention more apparent, the following embodiments of the present invention are described in further detail with reference to the accompanying drawings.
It should be noted that all expressions using "first" and "second" in the embodiments of the present invention are used for distinguishing two entities with the same name but different names or different parameters, and it should be noted that "first" and "second" are merely for convenience of description and should not be construed as limitations of the embodiments of the present invention, and they are not described in any more detail in the following embodiments.
Based on the above purpose, a first aspect of the embodiments of the present invention provides an embodiment of a method for cloud platform resource alarm. Fig. 1 is a schematic diagram illustrating an embodiment of a cloud platform resource warning method provided by the present invention. As shown in fig. 1, the embodiment of the present invention includes the following steps:
s1, carrying out hierarchical division on cloud platform resources, and setting a convergence threshold value for each layer;
s2, responding to the current level resource to generate alarm information, and gathering the alarm information into the information pool of the previous level resource;
s3, judging whether the total amount of the gathered alarm information exceeds the gathering threshold value of the upper layer of resources at preset time intervals;
s4, responding to the fact that the total amount of the gathered alarm information exceeds the gathering threshold of the previous layer of resources, gathering the alarm information into the information pool of the next previous layer of resources, and judging whether the total amount of the gathered alarm information exceeds the gathering threshold of the next previous layer of resources again; and
and S5, repeating the previous step until the total amount of the aggregated alarm information does not exceed the aggregation threshold of the corresponding level resource, displaying and notifying the alarm information of all the resources generating the alarm information, and determining and solving the problem of the corresponding level resource.
And carrying out hierarchical division on the cloud platform resources, and setting a convergence threshold value for each layer. Currently, a cloud management platform is classified into the following levels according to resource types: the virtual control center 1, the cluster 2, the host 3 and the virtual machine 4 are smaller in number and higher in hierarchy. The aggregation threshold is a critical value for aggregating the bottom layer alarms into the upper layer aggregated alarms, and is generally 90% by default. For example, if there are 10 virtual machines on a host and more than 9 virtual machines generate network loss problems, the host generates a convergence alarm indicating that 90% of the virtual machines generate network loss problems.
And responding to the current level resource to generate alarm information, and converging the alarm information into an information pool of the previous level resource. An alarm is generated when the lower-layer resource, for example, the virtual machine A1 has network loss; the lower layer resource converges the information into a converged information pool of the upper layer resource (the upper layer resource of the virtual machine A1 is the host 1); and the upper layer resource carries out timing scanning to obtain the information of the lower layer resource.
And judging whether the total amount of the gathered alarm information exceeds the gathering threshold of the previous layer of resources at intervals of preset time. The upper layer resource analyzes the information of the lower layer resource, for example, 10 virtual machines A1-A10 are arranged under A, and the information in the aggregated information pool is viewed in a monitoring period.
And in response to the fact that the total amount of the gathered alarm information exceeds the gathering threshold of the previous layer of resources, gathering the alarm information into an information pool of the next previous layer of resources, and judging whether the total amount of the gathered alarm information exceeds the gathering threshold of the next previous layer of resources again.
And repeating the previous step until the total amount of the gathered alarm information does not exceed the gathering threshold of the corresponding level resource, displaying and informing the alarm information of all the resources generating the alarm information, and determining and solving the problem of the corresponding level resource.
In some embodiments, further comprising: and in response to the fact that the total amount of the gathered alarm information does not exceed the gathering threshold of the previous layer of resources, displaying and notifying the alarm information on all the current level resources generating the alarm information. If the number of the virtual machines generating network loss connection is larger than the aggregation alarm threshold value, the alarm information is an aggregation alarm, and the host A aggregates the information into an aggregation information pool of an upper layer resource (the upper layer resource of the A is a cluster B). The cluster B repeats the above steps until the alarm information is not the aggregated alarm or reaches the top level resource. And after the generation of the convergence alarm is determined, displaying and pushing alarm information on the resource at the convergence position. The notification setting for the aggregated alert is typically 1 minute. The convergence alarm is different from the common alarm, the convergence alarm is generally urgent when formed, the influence range is large, and the notification is timely. After the alarm is confirmed, it means that it is known that the alarm is generated and no notification is needed, and at this time, the reason should be examined and processed with a focus.
In some embodiments, further comprising: and locking the current level resource in response to the fact that the total amount of the gathered alarm information exceeds the upper layer gathering threshold value. In some embodiments, the locking the current tier resource comprises: and putting the current level resource into a converged resource pool, and closing the operation authority of the current level resource. And after the alarm is recovered, an alarm recovery notice is generated, and the locked resources are automatically unlocked.
The embodiment of the invention provides a concept of convergence warning, tree-shaped hierarchical division is carried out on cloud platform resources, convergence threshold values are set for each layer, after the warning information of the bottom layer resources reaches the convergence threshold values, convergence is carried out on the top layer, once the convergence warning is carried out, the warning information is only displayed and prompted on the upper layer resources, and the warning of the bottom layer resources is only prompted, so that the problem of prompting of large-batch warning can be reduced. This alarm, which itself causes but the problem is displayed in the underlying resource, is an aggregated alarm. Meanwhile, in order to eliminate the root cause of alarm generation, after the owner of the upper layer resource is informed of the alarm (the owner of the upper layer resource, such as a host, a cluster and the like, is generally an operation and maintenance person and not a final client), only the upper layer resource can be operated, the lower layer resource is locked, only the operation of checking and not allowing to modify configuration and the like is carried out, after the alarm problem of the upper layer resource is checked and repaired, all alarm information can be restored, and at the moment, the unlocking is carried out, and normal business operation is carried out. The alarm aggregation is carried out through the tree hierarchy, the hierarchical resource locking after the alarm generation is carried out, the resource generated by the alarm information can be displayed more accurately, and the operation and maintenance personnel can conveniently carry out the troubleshooting. Therefore, the alarm information of the cloud management platform can be prevented from flooding, the troubleshooting accuracy of the alarm information is improved, and the accuracy and the usability of the cloud platform are improved. On the other hand, the owner of the upper-layer resource checks that the own resource generates the convergence alarm, so that the owner can pay more attention to the convergence alarm, more timely handle the problem and recover the service.
It should be particularly noted that, steps in the embodiments of the cloud platform resource alarm method described above may be intersected, replaced, added, and deleted, so that these methods for cloud platform resource alarm transformed by reasonable permutation and combination also belong to the scope of the present invention, and the scope of the present invention should not be limited to the embodiments.
Based on the above object, a second aspect of the embodiments of the present invention provides a cloud platform resource alarm system, including: the grading module is configured for carrying out hierarchical division on cloud platform resources and setting a convergence threshold value for each layer; the aggregation module is used for responding to the lower layer current level resource to generate alarm information and aggregating the alarm information into the information pool of the upper layer resource; the first judgment module judges whether the total amount of the gathered alarm information exceeds the gathering threshold of the upper layer of resources at intervals of preset time; the second judgment module is configured to respond to the fact that the total amount of the gathered alarm information exceeds the gathering threshold of the previous layer of resources, gather the alarm information into an information pool of the next previous layer of resources, and judge whether the total amount of the gathered alarm information exceeds the gathering threshold of the next previous layer of resources again; and the execution module is configured to repeat the previous step until the total amount of the aggregated alarm information does not exceed the aggregation threshold of the corresponding level resource, display and inform all lower-layer resources generating the alarm information of the alarm information, and determine and solve the current problem of the corresponding level resource.
In some embodiments, further comprising: and the execution module is configured to respond that the total amount of the gathered alarm information does not exceed the gathering threshold of the previous layer of resources, and display and inform all current level resources generating the alarm information of the alarm information.
In some embodiments, further comprising: and the locking module is configured to lock the current level resource in response to the fact that the total amount of the gathered alarm information exceeds the upper-level gathering threshold.
In some embodiments, the locking module is further configured to: and putting the current level resource into a converged resource pool, and closing the operation authority of the current level resource.
In view of the above object, a third aspect of the embodiments of the present invention provides a computer device, including: at least one processor; and a memory storing computer instructions executable on the processor, the instructions being executable by the processor to perform the steps of: s1, carrying out hierarchical division on cloud platform resources, and setting a convergence threshold value for each layer; s2, responding to the current level resource to generate alarm information, and gathering the alarm information into the information pool of the previous level resource; s3, judging whether the total amount of the gathered alarm information exceeds the gathering threshold value of the upper layer of resources at preset time intervals; s4, responding to the fact that the total amount of the gathered alarm information exceeds the gathering threshold of the previous layer of resources, gathering the alarm information into the information pool of the next previous layer of resources, and judging whether the total amount of the gathered alarm information exceeds the gathering threshold of the next previous layer of resources again; and S5, repeating the previous step until the total amount of the aggregated alarm information does not exceed the aggregation threshold of the corresponding level resource, displaying and notifying the alarm information of all the resources generating the alarm information, and determining and solving the problem of the corresponding level resource.
In some embodiments, further comprising: and in response to the fact that the total amount of the gathered alarm information does not exceed the gathering threshold of the previous layer of resources, displaying and notifying the alarm information on all the current level resources generating the alarm information.
In some embodiments, further comprising: and locking the current level resource in response to the fact that the total amount of the gathered alarm information exceeds the upper layer gathering threshold value.
In some embodiments, the locking the current tier resource comprises: and putting the current level resource into a converged resource pool, and closing the operation authority of the current level resource.
Fig. 2 is a schematic hardware structure diagram of an embodiment of the computer device for cloud platform resource alarm provided in the present invention.
Taking the apparatus shown in fig. 2 as an example, the apparatus includes a processor 301 and a memory 302, and may further include: an input device 303 and an output device 304.
The processor 301, the memory 302, the input device 303 and the output device 304 may be connected by a bus or other means, and fig. 2 illustrates the connection by a bus as an example.
The memory 302, which is a non-volatile computer-readable storage medium, may be used to store non-volatile software programs, non-volatile computer-executable programs, and modules, such as program instructions/modules corresponding to the method for cloud platform resource alerting in this embodiment of the present application. The processor 301 executes various functional applications and data processing of the server by running the nonvolatile software program, instructions and modules stored in the memory 302, that is, the method for alarming the cloud platform resource of the above method embodiment is implemented.
The memory 302 may include a storage program area and a storage data area, wherein the storage program area may store an operating system, an application program required for at least one function; the storage data area may store data created according to the use of the method of cloud platform resource alerting, and the like. Further, the memory 302 may include high speed random access memory, and may also include non-volatile memory, such as at least one magnetic disk storage device, flash memory device, or other non-volatile solid state storage device. In some embodiments, memory 302 optionally includes memory located remotely from processor 301, which may be connected to a local module via a network. Examples of such networks include, but are not limited to, the internet, intranets, local area networks, mobile communication networks, and combinations thereof.
The input device 303 may receive information such as a user name and a password that are input. The output means 304 may comprise a display device such as a display screen.
Program instructions/modules corresponding to one or more methods of cloud platform resource alerting are stored in the memory 302 and, when executed by the processor 301, perform the method of cloud platform resource alerting in any of the above-described method embodiments.
Any embodiment of the computer device executing the method for cloud platform resource warning may achieve the same or similar effects as any corresponding method embodiment described above.
The invention also provides a computer readable storage medium storing a computer program which, when executed by a processor, performs the method as above.
Finally, it should be noted that, as one of ordinary skill in the art can appreciate that all or part of the processes in the methods according to the embodiments described above can be implemented by instructing relevant hardware through a computer program, and the program of the method for cloud platform resource alarm can be stored in a computer-readable storage medium, and when executed, the program can include the processes according to the embodiments of the methods described above. The storage medium of the program may be a magnetic disk, an optical disk, a Read Only Memory (ROM), a Random Access Memory (RAM), or the like. The embodiments of the computer program may achieve the same or similar effects as any of the above-described method embodiments.
Furthermore, the methods disclosed according to embodiments of the present invention may also be implemented as a computer program executed by a processor, which may be stored in a computer-readable storage medium. Which when executed by a processor performs the above-described functions defined in the methods disclosed in embodiments of the invention.
Further, the above method steps and system elements may also be implemented using a controller and a computer readable storage medium for storing a computer program for causing the controller to implement the functions of the above steps or elements.
Further, it should be appreciated that the computer-readable storage media (e.g., memory) herein can be either volatile memory or nonvolatile memory, or can include both volatile and nonvolatile memory. By way of example, and not limitation, nonvolatile memory can include Read Only Memory (ROM), Programmable ROM (PROM), Electrically Programmable ROM (EPROM), Electrically Erasable Programmable ROM (EEPROM), or flash memory. Volatile memory can include Random Access Memory (RAM), which can act as external cache memory. By way of example and not limitation, RAM is available in a variety of forms such as synchronous RAM (DRAM), Dynamic RAM (DRAM), Synchronous DRAM (SDRAM), Double Data Rate SDRAM (DDRSDRAM), Enhanced SDRAM (ESDRAM), Synchronous Link DRAM (SLDRAM), and Direct Rambus RAM (DRRAM). The storage devices of the disclosed aspects are intended to comprise, without being limited to, these and other suitable types of memory.
Those of skill would further appreciate that the various illustrative logical blocks, modules, circuits, and algorithm steps described in connection with the disclosure herein may be implemented as electronic hardware, computer software, or combinations of both. To clearly illustrate this interchangeability of hardware and software, various illustrative components, blocks, modules, circuits, and steps have been described above generally in terms of their functionality. Whether such functionality is implemented as software or hardware depends upon the particular application and design constraints imposed on the overall system. Skilled artisans may implement the described functionality in varying ways for each particular application, but such implementation decisions should not be interpreted as causing a departure from the scope of the disclosed embodiments of the present invention.
The various illustrative logical blocks, modules, and circuits described in connection with the disclosure herein may be implemented or performed with the following components designed to perform the functions herein: a general purpose processor, a Digital Signal Processor (DSP), an Application Specific Integrated Circuit (ASIC), a Field Programmable Gate Array (FPGA) or other programmable logic device, discrete gate or transistor logic, discrete hardware components, or any combination of these components. A general purpose processor may be a microprocessor, but in the alternative, the processor may be any conventional processor, controller, microcontroller, or state machine. A processor may also be implemented as a combination of computing devices, e.g., a combination of a DSP and a microprocessor, a plurality of microprocessors, one or more microprocessors in conjunction with a DSP, and/or any other such configuration.
The steps of a method or algorithm described in connection with the disclosure herein may be embodied directly in hardware, in a software module executed by a processor, or in a combination of the two. A software module may reside in RAM memory, flash memory, ROM memory, EPROM memory, EEPROM memory, registers, hard disk, a removable disk, a CD-ROM, or any other form of storage medium known in the art. An exemplary storage medium is coupled to the processor such the processor can read information from, and write information to, the storage medium. In the alternative, the storage medium may be integral to the processor. The processor and the storage medium may reside in an ASIC. The ASIC may reside in a user terminal. In the alternative, the processor and the storage medium may reside as discrete components in a user terminal.
In one or more exemplary designs, the functions may be implemented in hardware, software, firmware, or any combination thereof. If implemented in software, the functions may be stored on or transmitted over as one or more instructions or code on a computer-readable medium. Computer-readable media includes both computer storage media and communication media including any medium that facilitates transfer of a computer program from one place to another. A storage media may be any available media that can be accessed by a general purpose or special purpose computer. By way of example, and not limitation, such computer-readable media can comprise RAM, ROM, EEPROM, CD-ROM or other optical disk storage, magnetic disk storage or other magnetic storage devices, or any other medium that can be used to carry or store desired program code in the form of instructions or data structures and that can be accessed by a general-purpose or special-purpose computer, or a general-purpose or special-purpose processor. Also, any connection is properly termed a computer-readable medium. For example, if the software is transmitted from a website, server, or other remote source using a coaxial cable, fiber optic cable, twisted pair, Digital Subscriber Line (DSL), or wireless technologies such as infrared, radio, and microwave, then the coaxial cable, fiber optic cable, twisted pair, DSL, or wireless technologies such as infrared, radio, and microwave are included in the definition of medium. Disk and disc, as used herein, includes Compact Disc (CD), laser disc, optical disc, Digital Versatile Disc (DVD), floppy disk, blu-ray disc where disks usually reproduce data magnetically, while discs reproduce data optically with lasers. Combinations of the above should also be included within the scope of computer-readable media.
The foregoing is an exemplary embodiment of the present disclosure, but it should be noted that various changes and modifications could be made herein without departing from the scope of the present disclosure as defined by the appended claims. The functions, steps and/or actions of the method claims in accordance with the disclosed embodiments described herein need not be performed in any particular order. Furthermore, although elements of the disclosed embodiments of the invention may be described or claimed in the singular, the plural is contemplated unless limitation to the singular is explicitly stated.
It should be understood that, as used herein, the singular forms "a", "an" and "the" are intended to include the plural forms as well, unless the context clearly supports the exception. It should also be understood that "and/or" as used herein is meant to include any and all possible combinations of one or more of the associated listed items.
The numbers of the embodiments disclosed in the embodiments of the present invention are merely for description, and do not represent the merits of the embodiments.
It will be understood by those skilled in the art that all or part of the steps for implementing the above embodiments may be implemented by hardware, or may be implemented by a program instructing relevant hardware, and the program may be stored in a computer-readable storage medium, and the above-mentioned storage medium may be a read-only memory, a magnetic disk or an optical disk, etc.
Those of ordinary skill in the art will understand that: the discussion of any embodiment above is meant to be exemplary only, and is not intended to intimate that the scope of the disclosure, including the claims, of embodiments of the invention is limited to these examples; within the idea of an embodiment of the invention, also technical features in the above embodiment or in different embodiments may be combined and there are many other variations of the different aspects of the embodiments of the invention as described above, which are not provided in detail for the sake of brevity. Therefore, any omissions, modifications, substitutions, improvements, and the like that may be made without departing from the spirit and principles of the embodiments of the present invention are intended to be included within the scope of the embodiments of the present invention.

Claims (10)

1. A cloud platform resource alarm method is characterized by comprising the following steps:
carrying out hierarchy division on cloud platform resources, and setting a convergence threshold value for each layer;
responding to the current level resource to generate alarm information, and converging the alarm information into an information pool of the previous level resource;
judging whether the total amount of the gathered alarm information exceeds the gathering threshold of the previous layer of resources at intervals of preset time;
responding to the fact that the total amount of the gathered alarm information exceeds the gathering threshold value of the previous layer of resources, gathering the alarm information into an information pool of the next previous layer of resources, and judging whether the total amount of the gathered alarm information exceeds the gathering threshold value of the next previous layer of resources again; and
and repeating the previous step until the total amount of the gathered alarm information does not exceed the gathering threshold of the corresponding level resource, displaying and informing the alarm information of all the resources generating the alarm information, and determining and solving the problem of the corresponding level resource.
2. The method of claim 1, further comprising:
and in response to the fact that the total amount of the gathered alarm information does not exceed the gathering threshold of the previous layer of resources, displaying and notifying the alarm information on all the current level resources generating the alarm information.
3. The method of claim 1, further comprising:
and locking the current level resource in response to the fact that the total amount of the gathered alarm information exceeds the upper layer gathering threshold value.
4. The method of claim 3, wherein the locking the current tier resource comprises:
and putting the current level resource into a converged resource pool, and closing the operation authority of the current level resource.
5. A system for cloud platform resource alerting, comprising:
the grading module is configured for carrying out hierarchical division on cloud platform resources and setting a convergence threshold value for each layer;
the aggregation module is used for responding to the lower layer current level resource to generate alarm information and aggregating the alarm information into the information pool of the upper layer resource;
the first judgment module judges whether the total amount of the gathered alarm information exceeds the gathering threshold of the upper layer of resources at intervals of preset time; and
the second judgment module is configured to respond to the fact that the total amount of the gathered alarm information exceeds the gathering threshold of the previous layer of resources, gather the alarm information into an information pool of the next previous layer of resources, and judge whether the total amount of the gathered alarm information exceeds the gathering threshold of the next previous layer of resources again;
and the execution module is configured to repeat the previous step until the total amount of the aggregated alarm information does not exceed the aggregation threshold of the corresponding level resource, display and inform all lower-layer resources generating the alarm information of the alarm information, and determine and solve the current problem of the corresponding level resource.
6. The system of claim 5, further comprising:
and the execution module is configured to respond that the total amount of the gathered alarm information does not exceed the gathering threshold of the previous layer of resources, and display and inform all current level resources generating the alarm information of the alarm information.
7. The system of claim 5, further comprising:
and the locking module is configured to lock the current level resource in response to the fact that the total amount of the gathered alarm information exceeds the upper-level gathering threshold.
8. The system of claim 7, wherein the locking module is further configured to:
and putting the current level resource into a converged resource pool, and closing the operation authority of the current level resource.
9. A computer device, comprising:
at least one processor; and
a memory storing computer instructions executable on the processor, the instructions when executed by the processor implementing the steps of the method of any one of claims 1 to 4.
10. A computer-readable storage medium, in which a computer program is stored which, when being executed by a processor, carries out the steps of the method according to any one of claims 1 to 4.
CN202010109622.3A 2020-02-22 2020-02-22 Method, system, equipment and medium for cloud platform resource alarm Active CN111343017B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202010109622.3A CN111343017B (en) 2020-02-22 2020-02-22 Method, system, equipment and medium for cloud platform resource alarm

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202010109622.3A CN111343017B (en) 2020-02-22 2020-02-22 Method, system, equipment and medium for cloud platform resource alarm

Publications (2)

Publication Number Publication Date
CN111343017A true CN111343017A (en) 2020-06-26
CN111343017B CN111343017B (en) 2022-12-09

Family

ID=71186941

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202010109622.3A Active CN111343017B (en) 2020-02-22 2020-02-22 Method, system, equipment and medium for cloud platform resource alarm

Country Status (1)

Country Link
CN (1) CN111343017B (en)

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104410535A (en) * 2014-12-23 2015-03-11 浪潮电子信息产业股份有限公司 Intelligent cloud resource monitoring and warning method
CN107368365A (en) * 2017-07-25 2017-11-21 携程旅游信息技术(上海)有限公司 Cloud platform automatic O&M method, system, equipment and storage medium
CN108769207A (en) * 2018-05-30 2018-11-06 郑州云海信息技术有限公司 A kind of cloud platform resource monitoring method and system
CN108989132A (en) * 2018-08-24 2018-12-11 深圳前海微众银行股份有限公司 Fault warning processing method, system and computer readable storage medium
CN110704283A (en) * 2019-09-05 2020-01-17 北京浪潮数据技术有限公司 Method, device and medium for uniformly generating alarm information

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104410535A (en) * 2014-12-23 2015-03-11 浪潮电子信息产业股份有限公司 Intelligent cloud resource monitoring and warning method
CN107368365A (en) * 2017-07-25 2017-11-21 携程旅游信息技术(上海)有限公司 Cloud platform automatic O&M method, system, equipment and storage medium
CN108769207A (en) * 2018-05-30 2018-11-06 郑州云海信息技术有限公司 A kind of cloud platform resource monitoring method and system
CN108989132A (en) * 2018-08-24 2018-12-11 深圳前海微众银行股份有限公司 Fault warning processing method, system and computer readable storage medium
CN110704283A (en) * 2019-09-05 2020-01-17 北京浪潮数据技术有限公司 Method, device and medium for uniformly generating alarm information

Also Published As

Publication number Publication date
CN111343017B (en) 2022-12-09

Similar Documents

Publication Publication Date Title
US8499331B1 (en) Policy based network compliance
US8051028B2 (en) Method and apparatus for generating configuration rules for computing entities within a computing environment using association rule mining
US9794153B2 (en) Determining a risk level for server health check processing
US8024772B1 (en) Application service policy compliance server
CN112698915A (en) Multi-cluster unified monitoring alarm method, system, equipment and storage medium
CN113687969A (en) Alarm information generation method and device, electronic equipment and readable storage medium
CN111124722B (en) Method, equipment and medium for isolating fault memory
CN111104238B (en) CE-based memory diagnosis method, device and medium
AU2016357206A1 (en) Deployment assurance checks for monitoring industrial control systems
US11675647B2 (en) Determining root-cause of failures based on machine-generated textual data
CN111343017B (en) Method, system, equipment and medium for cloud platform resource alarm
CN114205224A (en) Multimedia equipment remote operation and maintenance control method and system based on Internet of things
US9282005B1 (en) IT infrastructure policy breach investigation interface
CN109462617B (en) Method and device for detecting communication behavior of equipment in local area network
EP2141884A1 (en) Anti-intrusion method and system for a communication network
CN111147542A (en) Secret-free access setting method, device, equipment and medium
US7472183B1 (en) Approaches for capturing illegal and undesired behavior in network components and component interactions
US11792075B2 (en) State-based entity behavior analysis
CN113010375B (en) Equipment alarm method and related equipment
CN114138615A (en) Service alarm processing method, device, equipment and storage medium
CN112817827A (en) Operation and maintenance method, device, server, equipment, system and medium
CN113158185B (en) Safety detection method and device
TWI789193B (en) Method and device for automatically checking authority of computer folder
US20230208727A1 (en) Data integrity illustration and evaluation system and method
TWI778787B (en) Monitoring and warning method and terminal equipment

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant