CN114518934A - Unified operation and maintenance platform architecture system - Google Patents
Unified operation and maintenance platform architecture system Download PDFInfo
- Publication number
- CN114518934A CN114518934A CN202111660543.2A CN202111660543A CN114518934A CN 114518934 A CN114518934 A CN 114518934A CN 202111660543 A CN202111660543 A CN 202111660543A CN 114518934 A CN114518934 A CN 114518934A
- Authority
- CN
- China
- Prior art keywords
- layer
- service
- management
- maintenance
- unified
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 238000012423 maintenance Methods 0.000 title claims abstract description 72
- 238000007726 management method Methods 0.000 claims abstract description 78
- 238000012544 monitoring process Methods 0.000 claims abstract description 69
- 238000012545 processing Methods 0.000 claims abstract description 19
- 238000011161 development Methods 0.000 claims abstract description 6
- 239000003795 chemical substances by application Substances 0.000 claims description 12
- 239000011159 matrix material Substances 0.000 claims description 9
- 238000005516 engineering process Methods 0.000 claims description 8
- 238000000034 method Methods 0.000 claims description 8
- 238000004458 analytical method Methods 0.000 claims description 7
- 238000013475 authorization Methods 0.000 claims description 6
- 230000008520 organization Effects 0.000 claims description 6
- 238000004140 cleaning Methods 0.000 claims description 4
- 238000001914 filtration Methods 0.000 claims description 4
- 238000011068 loading method Methods 0.000 claims description 4
- 108010028984 3-isopropylmalate dehydratase Proteins 0.000 claims description 3
- 238000004422 calculation algorithm Methods 0.000 claims description 3
- 238000010367 cloning Methods 0.000 claims description 3
- 238000013507 mapping Methods 0.000 claims description 3
- 238000005192 partition Methods 0.000 claims description 3
- 238000013468 resource allocation Methods 0.000 claims description 3
- 239000000344 soap Substances 0.000 claims description 3
- 238000007619 statistical method Methods 0.000 claims description 3
- 230000004931 aggregating effect Effects 0.000 claims 1
- 238000013480 data collection Methods 0.000 claims 1
- 238000013461 design Methods 0.000 abstract description 4
- 238000012384 transportation and delivery Methods 0.000 abstract description 4
- 238000013473 artificial intelligence Methods 0.000 abstract description 3
- 239000000306 component Substances 0.000 description 8
- 230000007246 mechanism Effects 0.000 description 7
- 238000003860 storage Methods 0.000 description 7
- 230000008569 process Effects 0.000 description 5
- 238000004220 aggregation Methods 0.000 description 3
- 230000002776 aggregation Effects 0.000 description 3
- 238000004364 calculation method Methods 0.000 description 3
- 238000006243 chemical reaction Methods 0.000 description 3
- 238000000605 extraction Methods 0.000 description 3
- 238000003745 diagnosis Methods 0.000 description 2
- 238000012163 sequencing technique Methods 0.000 description 2
- 101000797623 Homo sapiens Protein AMBP Proteins 0.000 description 1
- 102100032859 Protein AMBP Human genes 0.000 description 1
- 230000009286 beneficial effect Effects 0.000 description 1
- 230000008859 change Effects 0.000 description 1
- 238000010276 construction Methods 0.000 description 1
- 239000008358 core component Substances 0.000 description 1
- 230000007547 defect Effects 0.000 description 1
- 238000010586 diagram Methods 0.000 description 1
- 230000004941 influx Effects 0.000 description 1
- 238000007689 inspection Methods 0.000 description 1
- 238000009434 installation Methods 0.000 description 1
- 210000001503 joint Anatomy 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F9/00—Arrangements for program control, e.g. control units
- G06F9/06—Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
- G06F9/44—Arrangements for executing specific programs
- G06F9/455—Emulation; Interpretation; Software simulation, e.g. virtualisation or emulation of application or operating system execution engines
- G06F9/45533—Hypervisors; Virtual machine monitors
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F11/00—Error detection; Error correction; Monitoring
- G06F11/07—Responding to the occurrence of a fault, e.g. fault tolerance
- G06F11/0703—Error or fault processing not based on redundancy, i.e. by taking additional measures to deal with the error or fault not making use of redundancy in operation, in hardware, or in data representation
- G06F11/0766—Error or fault reporting or storing
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F11/00—Error detection; Error correction; Monitoring
- G06F11/07—Responding to the occurrence of a fault, e.g. fault tolerance
- G06F11/0703—Error or fault processing not based on redundancy, i.e. by taking additional measures to deal with the error or fault not making use of redundancy in operation, in hardware, or in data representation
- G06F11/079—Root cause analysis, i.e. error or fault diagnosis
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F11/00—Error detection; Error correction; Monitoring
- G06F11/30—Monitoring
- G06F11/3003—Monitoring arrangements specially adapted to the computing system or computing system component being monitored
- G06F11/3006—Monitoring arrangements specially adapted to the computing system or computing system component being monitored where the computing system is distributed, e.g. networked systems, clusters, multiprocessor systems
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F11/00—Error detection; Error correction; Monitoring
- G06F11/30—Monitoring
- G06F11/3051—Monitoring arrangements for monitoring the configuration of the computing system or of the computing system component, e.g. monitoring the presence of processing resources, peripherals, I/O links, software programs
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F11/00—Error detection; Error correction; Monitoring
- G06F11/30—Monitoring
- G06F11/32—Monitoring with visual or acoustical indication of the functioning of the machine
- G06F11/324—Display of status information
- G06F11/327—Alarm or error message display
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F9/00—Arrangements for program control, e.g. control units
- G06F9/06—Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
- G06F9/44—Arrangements for executing specific programs
- G06F9/455—Emulation; Interpretation; Software simulation, e.g. virtualisation or emulation of application or operating system execution engines
- G06F9/45533—Hypervisors; Virtual machine monitors
- G06F9/45558—Hypervisor-specific management and integration aspects
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04L—TRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
- H04L41/00—Arrangements for maintenance, administration or management of data switching networks, e.g. of packet switching networks
- H04L41/06—Management of faults, events, alarms or notifications
- H04L41/0631—Management of faults, events, alarms or notifications using root cause analysis; using analysis of correlation between notifications, alarms or events based on decision criteria, e.g. hierarchy, tree or time analysis
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04L—TRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
- H04L41/00—Arrangements for maintenance, administration or management of data switching networks, e.g. of packet switching networks
- H04L41/06—Management of faults, events, alarms or notifications
- H04L41/0677—Localisation of faults
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04L—TRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
- H04L43/00—Arrangements for monitoring or testing data switching networks
- H04L43/08—Monitoring or testing based on specific metrics, e.g. QoS, energy consumption or environmental parameters
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04L—TRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
- H04L63/00—Network architectures or network communication protocols for network security
- H04L63/10—Network architectures or network communication protocols for network security for controlling access to devices or network resources
- H04L63/105—Multiple levels of security
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04L—TRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
- H04L67/00—Network arrangements or protocols for supporting network services or applications
- H04L67/01—Protocols
- H04L67/02—Protocols based on web technology, e.g. hypertext transfer protocol [HTTP]
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04L—TRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
- H04L67/00—Network arrangements or protocols for supporting network services or applications
- H04L67/01—Protocols
- H04L67/10—Protocols in which an application is distributed across nodes in the network
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Software Systems (AREA)
- Computer Networks & Wireless Communication (AREA)
- Signal Processing (AREA)
- Quality & Reliability (AREA)
- Computing Systems (AREA)
- Computer Security & Cryptography (AREA)
- Computer Hardware Design (AREA)
- Environmental & Geological Engineering (AREA)
- Health & Medical Sciences (AREA)
- Biomedical Technology (AREA)
- Mathematical Physics (AREA)
- Management, Administration, Business Operations System, And Electronic Commerce (AREA)
Abstract
The invention discloses a unified operation and maintenance platform architecture system, which comprises the following unit modules: a functional architecture module: the module comprises an acquisition layer, a technical operation and maintenance domain layer, a public component domain layer, a service application domain layer and a centralized display layer; the technical architecture module comprises a display layer, a service layer, a processing layer, an operation layer, an acquisition layer and a resource layer. The invention covers all fields of operation and maintenance services, including monitoring, managing, controlling, serving, safety, big data, artificial intelligence and other aspects, provides product capability from the perspective of application scenes to meet the operation and maintenance requirements of different enterprises at different development stages, takes CMDB data as a core so as to meet the unified management of enterprise global resources, and also ensure that a data association architecture of each technical domain has flexible expandability to meet the operation and maintenance requirements of enterprises of different scales and different maturity, and takes user experience and engineering implementation amount into consideration in platform design so as to improve the usability of users and reduce project delivery cost.
Description
Technical Field
The invention relates to the technical field of media operation, in particular to a unified operation and maintenance platform architecture system.
Background
With the continuous expansion of the business, the construction and the expansion of a business system bring a series of pain points. The variety, the brand and the quantity of equipment are various, the system architecture and the version are different, and the artificial routing inspection monitoring is finite; multiple operation and maintenance processes, multiple media and insufficient electronization coverage; multiple operation and maintenance forms, a traditional environment, a virtual environment and a cloud environment. The support system is urgently needed to be changed towards the direction of 'IT as a service' by relying on the ITSS national standard and following the advanced service management concept of the ITIL to improve the capability and the operation and maintenance capability of the information application product, so that the operation cost is more effectively saved, the working flow is simplified, and the unified automatic supervision is realized.
Disclosure of Invention
The invention aims to solve the defects in the prior art and provides a unified operation and maintenance platform architecture system.
In order to achieve the purpose, the invention adopts the following technical scheme:
a unified operation and maintenance platform architecture system comprises the following unit modules:
a functional architecture module: the module comprises an acquisition layer, a technical operation and maintenance domain layer, a public component domain layer, a service application domain layer and a centralized display layer;
the technical architecture module comprises a display layer, a service layer, a processing layer, an operation layer, an acquisition layer and a resource layer;
the display layer is a centralized display portal layer of the unified monitoring system and is an inlet of the unified monitoring system; the service layer provides uniform operation and maintenance service for the uniform monitoring system, and the top layer is a northbound API (application programming interface) interface which provides uniform interface service for the display layer and a third-party system; the processing layer provides support for upper business services, and the support comprises processing capabilities of inquiry, storage, calculation, aggregation, conversion, cleaning, extraction, filtration and loading; the operation layer is the basis for deploying and operating the unified monitoring system, and H3Linux provides operating system resources for the unified monitoring system; the acquisition layer acquires the energy, state and configuration attribute data of various resources through a protocol; the resource layer comprises all objects of enterprise IT operation and maintenance management, including machine room dynamic loop equipment, IT infrastructure equipment, cloud environment, a database, middleware and application;
deploying the architecture module: the method is based on an H3C Matrix containerized deployment platform, and the platform deploys and monitors the micro-service based on a Kubernets cluster;
the authority management module: adopting RBAC model authority control to support function authorization and data authorization;
customizing the large screen module: the module supports the realization of the self-combination and the centralized display of the chart in a dragging mode.
Preferably, the acquisition layer can be used for interfacing management objects of terminals, networks, clouds and safety universes through Agent and non-Agent acquisition modes and acquisition of a third-party system;
the technical operation and maintenance domain layer comprises the fields of basic architecture management, hardware monitoring, service monitoring, dynamic ring management, video monitoring and wireless management professional technical operation and maintenance;
the public component domain layer is used for extracting the service function with public attribute to form a public module for other service scenes to be directly called, thereby avoiding repeated development and reducing the redundancy of the system, and comprises the following steps: the system comprises a flow arrangement engine, a knowledge search engine, a knowledge gallery, an AI algorithm and an API interface functional module;
the business application domain layer is an operation and maintenance business domain divided according to business characteristics and application scenes of IT operation and maintenance, and comprises monitoring management, resource management, service flow management, automatic management and intelligent analysis;
the centralized display layer provides a unified entrance for the daily operation and maintenance work of operation and maintenance personnel, and the platform provides a plurality of display modes of a PC desktop, a large screen and a mobile terminal.
Preferably, the presentation layer is a centralized presentation portal layer of the unified monitoring system, is an entrance of the unified monitoring system, and mainly comprises an administrator view, a viewer view, a tenant view, large screen monitoring, a desktop portal and a customized scene portal, and the mainly used technology stack comprises Html5, Javascript, Css, Vue and SpringBoot mainstream web front-end technology.
Preferably, the service layer: the upper layer is a northbound API interface, unified interface services are provided for a display layer and a third-party system, an API Gate is a unified inlet of all service requests, a Cas Server provides service request authentication for all service requests, access to the third-party system is achieved through single sign-on, and a RBAC role authority management model provides unified function level and data level authority control for operation and maintenance services. The following are business function services, which include authentication, organization and user management, log operation, notification management, alarm management, topology management, log management, statistical analysis, machine room facility monitoring, hardware device monitoring, application performance monitoring, business monitoring, user experience monitoring, service flow management, resource management, server automation, and network automation.
Preferably, the operation layer is a basis for deployment and operation of a unified monitoring system, H3Linux provides operating system resources for the unified monitoring system, the display layer, the service layer and the processing layer services on the upper layer are packaged and isolated in a Docker container mode, are arranged and managed through kubernets, are uniformly clustered and deployed through a Matrix graphical interface system, and provide functions of self-monitoring, backup, software installation and uninstallation.
Preferably, the acquisition layer supports Agent and non-Agent modes for data acquisition, and acquires the performance, state and configuration attribute data of various resources through various protocols such as SNMP, SSH, Telnet, FTP, sFTP, WMI, IPMI, NetConf, NetFlow, NetStream, JDBC, Restful, Soap, SDK, JMX, Socket and SMI-S.
Preferably, the resource layer includes all objects of the enterprise IT operation and maintenance management, including the machine room dynamic ring device, the IT infrastructure device, the cloud environment, the database, the middleware, and the application.
Preferably, the H3C Matrix deployment platform operates in a cluster manner, and the cluster is composed of:
master node: the system is responsible for resource management and container scheduling work of the whole cluster, and three physical servers are needed to be used as the cluster;
and (3) Worker node: sharing and processing cluster service, users can carry out service software installation selection according to service requirements and carry out resource allocation according to cluster load conditions;
one node is automatically selected from the three Master nodes to serve as a Master Master node, the Master Master node is responsible for managing and monitoring all the nodes in the cluster, and the configured northbound service virtual IP is issued to the Master Master node.
Preferably, the authority management module automatically filters data and displays the data in a tree structure hierarchy through the attribution organization mapping of resources and personnel so as to meet the requirement of centralized unified operation and maintenance of enterprises with a multi-level organizational structure.
Preferably, the customized large-screen module supports the diagrammatized display of third-party data after accessing according to a platform standard format, and simultaneously presets a plurality of graphically defined service components and supports the functions of large-screen partition layout, sequencing, cloning and previewing.
Compared with the prior art, the invention has the beneficial effects that: the invention covers all fields of operation and maintenance services, including monitoring, managing, controlling, serving, safety, big data, artificial intelligence and other aspects, provides product capability from the perspective of application scenes to meet the operation and maintenance requirements of different enterprises at different development stages, takes CMDB data as a core so as to meet the unified management of enterprise global resources, and also ensure that a data association architecture of each technical domain has flexible expandability to meet the operation and maintenance requirements of enterprises of different scales and different maturity, and takes user experience and engineering implementation amount into consideration in platform design so as to improve the usability of users and reduce project delivery cost.
Drawings
In order to more particularly and intuitively illustrate an embodiment of the present invention or a prior art solution, a brief description of the drawings needed for use in the description of the embodiment or the prior art will be provided below.
FIG. 1 is a functional architectural design diagram of the present invention;
FIG. 2 is a technical architecture layout;
FIG. 3 is a deployment architecture layout.
Detailed Description
The technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the drawings in the embodiments of the present invention, and it is obvious that the described embodiments are only a part of the embodiments of the present invention, and not all of the embodiments.
Referring to fig. 1-3, a unified operation and maintenance platform architecture system includes the following unit modules:
a functional architecture module: the module comprises an acquisition layer, a technical operation and maintenance domain layer, a public component domain layer, a service application domain layer and a centralized display layer;
the technical architecture module comprises a display layer, a service layer, a processing layer, an operation layer, an acquisition layer and a resource layer;
the display layer is a centralized display portal layer of the unified monitoring system and is an inlet of the unified monitoring system; the service layer provides uniform operation and maintenance service for the uniform monitoring system, and the top layer is a northbound API (application programming interface) interface which provides uniform interface service for the display layer and a third-party system; the processing layer provides support for upper business services, and the support comprises processing capabilities of inquiry, storage, calculation, aggregation, conversion, cleaning, extraction, filtration and loading; the operation layer is the basis for deploying and operating the unified monitoring system, and H3Linux provides operating system resources for the unified monitoring system; the acquisition layer acquires the energy, state and configuration attribute data of various resources through a protocol; the resource layer comprises all objects of enterprise IT operation and maintenance management, including machine room dynamic loop equipment, IT infrastructure equipment, cloud environment, a database, middleware and application;
deploying the architecture module: the method is based on an H3C Matrix containerization deployment platform, and the platform deploys and monitors the micro-service based on a Kubernetes cluster;
the authority management module: adopting RBAC model authority control to support function authorization and data authorization;
customizing the large screen module: the module supports the realization of the self-combination and the centralized display of the chart in a dragging mode.
In the embodiment, the acquisition layer can be used for butt-jointing management objects of terminals, networks, clouds and safety universes through Agent and non-Agent acquisition modes and acquisition of a third-party system;
the technical operation and maintenance domain layer comprises the fields of basic architecture management, hardware monitoring, service monitoring, dynamic ring management, video monitoring and wireless management professional technical operation and maintenance;
the public component domain layer is used for extracting the service function with public attribute to form a public module for other service scenes to be directly called, thereby avoiding repeated development and reducing the redundancy of the system, and comprises the following steps: the system comprises a flow arrangement engine, a knowledge search engine, a knowledge gallery, an AI algorithm and an API interface functional module;
the business application domain layer is an operation and maintenance business domain divided according to business characteristics and application scenes of IT operation and maintenance, and comprises monitoring management, resource management, service flow management, automatic management and intelligent analysis; the monitoring management has comprehensive and deep monitoring capability, and comprises monitoring on the global heterogeneous resources such as networks, hardware, storage, basic software, cloud platforms, business applications, videos, dynamic rings, wireless resources and the like.
Resource management is a core component of a platform, and mainly manages related information of an operation and maintenance object (namely, a resource), and provides information such as resource attributes and relationships for other components for consumption.
The service flow assembly mainly covers the flows and the related business of the work orders in the operation and maintenance work, and the work of the operation and maintenance personnel is connected in series and recorded through the flow work orders, so that the work standardization and the transaction work ordering are finally realized.
The automatic operation and maintenance management comprises the functions of network automation, server automation, storage automation, script management automation, application delivery automation and the like, the automation of IT operation and maintenance management is realized from the aspect of operation and maintenance operation, and the operation and maintenance efficiency and the user satisfaction of an IT department are really improved.
The intelligent analysis component mainly comprises functions of log acquisition and analysis, fault root cause diagnosis and positioning, usability, capacity analysis and the like, assists in quickly completing daily operation and maintenance work such as fault positioning, service operation diagnosis, system capacity expansion and the like, improves operation and maintenance work efficiency, and provides high-quality operation and maintenance support for stable and efficient operation of a service system.
The centralized display layer provides a unified entrance for the daily operation and maintenance work of operation and maintenance personnel, and the platform provides a plurality of display modes of a PC desktop, a large screen and a mobile terminal.
In this embodiment, the presentation layer is a centralized presentation portal layer of the unified monitoring system, is an entrance of the unified monitoring system, and mainly includes an administrator view, a viewer view, a tenant view, large screen monitoring, a desktop portal, and a customized scene portal, and the mainly used technology stack includes Html5, Javascript, Css, Vue, and SpringBoot mainstream web front-end technologies.
In this embodiment, the service layer: the upper layer is a northbound API interface, unified interface services are provided for a display layer and a third-party system, an API Gate is a unified inlet of all service requests, a Cas Server provides service request authentication for all service requests, access to the third-party system is achieved through single sign-on, and a RBAC role authority management model provides unified function level and data level authority control for operation and maintenance services. The following are business function services, which include authentication, organization and user management, log manipulation, notification management, alarm management, topology management, log management, statistical analysis, and machine room facilities monitoring, hardware device monitoring, application performance monitoring, business monitoring, user experience monitoring, service flow management, resource management, server automation, and network automation.
In the embodiment, the operation layer is the basis of the deployment and operation of the unified monitoring system, the H3Linux provides operating system resources for the unified monitoring system, the display layer, the service layer and the processing layer on the upper layer are packaged and isolated in a Docker container mode, the services are arranged and managed through Kubernets, then the unified cluster installation and deployment are carried out through a Matrix graphical interface system, and the functions of self-monitoring, backup, software installation and unloading are provided.
And (3) treatment layer: providing support for upper-layer business service, including processing capabilities of inquiry, storage, calculation, aggregation, conversion, cleaning, extraction, filtration, loading and the like, wherein a processing layer uses a distributed cluster technology to ensure that the service provided for the business layer has the characteristics of high availability, high concurrency, high performance and the like, a bottom-layer technology stack comprises a relational database MySQL (storage configuration attribute and management data), a time sequence database Influx DB (storing various index data to meet the high-performance inquiry and analysis requirements on the time sequence data), Redis (meeting the cache requirements of high-speed inquiry), an Elastic Search document database (storing document data to meet the requirements of large-scale log storage inquiry, full-text retrieval and the like), a database Orient DB (storing CI association relations to meet the requirements of high-performance inquiry), Kafka (meeting the high-speed throughput processing of large-scale messages), Zookeeper (to meet the requirements of distributed cluster coordination), spincloud, actividi (as a process engine to meet the process orchestration of service process management and automation operations), and so on.
In the embodiment, the acquisition layer supports the Agent mode and the non-Agent mode to acquire data, and the acquisition of the performance, state and configuration attribute data of various resources is realized through various protocols of SNMP, SSH, Telnet, FTP, sFTP, WMI, IPMI, NetConf, NetFlow, NetStream, JDBC, Restful, Soap, SDK, JMX, Socket and SMI-S.
In the embodiment, the resource layer comprises all objects of enterprise IT operation and maintenance management, including a machine room dynamic ring device, an IT infrastructure device, a cloud environment, a database, a middleware and an application.
In this embodiment, the H3C Matrix deployment platform operates in a cluster manner, and the cluster composition is:
master node: the system is responsible for resource management and container scheduling work of the whole cluster, and three physical servers are needed to be used as the cluster;
and (3) Worker node: sharing and processing cluster service, users can carry out service software installation selection according to service requirements and carry out resource allocation according to cluster load conditions;
one node is automatically selected from the three Master nodes to serve as a main Master node, the main Master node is responsible for managing and monitoring all the nodes in the cluster, and the configured northbound service virtual IP is issued to the main Master node.
In the embodiment, the authority management module automatically filters data and displays the data in a tree structure hierarchy through the mapping of the attribution mechanisms of resources and personnel so as to meet the requirement of centralized unified operation and maintenance of enterprises with a multi-level organization structure, and the personnel attribution mechanisms are associated with the resource attribution mechanisms so as to enable the personnel of each mechanism to only process resource objects of the corresponding mechanism. In each mechanism, resources are grouped to realize that different personnel in the same mechanism manage different resource objects; and the hierarchical and separate right management can be realized by combining the functional right and the data right. If more detailed authority division is needed, the user can customize the authority according to the need; different authorities are formed by combining the function menu and the operation buttons of the platform. Different rights packages can in turn be combined into different roles. Different roles are associated with different accounts, so that different personnel can be controlled to operate different menus and function items.
In the embodiment, the customized large-screen module supports the access of third-party data according to a platform standard format and then graphical display, and simultaneously presets a plurality of graphically defined service components and supports the functions of large-screen partition layout, sequencing, cloning and previewing.
In the scheme, the platform system functions cover all fields of operation and maintenance services, including monitoring, management, control, service, safety, big data, artificial intelligence and other aspects, product capability is provided from the perspective of an application scene to meet operation and maintenance requirements of different enterprises in different development stages, CMDB data is taken as a core so as to meet the unified management of global resources of the enterprises and also ensure that a data association framework of each technical field has flexible expandability so as to meet the operation and maintenance requirements of enterprises with different scales and different maturity, the framework has sufficient openness so as to quickly realize the butt joint with a third-party system, the framework has hot plug capability so as not to influence the normal operation of other service modules when a certain service module is started and stopped, and the platform has the capabilities of unified portal, unified alarm, unified resources, unified flow engine and unified knowledge management, the operation and maintenance requirements of centralized management are met, the user experience and the engineering implementation amount are considered in the platform design, and therefore the usability of users is improved and the project delivery cost is reduced.
The above description is only for the preferred embodiment of the present invention, but the scope of the present invention is not limited thereto, and any person skilled in the art should be able to substitute or change the technical solution and the inventive concept of the present invention within the technical scope of the present invention.
Claims (10)
1. A unified operation and maintenance platform architecture system is characterized by comprising the following unit modules:
the functional architecture module: the module comprises an acquisition layer, a technical operation and maintenance domain layer, a public component domain layer, a service application domain layer and a centralized display layer;
the technical architecture module comprises a display layer, a service layer, a processing layer, an operation layer, an acquisition layer and a resource layer;
the display layer is a centralized display portal layer of the unified monitoring system and is an inlet of the unified monitoring system; the service layer provides uniform operation and maintenance service for the uniform monitoring system, and the top layer is a northbound API (application programming interface) interface which provides uniform interface service for the display layer and a third-party system; the processing layer provides support for upper business services, and the support comprises the processing capabilities of inquiring, storing, calculating, aggregating, converting, cleaning, extracting, filtering and loading; the operation layer is the basis for deploying and operating the unified monitoring system, and H3Linux provides operating system resources for the unified monitoring system; the acquisition layer acquires the energy, state and configuration attribute data of various resources through a protocol; the resource layer comprises all objects of enterprise IT operation and maintenance management, including machine room dynamic loop equipment, IT infrastructure equipment, cloud environment, a database, middleware and application;
deploying the architecture module: the method is based on an H3C Matrix containerization deployment platform, and the platform deploys and monitors the micro-service based on a Kubernetes cluster;
the authority management module: adopting RBAC model authority control to support function authorization and data authorization;
customizing a large screen module: the module supports the realization of the self-combination and the centralized display of the chart in a dragging mode.
2. The unified operation and maintenance platform architecture system according to claim 1, wherein the collection layer can interface management objects of terminals, networks, clouds and security universes through Agent and non-Agent collection modes and collection of a third-party system;
the technical operation and maintenance domain layer comprises the fields of basic architecture management, hardware monitoring, service monitoring, dynamic ring management, video monitoring and wireless management professional technical operation and maintenance;
the public component domain layer is used for extracting the service function with public attribute to form a public module for other service scenes to be directly called, thereby avoiding repeated development and reducing the redundancy of the system, and comprises the following steps: the system comprises a flow arrangement engine, a knowledge search engine, a knowledge gallery, an AI algorithm and an API interface functional module;
the business application domain layer is an operation and maintenance business domain divided according to business characteristics and application scenes of IT operation and maintenance, and comprises monitoring management, resource management, service flow management, automatic management and intelligent analysis;
the centralized display layer provides a unified entrance for the daily operation and maintenance work of operation and maintenance personnel, and the platform provides a plurality of display modes of a PC desktop, a large screen and a mobile terminal.
3. The unified operation and maintenance platform architecture system according to claim 2, wherein the presentation layer is a centralized presentation portal layer of the unified monitoring system, is an entrance of the unified monitoring system, and mainly comprises an administrator view, a viewer view, a tenant view, a large screen monitor, a desktop portal, and a customized scenarization portal, and the mainly used technology stack comprises a web front-end technology of a main stream of Html5, Javascript, Css, Vue, and SpringBoot.
4. The unified operation and maintenance platform architecture system according to claim 3, wherein the service layer: the upper layer is a northbound API interface, unified interface services are provided for a display layer and a third-party system, an API Gate is a unified inlet of all service requests, a Cas Server provides service request authentication for all service requests, access to the third-party system is achieved through single sign-on, and a RBAC role authority management model provides unified function level and data level authority control for operation and maintenance services. The following are business function services, which include authentication, organization and user management, log operation, notification management, alarm management, topology management, log management, statistical analysis, machine room facility monitoring, hardware device monitoring, application performance monitoring, business monitoring, user experience monitoring, service flow management, resource management, server automation, and network automation.
5. The architecture system of claim 4, wherein the operation layer is a basis for deployment and operation of the unified monitoring system, H3Linux provides operating system resources for the unified monitoring system, the display layer, the service layer and the processing layer of the upper layer are packaged and isolated in a Docker container manner, are arranged and managed by kubernets, are uniformly installed and deployed in a cluster by a Matrix graphical interface system, and provide functions of self-monitoring, backup, software installation and uninstallation.
6. The framework system of claim 5, wherein the collection layer supports Agent and non-Agent modes for data collection, and collects performance, status, and configuration attribute data of various resources through SNMP, SSH, Telnet, FTP, sFTP, WMI, IPMI, NetConf, NetFlow, NetStream, JDBC, Restful, Soap, SDK, JMX, Socket, and SMI-S protocols.
7. The architecture system of claim 6, wherein the resource layer comprises all objects of enterprise IT operation and maintenance management, including machine room dynamic ring equipment, IT infrastructure equipment, cloud environment, database, middleware, and applications.
8. The system of claim 7, wherein the H3C Matrix deployment platform operates in a cluster, and the cluster is composed of:
master node: the system is responsible for resource management and container scheduling work of the whole cluster, and three physical servers are needed to be used as the cluster;
and (3) Worker node: sharing and processing cluster service, users can carry out service software installation selection according to service requirements and carry out resource allocation according to cluster load conditions;
one node is automatically selected from the three Master nodes to serve as a main Master node, the main Master node is responsible for managing and monitoring all the nodes in the cluster, and the configured northbound service virtual IP is issued to the main Master node.
9. The architecture system of claim 8, wherein the rights management module automatically filters data and displays the data in a tree-type hierarchy through the attribution organization mapping of resources and personnel, so as to meet the requirement of centralized unified operation and maintenance of enterprises with multi-level organizational structures.
10. The architecture system of a unified operation and maintenance platform according to claim 9, wherein the customized large screen module supports graphical display after third-party data is accessed according to a platform standard format, and also presets a plurality of graphically defined service components, and supports large screen partition layout, sorting, cloning and previewing functions.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202111660543.2A CN114518934A (en) | 2021-12-31 | 2021-12-31 | Unified operation and maintenance platform architecture system |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202111660543.2A CN114518934A (en) | 2021-12-31 | 2021-12-31 | Unified operation and maintenance platform architecture system |
Publications (1)
Publication Number | Publication Date |
---|---|
CN114518934A true CN114518934A (en) | 2022-05-20 |
Family
ID=81597717
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202111660543.2A Pending CN114518934A (en) | 2021-12-31 | 2021-12-31 | Unified operation and maintenance platform architecture system |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN114518934A (en) |
Cited By (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN114679366A (en) * | 2022-05-25 | 2022-06-28 | 广州嘉为科技有限公司 | Tenant-oriented operation and maintenance tool opening method, system and medium in multi-cloud environment |
CN115118745A (en) * | 2022-06-08 | 2022-09-27 | 浙江工业大学 | Performance equipment information interconnection platform system and construction method thereof |
CN115421394A (en) * | 2022-09-20 | 2022-12-02 | 浪潮通信信息系统有限公司 | Method and device for constructing standard model in smart home architecture |
CN116562848A (en) * | 2023-05-05 | 2023-08-08 | 江西意孚欧科技有限公司 | Operation and maintenance management platform |
CN116579732A (en) * | 2023-04-06 | 2023-08-11 | 三亚宇航科技有限公司 | Operation and maintenance supervision method based on Internet of things |
CN116775255A (en) * | 2023-08-15 | 2023-09-19 | 长沙伊士格信息科技有限责任公司 | Global integration system supporting wide integration scene |
CN116562848B (en) * | 2023-05-05 | 2024-10-25 | 西安绿点信息科技有限公司 | Operation and maintenance management platform |
Citations (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN109657076A (en) * | 2018-12-06 | 2019-04-19 | 安徽海豚新媒体产业发展有限公司 | One kind is multi-functional to melt media center service platform |
CN110266533A (en) * | 2019-06-18 | 2019-09-20 | 湖南晖龙集团股份有限公司 | Big data platform management system |
CN110458528A (en) * | 2019-08-07 | 2019-11-15 | 上海数讯信息技术有限公司 | A kind of full-service configuration management platform based on CMDB operation management |
CN111885439A (en) * | 2020-07-24 | 2020-11-03 | 西安众联润科信息技术有限公司 | Optical network integrated management and duty management system |
CN111917887A (en) * | 2020-08-17 | 2020-11-10 | 普元信息技术股份有限公司 | System for realizing data governance under big data environment |
CN113794578A (en) * | 2021-07-08 | 2021-12-14 | 中国南方电网有限责任公司 | Communication network monitoring architecture system based on cloud platform |
-
2021
- 2021-12-31 CN CN202111660543.2A patent/CN114518934A/en active Pending
Patent Citations (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN109657076A (en) * | 2018-12-06 | 2019-04-19 | 安徽海豚新媒体产业发展有限公司 | One kind is multi-functional to melt media center service platform |
CN110266533A (en) * | 2019-06-18 | 2019-09-20 | 湖南晖龙集团股份有限公司 | Big data platform management system |
CN110458528A (en) * | 2019-08-07 | 2019-11-15 | 上海数讯信息技术有限公司 | A kind of full-service configuration management platform based on CMDB operation management |
CN111885439A (en) * | 2020-07-24 | 2020-11-03 | 西安众联润科信息技术有限公司 | Optical network integrated management and duty management system |
CN111917887A (en) * | 2020-08-17 | 2020-11-10 | 普元信息技术股份有限公司 | System for realizing data governance under big data environment |
CN113794578A (en) * | 2021-07-08 | 2021-12-14 | 中国南方电网有限责任公司 | Communication network monitoring architecture system based on cloud platform |
Cited By (8)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN114679366A (en) * | 2022-05-25 | 2022-06-28 | 广州嘉为科技有限公司 | Tenant-oriented operation and maintenance tool opening method, system and medium in multi-cloud environment |
CN115118745A (en) * | 2022-06-08 | 2022-09-27 | 浙江工业大学 | Performance equipment information interconnection platform system and construction method thereof |
CN115421394A (en) * | 2022-09-20 | 2022-12-02 | 浪潮通信信息系统有限公司 | Method and device for constructing standard model in smart home architecture |
CN116579732A (en) * | 2023-04-06 | 2023-08-11 | 三亚宇航科技有限公司 | Operation and maintenance supervision method based on Internet of things |
CN116562848A (en) * | 2023-05-05 | 2023-08-08 | 江西意孚欧科技有限公司 | Operation and maintenance management platform |
CN116562848B (en) * | 2023-05-05 | 2024-10-25 | 西安绿点信息科技有限公司 | Operation and maintenance management platform |
CN116775255A (en) * | 2023-08-15 | 2023-09-19 | 长沙伊士格信息科技有限责任公司 | Global integration system supporting wide integration scene |
CN116775255B (en) * | 2023-08-15 | 2023-11-21 | 长沙伊士格信息科技有限责任公司 | Global integration system supporting wide integration scene |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN114518934A (en) | Unified operation and maintenance platform architecture system | |
WO2021017301A1 (en) | Management method and apparatus based on kubernetes cluster, and computer-readable storage medium | |
CN106559488B (en) | A method of establishing the power grid geographical information space service of tenant's driving | |
CN105843182B (en) | A kind of power scheduling accident prediction system and method based on OMS | |
WO2023142054A1 (en) | Container microservice-oriented performance monitoring and alarm method and alarm system | |
US10061371B2 (en) | System and method for monitoring and managing data center resources in real time incorporating manageability subsystem | |
EP2625614B1 (en) | System and method for monitoring and managing data center resources in real time incorporating manageability subsystem | |
CN109379217B (en) | A kind of different producer's arranging service device of Metropolitan Area Network (MAN) | |
CN107181808A (en) | A kind of privately owned cloud system and operation method | |
CN112162821B (en) | Container cluster resource monitoring method, device and system | |
CN108848132B (en) | Power distribution scheduling main station system based on cloud | |
CN103973815A (en) | Method for unified monitoring of storage environment across data centers | |
CN104601673B (en) | Extensible high-availability server layered monitoring system | |
CN109542583B (en) | Virtual equipment management method based on double buses | |
CN104637265A (en) | Dispatch-automated multilevel integration intelligent watching alarming system | |
Trakadas et al. | Scalable monitoring for multiple virtualized infrastructures for 5G services | |
CN114244676A (en) | Intelligent IT integrated gateway system | |
CN113596150A (en) | Message pushing method and device, computer equipment and storage medium | |
CN108011769A (en) | A kind of implementation method of visualized O&M system | |
CN113127526A (en) | Distributed data storage and retrieval system based on Kubernetes | |
CN108401035A (en) | A kind of integrated monitoring apparatus and method based on MDC | |
CN112068953B (en) | Cloud resource fine management traceability system and method | |
CN108471452B (en) | Single cabinet data center monitoring method, system and device | |
CN113824801B (en) | Intelligent integration terminal unified access management component system | |
CN115719147A (en) | Power transmission line inspection data processing method, device and platform |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination |