CN114518934A - Unified operation and maintenance platform architecture system - Google Patents

Unified operation and maintenance platform architecture system Download PDF

Info

Publication number
CN114518934A
CN114518934A CN202111660543.2A CN202111660543A CN114518934A CN 114518934 A CN114518934 A CN 114518934A CN 202111660543 A CN202111660543 A CN 202111660543A CN 114518934 A CN114518934 A CN 114518934A
Authority
CN
China
Prior art keywords
layer
service
management
maintenance
unified
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202111660543.2A
Other languages
Chinese (zh)
Inventor
管华骥
丁成波
熊海生
杨钲
廉龙彬
栾宜杰
李民明
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Anhui Dolphin New Media Industry Development Co ltd
Original Assignee
Anhui Dolphin New Media Industry Development Co ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Anhui Dolphin New Media Industry Development Co ltd filed Critical Anhui Dolphin New Media Industry Development Co ltd
Priority to CN202111660543.2A priority Critical patent/CN114518934A/en
Publication of CN114518934A publication Critical patent/CN114518934A/en
Pending legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F9/00Arrangements for program control, e.g. control units
    • G06F9/06Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
    • G06F9/44Arrangements for executing specific programs
    • G06F9/455Emulation; Interpretation; Software simulation, e.g. virtualisation or emulation of application or operating system execution engines
    • G06F9/45533Hypervisors; Virtual machine monitors
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/07Responding to the occurrence of a fault, e.g. fault tolerance
    • G06F11/0703Error or fault processing not based on redundancy, i.e. by taking additional measures to deal with the error or fault not making use of redundancy in operation, in hardware, or in data representation
    • G06F11/0766Error or fault reporting or storing
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/07Responding to the occurrence of a fault, e.g. fault tolerance
    • G06F11/0703Error or fault processing not based on redundancy, i.e. by taking additional measures to deal with the error or fault not making use of redundancy in operation, in hardware, or in data representation
    • G06F11/079Root cause analysis, i.e. error or fault diagnosis
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/30Monitoring
    • G06F11/3003Monitoring arrangements specially adapted to the computing system or computing system component being monitored
    • G06F11/3006Monitoring arrangements specially adapted to the computing system or computing system component being monitored where the computing system is distributed, e.g. networked systems, clusters, multiprocessor systems
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/30Monitoring
    • G06F11/3051Monitoring arrangements for monitoring the configuration of the computing system or of the computing system component, e.g. monitoring the presence of processing resources, peripherals, I/O links, software programs
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/30Monitoring
    • G06F11/32Monitoring with visual or acoustical indication of the functioning of the machine
    • G06F11/324Display of status information
    • G06F11/327Alarm or error message display
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F9/00Arrangements for program control, e.g. control units
    • G06F9/06Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
    • G06F9/44Arrangements for executing specific programs
    • G06F9/455Emulation; Interpretation; Software simulation, e.g. virtualisation or emulation of application or operating system execution engines
    • G06F9/45533Hypervisors; Virtual machine monitors
    • G06F9/45558Hypervisor-specific management and integration aspects
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L41/00Arrangements for maintenance, administration or management of data switching networks, e.g. of packet switching networks
    • H04L41/06Management of faults, events, alarms or notifications
    • H04L41/0631Management of faults, events, alarms or notifications using root cause analysis; using analysis of correlation between notifications, alarms or events based on decision criteria, e.g. hierarchy, tree or time analysis
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L41/00Arrangements for maintenance, administration or management of data switching networks, e.g. of packet switching networks
    • H04L41/06Management of faults, events, alarms or notifications
    • H04L41/0677Localisation of faults
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L43/00Arrangements for monitoring or testing data switching networks
    • H04L43/08Monitoring or testing based on specific metrics, e.g. QoS, energy consumption or environmental parameters
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L63/00Network architectures or network communication protocols for network security
    • H04L63/10Network architectures or network communication protocols for network security for controlling access to devices or network resources
    • H04L63/105Multiple levels of security
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L67/00Network arrangements or protocols for supporting network services or applications
    • H04L67/01Protocols
    • H04L67/02Protocols based on web technology, e.g. hypertext transfer protocol [HTTP]
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L67/00Network arrangements or protocols for supporting network services or applications
    • H04L67/01Protocols
    • H04L67/10Protocols in which an application is distributed across nodes in the network

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Software Systems (AREA)
  • Computer Networks & Wireless Communication (AREA)
  • Signal Processing (AREA)
  • Quality & Reliability (AREA)
  • Computing Systems (AREA)
  • Computer Security & Cryptography (AREA)
  • Computer Hardware Design (AREA)
  • Environmental & Geological Engineering (AREA)
  • Health & Medical Sciences (AREA)
  • Biomedical Technology (AREA)
  • Mathematical Physics (AREA)
  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)

Abstract

The invention discloses a unified operation and maintenance platform architecture system, which comprises the following unit modules: a functional architecture module: the module comprises an acquisition layer, a technical operation and maintenance domain layer, a public component domain layer, a service application domain layer and a centralized display layer; the technical architecture module comprises a display layer, a service layer, a processing layer, an operation layer, an acquisition layer and a resource layer. The invention covers all fields of operation and maintenance services, including monitoring, managing, controlling, serving, safety, big data, artificial intelligence and other aspects, provides product capability from the perspective of application scenes to meet the operation and maintenance requirements of different enterprises at different development stages, takes CMDB data as a core so as to meet the unified management of enterprise global resources, and also ensure that a data association architecture of each technical domain has flexible expandability to meet the operation and maintenance requirements of enterprises of different scales and different maturity, and takes user experience and engineering implementation amount into consideration in platform design so as to improve the usability of users and reduce project delivery cost.

Description

Unified operation and maintenance platform architecture system
Technical Field
The invention relates to the technical field of media operation, in particular to a unified operation and maintenance platform architecture system.
Background
With the continuous expansion of the business, the construction and the expansion of a business system bring a series of pain points. The variety, the brand and the quantity of equipment are various, the system architecture and the version are different, and the artificial routing inspection monitoring is finite; multiple operation and maintenance processes, multiple media and insufficient electronization coverage; multiple operation and maintenance forms, a traditional environment, a virtual environment and a cloud environment. The support system is urgently needed to be changed towards the direction of 'IT as a service' by relying on the ITSS national standard and following the advanced service management concept of the ITIL to improve the capability and the operation and maintenance capability of the information application product, so that the operation cost is more effectively saved, the working flow is simplified, and the unified automatic supervision is realized.
Disclosure of Invention
The invention aims to solve the defects in the prior art and provides a unified operation and maintenance platform architecture system.
In order to achieve the purpose, the invention adopts the following technical scheme:
a unified operation and maintenance platform architecture system comprises the following unit modules:
a functional architecture module: the module comprises an acquisition layer, a technical operation and maintenance domain layer, a public component domain layer, a service application domain layer and a centralized display layer;
the technical architecture module comprises a display layer, a service layer, a processing layer, an operation layer, an acquisition layer and a resource layer;
the display layer is a centralized display portal layer of the unified monitoring system and is an inlet of the unified monitoring system; the service layer provides uniform operation and maintenance service for the uniform monitoring system, and the top layer is a northbound API (application programming interface) interface which provides uniform interface service for the display layer and a third-party system; the processing layer provides support for upper business services, and the support comprises processing capabilities of inquiry, storage, calculation, aggregation, conversion, cleaning, extraction, filtration and loading; the operation layer is the basis for deploying and operating the unified monitoring system, and H3Linux provides operating system resources for the unified monitoring system; the acquisition layer acquires the energy, state and configuration attribute data of various resources through a protocol; the resource layer comprises all objects of enterprise IT operation and maintenance management, including machine room dynamic loop equipment, IT infrastructure equipment, cloud environment, a database, middleware and application;
deploying the architecture module: the method is based on an H3C Matrix containerized deployment platform, and the platform deploys and monitors the micro-service based on a Kubernets cluster;
the authority management module: adopting RBAC model authority control to support function authorization and data authorization;
customizing the large screen module: the module supports the realization of the self-combination and the centralized display of the chart in a dragging mode.
Preferably, the acquisition layer can be used for interfacing management objects of terminals, networks, clouds and safety universes through Agent and non-Agent acquisition modes and acquisition of a third-party system;
the technical operation and maintenance domain layer comprises the fields of basic architecture management, hardware monitoring, service monitoring, dynamic ring management, video monitoring and wireless management professional technical operation and maintenance;
the public component domain layer is used for extracting the service function with public attribute to form a public module for other service scenes to be directly called, thereby avoiding repeated development and reducing the redundancy of the system, and comprises the following steps: the system comprises a flow arrangement engine, a knowledge search engine, a knowledge gallery, an AI algorithm and an API interface functional module;
the business application domain layer is an operation and maintenance business domain divided according to business characteristics and application scenes of IT operation and maintenance, and comprises monitoring management, resource management, service flow management, automatic management and intelligent analysis;
the centralized display layer provides a unified entrance for the daily operation and maintenance work of operation and maintenance personnel, and the platform provides a plurality of display modes of a PC desktop, a large screen and a mobile terminal.
Preferably, the presentation layer is a centralized presentation portal layer of the unified monitoring system, is an entrance of the unified monitoring system, and mainly comprises an administrator view, a viewer view, a tenant view, large screen monitoring, a desktop portal and a customized scene portal, and the mainly used technology stack comprises Html5, Javascript, Css, Vue and SpringBoot mainstream web front-end technology.
Preferably, the service layer: the upper layer is a northbound API interface, unified interface services are provided for a display layer and a third-party system, an API Gate is a unified inlet of all service requests, a Cas Server provides service request authentication for all service requests, access to the third-party system is achieved through single sign-on, and a RBAC role authority management model provides unified function level and data level authority control for operation and maintenance services. The following are business function services, which include authentication, organization and user management, log operation, notification management, alarm management, topology management, log management, statistical analysis, machine room facility monitoring, hardware device monitoring, application performance monitoring, business monitoring, user experience monitoring, service flow management, resource management, server automation, and network automation.
Preferably, the operation layer is a basis for deployment and operation of a unified monitoring system, H3Linux provides operating system resources for the unified monitoring system, the display layer, the service layer and the processing layer services on the upper layer are packaged and isolated in a Docker container mode, are arranged and managed through kubernets, are uniformly clustered and deployed through a Matrix graphical interface system, and provide functions of self-monitoring, backup, software installation and uninstallation.
Preferably, the acquisition layer supports Agent and non-Agent modes for data acquisition, and acquires the performance, state and configuration attribute data of various resources through various protocols such as SNMP, SSH, Telnet, FTP, sFTP, WMI, IPMI, NetConf, NetFlow, NetStream, JDBC, Restful, Soap, SDK, JMX, Socket and SMI-S.
Preferably, the resource layer includes all objects of the enterprise IT operation and maintenance management, including the machine room dynamic ring device, the IT infrastructure device, the cloud environment, the database, the middleware, and the application.
Preferably, the H3C Matrix deployment platform operates in a cluster manner, and the cluster is composed of:
master node: the system is responsible for resource management and container scheduling work of the whole cluster, and three physical servers are needed to be used as the cluster;
and (3) Worker node: sharing and processing cluster service, users can carry out service software installation selection according to service requirements and carry out resource allocation according to cluster load conditions;
one node is automatically selected from the three Master nodes to serve as a Master Master node, the Master Master node is responsible for managing and monitoring all the nodes in the cluster, and the configured northbound service virtual IP is issued to the Master Master node.
Preferably, the authority management module automatically filters data and displays the data in a tree structure hierarchy through the attribution organization mapping of resources and personnel so as to meet the requirement of centralized unified operation and maintenance of enterprises with a multi-level organizational structure.
Preferably, the customized large-screen module supports the diagrammatized display of third-party data after accessing according to a platform standard format, and simultaneously presets a plurality of graphically defined service components and supports the functions of large-screen partition layout, sequencing, cloning and previewing.
Compared with the prior art, the invention has the beneficial effects that: the invention covers all fields of operation and maintenance services, including monitoring, managing, controlling, serving, safety, big data, artificial intelligence and other aspects, provides product capability from the perspective of application scenes to meet the operation and maintenance requirements of different enterprises at different development stages, takes CMDB data as a core so as to meet the unified management of enterprise global resources, and also ensure that a data association architecture of each technical domain has flexible expandability to meet the operation and maintenance requirements of enterprises of different scales and different maturity, and takes user experience and engineering implementation amount into consideration in platform design so as to improve the usability of users and reduce project delivery cost.
Drawings
In order to more particularly and intuitively illustrate an embodiment of the present invention or a prior art solution, a brief description of the drawings needed for use in the description of the embodiment or the prior art will be provided below.
FIG. 1 is a functional architectural design diagram of the present invention;
FIG. 2 is a technical architecture layout;
FIG. 3 is a deployment architecture layout.
Detailed Description
The technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the drawings in the embodiments of the present invention, and it is obvious that the described embodiments are only a part of the embodiments of the present invention, and not all of the embodiments.
Referring to fig. 1-3, a unified operation and maintenance platform architecture system includes the following unit modules:
a functional architecture module: the module comprises an acquisition layer, a technical operation and maintenance domain layer, a public component domain layer, a service application domain layer and a centralized display layer;
the technical architecture module comprises a display layer, a service layer, a processing layer, an operation layer, an acquisition layer and a resource layer;
the display layer is a centralized display portal layer of the unified monitoring system and is an inlet of the unified monitoring system; the service layer provides uniform operation and maintenance service for the uniform monitoring system, and the top layer is a northbound API (application programming interface) interface which provides uniform interface service for the display layer and a third-party system; the processing layer provides support for upper business services, and the support comprises processing capabilities of inquiry, storage, calculation, aggregation, conversion, cleaning, extraction, filtration and loading; the operation layer is the basis for deploying and operating the unified monitoring system, and H3Linux provides operating system resources for the unified monitoring system; the acquisition layer acquires the energy, state and configuration attribute data of various resources through a protocol; the resource layer comprises all objects of enterprise IT operation and maintenance management, including machine room dynamic loop equipment, IT infrastructure equipment, cloud environment, a database, middleware and application;
deploying the architecture module: the method is based on an H3C Matrix containerization deployment platform, and the platform deploys and monitors the micro-service based on a Kubernetes cluster;
the authority management module: adopting RBAC model authority control to support function authorization and data authorization;
customizing the large screen module: the module supports the realization of the self-combination and the centralized display of the chart in a dragging mode.
In the embodiment, the acquisition layer can be used for butt-jointing management objects of terminals, networks, clouds and safety universes through Agent and non-Agent acquisition modes and acquisition of a third-party system;
the technical operation and maintenance domain layer comprises the fields of basic architecture management, hardware monitoring, service monitoring, dynamic ring management, video monitoring and wireless management professional technical operation and maintenance;
the public component domain layer is used for extracting the service function with public attribute to form a public module for other service scenes to be directly called, thereby avoiding repeated development and reducing the redundancy of the system, and comprises the following steps: the system comprises a flow arrangement engine, a knowledge search engine, a knowledge gallery, an AI algorithm and an API interface functional module;
the business application domain layer is an operation and maintenance business domain divided according to business characteristics and application scenes of IT operation and maintenance, and comprises monitoring management, resource management, service flow management, automatic management and intelligent analysis; the monitoring management has comprehensive and deep monitoring capability, and comprises monitoring on the global heterogeneous resources such as networks, hardware, storage, basic software, cloud platforms, business applications, videos, dynamic rings, wireless resources and the like.
Resource management is a core component of a platform, and mainly manages related information of an operation and maintenance object (namely, a resource), and provides information such as resource attributes and relationships for other components for consumption.
The service flow assembly mainly covers the flows and the related business of the work orders in the operation and maintenance work, and the work of the operation and maintenance personnel is connected in series and recorded through the flow work orders, so that the work standardization and the transaction work ordering are finally realized.
The automatic operation and maintenance management comprises the functions of network automation, server automation, storage automation, script management automation, application delivery automation and the like, the automation of IT operation and maintenance management is realized from the aspect of operation and maintenance operation, and the operation and maintenance efficiency and the user satisfaction of an IT department are really improved.
The intelligent analysis component mainly comprises functions of log acquisition and analysis, fault root cause diagnosis and positioning, usability, capacity analysis and the like, assists in quickly completing daily operation and maintenance work such as fault positioning, service operation diagnosis, system capacity expansion and the like, improves operation and maintenance work efficiency, and provides high-quality operation and maintenance support for stable and efficient operation of a service system.
The centralized display layer provides a unified entrance for the daily operation and maintenance work of operation and maintenance personnel, and the platform provides a plurality of display modes of a PC desktop, a large screen and a mobile terminal.
In this embodiment, the presentation layer is a centralized presentation portal layer of the unified monitoring system, is an entrance of the unified monitoring system, and mainly includes an administrator view, a viewer view, a tenant view, large screen monitoring, a desktop portal, and a customized scene portal, and the mainly used technology stack includes Html5, Javascript, Css, Vue, and SpringBoot mainstream web front-end technologies.
In this embodiment, the service layer: the upper layer is a northbound API interface, unified interface services are provided for a display layer and a third-party system, an API Gate is a unified inlet of all service requests, a Cas Server provides service request authentication for all service requests, access to the third-party system is achieved through single sign-on, and a RBAC role authority management model provides unified function level and data level authority control for operation and maintenance services. The following are business function services, which include authentication, organization and user management, log manipulation, notification management, alarm management, topology management, log management, statistical analysis, and machine room facilities monitoring, hardware device monitoring, application performance monitoring, business monitoring, user experience monitoring, service flow management, resource management, server automation, and network automation.
In the embodiment, the operation layer is the basis of the deployment and operation of the unified monitoring system, the H3Linux provides operating system resources for the unified monitoring system, the display layer, the service layer and the processing layer on the upper layer are packaged and isolated in a Docker container mode, the services are arranged and managed through Kubernets, then the unified cluster installation and deployment are carried out through a Matrix graphical interface system, and the functions of self-monitoring, backup, software installation and unloading are provided.
And (3) treatment layer: providing support for upper-layer business service, including processing capabilities of inquiry, storage, calculation, aggregation, conversion, cleaning, extraction, filtration, loading and the like, wherein a processing layer uses a distributed cluster technology to ensure that the service provided for the business layer has the characteristics of high availability, high concurrency, high performance and the like, a bottom-layer technology stack comprises a relational database MySQL (storage configuration attribute and management data), a time sequence database Influx DB (storing various index data to meet the high-performance inquiry and analysis requirements on the time sequence data), Redis (meeting the cache requirements of high-speed inquiry), an Elastic Search document database (storing document data to meet the requirements of large-scale log storage inquiry, full-text retrieval and the like), a database Orient DB (storing CI association relations to meet the requirements of high-performance inquiry), Kafka (meeting the high-speed throughput processing of large-scale messages), Zookeeper (to meet the requirements of distributed cluster coordination), spincloud, actividi (as a process engine to meet the process orchestration of service process management and automation operations), and so on.
In the embodiment, the acquisition layer supports the Agent mode and the non-Agent mode to acquire data, and the acquisition of the performance, state and configuration attribute data of various resources is realized through various protocols of SNMP, SSH, Telnet, FTP, sFTP, WMI, IPMI, NetConf, NetFlow, NetStream, JDBC, Restful, Soap, SDK, JMX, Socket and SMI-S.
In the embodiment, the resource layer comprises all objects of enterprise IT operation and maintenance management, including a machine room dynamic ring device, an IT infrastructure device, a cloud environment, a database, a middleware and an application.
In this embodiment, the H3C Matrix deployment platform operates in a cluster manner, and the cluster composition is:
master node: the system is responsible for resource management and container scheduling work of the whole cluster, and three physical servers are needed to be used as the cluster;
and (3) Worker node: sharing and processing cluster service, users can carry out service software installation selection according to service requirements and carry out resource allocation according to cluster load conditions;
one node is automatically selected from the three Master nodes to serve as a main Master node, the main Master node is responsible for managing and monitoring all the nodes in the cluster, and the configured northbound service virtual IP is issued to the main Master node.
In the embodiment, the authority management module automatically filters data and displays the data in a tree structure hierarchy through the mapping of the attribution mechanisms of resources and personnel so as to meet the requirement of centralized unified operation and maintenance of enterprises with a multi-level organization structure, and the personnel attribution mechanisms are associated with the resource attribution mechanisms so as to enable the personnel of each mechanism to only process resource objects of the corresponding mechanism. In each mechanism, resources are grouped to realize that different personnel in the same mechanism manage different resource objects; and the hierarchical and separate right management can be realized by combining the functional right and the data right. If more detailed authority division is needed, the user can customize the authority according to the need; different authorities are formed by combining the function menu and the operation buttons of the platform. Different rights packages can in turn be combined into different roles. Different roles are associated with different accounts, so that different personnel can be controlled to operate different menus and function items.
In the embodiment, the customized large-screen module supports the access of third-party data according to a platform standard format and then graphical display, and simultaneously presets a plurality of graphically defined service components and supports the functions of large-screen partition layout, sequencing, cloning and previewing.
In the scheme, the platform system functions cover all fields of operation and maintenance services, including monitoring, management, control, service, safety, big data, artificial intelligence and other aspects, product capability is provided from the perspective of an application scene to meet operation and maintenance requirements of different enterprises in different development stages, CMDB data is taken as a core so as to meet the unified management of global resources of the enterprises and also ensure that a data association framework of each technical field has flexible expandability so as to meet the operation and maintenance requirements of enterprises with different scales and different maturity, the framework has sufficient openness so as to quickly realize the butt joint with a third-party system, the framework has hot plug capability so as not to influence the normal operation of other service modules when a certain service module is started and stopped, and the platform has the capabilities of unified portal, unified alarm, unified resources, unified flow engine and unified knowledge management, the operation and maintenance requirements of centralized management are met, the user experience and the engineering implementation amount are considered in the platform design, and therefore the usability of users is improved and the project delivery cost is reduced.
The above description is only for the preferred embodiment of the present invention, but the scope of the present invention is not limited thereto, and any person skilled in the art should be able to substitute or change the technical solution and the inventive concept of the present invention within the technical scope of the present invention.

Claims (10)

1. A unified operation and maintenance platform architecture system is characterized by comprising the following unit modules:
the functional architecture module: the module comprises an acquisition layer, a technical operation and maintenance domain layer, a public component domain layer, a service application domain layer and a centralized display layer;
the technical architecture module comprises a display layer, a service layer, a processing layer, an operation layer, an acquisition layer and a resource layer;
the display layer is a centralized display portal layer of the unified monitoring system and is an inlet of the unified monitoring system; the service layer provides uniform operation and maintenance service for the uniform monitoring system, and the top layer is a northbound API (application programming interface) interface which provides uniform interface service for the display layer and a third-party system; the processing layer provides support for upper business services, and the support comprises the processing capabilities of inquiring, storing, calculating, aggregating, converting, cleaning, extracting, filtering and loading; the operation layer is the basis for deploying and operating the unified monitoring system, and H3Linux provides operating system resources for the unified monitoring system; the acquisition layer acquires the energy, state and configuration attribute data of various resources through a protocol; the resource layer comprises all objects of enterprise IT operation and maintenance management, including machine room dynamic loop equipment, IT infrastructure equipment, cloud environment, a database, middleware and application;
deploying the architecture module: the method is based on an H3C Matrix containerization deployment platform, and the platform deploys and monitors the micro-service based on a Kubernetes cluster;
the authority management module: adopting RBAC model authority control to support function authorization and data authorization;
customizing a large screen module: the module supports the realization of the self-combination and the centralized display of the chart in a dragging mode.
2. The unified operation and maintenance platform architecture system according to claim 1, wherein the collection layer can interface management objects of terminals, networks, clouds and security universes through Agent and non-Agent collection modes and collection of a third-party system;
the technical operation and maintenance domain layer comprises the fields of basic architecture management, hardware monitoring, service monitoring, dynamic ring management, video monitoring and wireless management professional technical operation and maintenance;
the public component domain layer is used for extracting the service function with public attribute to form a public module for other service scenes to be directly called, thereby avoiding repeated development and reducing the redundancy of the system, and comprises the following steps: the system comprises a flow arrangement engine, a knowledge search engine, a knowledge gallery, an AI algorithm and an API interface functional module;
the business application domain layer is an operation and maintenance business domain divided according to business characteristics and application scenes of IT operation and maintenance, and comprises monitoring management, resource management, service flow management, automatic management and intelligent analysis;
the centralized display layer provides a unified entrance for the daily operation and maintenance work of operation and maintenance personnel, and the platform provides a plurality of display modes of a PC desktop, a large screen and a mobile terminal.
3. The unified operation and maintenance platform architecture system according to claim 2, wherein the presentation layer is a centralized presentation portal layer of the unified monitoring system, is an entrance of the unified monitoring system, and mainly comprises an administrator view, a viewer view, a tenant view, a large screen monitor, a desktop portal, and a customized scenarization portal, and the mainly used technology stack comprises a web front-end technology of a main stream of Html5, Javascript, Css, Vue, and SpringBoot.
4. The unified operation and maintenance platform architecture system according to claim 3, wherein the service layer: the upper layer is a northbound API interface, unified interface services are provided for a display layer and a third-party system, an API Gate is a unified inlet of all service requests, a Cas Server provides service request authentication for all service requests, access to the third-party system is achieved through single sign-on, and a RBAC role authority management model provides unified function level and data level authority control for operation and maintenance services. The following are business function services, which include authentication, organization and user management, log operation, notification management, alarm management, topology management, log management, statistical analysis, machine room facility monitoring, hardware device monitoring, application performance monitoring, business monitoring, user experience monitoring, service flow management, resource management, server automation, and network automation.
5. The architecture system of claim 4, wherein the operation layer is a basis for deployment and operation of the unified monitoring system, H3Linux provides operating system resources for the unified monitoring system, the display layer, the service layer and the processing layer of the upper layer are packaged and isolated in a Docker container manner, are arranged and managed by kubernets, are uniformly installed and deployed in a cluster by a Matrix graphical interface system, and provide functions of self-monitoring, backup, software installation and uninstallation.
6. The framework system of claim 5, wherein the collection layer supports Agent and non-Agent modes for data collection, and collects performance, status, and configuration attribute data of various resources through SNMP, SSH, Telnet, FTP, sFTP, WMI, IPMI, NetConf, NetFlow, NetStream, JDBC, Restful, Soap, SDK, JMX, Socket, and SMI-S protocols.
7. The architecture system of claim 6, wherein the resource layer comprises all objects of enterprise IT operation and maintenance management, including machine room dynamic ring equipment, IT infrastructure equipment, cloud environment, database, middleware, and applications.
8. The system of claim 7, wherein the H3C Matrix deployment platform operates in a cluster, and the cluster is composed of:
master node: the system is responsible for resource management and container scheduling work of the whole cluster, and three physical servers are needed to be used as the cluster;
and (3) Worker node: sharing and processing cluster service, users can carry out service software installation selection according to service requirements and carry out resource allocation according to cluster load conditions;
one node is automatically selected from the three Master nodes to serve as a main Master node, the main Master node is responsible for managing and monitoring all the nodes in the cluster, and the configured northbound service virtual IP is issued to the main Master node.
9. The architecture system of claim 8, wherein the rights management module automatically filters data and displays the data in a tree-type hierarchy through the attribution organization mapping of resources and personnel, so as to meet the requirement of centralized unified operation and maintenance of enterprises with multi-level organizational structures.
10. The architecture system of a unified operation and maintenance platform according to claim 9, wherein the customized large screen module supports graphical display after third-party data is accessed according to a platform standard format, and also presets a plurality of graphically defined service components, and supports large screen partition layout, sorting, cloning and previewing functions.
CN202111660543.2A 2021-12-31 2021-12-31 Unified operation and maintenance platform architecture system Pending CN114518934A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202111660543.2A CN114518934A (en) 2021-12-31 2021-12-31 Unified operation and maintenance platform architecture system

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202111660543.2A CN114518934A (en) 2021-12-31 2021-12-31 Unified operation and maintenance platform architecture system

Publications (1)

Publication Number Publication Date
CN114518934A true CN114518934A (en) 2022-05-20

Family

ID=81597717

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202111660543.2A Pending CN114518934A (en) 2021-12-31 2021-12-31 Unified operation and maintenance platform architecture system

Country Status (1)

Country Link
CN (1) CN114518934A (en)

Cited By (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN114679366A (en) * 2022-05-25 2022-06-28 广州嘉为科技有限公司 Tenant-oriented operation and maintenance tool opening method, system and medium in multi-cloud environment
CN115118745A (en) * 2022-06-08 2022-09-27 浙江工业大学 Performance equipment information interconnection platform system and construction method thereof
CN115421394A (en) * 2022-09-20 2022-12-02 浪潮通信信息系统有限公司 Method and device for constructing standard model in smart home architecture
CN116562848A (en) * 2023-05-05 2023-08-08 江西意孚欧科技有限公司 Operation and maintenance management platform
CN116579732A (en) * 2023-04-06 2023-08-11 三亚宇航科技有限公司 Operation and maintenance supervision method based on Internet of things
CN116775255A (en) * 2023-08-15 2023-09-19 长沙伊士格信息科技有限责任公司 Global integration system supporting wide integration scene
CN116562848B (en) * 2023-05-05 2024-10-25 西安绿点信息科技有限公司 Operation and maintenance management platform

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109657076A (en) * 2018-12-06 2019-04-19 安徽海豚新媒体产业发展有限公司 One kind is multi-functional to melt media center service platform
CN110266533A (en) * 2019-06-18 2019-09-20 湖南晖龙集团股份有限公司 Big data platform management system
CN110458528A (en) * 2019-08-07 2019-11-15 上海数讯信息技术有限公司 A kind of full-service configuration management platform based on CMDB operation management
CN111885439A (en) * 2020-07-24 2020-11-03 西安众联润科信息技术有限公司 Optical network integrated management and duty management system
CN111917887A (en) * 2020-08-17 2020-11-10 普元信息技术股份有限公司 System for realizing data governance under big data environment
CN113794578A (en) * 2021-07-08 2021-12-14 中国南方电网有限责任公司 Communication network monitoring architecture system based on cloud platform

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109657076A (en) * 2018-12-06 2019-04-19 安徽海豚新媒体产业发展有限公司 One kind is multi-functional to melt media center service platform
CN110266533A (en) * 2019-06-18 2019-09-20 湖南晖龙集团股份有限公司 Big data platform management system
CN110458528A (en) * 2019-08-07 2019-11-15 上海数讯信息技术有限公司 A kind of full-service configuration management platform based on CMDB operation management
CN111885439A (en) * 2020-07-24 2020-11-03 西安众联润科信息技术有限公司 Optical network integrated management and duty management system
CN111917887A (en) * 2020-08-17 2020-11-10 普元信息技术股份有限公司 System for realizing data governance under big data environment
CN113794578A (en) * 2021-07-08 2021-12-14 中国南方电网有限责任公司 Communication network monitoring architecture system based on cloud platform

Cited By (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN114679366A (en) * 2022-05-25 2022-06-28 广州嘉为科技有限公司 Tenant-oriented operation and maintenance tool opening method, system and medium in multi-cloud environment
CN115118745A (en) * 2022-06-08 2022-09-27 浙江工业大学 Performance equipment information interconnection platform system and construction method thereof
CN115421394A (en) * 2022-09-20 2022-12-02 浪潮通信信息系统有限公司 Method and device for constructing standard model in smart home architecture
CN116579732A (en) * 2023-04-06 2023-08-11 三亚宇航科技有限公司 Operation and maintenance supervision method based on Internet of things
CN116562848A (en) * 2023-05-05 2023-08-08 江西意孚欧科技有限公司 Operation and maintenance management platform
CN116562848B (en) * 2023-05-05 2024-10-25 西安绿点信息科技有限公司 Operation and maintenance management platform
CN116775255A (en) * 2023-08-15 2023-09-19 长沙伊士格信息科技有限责任公司 Global integration system supporting wide integration scene
CN116775255B (en) * 2023-08-15 2023-11-21 长沙伊士格信息科技有限责任公司 Global integration system supporting wide integration scene

Similar Documents

Publication Publication Date Title
CN114518934A (en) Unified operation and maintenance platform architecture system
WO2021017301A1 (en) Management method and apparatus based on kubernetes cluster, and computer-readable storage medium
CN106559488B (en) A method of establishing the power grid geographical information space service of tenant's driving
CN105843182B (en) A kind of power scheduling accident prediction system and method based on OMS
WO2023142054A1 (en) Container microservice-oriented performance monitoring and alarm method and alarm system
US10061371B2 (en) System and method for monitoring and managing data center resources in real time incorporating manageability subsystem
EP2625614B1 (en) System and method for monitoring and managing data center resources in real time incorporating manageability subsystem
CN109379217B (en) A kind of different producer's arranging service device of Metropolitan Area Network (MAN)
CN107181808A (en) A kind of privately owned cloud system and operation method
CN112162821B (en) Container cluster resource monitoring method, device and system
CN108848132B (en) Power distribution scheduling main station system based on cloud
CN103973815A (en) Method for unified monitoring of storage environment across data centers
CN104601673B (en) Extensible high-availability server layered monitoring system
CN109542583B (en) Virtual equipment management method based on double buses
CN104637265A (en) Dispatch-automated multilevel integration intelligent watching alarming system
Trakadas et al. Scalable monitoring for multiple virtualized infrastructures for 5G services
CN114244676A (en) Intelligent IT integrated gateway system
CN113596150A (en) Message pushing method and device, computer equipment and storage medium
CN108011769A (en) A kind of implementation method of visualized O&M system
CN113127526A (en) Distributed data storage and retrieval system based on Kubernetes
CN108401035A (en) A kind of integrated monitoring apparatus and method based on MDC
CN112068953B (en) Cloud resource fine management traceability system and method
CN108471452B (en) Single cabinet data center monitoring method, system and device
CN113824801B (en) Intelligent integration terminal unified access management component system
CN115719147A (en) Power transmission line inspection data processing method, device and platform

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination