CN110677288A

CN110677288A - Edge computing system and method generally used for multi-scene deployment

Info

Publication number: CN110677288A
Application number: CN201910911688.1A
Authority: CN
Inventors: 黄舒泉
Original assignee: Zhejiang 99Cloud Information Service Co Ltd
Current assignee: Zhejiang 99Cloud Information Service Co Ltd
Priority date: 2019-09-25
Filing date: 2019-09-25
Publication date: 2020-01-10

Abstract

The invention relates to the field of edge cloud computing, in particular to an edge computing system and method generally used for multi-scene deployment. The edge computing system and the method which are generally used for multi-scene deployment comprise a cloud platform and are characterized in that: the cloud platform is internally provided with a plurality of management servers which are respectively a configuration management server, a fault management server, a host management server, a service management server and a software management server. The beneficial effects are as follows: the magnitude deployment can be flexibly deployed in a severe environment to work, the robustness of the system is improved due to the reduction of the quantity, when the server is down, the system can be automatically recovered and put into service again in a very short time, and the maintenance cost caused by system expansion which possibly occurs in the future is reduced; and the method has ultra-low time delay, and improves the capability of high-complexity calculation.

Description

Edge computing system and method generally used for multi-scene deployment

Technical Field

The invention relates to the field of edge cloud computing, in particular to an edge computing system and method generally used for multi-scene deployment.

Background

In the prior art, as the 5G technology matures, a cloud operating system needs to be deployed in some remote areas or regional services with strong non-traditional technological strength according to the needs of customers to meet the data processing requirements of the remote areas or regional services, and a traditional cloud operating system is large in size, high in construction complexity, high in requirements for deployment environments and large in machine room; and the management mode of the traditional cloud platform cannot remotely configure and monitor the system, the response to the system error is slow, and the operation cost and the maintenance cost are high.

Disclosure of Invention

The invention aims to provide an edge computing system and method generally used for multi-scene deployment, and the technical scheme adopted by the invention is as follows:

the invention discloses an edge computing system and method generally used for multi-scene deployment, which comprises a cloud platform and is characterized in that: the cloud platform is internally provided with a plurality of management servers, namely a configuration management server, a fault management server, a host management server, a service management server and a software management server, wherein,

the configuration management is responsible for carrying out installation configuration of each component, and each time the system is started, the system stock service, the controller configuration service and the calculation configurator service are all re-executed, so that the system can be quickly restored to normal configuration after being restarted;

the fault management can count alarm times and check logs, and simultaneously comprises physical and virtual resources of a central cloud and an edge cloud;

the host management can monitor hardware resources and collect and synchronize virtual machine alarm, key processes and H/W faults from resource arrangement service, service management and configuration management; the host management can automatically restart the host by using different scheduling strategies according to cluster states, key processes, resource thresholds, faults of the physical host and the like under the condition that the virtual host is shut down;

the service management uses multiple channels to avoid the disconnection of communication and the split brain problem of the service and monitor the service state;

the software management provides a life cycle management mechanism for the shutdown problem of the virtual machine during upgrading, when hot migration is needed, resources on the host needing to be updated are automatically transferred to the available host, and the resources are automatically distributed to the updated host after updating is completed.

Further, the host management may use different scheduling policies to automatically restart the host according to a cluster state, a key process, a resource threshold, a failure of the physical host, and the like when the virtual host is powered off.

The invention has the beneficial effects that: the system can be automatically recovered and put into service again in a very short time when a server is down, so that the maintenance cost caused by system expansion which possibly occurs in the future is reduced; and the method has ultra-low time delay, and improves the capability of high-complexity calculation.

Drawings

FIG. 1 is an operating system architecture diagram of the present invention;

FIG. 2 is a schematic diagram of configuration management of the present invention;

FIG. 3 is a schematic diagram of the fault management of the present invention;

FIG. 4 is a schematic diagram of host management of the present invention;

FIG. 5 is a schematic diagram of the service management of the present invention;

FIG. 6 is a software management schematic of the present invention;

FIG. 7 is a diagram of a conventional cloud platform architecture of the present invention;

Detailed Description

The invention will be further described with reference to the following figures and examples.

The following system explanation and explanation are given by taking a 1+1 high-availability dual-control node control cluster as an example:

i. fig. 1 is a complete architecture diagram of the cloud operating system of the present invention, and the system architecture design includes a control node, a computing node, a storage node, a virtual network element interface, an operation support system, and a service support system, where a cloud computing platform, a virtual machine, and a distributed storage system are three components of a bottom layer; the operation support system and the service support system exchange data with the control node, and the virtual network element interface exchanges a calculation result with the calculation node; the virtual machine is optimized at the computing node, and an SR-IOV, OVS-DPDK and Intel network acceleration scheme is introduced into the network part; forming a distributed storage scheme Ceph in a storage node set; deploying virtual EPCs and virtual CPEs in virtual machines on upper-layer virtual network element interfaces VNFs to realize support of telecommunication network elements;

fig. 2 is a schematic diagram of a configuration management service, where sysinv provides state management of the entire software, modification of system configuration, and controllerconfig/controlproteconfig is responsible for setting the system configuration according to the role of a physical node;

FIG. 3 is a schematic diagram of fault management, wherein other system modules directly send alarm and log information to FM-manager through FM-API, and a central log system of fault management can collect log information of all nodes in the system; the fault management alarm system receives alarm information of all node roles;

FIG. 4 is a schematic diagram of a mainframe management service showing the cooperation between the mainframe management service and other management services and monitoring modules, the mainframe management using rmon to monitor the storage and usage of the central processing unit and the memory; using a pmon management base process to monitor computing and block storage services; providing a heartbeat detection service of the platform using hbs service; providing a management service to the server BWC using the hwmond service; using other service modules of the MTC service main pipe MTCE platform to provide interfaces to the outside;

v. fig. 5 is a schematic diagram of service management, the service management is composed of three components, the high-availability controller of the service management is a redundancy model, a 1+1 high-availability dual-control node is adopted to control a cluster, a main control node and a standby control node are in real-time communication, when the main control node fails, an HA process is automatically triggered, the standby node is switched to be the main control node, and the service management can be expanded to be N + M or N control nodes; the high-reliability message service can use at most three independent communication paths to avoid the split brain problem of communication, each path of the LAG protection link is configured, and the HMAC SHA-512 is used for carrying out identity verification on the message; its service monitoring may be active or passive;

fig. 6 is a schematic diagram of software management, where the software management provides a patch production tool and a management service of the patch, supports hot patches and reboot required patches, and needs to restart nodes when replacing kernel patches; through the real-time migration service of the virtual machine, the service is ensured not to be interrupted when the reboot patch is installed on the management node;

fig. 7 is an architecture diagram of a conventional cloud platform, and as a supplementary description, the conventional cloud platform places a computing node, a network node, and a storage node in a resource pool composed of stacks, a user calls a corresponding resource of the resource pool using an API, and a bottom layer includes a physical storage, a network switch, and a server, and also includes stacks.

List of abbreviations, english and key term definitions:

KVM (Kernel-based Virtual Machine): the kernel-based virtual machine is a virtualization infrastructure used in a Linux kernel, and can convert the Linux kernel into a virtual machine monitor;

EPC (evolved Packet core): the system is characterized in that only a packet domain is available, a circuit domain is not available, the system is based on an all-IP structure, control and bearing are separated, and a network structure is flattened;

cpe (customer premix equipment): a mobile signal access device for receiving mobile signals and forwarding the mobile signals by wireless WIFI signals;

ceph: an open source software storage platform that applies object storage on a single distributed computer cluster and provides interfaces for object level, block level, or file level storage;

SR-IOV: a virtualization hardware acceleration scheme is originally designed for sharing network resources among virtual machines;

OVS-DPDK: open vSwitch and DPDK will combine virtual machine acceleration schemes.

The invention is light-weight to deploy, can be flexibly deployed in a severe environment to work, improves the robustness of the system due to the reduction of the volume, can automatically recover and put into service again in a very short time when a server is down, and reduces the maintenance cost caused by system expansion which possibly occurs in the future; and the method has ultra-low time delay, and improves the capability of high-complexity calculation.

The present invention is not limited to the above embodiments, and any technical solutions similar or identical to the present invention, which are made in the light of the present invention, are within the scope of the present invention.

The techniques, shapes, and configurations not described in detail in the present invention are all known techniques.

Claims

1. An edge computing system and method generally used for multi-scene deployment comprises a cloud platform and is characterized in that: the cloud platform is internally provided with a plurality of management servers, namely a configuration management server, a fault management server, a host management server, a service management server and a software management server, wherein,

2. The system and method of claim 1, wherein the edge computing system is generally used for multi-scene deployment, and comprises: the host management can automatically restart the host by using different scheduling strategies according to cluster states, key processes, resource thresholds, faults of the physical host and the like under the condition that the virtual host is shut down.