WO2020207266A1 - Network system, instance management method, device, and storage medium - Google Patents
Network system, instance management method, device, and storage medium Download PDFInfo
- Publication number
- WO2020207266A1 WO2020207266A1 PCT/CN2020/081570 CN2020081570W WO2020207266A1 WO 2020207266 A1 WO2020207266 A1 WO 2020207266A1 CN 2020081570 W CN2020081570 W CN 2020081570W WO 2020207266 A1 WO2020207266 A1 WO 2020207266A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- instance
- edge cloud
- cloud node
- edge
- migrated
- Prior art date
Links
- 238000003860 storage Methods 0.000 title claims abstract description 47
- 238000007726 management method Methods 0.000 title abstract description 497
- 238000012545 processing Methods 0.000 claims abstract description 18
- 230000005012 migration Effects 0.000 claims description 148
- 238000013508 migration Methods 0.000 claims description 148
- 238000000034 method Methods 0.000 claims description 135
- 230000008569 process Effects 0.000 claims description 47
- 238000004590 computer program Methods 0.000 claims description 19
- 230000004044 response Effects 0.000 abstract description 13
- 230000001934 delay Effects 0.000 abstract description 5
- 238000012423 maintenance Methods 0.000 description 189
- 238000012544 monitoring process Methods 0.000 description 82
- 238000004891 communication Methods 0.000 description 43
- 238000010276 construction Methods 0.000 description 27
- 230000006870 function Effects 0.000 description 25
- 230000008030 elimination Effects 0.000 description 18
- 238000003379 elimination reaction Methods 0.000 description 18
- 239000012634 fragment Substances 0.000 description 15
- 238000003672 processing method Methods 0.000 description 14
- 238000005516 engineering process Methods 0.000 description 13
- 238000012217 deletion Methods 0.000 description 12
- 230000037430 deletion Effects 0.000 description 12
- 238000010586 diagram Methods 0.000 description 12
- 238000009826 distribution Methods 0.000 description 12
- 230000005540 biological transmission Effects 0.000 description 11
- 230000002159 abnormal effect Effects 0.000 description 10
- 230000009471 action Effects 0.000 description 9
- 238000007596 consolidation process Methods 0.000 description 7
- 238000007405 data analysis Methods 0.000 description 7
- 238000013467 fragmentation Methods 0.000 description 7
- 238000006062 fragmentation reaction Methods 0.000 description 7
- 230000010354 integration Effects 0.000 description 6
- 230000009286 beneficial effect Effects 0.000 description 4
- 230000005236 sound signal Effects 0.000 description 4
- 238000012795 verification Methods 0.000 description 4
- 239000008186 active pharmaceutical agent Substances 0.000 description 3
- 230000008859 change Effects 0.000 description 3
- 230000001360 synchronised effect Effects 0.000 description 3
- 238000004458 analytical method Methods 0.000 description 2
- 230000009118 appropriate response Effects 0.000 description 2
- 230000006399 behavior Effects 0.000 description 2
- 238000005315 distribution function Methods 0.000 description 2
- 230000004048 modification Effects 0.000 description 2
- 238000012986 modification Methods 0.000 description 2
- 230000003287 optical effect Effects 0.000 description 2
- 238000011084 recovery Methods 0.000 description 2
- 230000009467 reduction Effects 0.000 description 2
- 230000008439 repair process Effects 0.000 description 2
- 238000012954 risk control Methods 0.000 description 2
- 230000008901 benefit Effects 0.000 description 1
- 230000033228 biological regulation Effects 0.000 description 1
- 238000004364 calculation method Methods 0.000 description 1
- 238000013480 data collection Methods 0.000 description 1
- 230000007123 defense Effects 0.000 description 1
- 230000000694 effects Effects 0.000 description 1
- 230000007613 environmental effect Effects 0.000 description 1
- 230000003203 everyday effect Effects 0.000 description 1
- 230000036541 health Effects 0.000 description 1
- 230000006872 improvement Effects 0.000 description 1
- 238000007689 inspection Methods 0.000 description 1
- 238000009434 installation Methods 0.000 description 1
- 239000004973 liquid crystal related substance Substances 0.000 description 1
- 230000014759 maintenance of location Effects 0.000 description 1
- 238000004519 manufacturing process Methods 0.000 description 1
- 230000006855 networking Effects 0.000 description 1
- 238000005457 optimization Methods 0.000 description 1
- 238000004806 packaging method and process Methods 0.000 description 1
- 238000007781 pre-processing Methods 0.000 description 1
- 238000013468 resource allocation Methods 0.000 description 1
- 230000003068 static effect Effects 0.000 description 1
- 239000002699 waste material Substances 0.000 description 1
Images
Classifications
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04L—TRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
- H04L41/00—Arrangements for maintenance, administration or management of data switching networks, e.g. of packet switching networks
- H04L41/08—Configuration management of networks or network elements
- H04L41/0893—Assignment of logical groups to network elements
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04L—TRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
- H04L41/00—Arrangements for maintenance, administration or management of data switching networks, e.g. of packet switching networks
- H04L41/08—Configuration management of networks or network elements
- H04L41/0896—Bandwidth or capacity management, i.e. automatically increasing or decreasing capacities
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04L—TRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
- H04L41/00—Arrangements for maintenance, administration or management of data switching networks, e.g. of packet switching networks
- H04L41/50—Network service management, e.g. ensuring proper service fulfilment according to agreements
- H04L41/5041—Network service management, e.g. ensuring proper service fulfilment according to agreements characterised by the time relationship between creation and deployment of a service
- H04L41/5051—Service on demand, e.g. definition and deployment of services in real time
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04L—TRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
- H04L41/00—Arrangements for maintenance, administration or management of data switching networks, e.g. of packet switching networks
- H04L41/50—Network service management, e.g. ensuring proper service fulfilment according to agreements
- H04L41/5041—Network service management, e.g. ensuring proper service fulfilment according to agreements characterised by the time relationship between creation and deployment of a service
- H04L41/5054—Automatic deployment of services triggered by the service manager, e.g. service implementation by automatic configuration of network components
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04L—TRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
- H04L67/00—Network arrangements or protocols for supporting network services or applications
- H04L67/01—Protocols
- H04L67/10—Protocols in which an application is distributed across nodes in the network
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04L—TRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
- H04L67/00—Network arrangements or protocols for supporting network services or applications
- H04L67/01—Protocols
- H04L67/10—Protocols in which an application is distributed across nodes in the network
- H04L67/1001—Protocols in which an application is distributed across nodes in the network for accessing one among a plurality of replicated servers
- H04L67/1004—Server selection for load balancing
- H04L67/1008—Server selection for load balancing based on parameters of servers, e.g. available memory or workload
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04L—TRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
- H04L67/00—Network arrangements or protocols for supporting network services or applications
- H04L67/01—Protocols
- H04L67/10—Protocols in which an application is distributed across nodes in the network
- H04L67/1001—Protocols in which an application is distributed across nodes in the network for accessing one among a plurality of replicated servers
- H04L67/1004—Server selection for load balancing
- H04L67/1012—Server selection for load balancing based on compliance of requirements or conditions with available server resources
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04L—TRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
- H04L67/00—Network arrangements or protocols for supporting network services or applications
- H04L67/01—Protocols
- H04L67/10—Protocols in which an application is distributed across nodes in the network
- H04L67/1001—Protocols in which an application is distributed across nodes in the network for accessing one among a plurality of replicated servers
- H04L67/1004—Server selection for load balancing
- H04L67/1021—Server selection for load balancing based on client or server locations
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04L—TRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
- H04L67/00—Network arrangements or protocols for supporting network services or applications
- H04L67/01—Protocols
- H04L67/10—Protocols in which an application is distributed across nodes in the network
- H04L67/1095—Replication or mirroring of data, e.g. scheduling or transport for data synchronisation between network nodes
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04L—TRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
- H04L41/00—Arrangements for maintenance, administration or management of data switching networks, e.g. of packet switching networks
- H04L41/08—Configuration management of networks or network elements
- H04L41/0894—Policy-based network configuration management
Definitions
- This application relates to the field of computer technology, and in particular to a network system, instance management and control method, equipment, and storage medium.
- the concept of cloud computing is based on centralized resource management and control. Even if multiple data centers are used for interconnection, all software and hardware resources are still treated as unified resources for management, scheduling, and sales. With the advent of the era of 5G and the Internet of Things and the gradual increase of cloud computing applications, the terminal side has higher and higher requirements for cloud resources in terms of latency and bandwidth. The centralized cloud network can no longer meet the increasing demand on the terminal side. Cloud resource requirements.
- Various aspects of the present application provide a network system, instance management and control method, device, and storage medium to reduce service response delay and bandwidth cost.
- the embodiment of the present application provides an instance management and control method, including: determining at least one instance deployed in at least one edge cloud node in a network system, the at least one instance can provide cloud computing services for the service demander; The instance is managed, so that the at least one instance provides cloud computing services for the service demander.
- An embodiment of the present application also provides a network system, including: a central management and control device, and at least one edge cloud node; at least one instance is deployed in the at least one edge cloud node, and the at least one instance can provide a cloud for service demanders Computing services; the central management and control device is used to manage and control the at least one instance, so that the at least one instance provides cloud computing services for the service demander.
- An embodiment of the present application also provides a central management and control device, including: a memory and a processor; the memory is used to store a computer program; when the computer program is executed by the processor, the processor is caused to implement the application The steps in the example management method provided in the embodiment.
- the embodiment of the present application also provides a computer-readable storage medium storing a computer program.
- the computer program is executed by one or more processors, the one or more processors are caused to implement the Examples of steps in the control method.
- the ability of cloud computing is considered to be placed on the edge side closer to the terminal, so a network system including edge cloud nodes is provided.
- the edge cloud node Instances that provide cloud computing services are deployed in the central control equipment. Under the control of the central control equipment, these instances can provide cloud computing services. This achieves the purpose of providing services to users with the help of resources in edge cloud nodes, so that "put cloud computing to a distance "Processing in edge cloud nodes closer to the terminal" has become a reality, which is conducive to reducing service response delay and bandwidth costs.
- Fig. 1a is a schematic structural diagram of a network system provided by an exemplary embodiment of this application.
- FIG. 1b is a schematic structural diagram of a central management and control device and an edge management and control device provided by an exemplary embodiment of this application;
- FIG. 1c is a schematic structural diagram of another network system provided by an exemplary embodiment of this application.
- FIG. 2a is a schematic flowchart of an example management and control method provided by an exemplary embodiment of this application;
- FIG. 2b is a schematic flowchart of an example upgrade method provided by an exemplary embodiment of this application.
- FIG. 2c is a schematic flowchart of an example migration method provided by an exemplary embodiment of this application.
- FIG. 3 is a schematic structural diagram of a central management and control device provided by an exemplary embodiment of this application.
- a network system including edge cloud nodes is provided.
- instances that provide cloud computing services are deployed in the edge cloud nodes. Under the control of the central control device, these instances can provide cloud computing services to achieve With the purpose of providing services to users with the help of resources in edge cloud nodes, it becomes a reality to "place cloud computing in edge cloud nodes closer to the terminal for processing", which is conducive to reducing service response delays and reducing bandwidth costs.
- Fig. 1a is a schematic structural diagram of a network system provided by an exemplary embodiment of this application.
- the network system 100 includes: a central management and control device 101 and at least one edge cloud node 102; at least one edge cloud node 102 is connected to the central management and control device 101 in a network.
- the network system 100 in this embodiment is a cloud computing platform built on edge infrastructure based on cloud computing technology and edge computing capabilities, and is a cloud platform with computing, network, storage, and security capabilities at the edge.
- Edge cloud is a relative concept.
- Edge cloud refers to a cloud computing platform that is relatively close to the terminal. In other words, it is different from central cloud or traditional cloud computing platform.
- Central cloud or traditional cloud computing platform can include large-scale resources and centralized locations. Data centers, and edge cloud nodes cover a wider network range, and therefore have the characteristics of being closer to the terminal.
- the resource scale of a single edge cloud node is small, but the number of edge cloud nodes is large, and multiple edge cloud nodes constitute the original Part of the edge cloud in the embodiment.
- the terminal in this embodiment refers to the demand side of cloud computing services, for example, it may be a terminal or a user side in the Internet, or a terminal or a user side in the Internet of Things.
- the edge cloud network is a network based on the infrastructure between the central cloud or the traditional cloud computing system and the terminal.
- the network system 100 includes at least one edge cloud node 102, and each edge cloud node 102 includes a series of edge infrastructures.
- edge infrastructures include, but are not limited to: distributed data centers (DC), wireless computer rooms or clusters, operations Communication networks, core network equipment, base stations, edge gateways, home gateways, computing devices and/or storage devices and other edge devices and corresponding network environments. It is explained here that the locations, capabilities, and included infrastructure of different edge cloud nodes 102 may be the same or different.
- the network system 100 of this embodiment is combined with a central cloud or a traditional cloud computing platform and other central networks and terminals to form a "cloud edge-end three-body coordination" network architecture.
- the network can be forwarded and stored.
- Tasks such as computing and/or intelligent data analysis are processed in each edge cloud node 102 in the network system 100. Since each edge cloud node 102 is closer to the terminal, the response delay can be reduced and the central cloud or traditional cloud The pressure on the computing platform reduces bandwidth costs.
- a central management and control device 101 is deployed.
- the central management and control device 101 uses the edge cloud node 102 as the management and control object for resource scheduling, image management, instance management and control, operation and maintenance, network, security, etc.
- At least one edge cloud node 102 in the network system 100 is uniformly managed and controlled, so that cloud computing services are placed in each edge cloud node 102 for processing.
- the central control device 101 can be deployed in one or more cloud computing data centers, or it can be deployed in one or more traditional data centers, and the central control device 101 can also be connected to at least one edge cloud under its control.
- the nodes jointly constitute an edge cloud network, which is not limited in this embodiment.
- the edge cloud node 102 For an edge cloud node 102, various resources may be provided externally, such as computing resources such as CPU and GPU, storage resources such as memory and hard disk, and network resources such as bandwidth.
- the edge cloud node 102 can also create a corresponding instance based on the image, and provide various cloud computing services externally through the instance.
- the image is the basic file needed to create an instance in the edge cloud node.
- it can be an image file such as an operating system, application, or operation configuration required to provide users with cloud computing services, and it can be in line with edge cloud node computing deployment Requirements, according to a specific series of documents in a certain format made into documents.
- images which can be virtual machine (VM) image files, container (Docker) image files, or various types of application packaging files, etc.
- VM virtual machine
- Docker container
- application packaging files etc.
- the image form can be compatible with the virtualization technology used by cloud computing services. Regarding, this embodiment does not limit this.
- the implementation form of the instance can be a virtual machine, container, or application.
- the central management and control device 101 can perform resource scheduling on at least one edge cloud node 102 according to resource requirements, or can perform image management and distribution for at least one edge cloud node 102 according to image requirements.
- resource scheduling on at least one edge cloud node 102 and provide mirroring for at least one edge cloud node 102 according to cloud computing service requirements.
- cloud computing service requirements include resource requirements and mirroring requirements.
- the central management and control device 101 may provide a requirement submission portal to the outside, and the requirement submission portal may be a web page, an application page, or a command window. The role of the requirement submission portal is for the requirement to submit its own requirement description information to the central control device 101.
- the resource demand description information can be submitted to the central management and control device 101 through the above demand submission entry.
- the resource demand description information includes: edge cloud node selection parameters and resource selection parameters; edge cloud node selection parameters include scheduling domains and/or For the performance requirements of edge cloud nodes, the resource selection parameters include resource type, resource quantity, and performance requirements for resource equipment.
- the central management and control device 101 may perform resource scheduling on at least one edge cloud node according to the resource requirement description information.
- a resource scheduling method includes: the central management and control device 101 determines the scheduled target edge cloud node and the scheduled target edge cloud node from at least one edge cloud node 102 of the network system 100 according to resource demand description information Resource information; according to the resource information, the corresponding resource device in the target edge cloud node is controlled to allocate or reserve resources.
- the mirroring demand description information can be submitted to the central management and control device 101 through the above demand submission entry.
- the mirroring demand description information can point to the mirror that needs to be used, which can be the mirror itself, or the name, ID and other identification types of the mirror.
- the information can also be some function description information of the cloud computing service, which can reflect the required image.
- the central management and control device 101 can obtain the image according to the description information of the image demand; provide the image to the edge cloud node in the network system 100 that needs the image, so that the edge cloud node creates a corresponding instance based on the image, and the instance provides the corresponding cloud externally Computing services.
- the service demand description information can be submitted to the central management and control device 101 through the above demand submission portal.
- the service demand description information includes resource demand description information and mirroring demand description information.
- resource requirement description information and mirroring requirement description information please refer to the previous description, which will not be repeated here. It is worth noting that the resource requirement description information and the mirroring requirement description information in the service requirement description information can be submitted together or separately.
- the central management and control device 101 can perform resource scheduling on at least one edge cloud node 102 in the network system 100 according to the service demand description information; provide a mirror image of the scheduled resources in the at least one edge cloud node 102 to use the The scheduled resources provide corresponding cloud computing services.
- the central management and control device 101 can not only provide a mirror image for at least one edge cloud node 102 for the edge cloud node 102 to create a corresponding instance, but can also manage and control the instances in at least one edge cloud node 102.
- the instances in the edge cloud node may be created based on the image provided by the central management and control device 101, or based on other images, or may be migrated from other edge cloud nodes or other systems, which is not limited.
- At least one instance in the edge cloud node 102 can provide cloud computing services for the service demander, where the service demander can be any device, application, system, or another service that needs to use the cloud computing service provided by the instance in the edge cloud node .
- the service demander can be but not limited to: online video system, risk management system, customer information management system, data distribution system, etc.
- the central management and control device 101 can manage and control at least one instance of at least one edge cloud node 102, so that these instances can provide cloud computing services for service demanders.
- the central management and control device 101 can perform various management and control on at least one instance, for example, it can include at least one of upgrade, migration, shutdown, restart, and release, but is not limited thereto.
- the instance upgrade and migration will be described in detail below.
- the central management and control equipment 101 performs upgrade management and control of instances mainly including:
- the central management and control device 101 determines the instance to be upgraded from at least one instance.
- the instance to be upgraded can be one or more; it sends an upgrade request to the service demander, so that the service demander can determine the instance to be upgraded based on the business situation of the instance to be upgraded Upgrade strategy.
- the upgrade request carries the identification information of the instance to be upgraded, such as the ID and name of the instance to be upgraded. It can also be the ID and name of the service corresponding to the instance to be upgraded. It can also be the ID, name and other information of the image corresponding to the instance to be upgraded. .
- the service demander can determine the instance to be upgraded according to the upgrade request, and combine the business conditions on the instance to be upgraded, such as the business request on the instance to be upgraded and the response status of the business request, to determine whether the instance to be upgraded is It is suitable for upgrading, when is suitable for upgrading, what method is used for upgrading, etc., and then an upgrade strategy can be generated for the instance to be upgraded and returned to the central control device 101.
- the central management and control device 101 receives the upgrade strategy sent by the service demander, and upgrades the instance to be upgraded according to the upgrade strategy.
- the service demander can combine the business conditions on the instance to be upgraded, such as the number of business requests that have been received and not yet completed (referred to as inventory business requests), and whether there are any new business requests ( Incremental service request), etc., to determine when the instance to be upgraded can be upgraded, that is, the upgrade strategy can include the upgrade time. If all existing service requests on the instance to be upgraded have been responded, and there are no incremental service requests, in this case, the upgrade service request of the instance to be upgraded will not be interrupted and will not affect the user experience, then it is considered OK Upgrade the instance to be upgraded.
- inventory business requests the number of business requests that have been received and not yet completed
- Incremental service request Incremental service request
- the instance to be upgraded when it considers that the instance to be upgraded can be upgraded, it can return an upgrade notification to the central management and control device 101, and the upgrade notification carries the time to instruct the central management and control device 101 to upgrade the instance to be upgraded after receiving the upgrade notification.
- Information the way that the upgrade notification carries the time information can be explicit or implicit.
- the instance to be upgraded after receiving the upgrade notification, the instance to be upgraded can be upgraded.
- the service demander can also estimate the appropriate upgrade time in combination with the business conditions on the instance to be upgraded, and send the upgrade time to the central control device 101 in the upgrade notification.
- the central management and control device 101 After receiving the upgrade notification, the central management and control device 101 obtains the upgrade time therefrom, and starts to upgrade the instance to be upgraded at the upgrade time.
- the upgrade strategy may include an upgrade time, which is determined by the service demander in combination with the business situation on the instance to be upgraded.
- the upgrade strategy may not include the upgrade time.
- the upgrade time can be determined by the central management and control device 101 according to factors such as the status of the instance to be upgraded, the load situation of the central management and control device 101, and the like.
- the upgrade strategy may include an upgrade method, where the upgrade method refers to the method used to upgrade the instance to be upgraded, which can be determined by the service demander in combination with the business situation on the instance to be upgraded. Depending on the image type, the upgrade method is also different.
- the central control device 101 can start to upgrade the instance to be upgraded at the upgrade time specified in the upgrade strategy; if the upgrade strategy includes the upgrade method, the central control device 101 can use the upgrade method specified in the upgrade strategy to treat The upgrade instance is upgraded; if the upgrade strategy includes an upgrade time and an upgrade method, the central management and control device 101 can adopt the upgrade method specified in the upgrade strategy, and start to upgrade the instance to be upgraded at the upgrade time specified in the upgrade strategy.
- the upgrade of the instance can be initiated by the central control device 101.
- the central management and control device 101 can monitor the version information of the mirror corresponding to each instance, and when a new version of the mirror is found, it can determine that the instance corresponding to the new version of the mirror needs to be upgraded; or, it can also monitor the running status of each instance , Life cycle and other information.
- problems such as loopholes, instability, insufficiency, excessive consumption of CPU or memory resources are found in the running process of the instance, it can be determined that the instance with these problems needs to be upgraded.
- upgrading the instance can also be initiated by the service demander.
- the service demander can send upgrade description information to the central management and control device 101.
- the upgrade description information includes instance filter conditions.
- the instances to be upgraded can be filtered out Instance.
- the instance filter condition can be the identification information of the instance to be upgraded, such as the ID and name of the instance to be upgraded, or the ID and name of the image corresponding to the instance to be upgraded, or the ID and name of the service corresponding to the instance to be upgraded. Determine the instance to be upgraded.
- the instance filter condition may also be identifying information indicating to upgrade all the instances, such as “all”, “1”, etc.
- the identifying information can be flexibly set.
- the upgrade description information sent by the service demander can be received, the instance filter condition is obtained from the upgrade description information, and the instance to be upgraded is determined from at least one instance according to the instance filter condition;
- the party sends an upgrade request to request the service demander to determine an upgrade strategy for the upgraded instance based on the business situation on the instance to be upgraded; after the service demander returns the upgrade strategy of the instance to be upgraded, the instance to be upgraded can be upgraded according to the upgrade strategy.
- upgrading the instance to be upgraded mainly refers to: shutting down the instance to be upgraded, updating the instance to be upgraded according to the mirror image of the corresponding version (generally, the new version), and restarting the instance after the update.
- the image version required for the upgrade of the instance to be upgraded can be determined by the central management and control device 101, for example, the latest version of the corresponding image is used as the image version required for the upgrade, or it can be specified by the service demander.
- the service demander may carry the image version required for the upgrade in the upgrade description information and provide it to the central management and control device 101.
- the upgrade description information may include "all or specified instances from mirror version A to mirror version B). Upgrade" and other information.
- the central management and control device 101 can obtain the image version required for the upgrade from the upgrade description information, and then, according to the upgrade strategy, use the image corresponding to the image version to upgrade the instance to be upgraded.
- the instance upgrade process ends.
- the instance needs to be migrated in some cases. For example, in the case that the entire edge cloud node is faulty or unavailable, the instances in the edge cloud node need to be migrated to other edge cloud nodes. For another example, in the case of a failure or downtime of a physical machine hosting an instance, the instance on the physical machine needs to be migrated to another physical machine. For another example, it may be necessary to migrate one or some instances from one edge cloud node to other edge cloud nodes due to business needs. For another example, when resources need to be merged, one or some instances need to be migrated. Under the management and control of the central control device 101, instances in the edge cloud nodes can be migrated, and the migration process mainly includes:
- the central management and control device 101 determines the instance to be migrated from at least one instance. There may be one or more instances to be migrated; if there are multiple instances to be migrated, the multiple instances to be migrated can be deployed in the same edge cloud node or in different edge cloud nodes.
- the central management and control device 101 may monitor the state of at least one instance deployed in at least one edge cloud node 102, and obtain the failed instance and/or the instance in which a specified event occurs during operation according to the state of the at least one instance.
- the instance to be migrated, and then the instance to be migrated is migrated.
- a failed instance refers to an instance that cannot run normally, for example, it can be an instance on a physical machine that is down, or an instance that itself is down. Such instances need to be migrated in order to continue to serve the demand side.
- the designated event mainly refers to some events that the instance can still run normally after occurrence, which can be flexibly set according to application requirements, and there is no restriction on this.
- the specified event can be some early warning or alarm events, etc. Although some early warning or alarm events occur, the instance does not produce actual problems and can still run (that is, no failure), but there are hidden dangers of failure, which can be carried out in time before failure Migration to avoid problems such as service interruption caused by faults.
- the central management and control device 101 maintains the information of each edge cloud node and the information of each instance deployed in each edge cloud node. Based on this, the edge cloud node to which the instance to be migrated belongs can be determined. For ease of description and distinction, the The edge cloud node to which the instance belongs before the migration is recorded as the first edge cloud node.
- the central management and control device 101 can determine the instance to be migrated from at least one instance according to resource merging requirements, and then migrate the instance to be migrated.
- resource merging is mainly the process of integrating resource fragments through instance migration.
- resource merging requirements can be system-level or node-level.
- System-level resource merging refers to the integration of resource fragments in the entire network system through instance migration; node-level resource merging refers to the consideration from the dimension of edge cloud nodes, through instance migration to edge cloud The resource fragments in the node are integrated.
- the resource consolidation requirement may be provided by the service demander.
- a service demander needs to deploy a new instance, if the available resources on each resource device in the edge cloud node it serves are not enough to carry the new instance, it can request the migration of the instance in the edge cloud node. Resource integration, so as to provide sufficient resources for new instances.
- the resource merging requirement may also be a regular behavior of the resource scheduling module of the central management and control device 101.
- the resource scheduling module of the central management and control device 101 periodically performs resource fragmentation checks. When it is found that the fragmentation rate reaches a certain threshold and instance migration can be performed, it integrates the resource fragments in each edge cloud node to improve the resources in the edge cloud node. Utilization rate.
- the resource consolidation requirements contain information related to resource consolidation.
- the resource merging requirements may include information about instances that need to be migrated to achieve the purpose of resource merging. Based on this, the instances to be migrated can be directly determined according to the resource merging requirements.
- the resource merging requirements may include information about edge cloud nodes that need to be merged. Based on this, the edge cloud node that needs to be merged can be determined according to the resource merging requirements.
- the edge cloud node that needs to be merged is called the first edge cloud node; in turn, the resources in the first edge cloud node can be combined The remaining available resources on the device and the resources required by each instance in the first edge cloud node determine the instance to be migrated.
- the central management and control device 101 can determine whether the first edge cloud node to which the instance to be migrated belongs meets the intra-node migration condition; if the first edge cloud node meets the intra-node migration condition, then The instance to be migrated is migrated within the edge cloud node; if the first edge cloud node does not meet the intra-node migration condition, the instance to be migrated is migrated across edge cloud nodes.
- the central management and control device 101 may determine whether the first edge cloud node is currently available; if the first edge cloud node is currently available, determine whether the available resources of the first edge cloud node are sufficient to carry the instance to be migrated; The available resources of an edge cloud node are sufficient to carry the instance to be migrated, and it is determined that the first edge cloud node meets the migration conditions within the node; if the first edge cloud node is currently in an unavailable state, or the available resources of the first edge cloud node are insufficient to carry the instance In the migration instance, it is determined that the first edge cloud node does not meet the intra-node migration condition.
- the migration of instances is divided into two types: intra-node migration and cross-node migration.
- the available resources of the first edge cloud node mainly refer to the available resources on each resource device in the first edge cloud node; accordingly, judging whether the available resources of the first edge cloud node are sufficient to carry the instance to be migrated mainly refers to judging Whether there is a resource device with sufficient resources available in the first edge cloud node to carry the instance to be migrated.
- the instance migration for resource merging is mainly intra-node migration, and of course, it can also be cross-node migration.
- the instance to be migrated may also be determined A resource device, where the resource device is a resource device whose remaining available resources in the first edge cloud node can carry the instance to be migrated.
- the resource device is a resource device whose remaining available resources in the first edge cloud node can carry the instance to be migrated.
- cross-node migration can be performed for the instance to be migrated.
- resource merging in the process of cross-node migration for the instances to be migrated, priority is given to migrating the instances to be migrated to other edge cloud nodes that have been used and the remaining available resources can carry the resource devices of the instances to be migrated; Further, in the case that there are multiple resources that have been used and remaining available resources can carry the resource equipment of the instance to be migrated, the principle of minimum resource fragmentation can be used to select the matching degree between the remaining available resources and the resources required by the instance to be migrated. Higher resource equipment, try to produce less resource fragments or no resource fragments.
- the continuity of the cloud computing service provided by the instance can be ensured through the hot migration technology.
- the hot migration technology please refer to the prior art, which will not be repeated here.
- the central management and control device 101 can select a second edge cloud node from at least one edge cloud node, the second edge cloud node is different from the first edge cloud node, and the available resources in the second edge cloud node are sufficient to carry the migration Instance, that is, enough resources; migrate the instance to be migrated to the second edge cloud node, and send the attribute information of the instance to be migrated in the second edge cloud node to the service demander, so that the service demander can base on the attribute information Perform business scheduling for the instances to be migrated.
- the attribute information of the instance to be migrated in the second edge cloud node means that after the instance to be migrated is migrated to the second edge cloud node, an external (for example, a service demander or a third party authorized by the service demander) conducts an operation on the instance to be migrated.
- Information required for service scheduling may include, but is not limited to, for example, information such as the area where the second edge cloud node is located, operator information, and/or public network IP. Taking the service demander as an example, it is possible to determine whether to request the service according to the area and operator information of the second edge cloud node in the above attribute information, combined with the operator information and area of the terminal that initiated the service request.
- the public network IP in the above attribute information can be provided to The terminal, the terminal's request can access the instance to be migrated in the second edge cloud node, achieving the purpose of scheduling the service request of the terminal to the instance to be migrated in the second edge cloud node.
- the following methods can be used but not limited to:
- Method 1 According to the distance between other edge cloud nodes and the first edge cloud node, select the edge cloud node whose distance from the first edge cloud node is less than the set distance threshold, or select the closest distance to the first edge cloud node An edge cloud node, or an edge cloud node arbitrarily selected from the N edge cloud nodes closest to the first edge cloud node as the second edge cloud node.
- the second edge cloud node is closest or relatively close to the first edge cloud node, which can save data transmission time and help improve migration efficiency.
- the distance between other edge cloud nodes and the first edge cloud node may be the average distance between other edge cloud nodes and the first edge cloud node, or the distance between other edge cloud nodes and the first edge cloud node.
- the distance between the centers may also be the distance between other edge cloud nodes and the closest outer edge of the first edge cloud node, etc., which can be adaptively defined according to requirements.
- Method 2 You can select edge cloud nodes with relatively sufficient bandwidth resources according to the bandwidth resources of other edge cloud nodes. For example, select the edge cloud node with the largest bandwidth resource, or select the bandwidth resource greater than the set bandwidth threshold, or select the bandwidth utilization rate lower The edge cloud node serves as the second edge cloud node. In method 2, the bandwidth resources of the second edge cloud node are sufficient, which can increase the data transmission rate, which is beneficial to improve the migration efficiency.
- Method 3 According to the current load situation of other edge cloud nodes, select the edge cloud node with relatively light load, for example, select the edge cloud node with the smallest load, or select the edge cloud node with the load less than the set load threshold as the second Edge cloud node.
- the load of the second edge cloud node is lighter, it has sufficient resources and can handle instance migration in time, which is beneficial to improve migration efficiency.
- the central management and control device 101 may reserve or allocate resources for the instance to be migrated in the second edge cloud node according to the resource requirements of the instance to be migrated; After the resource reservation or allocation is successful, the instance to be migrated is migrated to the resources reserved or allocated in the second edge cloud node.
- the resource requirements of the instances to be migrated can be combined to determine the type of resources, the amount of resources and/or the performance requirements of the resource equipment required by the instances to be migrated, and resource reservation or allocation can be performed in the second edge cloud node based on this information , which can provide resource guarantee for successful instance migration.
- the process of the central management and control device 101 in the second edge cloud node to reserve or allocate resources for the instances to be migrated please refer to the content of the subsequent resource scheduling part, which will not be repeated here.
- the central management and control device 101 may also notify the service demander of the migration event, so that the service demander can make appropriate response actions, such as Update the information of the instance in the service demander, or make a disaster recovery response to the downtime during the instance migration.
- the attribute information of the instance to be migrated in the second edge cloud node may be provided to the service demander.
- the attribute information of the instance to be migrated in the second edge cloud node may be provided to the service demander.
- the central control device 101 may also send a migration request to the service demander for the service demander Determine a migration strategy for the instance to be migrated in combination with the business situation on the instance to be migrated; receive the migration strategy sent by the service demander, and migrate the instance to be migrated to the second edge cloud node according to the migration strategy.
- the migration strategy mainly includes at least one information of whether to migrate, migration time, and migration mode.
- the service demander can determine when to perform the migration based on the number of stock service requests and incremental service requests on the instance to be migrated and the response status. For example, all stock service requests on the instance to be migrated can be responded to. And if there are not many incremental business requests, determine the instance migration.
- the central management and control device 101 may send the attribute information of the instance to be migrated in the second edge cloud node together with the migration request to the service demander.
- the attribute information of the instance to be migrated in the second edge cloud node may be provided to the service demander.
- the instance to be migrated is an instance that has a specified event but can still run normally
- the instance to be migrated can continue to run on the first edge cloud node, so that the business during the migration process can continue to be scheduled
- To the instance to be migrated in the first edge cloud node ensure business continuity.
- the central management and control device can release the instance to be migrated in the first edge cloud node when there are no more business requests on the instance to be migrated running on the first edge cloud node.
- the service requester no after determining that the service requester no longer has any service requests on the instance to be migrated running in the first edge cloud node, and there is neither an inventory service request nor an incremental service request, it can send a release to the central control device 101 Notification; the central management and control device 101 receives the release notification sent by the service demander, and releases the instance to be migrated running in the first edge cloud node according to the release notification. Further, the central management and control device 101 may also synchronize the running state of the instance to be migrated running in the first edge cloud node to the instance to be migrated in the second edge cloud node.
- migrating the instance to be migrated to the second edge cloud node is mainly to control the corresponding resource equipment in the second edge cloud node to reserve or The process of creating an instance to be migrated on the allocated resources.
- the central management and control device 101 may provide the corresponding resource device in the second edge cloud node with the application image or instance snapshot corresponding to the instance to be migrated, so that the corresponding resource device in the second edge cloud node can perform the pre-processing based on the application image or instance snapshot. Create an instance to be migrated on the reserved or allocated resources, but it is not limited to this.
- the central management and control device 101 of this embodiment may encapsulate its own instance upgrade, instance migration, and other management and control functions into a series of application programming interfaces (Application Programming Interface, API) and open them to service demanders.
- API Application Programming Interface
- These open APIs are called Open APIs (OpenAPI), and the central management and control device 101 can interact with the service demander through OpenAPI.
- the central management and control device 101 can directly control and schedule at least one edge cloud node 102, but it is not limited to this.
- an edge management and control device 103 is also included in the network system 100.
- the number of edge management and control devices 103 may be one or multiple.
- the edge management and control device 103 may be deployed in one or more edge cloud nodes 102.
- an edge management and control device 103 is separately deployed in each edge cloud node 102.
- each edge cloud node includes one or more resource devices.
- the edge management and control device 103 may be deployed on one resource device in a centralized manner, or may be deployed on multiple resource devices in a distributed manner.
- each edge cloud node can include one or more proprietary devices in addition to resource devices.
- the edge management and control device 103 can be deployed on one dedicated device or distributed on multiple dedicated devices.
- the proprietary device refers to the physical device used to deploy the edge management and control device 103, which is different from the resource device.
- the edge management and control device 103 can also be deployed with the central management and control device 101, which is not limited here.
- the central management and control device 101 may be deployed in one or more cloud computing data centers or traditional data centers, and may also be deployed in an edge cloud network together with at least one edge cloud node.
- the central management and control device of this embodiment can be a logical device with the capabilities of resource scheduling and image management. These functions can be implemented on one physical machine or virtual machine, or distributed in multiple devices. On a physical machine or a virtual machine.
- the central management and control device in this embodiment may also be one or more physical devices with capabilities such as resource scheduling and image management.
- the embodiment of the present application does not limit the implementation structure of the central management and control device 101, and any device structure with the foregoing capabilities is applicable to the embodiment of the present application.
- the edge management and control device 103 can also be a logical device, which has the ability to deploy a physical machine (for example, a resource device or a proprietary device in an edge cloud node) or a virtual machine. It can also be deployed on multiple physical machines (such as resource devices or proprietary devices in edge cloud nodes) or virtual machines in a decentralized manner.
- the edge control device can also be one or more physical devices with corresponding capabilities.
- the embodiments of the present application do not limit the implementation structure of the edge management and control device 103, and any device structure with corresponding capabilities is applicable to the embodiments of the present application.
- the edge management and control device 103 can assist and cooperate with the central management and control device 101 to manage and control at least one edge cloud node 102. With the assistance of the edge management and control device 103, the central management and control device 101 can manage and schedule at least one edge cloud node 102 more conveniently and efficiently, thereby achieving the purpose of making full use of edge resources.
- the central management and control device 101 and the edge management and control device 103 can establish a secure and encrypted communication channel, and interact based on the communication channel.
- the communication channel includes a control interface and a data interface
- the central management and control device 101 interacts with the edge management and control device 103 on the control plane and the data plane based on the control interface and the data interface to complete the scheduling and management of the edge cloud node 102.
- the data interface is used for data transmission between the central management and control device 101 and the edge management and control device 103.
- the control interface has but not limited to the following functions:
- the central control device 101 can perform resource scheduling on edge cloud nodes from multiple dimensions through a control interface with resource scheduling capabilities (can be referred to as resource scheduling interface for short).
- the edge cloud node is the central control device 101 for resource scheduling Object;
- the central management and control device 101 can provide images to edge cloud nodes through a control interface with image management and distribution capabilities (referred to as image management interfaces), so that the edge cloud nodes can create images based on the received images
- image management interfaces image management and distribution capabilities
- Operation and maintenance management capability The central control device 101 performs operation and maintenance management on edge cloud nodes through a control interface with operation and maintenance management capabilities (referred to as the operation and maintenance management interface).
- the operation and maintenance management includes but is not limited to: control edge cloud nodes Application, virtualization software, etc., monitor the status, resource usage and infrastructure of the instance.
- the central management and control device 101 of this embodiment has but not limited to the following functions:
- service requirements such as the specifications of cloud computing services, the areas where cloud computing services need to be deployed, the distribution of operator networks, network delays, load conditions, bandwidth costs, required resource types and/or resource equipment Performance requirements, etc., to schedule edge cloud nodes;
- the image required for cloud computing services can be obtained, and the image can be provided to the corresponding resource equipment in the edge cloud node for configuration and installation, so that the corresponding resource equipment can create corresponding instances to provide cloud computing services;
- Operation and maintenance management and control of edge cloud nodes can be performed, including but not limited to: management and control of applications, virtualized components, instance status, resource usage and/or infrastructure conditions in edge cloud nodes, to achieve remote operation and maintenance, logs Management etc.
- the central control equipment can also have other functions, such as security assurance functions, involving the security of the central control equipment, the link security between the central control equipment and the edge control equipment, and the edge cloud nodes.
- Security of cloud nodes responsible for maintaining networking information in the network system.
- At least one edge cloud node 102 can form a resource pool, and each edge cloud node 102 serves as a scheduling object, and provides various resources or cloud computing services externally under the scheduling of the central management and control device 101.
- the central management and control device 101 and the edge management and control device 102 cooperate with each other to perform resource scheduling on at least one edge cloud node 102, and can also perform mirror management and distribution for at least one edge cloud node 102.
- it can also perform resource scheduling on at least one edge cloud node 102.
- the edge cloud node 102 performs resource scheduling and provides a mirror image for at least one edge cloud node 102.
- the management and control of the instances in the edge cloud node 102 is also a problem that the network system 100 needs to solve. Successfully solving this problem is also "putting cloud computing in the distance The basis of processing in the edge cloud node closer to the terminal.
- the central management and control device 101 and the edge management and control device 103 cooperate with each other, and can also manage and control instances in at least one edge cloud node 102, such as at least one of upgrade, migration, shutdown, restart, and release.
- the edge management and control device 103 may assist the central management and control device 101 to upgrade the instance to be upgraded by using the mirror corresponding to the mirror version according to the upgrade strategy.
- the central management and control device 101 may send the upgrade policy and the image corresponding to the image version to the edge management and control device 103, and the edge management and control device 103 uses the image corresponding to the image version to upgrade the instance to be upgraded according to the upgrade policy.
- the central management and control device 101 can send the upgrade strategy and the image corresponding to the mirror version to the edge management and control device 103 in the edge cloud node to which the instance to be upgraded belongs, and the waiting
- the edge management and control device 103 in the edge cloud node to which the upgraded instance belongs uses the image corresponding to the image version to upgrade the instance to be upgraded according to the upgrade strategy.
- the upgrade method indicated by the upgrade strategy can be used.
- the mirror corresponding to the mirror version is provided to the resource device where the instance to be upgraded is located, and the resource device uses the mirror to upgrade the instance. Upgrade.
- the edge management and control device 103 may assist the central management and control device 101 to control the corresponding resource device in the second edge cloud node to reserve or allocate resources for the instance to be migrated.
- the central management and control device 101 may determine the scheduled resource information in the second edge cloud node according to the resource requirements of the instance to be migrated, and provide the resource information to the edge management and control device 103, and the edge management and control device 103 controls the resource according to the resource information.
- the corresponding resource device in the second edge cloud node reserves or allocates resources for the instance to be migrated.
- the central management and control device 101 can provide resource information to the edge management and control device 103 in the second edge cloud node, and the edge management and control device in the second edge cloud node 103, according to the resource information, controls the corresponding resource device in the second edge cloud node to reserve or allocate resources for the instance to be migrated.
- the edge management and control device 103 may also assist the central management and control device 101 in migrating the instance to be migrated to the resources reserved or allocated by the corresponding resource device in the second edge cloud node.
- the central management and control device 101 may send a migration instruction to the edge management and control device 103.
- the migration instruction instructs the edge management and control device 103 to obtain the image or instance snapshot corresponding to the instance to be migrated and provide it to the corresponding resource device in the second edge cloud node for the second edge cloud
- the corresponding resource device in the node creates an instance to be migrated on the reserved or allocated resources according to the image or instance snapshot.
- the central management and control device 101 may send a migration instruction to the edge management and control device 103 in the second edge cloud node to instruct the edge management and control device 103 in the second edge cloud node
- the image or snapshot corresponding to the instance to be migrated is obtained and provided to the corresponding resource device in the second edge cloud node for the corresponding resource device in the second edge cloud node to create the instance to be migrated on the reserved or allocated resources according to the image or snapshot.
- the way in which the edge management and control device 103 obtains the snapshot will be different according to the storage mode of the snapshot. If the snapshot is stored in the first edge cloud node, it depends on whether the state of the first edge cloud node is available. If the first edge cloud node is unavailable, it is not suitable to use the snapshot for instance migration, and you need to use mirroring instead. Instance migration; if the first edge cloud node is in an available state, the edge management and control device can obtain a snapshot from the first edge cloud node.
- the process of obtaining the snapshot has nothing to do with the state of the first edge cloud node, and the edge management and control device can obtain the snapshot from other edge cloud nodes when the other edge cloud nodes are available. .
- the edge management and control device After obtaining the snapshot, the edge management and control device provides the snapshot copy to the corresponding resource device in the second edge cloud node for the corresponding resource device to create an instance to be migrated through the snapshot. Among them, creating an instance through a snapshot can restore the data saved when the snapshot is taken.
- the edge management and control device 103 may first determine whether the mirror corresponding to the instance to be migrated is stored in the second edge cloud node when acquiring the mirror. If the second edge cloud node has a corresponding image, the edge management and control device can directly provide the corresponding image in the second edge cloud node to the corresponding resource device in the second edge cloud node for the corresponding resource device to create an instance to be migrated through the image.
- the edge management and control device can request the corresponding image from the central management and control device; the central management and control device can obtain the image from the mirror library and provide it to the edge management and control device, or instruct the edge management and control device to store the corresponding image from another The image is obtained at the edge cloud node; the edge management and control device provides the corresponding resource device in the second edge cloud node after obtaining the corresponding image, so that the corresponding resource device can create an instance to be migrated through the image.
- the process of the central management and control device instructing the edge management and control device to obtain images from other edge cloud nodes that store corresponding images may refer to the description in the subsequent image management and distribution related embodiments, which will not be repeated here.
- the central management and control device can perform resource scheduling on at least one edge cloud, which mainly refers to determining the target edge cloud node and target edge cloud node that can be scheduled from at least one edge cloud node 102 in the network system 100 according to service demand description information Scheduled resource information; the resource information is sent to the edge management and control device 103 for the edge management and control device 103 to control the corresponding resource device in the target edge cloud node for resource allocation or reservation.
- the number of target edge cloud nodes can be specified by the user, or can be independently determined by the resource center management and control device according to the service requirement description information, and it can be one or more.
- the service demand description information can be directly submitted by the service demander, or it can be extracted or calculated from the service-related information submitted by the service demander.
- the service demander can be a user, an application, a physical machine, or another service that requires a certain service.
- the resource scheduling function described here mainly includes the selection of edge cloud nodes and the resource scheduling within the edge cloud nodes, but it is not limited to these two aspects.
- the internal resource scheduling of the edge cloud node is specifically embodied as the operation of determining the scheduled resource information in the target edge cloud node and providing resource information.
- the main purpose is to allocate cloud computing services to the final at the granularity of each edge cloud node.
- Basic resources such as server and other resource equipment.
- the central control equipment can maintain the information of the resources contained in each edge cloud node as the basis for resource scheduling.
- the service requirement description information includes edge cloud node selection parameters and resource selection parameters.
- the edge cloud node selection parameter refers to the parameter required to select the target edge cloud node;
- the resource selection parameter refers to the information required to select the scheduled resource in the edge cloud node.
- the central management and control equipment can parse out the edge cloud node selection parameters and resource selection parameters from the service demand description information; determine the scheduled target edge cloud node from at least one edge cloud node according to the edge cloud node selection parameters, and according to the resource The selection parameters determine the scheduled resource information in the target edge cloud node.
- the service requirement description information may include the scheduling domain and/or the QoS requirements of the cloud computing service, and these parameters may be used as edge cloud node selection parameters.
- the scheduling domain points to the area where cloud computing services need to be deployed, which determines the geographic location of edge cloud nodes that should be scheduled.
- the QoS requirements of cloud computing services may include the requirements of cloud computing services on network delay, load conditions, and/or bandwidth costs.
- the central management and control device can select the edge cloud node that can meet the scheduling domain and/or QoS requirements as the target according to the QoS requirements of the scheduling domain and/or cloud computing service, combined with the geographic location and resource remaining amount of at least one edge cloud node Edge cloud node.
- the central management and control device may select the edge cloud node pointed to by the scheduling domain as the target edge cloud node in combination with the geographic location of at least one edge cloud node according to the scheduling domain.
- the central management and control device can also select the edge cloud node that meets the network delay, load and/or bandwidth cost requirements based on the QoS requirements of the cloud computing service, such as network delay, load conditions, and/or bandwidth cost requirements.
- the edge cloud node serves as the target edge cloud node.
- the central management and control equipment can also select the edge cloud node that can meet the scheduling domain and QoS requirements as the target edge cloud based on the QoS requirements of the scheduling domain and cloud computing services at the same time, combined with the geographic location and remaining amount of resources of at least one edge cloud node. node.
- the service requirement description information can also include the resource type, the number of resources, and/or the performance of the resource equipment required by the cloud computing service. These parameters can be As a resource selection parameter. Based on this, after determining the target edge cloud node, the central management and control device can determine the scheduled resource information in the target edge cloud node according to the resource selection parameters.
- the resource information here may include: resource type, resource quantity, and/or performance requirements for resource devices, so that the edge management and control device can control the corresponding resource device in the target edge cloud node to allocate or reserve resources accordingly.
- resource types may include, but are not limited to: computing resources such as CPU and GPU, storage resources such as memory and hard disk, and resource types such as bandwidth resources.
- computing resources such as CPU and GPU
- storage resources such as memory and hard disk
- resource types such as bandwidth resources.
- the number of resources can be 12 CPUs, 24 CPUs, etc.
- memory resources such as an example
- the number of resources can be 16G memory, 32G memory, etc.
- bandwidth resources such as an example, the number of resources can be 1M bandwidth, 10M Bandwidth etc.
- the central management and control device can also have the function of computing power orchestration.
- the computing power orchestration is oriented to relatively complex application scenarios. Multiple cloud computing services are bound together as the smallest resource requirement unit. In this way, in the resource scheduling In the process, multiple cloud computing services can be bound together as a whole, and one or several edge cloud nodes can be selected for them, and the same or several edge cloud nodes can provide resources for them together. Computing power scheduling improves the diversity of resource scheduling and increases the flexibility of resource scheduling, but it does not affect the overall process of resource scheduling.
- the image management function of the central control device mainly refers to the management of images and the provision of required images for edge cloud nodes.
- the edge cloud node can create an instance on the corresponding resource device according to the image, and then the created instance can provide users with required cloud computing services.
- edge cloud nodes In practical applications, there are various scenarios where mirroring needs to be provided for edge cloud nodes.
- a user such as a service demander
- the central management and control device can provide a corresponding image for the scheduled target edge cloud node.
- the central management and control device can provide a corresponding image for the scheduled target edge cloud node.
- the central management and control device can provide a corresponding image for the scheduled target edge cloud node.
- the edge cloud node of the computing service provides a corresponding image, so that the edge cloud node creates a new instance based on the image, so as to achieve the purpose of capacity expansion.
- the edge cloud node that needs to be mirrored is recorded as the third edge cloud node.
- the third edge cloud node can be any edge cloud node in the network system, depending on the application scenario. Depends.
- the following takes the central control device to provide a mirror image for the third edge cloud node as an example to describe the image management function of the central control device.
- the central control device can first determine the target image that needs to be provided to the third edge cloud node; then, provide the target image for the third edge cloud node for use by the third edge cloud node
- the target image provides cloud computing services.
- a mirror library is maintained, and the mirror library is used to store images in the system.
- Users can choose to use the mirror in the mirror library.
- the user can be provided with a mirror configuration interface with a drop-down menu.
- the drop-down menu includes many mirrors that can be selected by the user, and the user can choose the mirror to use.
- the central management and control device can obtain the image required by the third edge cloud node from the mirror library, and then provide the image to the third edge cloud node and use the image The permissions are open to the corresponding users.
- the central management and control device may directly issue the target image to the third edge cloud node, or instruct the third edge cloud node to download the target image to a designated storage location.
- the central control device can also maintain the correspondence between the issued image and the edge cloud node where the issued image is located.
- the correspondence relationship may include the identification information of the issued image and the identification information of the edge cloud node where the image has been issued.
- the issued image refers to the image that the central control device has provided (for example, issued) to one or some edge cloud nodes; the edge cloud node where the issued image is located refers to the edge cloud node to which the issued image is provided.
- the same image may be provided (for example, distributed) to one edge cloud node, or may be provided (for example, distributed) to multiple edge cloud nodes.
- the central management and control device can also control the third edge cloud node from the image that already has the image.
- Other edge cloud nodes acquire the image without directly providing the image to the third edge cloud node, which can reduce the processing burden of the central control device to a certain extent, and can also improve the efficiency of image acquisition under the condition of reasonable control.
- the central management and control device may determine the image that needs to be provided to the third edge cloud node.
- the third edge cloud node The image provided by the cloud node is recorded as the target image; according to the information of the target image, a match is made in the correspondence between the maintained issued image and the edge cloud node where the issued image is located; if the corresponding relationship is matched with the target Mirror the corresponding fourth edge cloud node, which means that the target image has been provided to the fourth edge cloud node, and the target image at the fourth edge cloud node can be provided to the third edge cloud node; among them, the fourth edge cloud
- the node can also be an edge cloud node in the network system, and the number can be one or more.
- the target image at the fourth edge cloud node can be obtained.
- the central management and control device may specifically send the information of the fourth edge cloud node and the target mirror to the edge management and control device; the edge management and control device 103 may communicate with the fourth edge cloud node according to the The target image information, the target image at the fourth edge cloud node is provided to the corresponding resource device in the third edge cloud node, for the corresponding resource device to create an instance that can provide cloud computing services based on the target image, and then provide it to the service demander The cloud computing service.
- the information of the fourth edge cloud node may be any information that can identify the fourth edge cloud node, for example, it may be information such as the ID, name, or geographic location of the fourth edge cloud node.
- the information of the target image can be any information that can identify the target image, such as the ID, name, or number of the target image.
- the central management and control device 101 may specifically send information about the fourth edge cloud node and the target image to the third edge cloud
- the edge management and control device in the node is used for the edge management and control device in the third edge cloud node to obtain the target image from the fourth edge cloud node through the communication channel between it and the edge management and control device in the fourth edge cloud node and provide it to the first edge cloud node.
- the edge management and control device 103 in the third edge cloud node can receive the information of the fourth edge cloud node and the target mirror image sent by the central management and control device 101, and according to the information of the fourth edge cloud node and the target mirror image, through it and the first
- the communication channel between the edge management and control devices in the four-edge cloud node obtains the target image from the fourth edge cloud node, and provides the target image to the corresponding resource device in the third edge cloud node, so that the corresponding resource device can create an image based on the target image.
- the edge management and control device 103 in the third edge cloud node obtains the target image from the fourth edge cloud node through the communication channel between it and the edge management and control device in the fourth edge cloud node.
- the process includes: third The edge management and control device 103 in the edge cloud node sends a request for acquiring the target image to the edge management and control device 103 in the fourth edge cloud node through the communication channel between it and the edge management and control device in the fourth edge cloud node. Carry the information of the target image.
- the edge management and control device 103 in the fourth edge cloud node receives the request, and determines whether the target image exists in the fourth edge cloud node according to the target image information carried in the request, and whether the target image exists in the fourth edge cloud node Next, through the communication channel between it and the edge management and control device 103 in the third edge cloud node, the target image is returned to the edge management and control device 103 in the third edge cloud node, or the target image is mirrored in the fourth edge cloud node The storage address of is returned to the edge management and control device 103 in the third edge cloud node.
- the edge management and control device 103 in the third edge cloud node receives the target image returned by the edge management and control device 103 in the fourth edge cloud node, or receives the target image returned by the edge management and control device 103 in the fourth edge cloud node in the fourth edge cloud.
- the storage address in the node read or download the target image according to the storage address.
- the edge management and control device 103 in the third edge cloud node and the edge management and control device 103 in the fourth edge cloud node may establish a communication channel by themselves, or may establish a channel under the control of the central management and control device 101.
- the central management and control device can also control the establishment of communication channels between different edge management and control devices, and is responsible for maintaining the information of the existing communication channels between the edge management and control devices, for example, which edge management and control devices have established communication channels and communication When the channel is established, the status of the communication channel, and the retention time information.
- the central management and control device determines that the target image has been provided to the fourth edge cloud node, and before providing the information of the fourth edge cloud node and the target image to the edge management and control device in the third edge cloud node, it can also According to the information of the existing communication channel between the maintained edge management and control devices, determine whether there is a communication channel between the edge management and control device in the third edge cloud node and the edge management and control device in the fourth edge cloud node; if the judgment result is No, that is, there is no communication channel between the edge management and control device in the third edge cloud node and the edge management and control device in the fourth edge cloud node, you can control the edge management and control device in the third edge cloud node and the fourth edge cloud
- the edge management and control device in the node establishes a communication channel, so that the edge management and control device in the third edge cloud node can obtain the target image from the fourth edge cloud node through the communication channel.
- the central management and control device After the edge management and control device in the third edge cloud node establishes a communication channel with the edge management and control device in the fourth edge cloud node, the central management and control device provides the fourth edge cloud node and target image information to the third edge cloud node Edge control equipment in China.
- the judgment result is yes, that is, there is already a communication channel between the edge management and control device in the third edge cloud node and the edge management and control device in the fourth edge cloud node, you can directly mirror the fourth edge cloud node and the target The information is provided to the edge management and control device in the third edge cloud node.
- the central management and control device can also provide the information of the fourth edge cloud node and the target image to the edge management and control device in the third edge cloud node according to the existing communication channel between the maintained edge management and control devices.
- Information determine whether there is a communication channel between the edge management and control device in the third edge cloud node and the edge management and control device in the fourth edge cloud node; if the judgment result is no, that is, the edge management and control device in the third edge cloud node and If there is no communication channel between the edge management and control devices in the fourth edge cloud node, the edge management and control device in the third edge cloud node and the edge management and control device in the fourth edge cloud node can be controlled to establish a communication channel to facilitate the third
- the edge management and control device in the edge cloud node can obtain the target image from the fourth edge cloud node through the communication channel.
- the central management and control device may also provide the target image at the fourth edge cloud node to the third edge cloud node according to the fourth edge cloud node.
- the target image at the fourth edge cloud node is provided to the third edge cloud node; if the judgment result is no, the target image can be obtained from the image library and the target image is provided to the third edge cloud node.
- the fourth edge cloud node can be combined with the operator to which the fourth edge cloud node belongs to determine whether the operator to which the fourth edge cloud node belongs is the same as the operator to which the first edge cloud node belongs; if the judgment result is yes, it means that the fourth edge cloud node is The first edge cloud node is an edge cloud node under the same operator.
- the two can perform data transmission, and the data transmission rate is faster than the cross-operator data transmission rate, which is suitable for providing target mirroring for the first edge cloud node.
- the location attribute of the fourth edge cloud node can be combined to determine whether the distance between the fourth edge cloud node and the third edge cloud node is less than the set distance threshold; if the judgment result is yes, the fourth edge cloud node Close to the third edge cloud node, it is suitable to provide the target image for the third edge cloud node.
- the fourth edge cloud node that is closer to the third edge cloud node provides a mirror image for the third edge cloud node, which is convenient for the third edge cloud node.
- the edge cloud node quickly obtains the image to improve efficiency.
- the distance between the fourth edge cloud node and the third edge cloud node can be the average distance between two edge cloud nodes, or the distance between the centers of two edge cloud nodes, or two edge clouds
- the distance between nodes and the nearest outer edge can be flexibly defined according to requirements.
- the bandwidth attribute of the fourth edge cloud node can be combined to determine whether the available bandwidth of the fourth edge cloud node is greater than the set bandwidth threshold; if the judgment result is yes, it means that the bandwidth resource of the fourth edge cloud node is relatively abundant, and it is suitable for
- the third edge cloud node provides the target image, so that the fourth edge cloud node with sufficient bandwidth resources provides the image for the third edge cloud node, which can ensure the transmission rate of the image, and facilitate the third edge cloud node to quickly obtain the image and improve efficiency .
- the load attribute of the fourth edge cloud node can be combined with the load attribute of the fourth edge cloud node to determine whether the load of the fourth edge cloud node is less than the set load threshold; if the judgment result is yes, it means that the load of the fourth edge cloud node is lighter, and it is suitable for The third edge cloud node provides the target image, so that the lighter-loaded fourth edge cloud node provides the image for the third edge cloud node.
- it can achieve load balancing, and on the other hand, it is also convenient for the third edge cloud node to quickly obtain the image. ,Improve efficiency.
- multiple attributes of the fourth edge cloud node can be combined to use the above several methods in combination, and then select a target image suitable for the first edge cloud node.
- the fourth edge cloud node For example, if there are multiple fourth edge cloud nodes, the operators to which the multiple fourth edge cloud nodes belong can be combined to select the first edge cloud node belonging to the same operator as the first edge cloud node from the multiple fourth edge cloud nodes.
- the fourth edge cloud node provides a target image for the first edge cloud node.
- the target image may have been provided to the third edge cloud node.
- the third edge cloud node For example, in a business expansion scenario, it is necessary to create a new instance in the edge cloud node that is currently providing cloud computing services to the service demander. The image used is the same as the image used by the previous instance. If the edge cloud node still stores the image used by the previous instance, there is no need to repeatedly provide the image for the edge cloud node.
- the central management and control device can determine the maintained issued image and the edge cloud node where the issued image is located before providing the target image at the fourth edge cloud node to the third edge cloud node Whether the third edge cloud node is included in the corresponding relationship; if the judgment result is yes, it indicates that the target image has been provided to the third edge cloud node, and the target image is still stored in the third edge cloud node, then the target image can be The information is provided to the third edge cloud node for the third edge cloud node to read the target image stored in it, without the need to transmit the target image again, which can save network resources consumed by the transmission of the target image; if the judgment result is no, it indicates that it has not been sent to The third edge cloud node has provided the target image, or the target image no longer exists in the third edge cloud node, the target image at the fourth edge cloud node may be provided to the third edge cloud node.
- the edge management and control device when the edge management and control device is deployed in the third edge cloud node, if the central management and control device determines that the corresponding relationship between the maintained issued image and the edge cloud node where the issued image contains the target image, the target image
- the information of the image is provided to the edge management and control device in the third edge cloud node, and the edge management and control device in the third edge cloud node can obtain the target image from the storage space of the image in the third edge cloud node according to the information of the target image.
- the image is provided to the corresponding resource device in the third edge cloud node, so that the corresponding resource device can create an instance that can provide cloud computing services according to the target image.
- the same edge cloud node may provide multiple cloud computing services for the same user or different users, and may receive multiple images, and these images will be stored in the edge cloud node.
- Edge cloud nodes can provide a certain amount of storage space for storing images. Considering that the storage space of the image in the edge cloud node is limited, in order to have enough storage space to store the newly received image, the edge cloud node needs to eliminate the locally stored image.
- the central management and control device is responsible for providing a mirroring elimination strategy for edge cloud nodes. The central management and control device can generate the elimination strategy of the image, deliver the elimination strategy to each edge cloud node, and each edge cloud node performs elimination processing on the stored image according to the elimination strategy.
- the central management and control equipment can issue the elimination strategy to the edge management and control equipment, and the edge management and control equipment eliminates the images stored in each edge cloud node according to the elimination strategy. Furthermore, in the case where edge management and control equipment is deployed in each edge cloud node, the central management and control equipment can issue the elimination strategy to the edge management and control equipment in each edge cloud node, and the edge management and control equipment in each edge cloud node will be The elimination strategy eliminates the image stored in the edge cloud node to which it belongs.
- the elimination strategy may be an elimination strategy with the earliest receiving time, that is, according to the receiving time of the image, the image with the earliest receiving time is preferentially eliminated.
- the elimination strategy may be the elimination strategy with the least frequency of use, that is, the image with the least frequency of use is preferentially eliminated according to the frequency of use of the image.
- the elimination strategy may be the elimination strategy with the largest resource occupation, that is, according to the size of the storage space occupied by the image, the image with the largest storage space is first eliminated.
- the image stored in the node can be eliminated regularly according to the above elimination strategy; or, whenever a new image needs to be received or acquired, it can be judged whether there is enough storage space in the node for storage If there is not enough storage space in the current node for the new image, the image stored in the node is eliminated according to the above elimination strategy, so as to store the new image.
- the edge of the third edge cloud node can determine whether there is enough storage space in the third edge cloud node to store the target image; if there is not enough storage space in the third edge cloud node, it will eliminate the image stored in the third edge cloud node according to the elimination strategy. In order to have enough storage space to store the target image. Optionally, if there is enough storage space in the third edge cloud node, the image stored in the third edge cloud node may not be eliminated temporarily.
- the network system 100 further includes: an image construction device 104.
- the image construction device 104 may be deployed in one or more edge cloud nodes, and is mainly responsible for the construction and verification of application images.
- the image construction device 104 can provide an edge cloud environment, can build an image that is compatible with the edge cloud environment, and can also verify whether the image is compatible with the edge cloud environment.
- the image that is not compatible with the edge cloud environment can be reconstructed or output Adapted prompt information, etc. Based on the image building device 104, the user can add a new image to the network system 100.
- a user can submit a third request for adding a new image to the central management and control device, and the third request includes image construction information;
- the device sends a construction request, which includes image construction information; after receiving the construction request, the image construction device obtains the image construction information from it, constructs an image adapted to the edge cloud environment based on the image construction information, and returns the constructed image to the center Control equipment; the central control equipment receives the newly constructed mirror image returned by the mirror construction equipment and adds it to the mirror library to continuously enrich the mirror library.
- a mirroring rule and specification can be provided to users (such as service demanders), allowing users to make or generate mirrors by themselves.
- the mirrors generated or made by users need to conform to the edge cloud Environmental safety, regulations and other related requirements.
- the user can send a fourth request for adding a new image to the central control device.
- the fourth request includes the image to be added.
- the new image refers to the image made or generated by the user. This embodiment does not It does not limit the way users make or generate images.
- the central control device receives the fourth request, obtains the image to be added from the fourth request, and sends the image to be added to the image construction device; the image construction device adapts the image to be added to the edge cloud environment; if it is to be added The image is adapted to the edge cloud environment, and the image construction device returns a message to the central control device that the new image is adapted to the edge cloud environment; if the new image is not compatible with the edge cloud environment, the image construction device returns to the central control device A message that the new image is not compatible with the edge cloud environment.
- the central control equipment if it receives a message from the mirror construction device that the new image to be added is adapted to the edge cloud environment, it will add the new mirror to the mirror library; if the mirror construction service is received, the mirror construction device returns The message that the new image to be added is not compatible with the edge cloud environment, or informs the user to re-submit the new image after reconstruction, or informs the user to provide the reconstruction method of the new image for the image building service image building equipment According to the reconstruction method, the newly added image is reconstructed into an image adapted to the edge cloud environment.
- the central control device can provide the reconstruction method to the image construction device, and the image construction device reconstructs the newly added image according to the reconstruction method to make it compatible with the edge cloud environment It adapts and returns the reconstructed image to the central control device; the central control device receives the reconstructed image and adds it to the mirror library.
- the image construction device 104 may be a logical device with functions such as image construction and verification (for example, it may be an instance that can provide image construction environment and resources, and has functions such as application deployment and image verification). These functions can be It can be implemented on one physical machine or virtual machine, or it can be distributed on multiple physical machines or virtual machines.
- the image construction device 104 of this embodiment may also be one or more physical devices with functions such as image construction and verification.
- the embodiments of this application do not limit the implementation structure of the image construction device, and any device structure with the above-mentioned functions is applicable to the embodiments of this application.
- the central management and control device can count the usage frequency of each mirror in the mirror library regularly or in real time, use mirrors with a frequency less than the frequency threshold as the mirrors to be deleted, and execute the mirror deletion process to delete them.
- the central management and control device may also receive a mirror deletion request submitted by a user (such as a service demander), use the mirror deleted in the mirror deletion request as a mirror to be deleted, and execute the mirror deletion process to delete it.
- the image deletion request may carry information of the image to be deleted, such as ID, name, or serial number.
- any of the above methods can be used to determine the image to be deleted.
- the image to be deleted can be deleted from the mirror library on the one hand, and the image to be deleted can be indicated to be stored on the other hand.
- the edge cloud node will delete the image to be deleted.
- the central management and control device may match the maintained corresponding relationship between the issued image and the edge cloud node where the issued image is located according to the image to be deleted, and determine the edge cloud node storing the image to be deleted according to the matching result.
- the fifth edge cloud node corresponding to the image to be deleted is matched in the corresponding relationship, it means that the image to be deleted has been issued to the fifth edge cloud node, and the image to be deleted is still stored in the fifth edge cloud node, so
- the fifth edge cloud node sends a deletion instruction, and the deletion instruction carries information about the image to be deleted to instruct the fifth edge cloud node to delete the image to be deleted stored therein.
- the fifth edge cloud node may be one or multiple.
- the central management and control device may specifically send a deletion instruction to the edge management and control device 103; the edge management and control device 103 receives the deletion instruction issued by the central management and control device, and then deletes the instruction from the Obtain the information of the image to be deleted in, and determine whether the image to be deleted is stored in the fifth edge cloud node according to the information of the image to be deleted; if the image to be deleted is stored in the storage, delete the image to be deleted in the fifth edge cloud node.
- the central management and control device 101 may specifically send a deletion instruction to the edge management and control device 103 in the fifth edge cloud node; the edge management and control device in the fifth edge cloud node 103 receives the delete instruction issued by the central management and control device, obtains the information of the image to be deleted from the delete instruction, and determines whether the image to be deleted is stored in the fifth edge cloud node according to the information of the image to be deleted; The image to be deleted stored in the fifth edge cloud node is deleted.
- the central control device deletes the image to be deleted from the image library, and the edge cloud node storing the image to be deleted also deletes the image to be deleted stored in it, the image deletion process is completed.
- the capabilities that can be supported by hardware or software under the control of the central control device 101 or the edge control device 103 are in the form of virtualization as an example
- Provide resources such as computing, network, and storage, and the corresponding image will be mounted to the corresponding instance in the form of a system disk.
- the capabilities of these resource devices can be used to provide cloud computing services.
- the resource device provides computing, network, and storage resources for the instance under the control of the edge management and control device, including: the edge management and control device applies for related resources from the resources allocated or reserved in the target edge node cloud according to the resource template provided by the central management and control device
- the computing resources, storage resources and/or network resources of the target edge cloud node are used to create related resources by calling the calculation, storage, network and other executors in the target edge cloud node.
- resource creation actions include: processing storage-related resources, creating an instance system disk based on the configuration information and content of the image, creating a corresponding data disk based on the resource template; creating network resources that the instance depends on, such as IP addresses, virtual switches And so on; and combine resource templates to create computing resources.
- the central management and control device may perform operation and maintenance management and control on at least one edge cloud node with the assistance of the edge management and control device.
- the edge management and control device may perform operation and maintenance monitoring on at least one edge cloud node and report the operation and maintenance monitoring data to the central management and control device, so that the central management and control device can manage and control at least one edge cloud node according to the operation and maintenance monitoring data.
- the central management and control device can perform operation and maintenance control on at least one edge cloud node according to the operation and maintenance monitoring data reported by the edge management and control device.
- the operation and maintenance monitoring of at least one edge cloud node can be carried out under the control of the central management and control equipment and the operation and maintenance monitoring data is reported to the central management and control equipment for the central management and control equipment according to the operation and maintenance
- the monitoring data controls the operation and maintenance of at least one edge cloud node.
- the edge management and control device may periodically perform operation and maintenance monitoring on at least one edge cloud node according to a timed task and report the operation and maintenance monitoring data to the central management and control device.
- the edge management and control equipment mainly performs functions such as monitoring, data collection, and reporting, while the operation and maintenance decisions are determined by the central management and control equipment.
- the central management and control device controls the edge management and control device to perform operation and maintenance monitoring of at least one edge cloud node, which can adopt but not limited to the following optional implementation manners:
- the central management and control device may send the first type of operation and maintenance monitoring instruction to the edge management and control device to instruct the edge management and control device to perform operation and maintenance monitoring on at least one edge cloud node from at least one operation and maintenance dimension and to The operation and maintenance monitoring data in the operation and maintenance dimension is reported to the central control equipment.
- the first type of operation and maintenance monitoring instruction is a monitoring instruction that instructs the edge management and control device to perform operation and maintenance monitoring of at least one edge cloud node from at least one operation and maintenance dimension and report operation and maintenance monitoring data in at least one operation and maintenance dimension.
- edge management and control equipment For edge management and control equipment, it can receive the first type of operation and maintenance monitoring instructions sent by the central management and control equipment, and according to the first type of operation and maintenance monitoring instructions, perform operation and maintenance monitoring of at least one edge cloud node from at least one operation and maintenance dimension, and Report the operation and maintenance monitoring data on at least one operation and maintenance dimension to the central control equipment.
- the central management and control device controls the operation and maintenance of at least one edge cloud node according to the operation and maintenance monitoring data in at least one operation and maintenance dimension reported by the edge management and control device. It is worth noting that at least one operation and maintenance dimension can be flexibly set according to application requirements and preset into edge control equipment and central control equipment. For examples of operation and maintenance dimensions, refer to the subsequent embodiments.
- the central management and control device may selectively perform operation and maintenance control on at least one edge cloud node in one or some operation and maintenance dimensions. Based on this, the central management and control device can send the second type of operation and maintenance monitoring instructions to the edge management and control device.
- the second type of operation and maintenance monitoring instruction corresponds to the specified operation and maintenance dimension, and is used to instruct the edge management and control device to check at least one edge in the specified operation and maintenance dimension.
- the cloud node performs operation and maintenance monitoring and reports the operation and maintenance monitoring data on the specified operation and maintenance dimension.
- For edge management and control equipment it can receive the second type of operation and maintenance monitoring instructions sent by the central management and control equipment, and perform operation and maintenance monitoring on at least one edge cloud node in the specified operation and maintenance dimension according to the second type of operation and maintenance monitoring instructions, and specify The operation and maintenance monitoring data in the operation and maintenance dimension is reported to the central management and control device, so that the central management and control device can perform operation and maintenance control on at least one edge cloud node according to the operation and maintenance monitoring data in the designated operation and maintenance dimension.
- the central management and control device is also used to receive the operation and maintenance monitoring data in the specified operation and maintenance dimension sent by the edge management and control device, and perform operation and maintenance control on at least one edge cloud node according to the operation and maintenance monitoring data in the specified operation and maintenance dimension.
- the edge management and control device periodically performs operation and maintenance monitoring of at least one edge cloud node according to a timed task, may periodically perform operation and maintenance monitoring of at least one edge cloud node from at least one operation and maintenance dimension according to a timed task; Further, the operation and maintenance monitoring data in at least one operation and maintenance dimension can be reported to the central control equipment. Among them, the monitoring period on different operation and maintenance dimensions may be the same or different.
- the edge management and control device can scan the edge cloud node for security vulnerabilities every 10 minutes, or monitor the traffic of the edge cloud node every 5 minutes.
- each designated operation and maintenance dimension can correspond to a second-type operation and maintenance monitoring instruction, that is, the central control device can send multiple second-type operation and maintenance monitoring instructions to the edge control device.
- the second type of operation and maintenance monitoring instruction corresponds to a specified operation and maintenance dimension.
- multiple designated operation and maintenance dimensions can also correspond to the same second-type operation and maintenance monitoring instruction, that is, the central management and control device can send a second-type operation and maintenance monitoring to the edge management and control device Instruction, this second type of operation and maintenance monitoring instruction corresponds to multiple specified operation and maintenance dimensions.
- the aforementioned at least one operation and maintenance dimension or specified operation and maintenance dimension may include but is not limited to the following dimensions: the object dimension in the running state, the log dimension, the security dimension, and the resource dimension.
- the object dimension in the running state may include the operating state dimension of the object and/or the life cycle dimension of the object;
- the security dimension may include: the traffic attack dimension and/or the security vulnerability dimension.
- the central management and control device performs O&M control on at least one edge cloud node, including but not limited to at least one of the following O&M control examples:
- Operation and maintenance management and control example 1 The central management and control device controls the edge management and control device to monitor the status of objects in at least one edge cloud node that are in operation.
- the control method includes sending a first type of operation and maintenance monitoring instruction to the edge management and control device or sending a second type of operation and maintenance monitoring instruction corresponding to the operating state dimension of the object.
- the edge management and control equipment is under the control of the central management and control equipment, or periodically according to timing tasks, monitors the status of the objects in the running state of at least one edge cloud node, and reports the running status of the monitored objects in the running state to Central control equipment.
- the central management and control equipment identifies objects with abnormal operating status from the operating status of the objects in the operating status reported by the edge management and control equipment.
- the objects with abnormal operating status are called target objects, and exception handling is performed on the target objects.
- the objects in the running state in the edge cloud node include, but are not limited to: instances, images, containers, other virtual components, physical machines, CPUs, and/or hard disks.
- the abnormal situation of the running state will be different.
- possible abnormal conditions include, but are not limited to: interruption, error reporting, and/or failure.
- possible abnormal conditions include, but are not limited to: crashes, black screens, alarms, and/or crashes of applications running on the physical machine.
- the exception handling method will be different, for example, it can include but not limited to: alarm, stop or restart the target object, migrate, delete and rebuild the target object, etc.
- Operation and maintenance control example 2 The central control device controls the edge control device to monitor the life cycle of at least one edge cloud node in the running state.
- the control method includes sending the first type of operation and maintenance monitoring instruction to the edge management and control device or sending the second type of operation and maintenance monitoring instruction corresponding to the life cycle dimension of the object.
- the edge control device is under the control of the central control device, or periodically according to a scheduled task, monitors the life cycle of at least one edge cloud node in the running state, and reports the life cycle of the monitored object in the running state Give the center control equipment.
- the central management and control device controls the stopping, restarting, migration or deletion of the running object after stopping, according to the life cycle of the running object reported by the edge management and control device.
- Operation and maintenance control example 3 The central control device controls the edge control device to collect log data in at least one edge cloud node.
- the control method includes sending the first type of operation and maintenance monitoring instruction to the edge management and control device or sending the second type of operation and maintenance monitoring instruction corresponding to the log dimension.
- the edge management and control device collects log data in at least one edge cloud node under the control of the central management and control device or periodically according to a timed task, and reports the collected log data to the central management and control device.
- the central management and control device receives the log data reported by the edge management and control device, performs data analysis on the log data, and performs follow-up actions based on the data analysis results, such as billing, risk control, and/or adding or subtracting instances.
- log data may include, but is not limited to: various performance, indicators and other data in edge cloud nodes, such as: instance bandwidth traffic, instance current running status, instance IO load, physical machine bandwidth traffic, physical machine The current operating status, the IO load of the physical machine, the operating status of the edge management and control equipment, and/or the operating status of other virtualization components, etc.
- the central control device can not only collect the log data of each edge cloud node reported by the edge control device, but also has the ability to perform data inspection. For some data, if the data stored by the central control device is inconsistent with the data in the edge cloud node, The latest data can be actively synchronized with the edge cloud node, for example, the latest version of the image can be synchronized with the edge cloud node.
- Operation and maintenance control example 4 The central control device controls the edge control device to monitor the traffic of at least one edge cloud node.
- the control method includes sending the first type of operation and maintenance monitoring instruction to the edge management and control device or sending the second type of operation and maintenance monitoring instruction corresponding to the traffic attack dimension.
- the edge management and control equipment is under the control of the central management and control equipment, or periodically according to a timing task, monitors the flow of at least one edge cloud node, and reports the monitored traffic attack events to the central management and control equipment.
- the central management and control equipment blocks traffic attack events that occur in edge cloud nodes.
- the edge management and control device may also report the monitored flow data to the central management and control device, and the central management and control device may also perform flow attack defense on at least one edge cloud node based on the flow data.
- Operation and maintenance control example 5 The central control equipment controls the edge control equipment to scan for network security vulnerabilities on at least one edge cloud node.
- the control method includes sending the first type of operation and maintenance monitoring instruction to the edge management and control device or sending the second type of operation and maintenance monitoring instruction corresponding to the network security dimension.
- the edge management and control equipment is under the control of the central management and control equipment, or periodically according to timing tasks, scans for network security vulnerabilities on at least one edge cloud node, and reports the scanned network security vulnerabilities to the central management and control equipment.
- the central control equipment receives the network security vulnerabilities reported by the edge control equipment, and repairs the network security vulnerabilities.
- Operation and maintenance control example 6 The central control device controls the edge control device to monitor the resource usage in at least one edge cloud node.
- the control method includes sending the first type of operation and maintenance monitoring instruction to the edge management and control device or sending the second type of operation and maintenance monitoring instruction corresponding to the resource dimension.
- the edge management and control device is under the control of the central management and control device, or periodically according to a timed task, monitors the resource usage in at least one edge cloud node, and reports the monitored resource usage information to the central management and control device.
- the central management and control device performs resource expansion or reduction on at least one edge cloud node based on the resource usage information reported by the edge management and control device.
- the resources here include various resource information, such as equipment resources such as physical machines, storage resources, computing resources such as CPUs and GPUs, and network resources such as bandwidth.
- each edge management and control device can, under the control of the central management and control equipment, perform operation and maintenance monitoring on its edge cloud node and monitor the operation and maintenance of its edge cloud node.
- the operation and maintenance monitoring data is reported to the central control equipment.
- the central management and control device can receive the operation and maintenance monitoring data reported by the edge management and control device in each edge cloud node, and perform operation and maintenance management and control on each edge cloud node according to the operation and maintenance monitoring data in each edge cloud node.
- a structural framework of a central management and control device includes: a resource scheduling management and control module, a mirror management and control module, and a central operation and maintenance module; the central operation and maintenance module further includes: a central monitoring unit, a central log unit, and a central security unit.
- a structural framework of edge management and control equipment includes: a resource scheduling service module, a mirroring service module, and an edge operation and maintenance module; the edge operation and maintenance module further includes: an edge monitoring unit, an edge log unit, and an edge security unit.
- the resource scheduling management and control module in the central management and control device cooperates with the resource scheduling service module in the edge management and control device to perform resource scheduling on edge cloud nodes.
- the resource scheduling function please refer to the description below.
- the image management and control module in the central management and control device cooperates with the image service module in the edge management and control device to perform image management and distribution for edge cloud nodes.
- image management and distribution functions please refer to the description below.
- the central operation and maintenance module in the central management and control equipment and the edge operation and maintenance module in the edge management and control equipment cooperate with each other to perform operation and maintenance management and control on the edge cloud nodes.
- the above operation and maintenance control examples 1-6 can be implemented by the corresponding units in the central operation and maintenance module and the edge operation and maintenance module.
- Operation and maintenance control example 3 can be realized by the cooperation of the central log unit in the central operation and maintenance module and the edge log unit in the edge operation and maintenance module.
- the central log unit sends the first type of operation and maintenance monitoring instruction or the second type of operation and maintenance monitoring instruction corresponding to the log dimension to the edge log unit; the edge log unit collects the edge according to the first or second type of operation and maintenance monitoring instruction
- the log data in the cloud node is reported to the central log unit; the central log unit performs data analysis on the log data and executes follow-up actions based on the data analysis results.
- Operation and maintenance control examples 4 and 5 can be realized by the cooperation of the central security unit in the central operation and maintenance module and the edge security unit in the edge operation and maintenance module.
- the central security unit sends the first type of operation and maintenance monitoring instruction to the edge security unit or sends the second type of operation and maintenance instruction corresponding to the traffic attack or the network security dimension;
- the edge security unit can be based on the first or second type of operation and maintenance Instruct the edge cloud nodes to perform traffic monitoring or network security vulnerability scanning, and report the monitored traffic attack events or network vulnerability security issues to the central security unit;
- the central security unit blocks traffic attack events or conducts network security vulnerability issues repair.
- Operation and maintenance control examples 1, 2 and 6 can be implemented by the central monitoring unit in the central operation and maintenance module and the edge monitoring unit in the edge operation and maintenance module, and the detailed implementation process is not repeated.
- the central management and control equipment can understand the health, resource usage, log data and/or infrastructure conditions of each instance in the edge cloud node, and can realize remote operation and maintenance, log management, etc.
- the edge management and control device in addition to the central management and control device that can perform O&M management and control on at least one edge cloud node, in the case where the central management and control device does not perform O&M management and control on the edge cloud node or cannot perform O&M management and control on the edge cloud node ,
- the edge management and control device can autonomously perform operation and maintenance management and control on at least one edge cloud node.
- the edge management and control device can monitor the connection between it and the central management and control device. When the connection with the central management and control device is lost, it can be determined that the central management and control device cannot perform operation and maintenance control on the edge cloud node.
- One operation and maintenance dimension controls the operation and maintenance of at least one edge cloud node.
- the edge management and control device can wait to receive the central management and control device to send If it does not receive the first type of operation and maintenance monitoring instruction sent by the central control device, it can be determined that the central control device is incorrect or cannot perform the operation and maintenance control on at least one edge cloud node. At least one operation and maintenance dimension controls the operation and maintenance of at least one edge cloud node.
- the edge management and control device and the central management and control device may pre-appoint the waiting time for the first type of operation and maintenance monitoring instruction. If the waiting time is exceeded and the first type of operation and maintenance monitoring instruction sent by the central management and control device is not received, then It is determined that the first type of operation and maintenance monitoring instruction sent by the central control equipment has not been received.
- the central management and control device sends the second type of operation and maintenance monitoring instructions corresponding to the specified operation and maintenance dimension to the edge management and control device to control the manner in which the edge management and control device performs operation and maintenance monitoring of at least one edge cloud node from the specified operation and maintenance dimension
- the edge management and control device can wait to receive the second type of operation and maintenance monitoring instruction sent by the central control device. If the second type of operation and maintenance monitoring instruction sent by the central control device is not received in the specified operation and maintenance dimension, it can be determined that the central control device is in If the specified operation and maintenance dimension is incorrect or unable to perform operation and maintenance control on at least one edge cloud node, it is possible to autonomously perform operation and maintenance control on at least one edge cloud node from the specified operation and maintenance dimension.
- the edge management and control device autonomously controls the operation and maintenance of at least one edge cloud node from at least one operation and maintenance dimension when the connection with the central management and control device is lost, then after the connection with the central management and control device is restored,
- the operation and maintenance control data during the loss of connection can be synchronized to the central control equipment.
- the operation and maintenance control data mainly includes data such as strategies, methods, and effects of operation and maintenance control, and of course, it can also include operation and maintenance monitoring data.
- the above-mentioned at least one operation and maintenance dimension or the designated operation and maintenance dimension may include, but is not limited to, the following dimensions: the object dimension in the running state, the log dimension, the security dimension, and the resource dimension.
- the object dimension in the running state may include the operating state dimension of the object and/or the life cycle dimension of the object;
- the security dimension may include: the traffic attack dimension and/or the security vulnerability dimension.
- the edge management and control device autonomously performs O&M control on at least one edge cloud node, including but not limited to at least one of the following O&M control examples:
- Operation and maintenance management and control example a autonomously monitor the status of objects in the running state in at least one edge cloud node, and perform exception handling for the monitored target objects whose running status is abnormal.
- the objects in the running state and the abnormal conditions of the running state please refer to the above description, which will not be repeated here.
- the edge management and control device when the edge management and control device performs abnormal processing on the target object, it is specifically used to: analyze the abnormal operating state of the target object, and determine at least one candidate processing method according to the analysis result; In the candidate processing method, the target processing method is acquired, and the target object is abnormally processed according to the target processing method.
- the edge management and control device when the edge management and control device obtains the target processing mode, it is specifically used to: when the edge management and control device maintains a connection with the central management and control device, report at least one candidate processing method to the central management and control device for the central management and control device to use.
- Select the processing method receive the processing method returned by the central control device as the target processing method; or, in the case that the edge control device loses the connection with the central control device, output at least one candidate processing method to the edge operation and maintenance control personnel for the edge
- the operation and maintenance personnel select the processing method; in response to the selection operation of the edge operation and maintenance management and control personnel, determine the selected processing method as the target processing method; or, in the case of loss of connection with the central control equipment, follow the set selection strategy,
- the target processing method is selected from at least one candidate processing method.
- Operation and maintenance management and control example b Autonomously monitor the life cycle of objects in the running state in at least one edge cloud node, and control the objects in the running state to stop, restart or delete after stopping according to the monitoring results. For containers or instances, you can control the container or instance to stop execution, restart after stopping, or delete the container or instance, etc.
- Operation and maintenance control example c autonomously collect log data in at least one edge cloud node, perform data analysis on the log data, and perform follow-up actions based on the data analysis results.
- Log data includes, but is not limited to, the bandwidth traffic of the instance in the edge cloud node, the current running status of the instance, the IO load of the instance, the bandwidth traffic of the physical machine, the current running status of the physical machine, the IO load of the physical machine, and the operation of edge control equipment. Status and/or operation status of other virtualization components.
- subsequent actions such as billing, risk control, and resource reallocation can be performed according to the analysis result of the log data, but are not limited to this.
- Operation and maintenance control example d Autonomously monitor the traffic of at least one edge cloud node, and block the monitored traffic attack events.
- Operation and maintenance control example e Autonomously scan for network security vulnerabilities on at least one edge cloud node, and fix the scanned network security vulnerabilities.
- Operation and maintenance control example f autonomously monitor the resource usage in at least one edge cloud node, and perform resource expansion or reduction on at least one edge cloud node according to the monitoring result.
- the resources here include but are not limited to: equipment resources such as physical machines, storage resources such as memory and disks, computing resources such as CPU and GPU, and network resources such as bandwidth. For these resources, when the usage is high, the capacity can be expanded for these resources, and when the usage is low, the capacity can be reduced for these resources.
- each edge management and control device can autonomously belong to its edge when the central management and control device is incorrect or cannot perform operation and maintenance control on its edge cloud node.
- Cloud nodes perform operation and maintenance management and control.
- the edge management and control device may periodically perform the operation, maintenance, management and control on at least one edge cloud node according to a timing task .
- the edge management and control device can monitor the traffic of at least one edge cloud node every 10 minutes according to the scheduled task, and block the monitored traffic attack event.
- the edge management and control device may scan for network security vulnerabilities on at least one edge cloud node every 5 minutes according to a scheduled task, and fix the scanned network security vulnerabilities.
- the edge management and control device can also autonomously control the operation and maintenance of at least one edge cloud node according to other independent strategies. For example, it can autonomously control the operation and maintenance of at least one edge cloud node at a fixed time every day. .
- the central management and control device is combined with the edge management and control device, and the central management and control device can perform operation, maintenance, management and control on at least one edge cloud node with the assistance of the edge management and control device, except
- the edge management and control equipment also has a certain ability of self-operation, maintenance, and control.
- the edge cloud node can be independently operated and maintained to achieve Two-level operation and maintenance management and control can more fully and comprehensively control the operation and maintenance of edge cloud nodes, and provide conditions for "putting cloud computing in edge cloud nodes closer to the terminal for processing", and then can use edge cloud nodes
- the resources to provide users with cloud computing services are conducive to reducing response delays, reducing the pressure on the central cloud or traditional cloud computing platforms, and reducing bandwidth costs.
- the resources, mirroring, instances, operation and maintenance of edge cloud nodes are uniformly controlled based on centralized management and control, and the edge cloud nodes can be managed and coordinated to the greatest extent. , It can reduce errors caused by single-point self-control or unsynchronized information of the entire network, and can use the characteristics of centralized management to achieve the optimization of resource scheduling, avoiding the waste of local resources at the edge.
- the embodiments of the present application provide example management and control methods from the perspective of central management and control equipment, which are described in detail below.
- Fig. 2a is a schematic flowchart of an example management and control method provided by an exemplary embodiment of this application. As shown in Figure 2a, the method includes:
- the network system includes at least one edge cloud node, at least one instance is deployed in the at least one edge cloud node, and at least one instance can provide cloud computing services for service demanders.
- the central management and control device determines at least one instance in the edge cloud node, and controls the at least one instance so that the at least one instance provides cloud computing services for the service demander.
- the service demander here may be any device, application, system or another service that needs to use the cloud computing service provided by the instance in the edge cloud node. Taking the system as an example, the service demander can be, but not limited to: online video systems, risk management systems, customer information management systems, or data distribution systems.
- the central management and control device may perform various management and control on at least one instance, for example, it may include at least one of upgrade, migration, shutdown, restart, and release, but is not limited thereto.
- the process for the central control equipment to upgrade and control an instance includes the following steps:
- the central control device can determine the instance to be upgraded from at least one instance, and there can be one or more instances to be upgraded; send an upgrade request to the service demander, so that the service demander can determine the instance to be upgraded based on the business situation of the instance to be upgraded Upgrade strategy.
- the upgrade request carries the identification information of the instance to be upgraded, such as the ID and name of the instance to be upgraded. It can also be the ID and name of the service corresponding to the instance to be upgraded. It can also be the ID, name and other information of the image corresponding to the instance to be upgraded. .
- the service demander can determine the instance to be upgraded according to the upgrade request, and combine the business conditions on the instance to be upgraded, such as the business request and response status on the instance to be upgraded, to determine whether the instance to be upgraded is suitable for upgrade. What time is suitable for upgrading, what method to use for upgrading, etc., and then generate an upgrade strategy for the instance to be upgraded and return it to the central control device.
- the central control equipment receives the upgrade strategy sent by the service demander, and upgrades the instance to be upgraded according to the upgrade strategy.
- the service demander can combine the business conditions on the instance to be upgraded, such as the number of business requests that have been received and not yet completed (referred to as inventory business requests), and whether there are any new business requests ( Incremental service request), etc., to determine when the instance to be upgraded can be upgraded, that is, the upgrade strategy can include the upgrade time.
- the central control device can start to upgrade the instance to be upgraded at the upgrade time specified in the upgrade policy.
- the upgrade strategy may include an upgrade method. Based on this, the central control device may use the upgrade method specified in the upgrade strategy to upgrade the instance to be upgraded.
- the upgrade strategy may include an upgrade time and an upgrade method, and the central management and control device may adopt the upgrade method specified in the upgrade strategy, and upgrade the instance to be upgraded at the upgrade time specified in the upgrade strategy.
- the upgrade strategy may also include information such as whether to upgrade, and in the case of upgrade, it further includes the upgrade time and/or the upgrade method.
- upgrading the instance can be initiated by the central control device.
- the central control device can monitor the version information of the mirror corresponding to each instance, and when a new version of the mirror is found, it can determine that the instance corresponding to the new version of the mirror needs to be upgraded; or, it can also monitor the running status of each instance, Life cycle and other information.
- problems such as vulnerabilities, instability, insufficiency, excessive consumption of CPU or memory resources are found during the running of an instance, it can be determined that the instance with these problems needs to be upgraded.
- upgrading the instance may also be initiated by the service demander.
- the service demander can send upgrade description information to the central management and control device, and the upgrade description information includes instance filter conditions.
- step 21b includes: receiving upgrade description information sent by the service demander; and determining the instance to be upgraded from at least one instance according to the instance filter condition.
- upgrading the instance to be upgraded mainly refers to: shutting down the instance to be upgraded, updating the instance to be upgraded according to the mirror image of the corresponding version (generally, the new version), and restarting the instance after the update.
- the image version required for upgrading the instance to be upgraded can be determined by the central management and control device.
- the latest version of the corresponding image can be used as the image version required for the upgrade, or it can be specified by the service demander.
- the service demander can carry the image version required for the upgrade in the upgrade description information and provide it to the central management and control device.
- the upgrade description information can include "Upgrade from mirror version A to mirror version B for all or specified instances. "And other information.
- upgrading the instance to be upgraded according to the upgrade strategy includes: according to the upgrade strategy, using the mirror corresponding to the mirror version to upgrade the instance to be upgraded.
- using the mirror corresponding to the mirror version to upgrade the instance to be upgraded can be: sending the upgrade strategy and the mirror corresponding to the mirror version to the edge management and control equipment in the network system , So that the edge management and control device uses the mirror corresponding to the mirror version to upgrade the instance to be upgraded according to the upgrade strategy.
- the process of the central management and control device for instance migration management and control includes the following steps:
- step 22c Determine that the first edge cloud node meets the intra-node migration condition; if the judgment result is yes, that is, the first edge cloud node meets the intra-node migration condition, go to step 23c; if the judgment result is no, the first edge cloud node does not meet the For intra-node migration conditions, go to step 24c.
- the instance to be migrated is migrated within the edge cloud node.
- the instance needs to be migrated. For example, in the case that the entire edge cloud node is faulty or unavailable, the instances in the edge cloud node need to be migrated to other edge cloud nodes. For another example, in the case of a failure or downtime of a physical machine hosting an instance, the instance on the physical machine needs to be migrated to another physical machine. For another example, it may be necessary to migrate one or some instances from one edge cloud node to other edge cloud nodes due to business needs. For another example, when resources need to be merged, one or some instances need to be migrated.
- instances in the edge cloud node can be migrated.
- the central control device determines the instance to be migrated from at least one instance. There may be one or more instances to be migrated; if there are multiple instances to be migrated, the multiple instances to be migrated can be deployed in the same edge cloud node or in different edge cloud nodes.
- the central management and control device may monitor the state of at least one instance deployed in at least one edge cloud node, and obtain a failed instance and/or an instance in which a specified event occurs during operation as an instance to be migrated according to the state of at least one instance.
- a failed instance refers to an instance that cannot operate normally, for example, it can be an instance on a physical machine where the downtime occurs, or an instance that itself is down.
- the designated event mainly refers to some events that the instance can still run normally after occurrence, which can be flexibly set according to application requirements, and there is no restriction on this.
- the specified event can be some early warning or alarm events, etc.
- the instance does not produce actual problems and can still run (that is, no failure), but there are hidden dangers of failure and can be migrated before failure.
- the central control equipment maintains the information of each edge cloud node and the information of each instance deployed in each edge cloud node. Based on this, the edge cloud node to which the instance to be migrated belongs can be determined. For ease of description and distinction, the instance to be migrated The edge cloud node to which it belongs before the migration is recorded as the first edge cloud node.
- the central management and control device may determine the instance to be migrated from at least one instance according to resource merging requirements, and then migrate the instance to be migrated.
- resource merging is mainly the process of integrating resource fragments through instance migration. After integration, the resource fragments in edge cloud nodes will be reduced or even nonexistent, which is conducive to improving resource utilization in edge cloud nodes.
- resource merging requirements can be system-level or node-level.
- System-level resource merging refers to the integration of resource fragments in the entire network system from the perspective of the entire network system through instance migration; node-level resource merging refers to the perspective of edge cloud nodes and the use of instance migration The resource fragments in the node are integrated.
- the resource consolidation requirement may be provided by the service demander.
- the service demander needs to deploy a new instance, if the available resources on each resource device in the edge cloud node it serves are not enough to carry the new instance, the instance in the edge cloud node can be migrated to implement resources Integration to provide sufficient resources for new instances.
- the resource consolidation requirement can also be the regular behavior of the central control equipment. For example, the central management and control equipment regularly performs resource fragmentation checks. When the fragmentation rate reaches a certain threshold and instance migration can be performed, the resource fragmentation in each edge cloud node is integrated to improve resource utilization in the edge cloud node.
- the resource consolidation requirements contain information related to resource consolidation.
- the resource merging requirements may include information about instances that need to be migrated to achieve the purpose of resource merging. Based on this, the instances to be migrated can be directly determined according to the resource merging requirements.
- the resource merging requirements may include information about edge cloud nodes that need to be merged. Based on this, the edge cloud node that needs to be merged can be determined according to the resource merging requirements.
- the edge cloud node that needs to be merged is called the first edge cloud node; in turn, the resources in the first edge cloud node can be combined The remaining available resources on the device and the resources required by each instance in the first edge cloud node determine the instance to be migrated.
- the central control device can determine whether the first edge cloud node to which the instance to be migrated belongs meets the intra-node migration conditions; if the first edge cloud node meets the intra-node migration conditions, it will be treated
- the migration instance performs intra-edge cloud node migration; if the first edge cloud node does not meet the intra-node migration condition, the migration instance to be migrated is migrated across edge cloud nodes.
- the central management and control device may determine whether the first edge cloud node is currently available; if the first edge cloud node is currently available, determine whether the available resources of the first edge cloud node are sufficient to carry the instance to be migrated; The available resources of the edge cloud node are sufficient to carry the instance to be migrated, and it is determined that the first edge cloud node meets the migration conditions within the node; if the first edge cloud node is currently in an unavailable state, or the available resources of the first edge cloud node are insufficient to carry the instance to be migrated For example, it is determined that the first edge cloud node does not meet the intra-node migration condition.
- the migration of instances is divided into two types: intra-node migration and cross-node migration.
- the available resources of the first edge cloud node mainly refer to the available resources on each resource device in the first edge cloud node; accordingly, judging whether the available resources of the first edge cloud node are sufficient to carry the instance to be migrated mainly refers to judging Whether there is a resource device with sufficient resources available in the first edge cloud node to carry the instance to be migrated.
- the instance migration for resource merging is mainly intra-node migration, and of course, it can also be cross-node migration.
- the instance to be migrated may also be determined A resource device, where the resource device is a resource device whose remaining available resources in the first edge cloud node can carry the instance to be migrated.
- the resource device is a resource device whose remaining available resources in the first edge cloud node can carry the instance to be migrated.
- cross-node migration can be performed for the instance to be migrated.
- resource merging in the process of cross-node migration for the instances to be migrated, priority is given to migrating the instances to be migrated to other edge cloud nodes that have been used and the remaining available resources can carry the resource devices of the instances to be migrated; Further, in the case that there are multiple resources that have been used and remaining available resources can carry the resource equipment of the instance to be migrated, the principle of minimum resource fragmentation can be used to select the matching degree between the remaining available resources and the resources required by the instance to be migrated. Higher resource equipment, try to produce less resource fragments or no resource fragments.
- the continuity of the cloud computing service provided by the instance can be ensured through the hot migration technology.
- the hot migration technology please refer to the prior art, which will not be repeated here.
- the central control device can select a second edge cloud node from at least one edge cloud node, the second edge cloud node is different from the first edge cloud node, and the available resources in the second edge cloud node are sufficient to carry the instance to be migrated , That is, sufficient resources; migrate the instance to be migrated to the second edge cloud node, and send the attribute information of the instance to be migrated in the second edge cloud node to the service demander, so that the service demander can target The instance to be migrated performs business scheduling.
- the attribute information of the instance to be migrated in the second edge cloud node means that after the instance to be migrated is migrated to the second edge cloud node, an external (for example, a service demander or a third party authorized by the service demander) conducts an operation on the instance to be migrated.
- Information required for service scheduling may include, but is not limited to, for example, information such as the area where the second edge cloud node is located, operator information, and/or public network IP.
- the following methods can be used but not limited to:
- Method 1 According to the distance between other edge cloud nodes and the first edge cloud node, select the edge cloud node whose distance from the first edge cloud node is less than the set distance threshold, or select the closest distance to the first edge cloud node An edge cloud node, or an edge cloud node arbitrarily selected from the N edge cloud nodes closest to the first edge cloud node as the second edge cloud node.
- the second edge cloud node is closest or relatively close to the first edge cloud node, which can save data transmission time and help improve migration efficiency.
- Method 2 You can select edge cloud nodes with relatively sufficient bandwidth resources according to the bandwidth resources of other edge cloud nodes. For example, select the edge cloud node with the largest bandwidth resource, or select the bandwidth resource greater than the set bandwidth threshold, or select the bandwidth utilization rate lower The edge cloud node serves as the second edge cloud node. In method 2, the bandwidth resources of the second edge cloud node are sufficient, which can increase the data transmission rate and help improve migration efficiency.
- Method 3 According to the current load situation of other edge cloud nodes, select the edge cloud node with relatively light load, for example, select the edge cloud node with the smallest load, or select the edge cloud node with the load less than the set load threshold as the second Edge cloud node.
- the load of the second edge cloud node is lighter, it has sufficient resources and can handle instance migration in time, which is beneficial to improve migration efficiency.
- the central management and control device may reserve or allocate resources for the instance to be migrated in the second edge cloud node according to the resource requirements of the instance to be migrated; After the reservation or allocation is successful, the instance to be migrated is migrated to the resources reserved or allocated in the second edge cloud node.
- the resource requirements of the instances to be migrated can be combined to determine the type of resources, the amount of resources and/or the performance requirements of the resource equipment required by the instances to be migrated, and resource reservation or allocation can be performed in the second edge cloud node based on this information , Which can provide resource guarantee for successful instance migration.
- the central control device can also notify the service demander of the migration event, and the service demander can make appropriate response actions, such as updating the The information of the instance in the service demander, or the disaster recovery response to the downtime during the instance migration.
- the attribute information of the instance to be migrated in the second edge cloud node may be provided to the service demander.
- the attribute information of the instance to be migrated in the second edge cloud node may be provided to the service demander.
- the central control device can also send a migration request to the service demander for the service demander to combine
- the business situation on the instance to be migrated determines the migration strategy for the instance to be migrated; the migration strategy sent by the service demander is received, and the instance to be migrated is migrated to the second edge cloud node according to the migration strategy.
- the migration strategy mainly includes at least one information of whether to migrate, migration time, and migration mode.
- the central management and control device may send the attribute information of the instance to be migrated in the second edge cloud node together with the migration request to the service demander.
- the attribute information of the instance to be migrated in the second edge cloud node may be provided to the service demander.
- the instance to be migrated is an instance that has a specified event but can still run normally, during the migration process, the instance to be migrated continues to run on the first edge cloud node, so that the business during the migration process can continue to be scheduled to Ensure business continuity on the instances to be migrated in the first edge cloud node.
- the central management and control device can release the instance to be migrated in the first edge cloud node.
- the service requester no may send a release notice to the central control device ;
- the central control device receives the release notification sent by the service demander, and releases the instance to be migrated running in the first edge cloud node according to the release notification.
- the central management and control device may also synchronize the running state of the instance to be migrated running in the first edge cloud node to the instance to be migrated in the second edge cloud node.
- migrating the instance to be migrated to the second edge cloud node is mainly to control the corresponding resource equipment in the second edge cloud node to reserve or allocate according to the mirror or snapshot corresponding to the instance to be migrated The process of creating an instance to be migrated on the resource.
- the central management and control device may determine the scheduled resource information in the second edge cloud node according to the resource requirements of the instance to be migrated, and send the resource information to the edge management and control device, and The edge management and control device controls the corresponding resource device in the second edge cloud node to reserve or allocate resources for the instance to be migrated according to the resource information. Then, the central management and control device can send a migration instruction to the edge management and control device.
- the migration instruction instructs the edge management and control device to obtain the image or instance snapshot corresponding to the instance to be migrated and provide it to the corresponding resource device in the second edge cloud node for the second edge cloud node According to the image or instance snapshot, the corresponding resource device creates an instance to be migrated on the reserved or allocated resources.
- the central management and control device may send a migration instruction to the edge management and control device in the second edge cloud node to instruct the edge management and control device in the second edge cloud node to obtain the instance to be migrated
- the corresponding image or snapshot is provided to the corresponding resource device in the second edge cloud node for the corresponding resource device in the second edge cloud node to create an instance to be migrated on the reserved or allocated resources according to the image or snapshot.
- the instance in the edge cloud node can provide cloud computing services to the service demander, achieving the purpose of providing services to users by using the resources in the edge cloud node, so that " It has become a reality to place cloud computing in edge cloud nodes closer to the terminal, which will help reduce response delays, reduce the pressure on the central cloud or traditional cloud computing platforms corresponding to edge cloud nodes, and reduce bandwidth costs.
- FIG. 3 is a schematic structural diagram of a central management and control device provided by an exemplary embodiment of this application. As shown in FIG. 3, the central management and control device includes: a memory 31 and a processor 32.
- the memory 31 is used to store computer programs, and can be configured to store various other data to support operations on the central control device. Examples of these data include instructions, messages, pictures, videos, etc. used to operate any application or method on the central control device.
- the processor 32 is coupled with the memory 31 and is configured to execute the computer program in the memory 31 to determine at least one instance deployed in at least one edge cloud node in the network system, and the at least one instance can provide the cloud for the service demander Computing services: At least one instance is managed and controlled so that at least one instance provides cloud computing services for the service demander.
- the management and control of at least one instance includes: at least one of upgrade, migration, shutdown, restart, and release.
- the central management and control device further includes: a communication component 33.
- the processor 32 upgrades at least one instance, it is specifically configured to: determine the instance to be upgraded from the at least one instance; send an upgrade request to the service demander through the communication component 33, so that the service demander can combine the instance to be upgraded
- the above business situation determines the upgrade strategy for the instance to be upgraded; the communication component 33 receives the upgrade strategy returned by the service demander, and upgrades the instance to be upgraded according to the upgrade strategy.
- the processor 32 determines the instance to be upgraded from at least one instance, it is specifically configured to: receive the upgrade description information sent by the service demander through the communication component 33, where the upgrade description information includes instance filter conditions; according to the instance filter conditions, from at least In one instance, the instance to be upgraded is determined.
- the upgrade description information also includes: the image version required for the upgrade. Then, when the processor 32 upgrades the instance to be upgraded according to the upgrade strategy, it is specifically configured to: according to the upgrade strategy, use the mirror corresponding to the mirror version to upgrade the instance to be upgraded.
- the processor 32 uses the mirror corresponding to the mirror version to upgrade the instance to be upgraded according to the upgrade strategy, it is specifically used to: send the mirror corresponding to the upgrade strategy and the mirror version to the edge management and control device in the network system for the edge
- the control equipment uses the mirror corresponding to the mirror version to upgrade the instance to be upgraded.
- the upgrade strategy includes but is not limited to: at least one piece of information in whether to upgrade, upgrade time, and upgrade method.
- the processor 32 migrates at least one instance, it is specifically configured to: determine the instance to be migrated from the at least one instance, and the edge cloud node to which the instance to be migrated belongs is recorded as the first edge cloud node; If the first edge cloud node meets the intra-node migration condition, the instance to be migrated is migrated within the edge cloud node; if the first edge cloud node does not meet the intra-node migration condition, the instance to be migrated is migrated across edge cloud nodes.
- the processor 32 determines the instance to be migrated from the at least one instance, it is specifically configured to: according to the state of the at least one instance, use the failed instance and/or the instance in which a specified event occurs during operation as the instance to be migrated .
- the processor 32 determines the instance to be migrated from the at least one instance, it is specifically configured to: determine the instance to be migrated from the at least one instance according to resource merging requirements.
- the processor 32 determines the instance to be migrated according to the resource merging demand, it is specifically configured to: determine the first edge cloud node that needs resource merging according to the resource merging demand; combine the remaining resources on each resource device in the first edge cloud node The available resources of and the resources required by each instance in the first edge cloud node determine the instance to be migrated.
- the processor 32 is further configured to: determine whether the first edge cloud node is in an available state; if the first edge cloud node is in an available state, determine whether the available resources of the first edge cloud node are sufficient to carry the instance to be migrated; If the available resources of an edge cloud node are sufficient to carry the instance to be migrated, it is determined that the first edge cloud node meets the migration conditions within the node; if the first edge cloud node is in an unavailable state, or the available resources of the first edge cloud node are insufficient to carry the instance For the migration instance, it is determined that the first edge cloud node does not meet the intra-node migration condition.
- the processor 32 is specifically configured to select a second edge cloud node from at least one edge cloud node when the instance to be migrated is migrated across edge cloud nodes, where the second edge cloud node is different from the first edge cloud node; Migrate the instance to be migrated to the second edge cloud node, and send the attribute information of the instance to be migrated in the second edge cloud node to the service demander, so that the service demander can perform business scheduling for the instance to be migrated based on the attribute information .
- the instance to be migrated is an instance in which a specified event occurs during operation, that is, an example in which a specified event occurs but can still run normally
- the processor 32 migrates the instance to be migrated to the second edge cloud node, It is specifically used for: sending a migration request to the service demander through the communication component 33, so that the service demander can determine the migration strategy for the instance to be migrated in combination with the business situation on the instance to be migrated; receiving the migration strategy sent by the service demander through the communication component 33, According to the migration strategy, the instance to be migrated is migrated to the second edge cloud node.
- the processor 32 migrates the instance to be migrated to the second edge cloud node, it is specifically configured to: according to the resource requirements of the instance to be migrated, control the corresponding resource device in the second edge cloud node to perform resources for the instance to be migrated Reservation or allocation: After the resource reservation or allocation is successful, the instance to be migrated is migrated to the resource reserved or allocated by the corresponding resource device in the second edge cloud node.
- the processor 32 migrates the instance to be migrated to the resource reserved or allocated by the corresponding resource device in the second edge cloud node, it is specifically configured to: control the corresponding resource device in the second edge cloud node according to the instance to be migrated The corresponding image or snapshot creates an instance to be migrated on the reserved or allocated resources.
- the processor 32 is specifically configured to send a migration instruction to the edge management and control device in the network system, and the migration instruction instructs the edge management and control device to obtain the image or instance snapshot corresponding to the instance to be migrated and provide it to the second
- the corresponding resource device in the edge cloud node allows the corresponding resource device to create the instance to be migrated on the reserved or allocated resources.
- the processor 32 is further configured to: receive a release notification sent by the service demander through the communication component 33, and release the instance to be migrated running in the first edge cloud node according to the release notification; wherein, during the migration process , The instance to be migrated continues to run in the first edge cloud node; wherein the release notification is sent after the service demander determines that there is no longer any service request on the instance to be migrated running in the first edge cloud node.
- the central management and control device further includes: a display 34, a power supply component 35, an audio component 36 and other components. Only some of the components are schematically shown in FIG. 3, which does not mean that the central control equipment only includes the components shown in FIG. In addition, the components in the dashed box in Figure 3 are optional components, which may be determined by the implementation of the central control equipment. If the central management and control device is a server-shaped device, it may optionally not include the display 34 and the audio component 36; if the central management and control device is a terminal device-type device, it may optionally include the display 34 and the audio component 36.
- an embodiment of the present application also provides a computer-readable storage medium storing a computer program.
- the computer program is executed by one or more processors, the one or more processors can implement the above method in the above-mentioned method embodiments. Steps or operations performed by the equipment.
- the communication component in FIG. 3 is configured to facilitate wired or wireless communication between the device where the communication component is located and other devices.
- the device where the communication component is located can access a wireless network based on communication standards, such as WiFi, 2G or 3G, or a combination of them.
- the communication component receives a broadcast signal or broadcast related information from an external broadcast management system via a broadcast channel.
- the communication component may further include a near field communication (NFC) module, radio frequency identification (RFID) technology, infrared data association (IrDA) technology, ultra-wideband (UWB) technology, and Bluetooth (BT) technology Wait.
- NFC near field communication
- RFID radio frequency identification
- IrDA infrared data association
- UWB ultra-wideband
- BT Bluetooth
- the display in FIG. 3 described above includes a screen, and the screen may include a liquid crystal display (LCD) and a touch panel (TP). If the screen includes a touch panel, the screen may be implemented as a touch screen to receive input signals from the user.
- the touch panel includes one or more touch sensors to sense touch, sliding, and gestures on the touch panel. The touch sensor may not only sense the boundary of a touch or slide action, but also detect the duration and pressure related to the touch or slide operation.
- the power supply components in Figure 3 above provide power for various components of the equipment where the power supply components are located.
- the power supply component may include a power management system, one or more power supplies, and other components associated with the generation, management, and distribution of power for the equipment where the power supply component is located.
- the audio component in FIG. 3 may be configured to output and/or input audio signals.
- the audio component includes a microphone (MIC).
- the microphone When the device where the audio component is located is in an operating mode, such as call mode, recording mode, and voice recognition mode, the microphone is configured to receive external audio signals.
- the received audio signal can be further stored in a memory or sent via a communication component.
- the audio component further includes a speaker for outputting audio signals.
- the embodiments of the present invention may be provided as methods, systems, or computer program products. Therefore, the present invention may adopt the form of a complete hardware embodiment, a complete software embodiment, or an embodiment combining software and hardware. Moreover, the present invention may adopt the form of a computer program product implemented on one or more computer-usable storage media (including but not limited to disk storage, CD-ROM, optical storage, etc.) containing computer-usable program codes.
- a computer-usable storage media including but not limited to disk storage, CD-ROM, optical storage, etc.
- These computer program instructions can also be stored in a computer-readable memory that can guide a computer or other programmable data processing equipment to work in a specific manner, so that the instructions stored in the computer-readable memory produce an article of manufacture including the instruction device.
- the device implements the functions specified in one process or multiple processes in the flowchart and/or one block or multiple blocks in the block diagram.
- These computer program instructions can also be loaded on a computer or other programmable data processing equipment, so that a series of operation steps are executed on the computer or other programmable equipment to produce computer-implemented processing, so as to execute on the computer or other programmable equipment.
- the instructions provide steps for implementing functions specified in a flow or multiple flows in the flowchart and/or a block or multiple blocks in the block diagram.
- the computing device includes one or more processors (CPU), input/output interfaces, network interfaces, and memory.
- processors CPU
- input/output interfaces network interfaces
- memory volatile and non-volatile memory
- the memory may include non-permanent memory in computer readable media, random access memory (RAM) and/or non-volatile memory, such as read-only memory (ROM) or flash memory (flash RAM). Memory is an example of computer readable media.
- RAM random access memory
- ROM read-only memory
- flash RAM flash memory
- Computer-readable media include permanent and non-permanent, removable and non-removable media, and information storage can be realized by any method or technology.
- the information can be computer-readable instructions, data structures, program modules, or other data.
- Examples of computer storage media include, but are not limited to, phase change memory (PRAM), static random access memory (SRAM), dynamic random access memory (DRAM), other types of random access memory (RAM), read-only memory (ROM), electrically erasable programmable read-only memory (EEPROM), flash memory or other memory technology, CD-ROM, digital versatile disc (DVD) or other optical storage, Magnetic cassettes, magnetic tape magnetic disk storage or other magnetic storage devices or any other non-transmission media can be used to store information that can be accessed by computing devices. According to the definition in this article, computer-readable media does not include transitory media, such as modulated data signals and carrier waves.
Landscapes
- Engineering & Computer Science (AREA)
- Computer Networks & Wireless Communication (AREA)
- Signal Processing (AREA)
- Computer Hardware Design (AREA)
- General Engineering & Computer Science (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
- Stored Programmes (AREA)
- Data Exchanges In Wide-Area Networks (AREA)
Abstract
Embodiments of the present application provide a network system, an instance management method, a device, and a storage medium. In the embodiments of the application, the concept of edge computing is used to place cloud computing capabilities closer to a terminal at an edge location, thereby providing a network system comprising edge cloud nodes. In the network system, instances capable of providing a cloud computing service to a service demand side are deployed in an edge cloud node, and can provide such services to a service demand side under the control of a central control device, thereby achieving the purpose of providing services to users using resources located in edge cloud nodes. The invention thus realizes "relocating cloud computing processing closer to a terminal by processing in an edge cloud node," and helps reduce response delays and bandwidth costs.
Description
本申请要求2019年04月08日递交的申请号为201910277465.4、发明名称为“网络系统、实例管控方法、设备及存储介质”的中国专利申请的优先权,其全部内容通过引用结合在本申请中。This application claims the priority of the Chinese patent application filed on April 8, 2019 with the application number 201910277465.4 and the invention title "network system, instance control method, equipment and storage medium", the entire content of which is incorporated into this application by reference .
本申请涉及计算机技术领域,尤其涉及一种网络系统、实例管控方法、设备及存储介质。This application relates to the field of computer technology, and in particular to a network system, instance management and control method, equipment, and storage medium.
目前,对云计算的概念都是基于集中式的资源管控来提出的,即使采用多个数据中心互联互通形式,依然将所有的软硬件资源视为统一的资源进行管理,调度和售卖。随着5G、物联网时代的到来以及云计算应用的逐渐增加,终端侧对云资源在时延、带宽等性能上的要求越来越高,集中式的云网络已经无法满足终端侧日渐增高的云资源需求。At present, the concept of cloud computing is based on centralized resource management and control. Even if multiple data centers are used for interconnection, all software and hardware resources are still treated as unified resources for management, scheduling, and sales. With the advent of the era of 5G and the Internet of Things and the gradual increase of cloud computing applications, the terminal side has higher and higher requirements for cloud resources in terms of latency and bandwidth. The centralized cloud network can no longer meet the increasing demand on the terminal side. Cloud resource requirements.
发明内容Summary of the invention
本申请的多个方面提供一种网络系统、实例管控方法、设备及存储介质,用以降低服务的响应时延,降低带宽成本。Various aspects of the present application provide a network system, instance management and control method, device, and storage medium to reduce service response delay and bandwidth cost.
本申请实施例提供一种实例管控方法,包括:确定部署于网络系统中至少一个边缘云节点中的至少一个实例,所述至少一个实例可为服务需求方提供云计算服务;对所述至少一个实例进行管控,以供所述至少一个实例为所述服务需求方提供云计算服务。The embodiment of the present application provides an instance management and control method, including: determining at least one instance deployed in at least one edge cloud node in a network system, the at least one instance can provide cloud computing services for the service demander; The instance is managed, so that the at least one instance provides cloud computing services for the service demander.
本申请实施例还提供一种网络系统,包括:中心管控设备,以及至少一个边缘云节点;所述至少一个边缘云节点中部署有至少一个实例,所述至少一个实例可为服务需求方提供云计算服务;所述中心管控设备,用于对所述至少一个实例进行管控,以供所述至少一个实例为所述服务需求方提供云计算服务。An embodiment of the present application also provides a network system, including: a central management and control device, and at least one edge cloud node; at least one instance is deployed in the at least one edge cloud node, and the at least one instance can provide a cloud for service demanders Computing services; the central management and control device is used to manage and control the at least one instance, so that the at least one instance provides cloud computing services for the service demander.
本申请实施例还提供一种中心管控设备,包括:存储器和处理器;所述存储器,用于存储计算机程序;当所述计算机程序被所述处理器执行时,致使所述处理器实现本申请实施例提供的实例管控方法中的步骤。An embodiment of the present application also provides a central management and control device, including: a memory and a processor; the memory is used to store a computer program; when the computer program is executed by the processor, the processor is caused to implement the application The steps in the example management method provided in the embodiment.
本申请实施例还提供一种存储有计算机程序的计算机可读存储介质,当所述计算机程序被一个或多个处理器执行时,致使所述一个或多个处理器实现本申请实施例提供的 实例管控方法中的步骤。The embodiment of the present application also provides a computer-readable storage medium storing a computer program. When the computer program is executed by one or more processors, the one or more processors are caused to implement the Examples of steps in the control method.
在本申请实施例中,结合边缘计算的概念,考虑将云计算的能力放到距离终端更近的边缘侧,于是提供一种包括边缘云节点的网络系统,在该网络系统中,边缘云节点中部署有提供云计算服务的实例,在中心管控设备的管控下,这些实例可以提供云计算服务,达到了借助边缘云节点中的资源为用户提供服务的目的,使得“将云计算放到距离终端更近的边缘云节点中处理”成为现实,有利于降低服务的响应时延,降低带宽成本。In the embodiments of the present application, in combination with the concept of edge computing, the ability of cloud computing is considered to be placed on the edge side closer to the terminal, so a network system including edge cloud nodes is provided. In the network system, the edge cloud node Instances that provide cloud computing services are deployed in the central control equipment. Under the control of the central control equipment, these instances can provide cloud computing services. This achieves the purpose of providing services to users with the help of resources in edge cloud nodes, so that "put cloud computing to a distance "Processing in edge cloud nodes closer to the terminal" has become a reality, which is conducive to reducing service response delay and bandwidth costs.
此处所说明的附图用来提供对本申请的进一步理解,构成本申请的一部分,本申请的示意性实施例及其说明用于解释本申请,并不构成对本申请的不当限定。在附图中:The drawings described here are used to provide a further understanding of the application and constitute a part of the application. The exemplary embodiments and descriptions of the application are used to explain the application and do not constitute an improper limitation of the application. In the attached picture:
图1a为本申请示例性实施例提供的一种网络系统的结构示意图;Fig. 1a is a schematic structural diagram of a network system provided by an exemplary embodiment of this application;
图1b为本申请示例性实施例提供的中心管控设备与边缘管控设备的一种结构示意图;FIG. 1b is a schematic structural diagram of a central management and control device and an edge management and control device provided by an exemplary embodiment of this application;
图1c为本申请示例性实施例提供的另一种网络系统的结构示意图;FIG. 1c is a schematic structural diagram of another network system provided by an exemplary embodiment of this application;
图2a为本申请示例性实施例提供的一种实例管控方法的流程示意图;FIG. 2a is a schematic flowchart of an example management and control method provided by an exemplary embodiment of this application;
图2b为本申请示例性实施例提供的一种实例升级方法的流程示意图;FIG. 2b is a schematic flowchart of an example upgrade method provided by an exemplary embodiment of this application;
图2c为本申请示例性实施例提供的一种实例迁移方法的流程示意图;FIG. 2c is a schematic flowchart of an example migration method provided by an exemplary embodiment of this application;
图3为本申请示例性实施例提供的一种中心管控设备的结构示意图。FIG. 3 is a schematic structural diagram of a central management and control device provided by an exemplary embodiment of this application.
为使本申请的目的、技术方案和优点更加清楚,下面将结合本申请具体实施例及相应的附图对本申请技术方案进行清楚、完整地描述。显然,所描述的实施例仅是本申请一部分实施例,而不是全部的实施例。基于本申请中的实施例,本领域普通技术人员在没有做出创造性劳动前提下所获得的所有其他实施例,都属于本申请保护的范围。In order to make the purpose, technical solutions, and advantages of the present application clearer, the technical solutions of the present application will be described clearly and completely in conjunction with specific embodiments of the present application and the corresponding drawings. Obviously, the described embodiments are only a part of the embodiments of the present application, rather than all the embodiments. Based on the embodiments in this application, all other embodiments obtained by those of ordinary skill in the art without creative work shall fall within the protection scope of this application.
针对现有集中式的云网络已经无法满足终端日渐增高的云资源需求的技术问题,在本申请一些实施例中,结合边缘计算的概念,考虑将云计算的能力放到距离终端更近的边缘侧,于是提供一种包括边缘云节点的网络系统,在该网络系统中,边缘云节点中部署有提供云计算服务的实例,在中心管控设备的管控下,这些实例可以提供云计算服务,达到了借助边缘云节点中的资源为用户提供服务的目的,使得“将云计算放到距离终端更近的边缘云节点中处理”成为现实,有利于降低服务的响应时延,降低带宽成本。In view of the technical problem that the existing centralized cloud network can no longer meet the terminal's increasing demand for cloud resources, in some embodiments of this application, in combination with the concept of edge computing, it is considered to place the cloud computing capability on the edge closer to the terminal Therefore, a network system including edge cloud nodes is provided. In this network system, instances that provide cloud computing services are deployed in the edge cloud nodes. Under the control of the central control device, these instances can provide cloud computing services to achieve With the purpose of providing services to users with the help of resources in edge cloud nodes, it becomes a reality to "place cloud computing in edge cloud nodes closer to the terminal for processing", which is conducive to reducing service response delays and reducing bandwidth costs.
以下结合附图,详细说明本申请各实施例提供的技术方案。The technical solutions provided by the embodiments of the present application will be described in detail below with reference to the accompanying drawings.
图1a为本申请示例性实施例提供的一种网络系统的结构示意图。如图1a所示,该网络系统100包括:中心管控设备101和至少一个边缘云节点102;至少一个边缘云节点102均与中心管控设备101网络连接。Fig. 1a is a schematic structural diagram of a network system provided by an exemplary embodiment of this application. As shown in FIG. 1a, the network system 100 includes: a central management and control device 101 and at least one edge cloud node 102; at least one edge cloud node 102 is connected to the central management and control device 101 in a network.
本实施例的网络系统100是基于云计算技术和边缘计算的能力,构筑在边缘基础设施之上的云计算平台,是一种边缘位置的具备计算、网络、存储以及安全等能力的云平台。The network system 100 in this embodiment is a cloud computing platform built on edge infrastructure based on cloud computing technology and edge computing capabilities, and is a cloud platform with computing, network, storage, and security capabilities at the edge.
与中心云或者传统的云计算平台相对应,本实施例的网络系统100可以视为一种边缘云网络系统。边缘云是个相对概念,边缘云是指相对靠近终端的云计算平台,或者说,与中心云或者传统的云计算平台相区别,中心云或者传统的云计算平台可以包括资源规模化且位置集中的数据中心,而边缘云节点覆盖的网络范围更广泛,也因此具备距离终端更近的特性,单个边缘云节点的资源规模较小,但是边缘云节点的数量多,多个边缘云节点构成了本实施例中边缘云的组成部分。本实施例的终端是指云计算服务的需求端,例如可以是互联网中的终端或者用户端,或者物联网中的终端或用户端。边缘云网络是基于中心云或者传统的云计算系统与终端之间的基础设施构建的网络。其中,网络系统100包括至少一个边缘云节点102,每个边缘云节点102包括一系列的边缘基础设施,这些边缘基础设施包括但不限于:分布式数据中心(DC)、无线机房或集群,运营商的通信网络、核心网设备、基站、边缘网关、家庭网关、计算设备和/或存储设备等边缘设备及对应的网络环境等等。在此说明,不同边缘云节点102的位置、能力以及包含的基础设施可以相同,也可以不相同。Corresponding to a central cloud or a traditional cloud computing platform, the network system 100 of this embodiment can be regarded as an edge cloud network system. Edge cloud is a relative concept. Edge cloud refers to a cloud computing platform that is relatively close to the terminal. In other words, it is different from central cloud or traditional cloud computing platform. Central cloud or traditional cloud computing platform can include large-scale resources and centralized locations. Data centers, and edge cloud nodes cover a wider network range, and therefore have the characteristics of being closer to the terminal. The resource scale of a single edge cloud node is small, but the number of edge cloud nodes is large, and multiple edge cloud nodes constitute the original Part of the edge cloud in the embodiment. The terminal in this embodiment refers to the demand side of cloud computing services, for example, it may be a terminal or a user side in the Internet, or a terminal or a user side in the Internet of Things. The edge cloud network is a network based on the infrastructure between the central cloud or the traditional cloud computing system and the terminal. Wherein, the network system 100 includes at least one edge cloud node 102, and each edge cloud node 102 includes a series of edge infrastructures. These edge infrastructures include, but are not limited to: distributed data centers (DC), wireless computer rooms or clusters, operations Communication networks, core network equipment, base stations, edge gateways, home gateways, computing devices and/or storage devices and other edge devices and corresponding network environments. It is explained here that the locations, capabilities, and included infrastructure of different edge cloud nodes 102 may be the same or different.
其中,本实施例的网络系统100与中心云或传统的云计算平台等中心网络、终端结合可形成“云边端三体协同”的网络架构,在该网络架构中,可以将网络转发、存储、计算和/或智能化数据分析等任务放在网络系统100中的各边缘云节点102中处理,由于各边缘云节点102更靠近终端,因此可以降低响应时延,减轻中心云或传统的云计算平台的压力,降低带宽成本。Among them, the network system 100 of this embodiment is combined with a central cloud or a traditional cloud computing platform and other central networks and terminals to form a "cloud edge-end three-body coordination" network architecture. In this network architecture, the network can be forwarded and stored. Tasks such as computing and/or intelligent data analysis are processed in each edge cloud node 102 in the network system 100. Since each edge cloud node 102 is closer to the terminal, the response delay can be reduced and the central cloud or traditional cloud The pressure on the computing platform reduces bandwidth costs.
如何合理地调度多个边缘云节点资源,以及如何管控好多个边缘云节点以正确和稳定的逻辑进行云计算服务,是一个重要的挑战。在本实施例的网络系统100中,部署有中心管控设备101,中心管控设备101以边缘云节点102为管控对象,在资源调度,镜像管理,实例管控,运维,网络,安全等各方面对网络系统100中的至少一个边缘云节点102进行统一管控,从而将云计算服务放到各边缘云节点102中处理。在部署实施上, 中心管控设备101可以部署在一个或多个云计算数据中心中,或者,可以部署在一个或多个传统数据中心中,中心管控设备101也可以和其管控的至少一个边缘云节点共同构成边缘云网络,本实施例对此不做限定。How to reasonably schedule the resources of multiple edge cloud nodes and how to manage and control multiple edge cloud nodes to perform cloud computing services with correct and stable logic is an important challenge. In the network system 100 of this embodiment, a central management and control device 101 is deployed. The central management and control device 101 uses the edge cloud node 102 as the management and control object for resource scheduling, image management, instance management and control, operation and maintenance, network, security, etc. At least one edge cloud node 102 in the network system 100 is uniformly managed and controlled, so that cloud computing services are placed in each edge cloud node 102 for processing. In terms of deployment and implementation, the central control device 101 can be deployed in one or more cloud computing data centers, or it can be deployed in one or more traditional data centers, and the central control device 101 can also be connected to at least one edge cloud under its control. The nodes jointly constitute an edge cloud network, which is not limited in this embodiment.
对一个边缘云节点102来说,可以对外提供各种资源,例如CPU、GPU等计算资源,内存、硬盘等存储资源,带宽等网络资源等。另外,边缘云节点102还可以根据镜像创建相应实例,通过实例对外提供各种云计算服务。其中,镜像是在边缘云节点中创建实例所需的基础文件,例如可以是为用户提供云计算服务所需的操作系统、应用、或操作配置等镜像文件,其可以是符合边缘云节点计算部署要求,根据特定的一系列文件按照一定的格式制作成的文件。另外,镜像的形态是多样的,可以是虚拟机(Virtual Machine,VM)镜像文件、容器(Docker)镜像文件或各类型的应用打包文件等,镜像形态可以与云计算服务需要使用的虚拟化技术有关,本实施例对此不做限定。与镜像对应,实例的实现形态可以是虚拟机、容器或应用程序等。For an edge cloud node 102, various resources may be provided externally, such as computing resources such as CPU and GPU, storage resources such as memory and hard disk, and network resources such as bandwidth. In addition, the edge cloud node 102 can also create a corresponding instance based on the image, and provide various cloud computing services externally through the instance. Among them, the image is the basic file needed to create an instance in the edge cloud node. For example, it can be an image file such as an operating system, application, or operation configuration required to provide users with cloud computing services, and it can be in line with edge cloud node computing deployment Requirements, according to a specific series of documents in a certain format made into documents. In addition, there are various forms of images, which can be virtual machine (VM) image files, container (Docker) image files, or various types of application packaging files, etc. The image form can be compatible with the virtualization technology used by cloud computing services. Regarding, this embodiment does not limit this. Corresponding to the image, the implementation form of the instance can be a virtual machine, container, or application.
结合上述,在本实施例中,中心管控设备101可以根据资源需求对至少一个边缘云节点102进行资源调度,也可以根据镜像需求针对至少一个边缘云节点102进行镜像的管理和分发,当然,也可以根据云计算服务需求既对至少一个边缘云节点102进行资源调度,又为至少一个边缘云节点102提供镜像。其中,云计算服务需求包括了资源需求和镜像需求。可选地,中心管控设备101可以对外提供需求提交入口,该需求提交入口可以是web页面、应用页面或命令窗等。该需求提交入口的作用是供需求方向中心管控设备101提交自己的需求描述信息。In combination with the above, in this embodiment, the central management and control device 101 can perform resource scheduling on at least one edge cloud node 102 according to resource requirements, or can perform image management and distribution for at least one edge cloud node 102 according to image requirements. Of course, It is possible to perform resource scheduling on at least one edge cloud node 102 and provide mirroring for at least one edge cloud node 102 according to cloud computing service requirements. Among them, cloud computing service requirements include resource requirements and mirroring requirements. Optionally, the central management and control device 101 may provide a requirement submission portal to the outside, and the requirement submission portal may be a web page, an application page, or a command window. The role of the requirement submission portal is for the requirement to submit its own requirement description information to the central control device 101.
对于资源需求方,可以通过上述需求提交入口向中心管控设备101提交资源需求描述信息,该资源需求描述信息包括:边缘云节点选择参数和资源选择参数;边缘云节点选择参数包括调度域和/或对边缘云节点的性能要求等,资源选择参数包括资源类型、资源数量、以及对资源设备的性能要求等。中心管控设备101可根据资源需求描述信息,对至少一个边缘云节点进行资源调度。可选地,一种资源调度方式包括:中心管控设备101根据资源需求描述信息,从网络系统100的至少一个边缘云节点102中确定被调度的目标边缘云节点以及目标边缘云节点中被调度的资源信息;根据该资源信息控制目标边缘云节点中相应资源设备进行资源分配或预留。For the resource demander, the resource demand description information can be submitted to the central management and control device 101 through the above demand submission entry. The resource demand description information includes: edge cloud node selection parameters and resource selection parameters; edge cloud node selection parameters include scheduling domains and/or For the performance requirements of edge cloud nodes, the resource selection parameters include resource type, resource quantity, and performance requirements for resource equipment. The central management and control device 101 may perform resource scheduling on at least one edge cloud node according to the resource requirement description information. Optionally, a resource scheduling method includes: the central management and control device 101 determines the scheduled target edge cloud node and the scheduled target edge cloud node from at least one edge cloud node 102 of the network system 100 according to resource demand description information Resource information; according to the resource information, the corresponding resource device in the target edge cloud node is controlled to allocate or reserve resources.
对于镜像需求方,可以通过上述需求提交入口向中心管控设备101提交镜像需求描述信息,该镜像需求描述信息可指向需要使用的镜像,可以是镜像本身,也可以是镜像的名称、ID等标识类信息,还可以是一些对云计算服务的功能描述信息,这些信息可以 反映出所需的镜像。中心管控设备101可根据镜像需求描述信息,获取镜像;将镜像提供给网络系统100中需要该镜像的边缘云节点,以供该边缘云节点根据该镜像创建相应实例,由该实例对外提供相应云计算服务。For the mirroring demand side, the mirroring demand description information can be submitted to the central management and control device 101 through the above demand submission entry. The mirroring demand description information can point to the mirror that needs to be used, which can be the mirror itself, or the name, ID and other identification types of the mirror. The information can also be some function description information of the cloud computing service, which can reflect the required image. The central management and control device 101 can obtain the image according to the description information of the image demand; provide the image to the edge cloud node in the network system 100 that needs the image, so that the edge cloud node creates a corresponding instance based on the image, and the instance provides the corresponding cloud externally Computing services.
对云计算服务需求方,可以通过上述需求提交入口向中心管控设备101提交服务需求描述信息,该服务需求描述信息包括资源需求描述信息和镜像需求描述信息。关于资源需求描述信息和镜像需求描述信息可参见前面的描述,在此不再赘述。值得的说明的是,服务需求描述信息中的资源需求描述信息和镜像需求描述信息可以是一并提交,也可以分开提交。中心管控设备101可根据服务需求描述信息,对网络系统100中至少一个边缘云节点102进行资源调度;为至少一个边缘云节点102中被调度的资源提供镜像,以利用至少一个边缘云节点中被调度的资源提供相应云计算服务。For the cloud computing service demander, the service demand description information can be submitted to the central management and control device 101 through the above demand submission portal. The service demand description information includes resource demand description information and mirroring demand description information. For resource requirement description information and mirroring requirement description information, please refer to the previous description, which will not be repeated here. It is worth noting that the resource requirement description information and the mirroring requirement description information in the service requirement description information can be submitted together or separately. The central management and control device 101 can perform resource scheduling on at least one edge cloud node 102 in the network system 100 according to the service demand description information; provide a mirror image of the scheduled resources in the at least one edge cloud node 102 to use the The scheduled resources provide corresponding cloud computing services.
关于上述资源调度和镜像管理与分发的详细过程,可参见下述实施例,在此暂不详述。For the detailed process of the foregoing resource scheduling and image management and distribution, refer to the following embodiments, which will not be described in detail here.
在本实施例中,中心管控设备101不仅可以为至少一个边缘云节点102提供镜像,供边缘云节点102创建相应实例,还可以对至少一个边缘云节点102中的实例进行管控。至少一个边缘云节点102中的实例可以是至少一个,即一个或多个。边缘云节点中的实例可以是根据中心管控设备101提供的镜像创建的,也可以是根据其它镜像创建的,也可以是从其它边缘云节点或其它系统中迁移过来的,对此不做限定。至少一个边缘云节点102中的实例可为服务需求方提供云计算服务,这里的服务需求方可以是任何需要使用边缘云节点中的实例提供的云计算服务的设备、应用、系统或另一服务。以系统为例,服务需求方可以是但不限于:在线视频系统、风险管控系统、客户信息管理系统、数据分发系统等。中心管控设备101可对至少一个边缘云节点102中的至少一个实例进行管控,便于这些实例为服务需求方提供云计算服务。In this embodiment, the central management and control device 101 can not only provide a mirror image for at least one edge cloud node 102 for the edge cloud node 102 to create a corresponding instance, but can also manage and control the instances in at least one edge cloud node 102. There may be at least one instance in the at least one edge cloud node 102, that is, one or more instances. The instances in the edge cloud node may be created based on the image provided by the central management and control device 101, or based on other images, or may be migrated from other edge cloud nodes or other systems, which is not limited. At least one instance in the edge cloud node 102 can provide cloud computing services for the service demander, where the service demander can be any device, application, system, or another service that needs to use the cloud computing service provided by the instance in the edge cloud node . Taking the system as an example, the service demander can be but not limited to: online video system, risk management system, customer information management system, data distribution system, etc. The central management and control device 101 can manage and control at least one instance of at least one edge cloud node 102, so that these instances can provide cloud computing services for service demanders.
其中,中心管控设备101可以对至少一个实例进行各种管控,例如可以包括升级、迁移、关停、重启和释放等中的至少一种,但不限于此。下面将对实例升级和迁移进行详细说明。The central management and control device 101 can perform various management and control on at least one instance, for example, it can include at least one of upgrade, migration, shutdown, restart, and release, but is not limited thereto. The instance upgrade and migration will be described in detail below.
实例升级:Instance upgrade:
在实际应用中,随着业务需求的变化或镜像版本的更新,有可能对镜像或相应实例进行升级。其中,中心管控设备101对实例进行升级管控主要包括:In actual applications, as business requirements change or the mirror version is updated, it is possible to upgrade the mirror or the corresponding instance. Among them, the central management and control equipment 101 performs upgrade management and control of instances mainly including:
中心管控设备101从至少一个实例中确定待升级实例,待升级实例可以是一个或多个;向服务需求方发送升级请求,以供服务需求方结合待升级实例上的业务情况为待升 级实例确定升级策略。该升级请求携带有待升级实例的标识类信息,例如待升级实例的ID、名称等,也可以是待升级实例对应服务的ID、名称等,还可以是待升级实例对应镜像的ID、名称等信息。服务需求方在接收到升级请求后,可根据该升级请求确定待升级实例,结合待升级实例上的业务情况,例如待升级实例上的业务请求及业务请求的响应状态等,判断待升级实例是否适合升级,什么时间适合升级,采用什么方法进行升级等,进而可为该待升级实例生成升级策略并返回给中心管控设备101。中心管控设备101接收服务需求方发送的升级策略,依据升级策略对待升级实例进行升级。The central management and control device 101 determines the instance to be upgraded from at least one instance. The instance to be upgraded can be one or more; it sends an upgrade request to the service demander, so that the service demander can determine the instance to be upgraded based on the business situation of the instance to be upgraded Upgrade strategy. The upgrade request carries the identification information of the instance to be upgraded, such as the ID and name of the instance to be upgraded. It can also be the ID and name of the service corresponding to the instance to be upgraded. It can also be the ID, name and other information of the image corresponding to the instance to be upgraded. . After receiving the upgrade request, the service demander can determine the instance to be upgraded according to the upgrade request, and combine the business conditions on the instance to be upgraded, such as the business request on the instance to be upgraded and the response status of the business request, to determine whether the instance to be upgraded is It is suitable for upgrading, when is suitable for upgrading, what method is used for upgrading, etc., and then an upgrade strategy can be generated for the instance to be upgraded and returned to the central control device 101. The central management and control device 101 receives the upgrade strategy sent by the service demander, and upgrades the instance to be upgraded according to the upgrade strategy.
在一可选实施例中,服务需求方可结合待升级实例上的业务情况,例如已接收到且尚未完成的业务请求(简称为存量业务请求)的数量,是否还有新增的业务请求(增量业务请求)等,判断什么时间可以对待升级实例进行升级,也就是说,升级策略中可以包括升级时间。如果待升级实例上的存量业务请求均已被响应,且不再有增量业务请求,在这种情况下,对待升级实例进行升级业务请求不会被中断,不会影响用户感受,则认为可以对待升级实例进行升级。对服务需求方来说,在认为可以对待升级实例进行升级时,可以向中心管控设备101返回升级通知,该升级通知携带有指示中心管控设备101在接收到升级通知后对待升级实例进行升级的时间信息,升级通知携带该时间信息的方式可以是显式的,也可以是隐式的。对中心管控设备101而言,在接收到升级通知后可对待升级实例进行升级。In an optional embodiment, the service demander can combine the business conditions on the instance to be upgraded, such as the number of business requests that have been received and not yet completed (referred to as inventory business requests), and whether there are any new business requests ( Incremental service request), etc., to determine when the instance to be upgraded can be upgraded, that is, the upgrade strategy can include the upgrade time. If all existing service requests on the instance to be upgraded have been responded, and there are no incremental service requests, in this case, the upgrade service request of the instance to be upgraded will not be interrupted and will not affect the user experience, then it is considered OK Upgrade the instance to be upgraded. For the service demander, when it considers that the instance to be upgraded can be upgraded, it can return an upgrade notification to the central management and control device 101, and the upgrade notification carries the time to instruct the central management and control device 101 to upgrade the instance to be upgraded after receiving the upgrade notification. Information, the way that the upgrade notification carries the time information can be explicit or implicit. For the central management and control device 101, after receiving the upgrade notification, the instance to be upgraded can be upgraded.
当然,除此上述方式之外,服务需求方也可以结合待升级实例上的业务情况,预估出合适的升级时间,将该升级时间携带在升级通知中发送给中心管控设备101。中心管控设备101接收到升级通知后,从中获取升级时间,并在该升级时间开始对待升级实例进行升级。Of course, in addition to the above-mentioned method, the service demander can also estimate the appropriate upgrade time in combination with the business conditions on the instance to be upgraded, and send the upgrade time to the central control device 101 in the upgrade notification. After receiving the upgrade notification, the central management and control device 101 obtains the upgrade time therefrom, and starts to upgrade the instance to be upgraded at the upgrade time.
升级策略可以包括升级时间,该升级时间由服务需求方结合待升级实例上的业务情况确定。当然,升级策略也可以不包括升级时间,升级时间可由中心管控设备101根据待升级实例的状态、中心管控设备101的负载情况等因素自行确定。除此之外,升级策略可以包括升级方法,这里的升级方法是指对待升级实例进行升级采用的方法,可由服务需求方结合待升级实例上的业务情况确定。根据镜像类型的不同,升级方法也不同。若升级策略包括升级时间,则中心管控设备101可以在升级策略中指定的升级时间开始对待升级实例进行升级;若升级策略包括升级方法,则中心管控设备101可以采用升级策略中指定的升级方法对待升级实例进行升级;若升级策略包括升级时间和升级方法,则中心管控设备101可以采用升级策略中指定的升级方法,在升级策略中指定的升级时 间开始对待升级实例进行升级。The upgrade strategy may include an upgrade time, which is determined by the service demander in combination with the business situation on the instance to be upgraded. Of course, the upgrade strategy may not include the upgrade time. The upgrade time can be determined by the central management and control device 101 according to factors such as the status of the instance to be upgraded, the load situation of the central management and control device 101, and the like. In addition, the upgrade strategy may include an upgrade method, where the upgrade method refers to the method used to upgrade the instance to be upgraded, which can be determined by the service demander in combination with the business situation on the instance to be upgraded. Depending on the image type, the upgrade method is also different. If the upgrade strategy includes the upgrade time, the central control device 101 can start to upgrade the instance to be upgraded at the upgrade time specified in the upgrade strategy; if the upgrade strategy includes the upgrade method, the central control device 101 can use the upgrade method specified in the upgrade strategy to treat The upgrade instance is upgraded; if the upgrade strategy includes an upgrade time and an upgrade method, the central management and control device 101 can adopt the upgrade method specified in the upgrade strategy, and start to upgrade the instance to be upgraded at the upgrade time specified in the upgrade strategy.
可选地,对实例进行升级,可由中心管控设备101发起。例如,中心管控设备101可以监控各实例对应镜像的版本信息,当发现新版本的镜像时,可以确定需要对与该新版本的镜像对应的实例进行升级;或者,也可以监控各实例的运行状态、生命周期等信息,当发现实例运行过程中出现漏洞、不稳定、功能不全、CPU或内存资源消耗过大等问题时,可以确定需要对出现这些问题的实例进行升级。Optionally, the upgrade of the instance can be initiated by the central control device 101. For example, the central management and control device 101 can monitor the version information of the mirror corresponding to each instance, and when a new version of the mirror is found, it can determine that the instance corresponding to the new version of the mirror needs to be upgraded; or, it can also monitor the running status of each instance , Life cycle and other information. When problems such as loopholes, instability, insufficiency, excessive consumption of CPU or memory resources are found in the running process of the instance, it can be determined that the instance with these problems needs to be upgraded.
可选地,对实例进行升级,也可以由服务需求方发起。例如,根据业务需求,需要对实例进行升级时,服务需求方可以向中心管控设备101发送升级描述信息,该升级描述信息包括实例过滤条件,基于该实例过滤条件可以从众多实例中过滤出待升级实例。实例过滤条件可以是待升级实例的标识类信息,例如待升级实例的ID、名称,或者和待升级实例对应镜像的ID、名称,或者待升级实例对应服务的ID、名称等,这些信息均可确定出待升级实例。或者,若需要对全部实例进行升级,则实例过滤条件也可以是指示对全部实例进行升级的标识性信息,例如“all”、“1”等,该标识性信息可灵活设定。对中心管控设备101而言,可接收服务需求方发送的升级描述信息,从该升级描述信息中获取实例过滤条件,根据该实例过滤条件,从至少一个实例中确定待升级实例;然后向服务需求方发送升级请求,以请求服务需求方结合待升级实例上的业务情况为该升级实例确定升级策略;在服务需求方返回待升级实例的升级策略后,可依据升级策略对待升级实例进行升级。Optionally, upgrading the instance can also be initiated by the service demander. For example, when an instance needs to be upgraded according to business requirements, the service demander can send upgrade description information to the central management and control device 101. The upgrade description information includes instance filter conditions. Based on the instance filter conditions, the instances to be upgraded can be filtered out Instance. The instance filter condition can be the identification information of the instance to be upgraded, such as the ID and name of the instance to be upgraded, or the ID and name of the image corresponding to the instance to be upgraded, or the ID and name of the service corresponding to the instance to be upgraded. Determine the instance to be upgraded. Or, if it is necessary to upgrade all the instances, the instance filter condition may also be identifying information indicating to upgrade all the instances, such as “all”, “1”, etc. The identifying information can be flexibly set. For the central control device 101, the upgrade description information sent by the service demander can be received, the instance filter condition is obtained from the upgrade description information, and the instance to be upgraded is determined from at least one instance according to the instance filter condition; The party sends an upgrade request to request the service demander to determine an upgrade strategy for the upgraded instance based on the business situation on the instance to be upgraded; after the service demander returns the upgrade strategy of the instance to be upgraded, the instance to be upgraded can be upgraded according to the upgrade strategy.
其中,对待升级实例进行升级主要是指:关停待升级实例,根据相应版本(一般是指新版本)的镜像对待升级实例进行更新,更新完后再重启实例。其中,对待升级实例进行升级所需的镜像版本可以由中心管控设备101确定,例如将相应镜像的最新版本作为升级所需的镜像版本,也可以由服务需求方指定。可选地,服务需求方可以将升级所需的镜像版本携带在升级描述信息中提供给中心管控设备101,例如该升级描述信息可以包括“对所有或指定实例进行镜像版本A到镜像版本B的升级”等信息。基于此,中心管控设备101可以从升级描述信息中获取升级所需的镜像版本,然后,依据升级策略,利用该镜像版本对应的镜像对待升级实例进行升级。当待升级实例全部完成升级后,此次实例升级过程结束。Among them, upgrading the instance to be upgraded mainly refers to: shutting down the instance to be upgraded, updating the instance to be upgraded according to the mirror image of the corresponding version (generally, the new version), and restarting the instance after the update. The image version required for the upgrade of the instance to be upgraded can be determined by the central management and control device 101, for example, the latest version of the corresponding image is used as the image version required for the upgrade, or it can be specified by the service demander. Optionally, the service demander may carry the image version required for the upgrade in the upgrade description information and provide it to the central management and control device 101. For example, the upgrade description information may include "all or specified instances from mirror version A to mirror version B). Upgrade" and other information. Based on this, the central management and control device 101 can obtain the image version required for the upgrade from the upgrade description information, and then, according to the upgrade strategy, use the image corresponding to the image version to upgrade the instance to be upgraded. When all the instances to be upgraded are upgraded, the instance upgrade process ends.
实例迁移:Instance migration:
在实际应用中,在一些情况下需要对实例进行迁移。例如,在整个边缘云节点故障或不可用的情况下,需要将该边缘云节点中的实例迁移到其它边缘云节点中。又例如, 在承载某个实例的物理机出现故障或宕机的情况下,需要将该物理机上的实例迁移到其它物理机上。又例如,可能因为业务需要,需要将某个或某些实例从一个边缘云节点迁移到其它边缘云节点中。又例如,在需要进行资源归并的情况下,也需要对某个或某些实例进行迁移。在中心管控设备101的管控下,可对边缘云节点中的实例进行迁移,该迁移过程主要包括:In practical applications, the instance needs to be migrated in some cases. For example, in the case that the entire edge cloud node is faulty or unavailable, the instances in the edge cloud node need to be migrated to other edge cloud nodes. For another example, in the case of a failure or downtime of a physical machine hosting an instance, the instance on the physical machine needs to be migrated to another physical machine. For another example, it may be necessary to migrate one or some instances from one edge cloud node to other edge cloud nodes due to business needs. For another example, when resources need to be merged, one or some instances need to be migrated. Under the management and control of the central control device 101, instances in the edge cloud nodes can be migrated, and the migration process mainly includes:
中心管控设备101从至少一个实例中确定待迁移实例。待迁移实例可以是一个或多个;若待迁移实例是多个,多个待迁移实例可部署于同一边缘云节点中,也可以部署于不同边缘云节点中。The central management and control device 101 determines the instance to be migrated from at least one instance. There may be one or more instances to be migrated; if there are multiple instances to be migrated, the multiple instances to be migrated can be deployed in the same edge cloud node or in different edge cloud nodes.
在一些应用场景中,中心管控设备101可以监控至少一个边缘云节点102中部署的至少一个实例的状态,根据至少一个实例的状态,获取出现故障的实例和/或运行中发生指定事件的实例作为待迁移实例,进而对待迁移实例进行迁移。其中,出现故障的实例是指不能正常运行的实例,例如可以是发生宕机的物理机上的实例,也可以是本身宕机的实例等,这类实例需要进行迁移,以便能够继续为服务需求方提供云计算服务。指定事件主要是指一些出现后实例仍能正常运行的事件,可以根据应用需求灵活设定,对此不做限定。举例说明,指定事件可以是一些预警或告警事件等,虽然发生一些预警或告警事件,但实例并未产生实际问题,仍可运行(即未故障),但有故障隐患,可在故障前及时进行迁移,以避免故障引起的服务中断等问题。另外,中心管控设备101维护有各边缘云节点的信息以及各边缘云节点中部署的各实例的信息,基于此,可以确定待迁移实例所属的边缘云节点,为便于描述和区分,将待迁移实例在迁移前所属的边缘云节点记为第一边缘云节点。In some application scenarios, the central management and control device 101 may monitor the state of at least one instance deployed in at least one edge cloud node 102, and obtain the failed instance and/or the instance in which a specified event occurs during operation according to the state of the at least one instance. The instance to be migrated, and then the instance to be migrated is migrated. Among them, a failed instance refers to an instance that cannot run normally, for example, it can be an instance on a physical machine that is down, or an instance that itself is down. Such instances need to be migrated in order to continue to serve the demand side. Provide cloud computing services. The designated event mainly refers to some events that the instance can still run normally after occurrence, which can be flexibly set according to application requirements, and there is no restriction on this. For example, the specified event can be some early warning or alarm events, etc. Although some early warning or alarm events occur, the instance does not produce actual problems and can still run (that is, no failure), but there are hidden dangers of failure, which can be carried out in time before failure Migration to avoid problems such as service interruption caused by faults. In addition, the central management and control device 101 maintains the information of each edge cloud node and the information of each instance deployed in each edge cloud node. Based on this, the edge cloud node to which the instance to be migrated belongs can be determined. For ease of description and distinction, the The edge cloud node to which the instance belongs before the migration is recorded as the first edge cloud node.
在另一些应用场景中,随着时间的积累,边缘云节点中会出现一些资源碎片,或者需要部署一个资源规格较大的实例,但边缘云节点中的资源设备上可能已经没有满足资源规格要求的可用资源,这些情况下可以通过实例迁移对边缘云节点中的资源进行归并,这可以充分利用资源碎片,进而产出规格较大的资源块,有利于提高资源利用率。基于此,中心管控设备101可以根据资源归并需求,从至少一个实例中确定待迁移实例,进而对待迁移实例进行迁移。其中,资源归并主要是通过实例迁移对资源碎片进行整合的过程,经过整合后,边缘云节点中的资源碎片会减少甚至不存在,这有利于提高边缘云节点中的资源利用率。值得说明的是,资源归并需求可以是系统级的,也可以节点级的。系统级的资源归并是指从整个网络系统的维度考虑,通过实例迁移对整个网络系统中的资源碎片进行整合;节点级的资源归并是指从边缘云节点的维度考虑,通过实例迁移对 边缘云节点中的资源碎片进行整合。In other application scenarios, as time accumulates, there will be some resource fragments in the edge cloud node, or an instance with a larger resource specification needs to be deployed, but the resource equipment in the edge cloud node may not meet the resource specification requirements In these cases, the resources in the edge cloud nodes can be merged through instance migration, which can make full use of resource fragments, and then produce larger resource blocks, which is beneficial to improve resource utilization. Based on this, the central management and control device 101 can determine the instance to be migrated from at least one instance according to resource merging requirements, and then migrate the instance to be migrated. Among them, resource merging is mainly the process of integrating resource fragments through instance migration. After integration, the resource fragments in edge cloud nodes will be reduced or even nonexistent, which is conducive to improving resource utilization in edge cloud nodes. It is worth noting that resource merging requirements can be system-level or node-level. System-level resource merging refers to the integration of resource fragments in the entire network system through instance migration; node-level resource merging refers to the consideration from the dimension of edge cloud nodes, through instance migration to edge cloud The resource fragments in the node are integrated.
可选地,资源归并需求可以是服务需求方提供的。例如,服务需求方需要部署一个新的实例时,若为其服务的边缘云节点中各资源设备上的可用资源均不足以承载该新实例,可以要求对该边缘云节点中的实例进行迁移实现资源整合,从而为新实例提供足够的资源。或者,资源归并需求也可以是中心管控设备101的资源调度模块的定期行为。例如,中心管控设备101的资源调度模块定期执行资源碎片检查,当发现碎片率达到一定的阈值并可以执行实例迁移时,对各边缘云节点中的资源碎片进行整合,提高边缘云节点中的资源利用率。Optionally, the resource consolidation requirement may be provided by the service demander. For example, when a service demander needs to deploy a new instance, if the available resources on each resource device in the edge cloud node it serves are not enough to carry the new instance, it can request the migration of the instance in the edge cloud node. Resource integration, so as to provide sufficient resources for new instances. Alternatively, the resource merging requirement may also be a regular behavior of the resource scheduling module of the central management and control device 101. For example, the resource scheduling module of the central management and control device 101 periodically performs resource fragmentation checks. When it is found that the fragmentation rate reaches a certain threshold and instance migration can be performed, it integrates the resource fragments in each edge cloud node to improve the resources in the edge cloud node. Utilization rate.
其中,资源归并需求中包含有与资源归并相关的信息。例如,资源归并需求中可以包含为了达到资源归并目的需要迁移的实例的信息,基于此,可根据资源归并需求,直接确定待迁移实例。又例如,资源归并需求中可以包含需要资源归并的边缘云节点的信息。基于此,可根据资源归并需求,确定需要进行资源归并的边缘云节点,本实施例中将需要资源归并的边缘云节点称为第一边缘云节点;进而可以结合第一边缘云节点中各资源设备上剩余的可用资源和第一边缘云节点中各实例需要的资源,确定待迁移实例。Among them, the resource consolidation requirements contain information related to resource consolidation. For example, the resource merging requirements may include information about instances that need to be migrated to achieve the purpose of resource merging. Based on this, the instances to be migrated can be directly determined according to the resource merging requirements. For another example, the resource merging requirements may include information about edge cloud nodes that need to be merged. Based on this, the edge cloud node that needs to be merged can be determined according to the resource merging requirements. In this embodiment, the edge cloud node that needs to be merged is called the first edge cloud node; in turn, the resources in the first edge cloud node can be combined The remaining available resources on the device and the resources required by each instance in the first edge cloud node determine the instance to be migrated.
无论是哪种应用场景,在确定待迁移实例后,中心管控设备101可以判断待迁移实例所属的第一边缘云节点是否满足节点内迁移条件;若第一边缘云节点满足节点内迁移条件,则对待迁移实例进行边缘云节点内的迁移;若第一边缘云节点不满足节点内迁移条件,则对待迁移实例进行跨边缘云节点的迁移。Regardless of the application scenario, after determining the instance to be migrated, the central management and control device 101 can determine whether the first edge cloud node to which the instance to be migrated belongs meets the intra-node migration condition; if the first edge cloud node meets the intra-node migration condition, then The instance to be migrated is migrated within the edge cloud node; if the first edge cloud node does not meet the intra-node migration condition, the instance to be migrated is migrated across edge cloud nodes.
可选地,中心管控设备101可以判断第一边缘云节点当前是否处于可用状态;若第一边缘云节点当前处于可用状态,判断第一边缘云节点的可用资源是否足够承载待迁移实例;若第一边缘云节点的可用资源足够承载待迁移实例,确定第一边缘云节点满足节点内迁移条件;若第一边缘云节点当前处于不可用状态,或者第一边缘云节点的可用资源不足以承载待迁移实例,确定第一边缘云节点不满足节点内迁移条件。在本申请实施例中,将实例的迁移划分为两种类型:节点内迁移和跨节点迁移。其中,第一边缘云节点的可用资源主要是指第一边缘云节点中各台资源设备上的可用资源;相应地,判断第一边缘云节点的可用资源是否足够承载待迁移实例主要是指判断第一边缘云节点中是否存在可用资源足以承载待迁移实例的资源设备。Optionally, the central management and control device 101 may determine whether the first edge cloud node is currently available; if the first edge cloud node is currently available, determine whether the available resources of the first edge cloud node are sufficient to carry the instance to be migrated; The available resources of an edge cloud node are sufficient to carry the instance to be migrated, and it is determined that the first edge cloud node meets the migration conditions within the node; if the first edge cloud node is currently in an unavailable state, or the available resources of the first edge cloud node are insufficient to carry the instance In the migration instance, it is determined that the first edge cloud node does not meet the intra-node migration condition. In the embodiments of the present application, the migration of instances is divided into two types: intra-node migration and cross-node migration. Among them, the available resources of the first edge cloud node mainly refer to the available resources on each resource device in the first edge cloud node; accordingly, judging whether the available resources of the first edge cloud node are sufficient to carry the instance to be migrated mainly refers to judging Whether there is a resource device with sufficient resources available in the first edge cloud node to carry the instance to be migrated.
值得说明的是,在资源归并场景中,为了实现资源归并的实例迁移主要是节点内迁移,当然,也可以是跨节点迁移。可选地,在根据第一边缘云节点中各资源设备上剩余的可用资源和第一边缘云节点中各实例需要的资源确定待迁移实例的过程中,还可以确 定待迁移实例需要迁移到的资源设备,该资源设备是第一边缘云节点中剩余的可用资源可以承载待迁移实例的资源设备。当然,若第一边缘云节点中不存在剩余的可用资源可以承载待迁移实例的资源设备,可以针对待迁移实例进行跨节点迁移。鉴于资源归并的目的,在针对待迁移实例进行跨节点迁移的过程中,优先考虑将待迁移实例迁移到其它边缘云节点中已经被使用且剩余的可用资源可以承载待迁移实例的资源设备上;进一步,在有多个已经被使用且剩余的可用资源可以承载待迁移实例的资源设备的情况下,可以以资源碎片最小为原则,从中选择剩余的可用资源与待迁移实例需要的资源的匹配度较高的资源设备,尽量产生较少的资源碎片或不产生资源碎片。It is worth noting that in the resource merging scenario, the instance migration for resource merging is mainly intra-node migration, and of course, it can also be cross-node migration. Optionally, in the process of determining the instance to be migrated according to the remaining available resources on each resource device in the first edge cloud node and the resources required by each instance in the first edge cloud node, the instance to be migrated may also be determined A resource device, where the resource device is a resource device whose remaining available resources in the first edge cloud node can carry the instance to be migrated. Of course, if there are no remaining available resources in the first edge cloud node that can carry resource devices of the instance to be migrated, cross-node migration can be performed for the instance to be migrated. In view of the purpose of resource merging, in the process of cross-node migration for the instances to be migrated, priority is given to migrating the instances to be migrated to other edge cloud nodes that have been used and the remaining available resources can carry the resource devices of the instances to be migrated; Further, in the case that there are multiple resources that have been used and remaining available resources can carry the resource equipment of the instance to be migrated, the principle of minimum resource fragmentation can be used to select the matching degree between the remaining available resources and the resources required by the instance to be migrated. Higher resource equipment, try to produce less resource fragments or no resource fragments.
对于节点内迁移:可选地,可以通过热迁移技术保证实例所提供云计算服务的连续性,关于热迁移技术可参见现有技术,在此不再赘述。For intra-node migration: Optionally, the continuity of the cloud computing service provided by the instance can be ensured through the hot migration technology. For the hot migration technology, please refer to the prior art, which will not be repeated here.
对于跨节点迁移:中心管控设备101可以从至少一个边缘云节点选择第二边缘云节点,第二边缘云节点不同于第一边缘云节点,且第二边缘云节点中的可用资源足够承载待迁移实例,即有足够资源;将待迁移实例迁移到第二边缘云节点中,并将待迁移实例在第二边缘云节点中的属性信息发送给服务需求方,以供服务需求方基于该属性信息针对待迁移实例进行业务调度。其中,待迁移实例在第二边缘云节点中的属性信息是指在待迁移实例迁移到第二边缘云节点之后,外部(例如服务需求方或服务需求方授权的第三方)针对待迁移实例进行业务调度所需的信息,例如可以包括但不限于:第二边缘云节点所在的地区、运营商信息和/或公网IP等信息。以服务需求方为例,可以根据上述属性信息中第二边缘云节点所在的地区和运营商信息,结合发起业务请求的终端使用网络的运营商信息和所在地区等信息,判断是否将该业务请求分配到第二边缘云节点中的待迁移实例;若确定将业务请求分配到第二边缘云节点中的待迁移实例,则可以通过系统的调度能力,将上述属性信息中的公网IP提供给终端,终端的请求就可以访问到第二边缘云节点中的待迁移实例,达到将终端的业务请求调度到第二边缘云节点中的待迁移实例上的目的。For cross-node migration: the central management and control device 101 can select a second edge cloud node from at least one edge cloud node, the second edge cloud node is different from the first edge cloud node, and the available resources in the second edge cloud node are sufficient to carry the migration Instance, that is, enough resources; migrate the instance to be migrated to the second edge cloud node, and send the attribute information of the instance to be migrated in the second edge cloud node to the service demander, so that the service demander can base on the attribute information Perform business scheduling for the instances to be migrated. Among them, the attribute information of the instance to be migrated in the second edge cloud node means that after the instance to be migrated is migrated to the second edge cloud node, an external (for example, a service demander or a third party authorized by the service demander) conducts an operation on the instance to be migrated. Information required for service scheduling may include, but is not limited to, for example, information such as the area where the second edge cloud node is located, operator information, and/or public network IP. Taking the service demander as an example, it is possible to determine whether to request the service according to the area and operator information of the second edge cloud node in the above attribute information, combined with the operator information and area of the terminal that initiated the service request. Assigned to the instance to be migrated in the second edge cloud node; if it is determined to assign the service request to the instance to be migrated in the second edge cloud node, the public network IP in the above attribute information can be provided to The terminal, the terminal's request can access the instance to be migrated in the second edge cloud node, achieving the purpose of scheduling the service request of the terminal to the instance to be migrated in the second edge cloud node.
可选地,在选择第二边缘云节点时,可以采用但不限于以下方式:Optionally, when selecting the second edge cloud node, the following methods can be used but not limited to:
方式1:可以根据其它边缘云节点与第一边缘云节点之间的距离,选择与第一边缘云节点的距离小于设定距离阈值的边缘云节点,或者选择与第一边缘云节点距离最近的边缘云节点,或者从与第一边缘云节点距离最近的N个边缘云节点中任意选择一个边缘云节点,作为第二边缘云节点。在方式1中,第二边缘云节点距离第一边缘云节点距离最近或较近,可节约数据传输时间,有利于提高迁移效率。可选地,其它边缘云节点与 第一边缘云节点之间的距离可以是其它边缘云节点与第一边缘云节点之间的平均距离,也可以是其它边缘云节点与第一边缘云节点的中心之间的距离,还可以是其它边缘云节点与第一边缘云节点最靠近的外边缘之间的距离等,可根据需求适应性定义。Method 1: According to the distance between other edge cloud nodes and the first edge cloud node, select the edge cloud node whose distance from the first edge cloud node is less than the set distance threshold, or select the closest distance to the first edge cloud node An edge cloud node, or an edge cloud node arbitrarily selected from the N edge cloud nodes closest to the first edge cloud node as the second edge cloud node. In Manner 1, the second edge cloud node is closest or relatively close to the first edge cloud node, which can save data transmission time and help improve migration efficiency. Optionally, the distance between other edge cloud nodes and the first edge cloud node may be the average distance between other edge cloud nodes and the first edge cloud node, or the distance between other edge cloud nodes and the first edge cloud node. The distance between the centers may also be the distance between other edge cloud nodes and the closest outer edge of the first edge cloud node, etc., which can be adaptively defined according to requirements.
方式2:可以根据其它边缘云节点的带宽资源,从中选择带宽资源相对充足的边缘云节点,例如选择带宽资源最大的,或者选择带宽资源大于设定带宽阈值的,或者选择带宽使用率较低的边缘云节点,作为第二边缘云节点。在方式2中,第二边缘云节点的带宽资源充足,可提高数据传输速率,有利于提高迁移效率。Method 2: You can select edge cloud nodes with relatively sufficient bandwidth resources according to the bandwidth resources of other edge cloud nodes. For example, select the edge cloud node with the largest bandwidth resource, or select the bandwidth resource greater than the set bandwidth threshold, or select the bandwidth utilization rate lower The edge cloud node serves as the second edge cloud node. In method 2, the bandwidth resources of the second edge cloud node are sufficient, which can increase the data transmission rate, which is beneficial to improve the migration efficiency.
方式3:可以根据其它边缘云节点当前的负载情况,从中选择负载相对较轻的边缘云节点,例如选择负载量最小的,或者选择负载量小于设定负载量阈值的边缘云节点,作为第二边缘云节点。在方式3中,第二边缘云节点的负载较轻,可有足够资源且能够及时处理实例迁移,有利于提高迁移效率。Method 3: According to the current load situation of other edge cloud nodes, select the edge cloud node with relatively light load, for example, select the edge cloud node with the smallest load, or select the edge cloud node with the load less than the set load threshold as the second Edge cloud node. In mode 3, the load of the second edge cloud node is lighter, it has sufficient resources and can handle instance migration in time, which is beneficial to improve migration efficiency.
可选地,在将待迁移实例迁移到第二边缘云节点时,中心管控设备101可根据待迁移实例的资源需求,在第二边缘云节点中为待迁移实例进行资源预留或分配;在资源预留或分配成功后,将待迁移实例迁移到第二边缘云节点中预留或分配的资源上。例如,可结合待迁移实例的资源需求,确定待迁移实例需要的资源类型、资源量和/或对资源设备的性能要求等信息,根据这些信息在第二边缘云节点中进行资源预留或分配,可为实例成功迁移提供资源保障。关于中心管控设备101在第二边缘云节点中为待迁移实例进行资源预留或分配的过程,可参见后续资源调度部分的内容,在此不再赘述。Optionally, when migrating the instance to be migrated to the second edge cloud node, the central management and control device 101 may reserve or allocate resources for the instance to be migrated in the second edge cloud node according to the resource requirements of the instance to be migrated; After the resource reservation or allocation is successful, the instance to be migrated is migrated to the resources reserved or allocated in the second edge cloud node. For example, the resource requirements of the instances to be migrated can be combined to determine the type of resources, the amount of resources and/or the performance requirements of the resource equipment required by the instances to be migrated, and resource reservation or allocation can be performed in the second edge cloud node based on this information , Which can provide resource guarantee for successful instance migration. Regarding the process of the central management and control device 101 in the second edge cloud node to reserve or allocate resources for the instances to be migrated, please refer to the content of the subsequent resource scheduling part, which will not be repeated here.
可选地,若待迁移实例是出现故障的实例,即不可正常运行的实例,中心管控设备101还可以将该迁移事件通知给服务需求方,这样服务需求方可以做出合适的响应动作,比如更新该实例在服务需求方中的信息,或针对实例迁移过程中的宕机情况做出容灾响应。进一步,可在通知迁移事件的过程中,一并将待迁移实例在第二边缘云节点中的属性信息提供给服务需求方。当然,也可以在将待迁移实例成功迁移至第二边缘云节点之后,将待迁移实例在第二边缘云节点中的属性信息提供给服务需求方。Optionally, if the instance to be migrated is a failed instance, that is, an instance that is not functioning normally, the central management and control device 101 may also notify the service demander of the migration event, so that the service demander can make appropriate response actions, such as Update the information of the instance in the service demander, or make a disaster recovery response to the downtime during the instance migration. Further, in the process of notifying the migration event, the attribute information of the instance to be migrated in the second edge cloud node may be provided to the service demander. Of course, after the instance to be migrated is successfully migrated to the second edge cloud node, the attribute information of the instance to be migrated in the second edge cloud node may be provided to the service demander.
可选地,若待迁移实例是运行过程中发生指定事件的实例,即虽发生指定事件但仍可正常运行的实例,中心管控设备101还可以向服务需求方发送迁移请求,以供服务需求方结合待迁移实例上的业务情况为待迁移实例确定迁移策略;接收服务需求方发送的迁移策略,依据迁移策略将待迁移实例迁移到第二边缘云节点中。该迁移策略主要包括是否迁移、迁移时间以及迁移方式中的至少一个信息。可选地,服务需求方可以根据待迁移实例上的存量业务请求和增量业务请求的数量以及响应状态,确定什么时间进行迁 移,例如可以在待迁移实例上的存量业务请求均已被响应,且增量业务请求不多的情况下,确定进行实例迁移。Optionally, if the instance to be migrated is an instance in which a specified event occurs during operation, that is, an instance that can still run normally despite the occurrence of a specified event, the central control device 101 may also send a migration request to the service demander for the service demander Determine a migration strategy for the instance to be migrated in combination with the business situation on the instance to be migrated; receive the migration strategy sent by the service demander, and migrate the instance to be migrated to the second edge cloud node according to the migration strategy. The migration strategy mainly includes at least one information of whether to migrate, migration time, and migration mode. Optionally, the service demander can determine when to perform the migration based on the number of stock service requests and incremental service requests on the instance to be migrated and the response status. For example, all stock service requests on the instance to be migrated can be responded to. And if there are not many incremental business requests, determine the instance migration.
进一步可选地,中心管控设备101可以将待迁移实例在第二边缘云节点中的属性信息连同上述迁移请求一并发送给服务需求方。或者,也可以在将待迁移实例成功迁移至第二边缘云节点之后,将待迁移实例在第二边缘云节点中的属性信息提供给服务需求方。Further optionally, the central management and control device 101 may send the attribute information of the instance to be migrated in the second edge cloud node together with the migration request to the service demander. Alternatively, after the instance to be migrated is successfully migrated to the second edge cloud node, the attribute information of the instance to be migrated in the second edge cloud node may be provided to the service demander.
进一步可选地,若待迁移实例是发生指定事件但仍可正常运行的实例,在迁移过程中,待迁移实例可继续运行在第一边缘云节点中,这样迁移过程中的业务请可继续调度到第一边缘云节点中的待迁移实例上,保证业务连续性。在待迁移实例成功迁移到第二边缘云节点中,且服务需求方在确保将新的业务请求全部调度到第二边缘云节点中,且第一边缘云节点中的业务请求逐步减少最终没有新的业务请求,即运行于第一边缘云节点中的待迁移实例上不再有任何业务请求的情况下,中心管控设备可将第一边缘云节点中的待迁移实例释放掉。可选地,服务需求方在确定运行于第一边缘云节点中的待迁移实例上不再有任何业务请求,既没有存量业务请求也没有增量业务请求之后,可以向中心管控设备101发送释放通知;中心管控设备101接收服务需求方发送的释放通知,根据该释放通知将运行在第一边缘云节点中的待迁移实例释放掉。进一步,中心管控设备101还可以将运行在第一边缘云节点中的待迁移实例的运行状态同步给第二边缘云节点中的待待迁移实例。Further optionally, if the instance to be migrated is an instance that has a specified event but can still run normally, during the migration process, the instance to be migrated can continue to run on the first edge cloud node, so that the business during the migration process can continue to be scheduled To the instance to be migrated in the first edge cloud node, ensure business continuity. After the instance to be migrated is successfully migrated to the second edge cloud node, and the service demander is ensuring that all new business requests are scheduled to the second edge cloud node, and the business requests in the first edge cloud node are gradually reduced, there is no new The central management and control device can release the instance to be migrated in the first edge cloud node when there are no more business requests on the instance to be migrated running on the first edge cloud node. Optionally, after determining that the service requester no longer has any service requests on the instance to be migrated running in the first edge cloud node, and there is neither an inventory service request nor an incremental service request, it can send a release to the central control device 101 Notification; the central management and control device 101 receives the release notification sent by the service demander, and releases the instance to be migrated running in the first edge cloud node according to the release notification. Further, the central management and control device 101 may also synchronize the running state of the instance to be migrated running in the first edge cloud node to the instance to be migrated in the second edge cloud node.
进一步,无论待迁移实例是哪种实例,将待迁移实例迁移到第二边缘云节点中,主要是控制第二边缘云节点中相应资源设备根据待迁移实例对应的镜像或实例快照在预留或分配的资源上创建待迁移实例的过程。可选地,中心管控设备101可以向第二边缘云节点中相应资源设备提供待迁移实例对应的应用镜像或实例快照,以供第二边缘云节点中相应资源设备根据应用镜像或实例快照在预留或分配的资源上创建待迁移实例,但不限于此。Furthermore, regardless of the instance to be migrated, migrating the instance to be migrated to the second edge cloud node is mainly to control the corresponding resource equipment in the second edge cloud node to reserve or The process of creating an instance to be migrated on the allocated resources. Optionally, the central management and control device 101 may provide the corresponding resource device in the second edge cloud node with the application image or instance snapshot corresponding to the instance to be migrated, so that the corresponding resource device in the second edge cloud node can perform the pre-processing based on the application image or instance snapshot. Create an instance to be migrated on the reserved or allocated resources, but it is not limited to this.
可选地,本实施例的中心管控设备101可以将自己的实例升级、实例迁移等管控功能封装成一系列应用编程接口(Application Programming Interface,API)并开放给服务需求方使用。这些开放的API称为开放API(OpenAPI),中心管控设备101可通过OpenAPI与服务需求方进行交互。Optionally, the central management and control device 101 of this embodiment may encapsulate its own instance upgrade, instance migration, and other management and control functions into a series of application programming interfaces (Application Programming Interface, API) and open them to service demanders. These open APIs are called Open APIs (OpenAPI), and the central management and control device 101 can interact with the service demander through OpenAPI.
值得说明的是,在网络系统100中,中心管控设备101可以直接对至少一个边缘云节点102进行管控和调度,但并不限于此。如1b所示,在网络系统100中,除了包括 中心管控设备101和至少一个边缘云节点102之外,还包括边缘管控设备103。其中,边缘管控设备103的数量可以是一个,也可以是多个。另外,边缘管控设备103可以部署在一个或多个边缘云节点102中。在一可选实施例中,如图1b所示,每个边缘云节点102中分别部署边缘管控设备103。进一步,每个边缘云节点包括一台或多台资源设备,可选地,边缘管控设备103可集中部署在一台资源设备上,也可以分散部署在多台资源设备上。另外,每个边缘云节点除了包括资源设备之外,还可以包括一台或多台专有设备,其中边缘管控设备103可以集中部署在一台专有设备上,或分散部署在多台专有设备上。其中,专有设备是指用来部署边缘管控设备103的物理设备,不同于资源设备。此外,边缘管控设备103也可以与中心管控设备101部署在一起,在此不作限定。另外,中心管控设备101可以部署在一个或多个云计算数据中心或传统数据中心中,也可以和至少一个边缘云节点一起部署在边缘云网络中。It is worth noting that in the network system 100, the central management and control device 101 can directly control and schedule at least one edge cloud node 102, but it is not limited to this. As shown in 1b, in the network system 100, in addition to a central management and control device 101 and at least one edge cloud node 102, an edge management and control device 103 is also included. Among them, the number of edge management and control devices 103 may be one or multiple. In addition, the edge management and control device 103 may be deployed in one or more edge cloud nodes 102. In an optional embodiment, as shown in FIG. 1b, an edge management and control device 103 is separately deployed in each edge cloud node 102. Further, each edge cloud node includes one or more resource devices. Optionally, the edge management and control device 103 may be deployed on one resource device in a centralized manner, or may be deployed on multiple resource devices in a distributed manner. In addition, each edge cloud node can include one or more proprietary devices in addition to resource devices. The edge management and control device 103 can be deployed on one dedicated device or distributed on multiple dedicated devices. On the device. Among them, the proprietary device refers to the physical device used to deploy the edge management and control device 103, which is different from the resource device. In addition, the edge management and control device 103 can also be deployed with the central management and control device 101, which is not limited here. In addition, the central management and control device 101 may be deployed in one or more cloud computing data centers or traditional data centers, and may also be deployed in an edge cloud network together with at least one edge cloud node.
在此说明,本实施例的中心管控设备可以是一台具有资源调度和镜像管理等能力的逻辑设备,这些功能可以部署一台物理机或虚拟机上实现,也可以分散性地部署在多台物理机或虚拟机上。当然,本实施例的中心管控设备也可以是一台或多台具有资源调度和镜像管理等能力的物理设备。本申请实施例并不限定中心管控设备101的实现结构,凡是具有上述能力的设备结构均适用于本申请实施例。It is explained here that the central management and control device of this embodiment can be a logical device with the capabilities of resource scheduling and image management. These functions can be implemented on one physical machine or virtual machine, or distributed in multiple devices. On a physical machine or a virtual machine. Of course, the central management and control device in this embodiment may also be one or more physical devices with capabilities such as resource scheduling and image management. The embodiment of the present application does not limit the implementation structure of the central management and control device 101, and any device structure with the foregoing capabilities is applicable to the embodiment of the present application.
与中心管控设备101相类似,边缘管控设备103也可以是一台逻辑设备,其具有的能力可以部署一台物理机(例如边缘云节点中的资源设备或专有设备)或虚拟机上实现,也可以分散性地部署在多台物理机(例如边缘云节点中的资源设备或专有设备)或虚拟机上。当然,边缘管控设备也可以是一台或多台具有相应能力的物理设备。本申请实施例并不限定边缘管控设备103的实现结构,凡是具有相应能力的设备结构均适用于本申请实施例。Similar to the central management and control device 101, the edge management and control device 103 can also be a logical device, which has the ability to deploy a physical machine (for example, a resource device or a proprietary device in an edge cloud node) or a virtual machine. It can also be deployed on multiple physical machines (such as resource devices or proprietary devices in edge cloud nodes) or virtual machines in a decentralized manner. Of course, the edge control device can also be one or more physical devices with corresponding capabilities. The embodiments of the present application do not limit the implementation structure of the edge management and control device 103, and any device structure with corresponding capabilities is applicable to the embodiments of the present application.
在本实施例中,边缘管控设备103可辅助、配合中心管控设备101对至少一个边缘云节点102进行管控和调度。在边缘管控设备103的协助下,中心管控设备101可以更加方便、高效地对至少一个边缘云节点102进行管控和调度,进而达到充分利用边缘资源的目的。In this embodiment, the edge management and control device 103 can assist and cooperate with the central management and control device 101 to manage and control at least one edge cloud node 102. With the assistance of the edge management and control device 103, the central management and control device 101 can manage and schedule at least one edge cloud node 102 more conveniently and efficiently, thereby achieving the purpose of making full use of edge resources.
其中,中心管控设备101与边缘管控设备103之间可以建立安全、加密的通信通道,并基于该通信通道进行交互。该通信通道包括控制接口和数据接口,则中心管控设备101基于控制接口和数据接口与边缘管控设备103进行控制面和数据面的交互,完成对边缘云节点102的调度和管控。其中,数据接口用于在中心管控设备101与边缘管控设备103 之间进行数据传输。控制接口具备但不限于以下功能:Among them, the central management and control device 101 and the edge management and control device 103 can establish a secure and encrypted communication channel, and interact based on the communication channel. The communication channel includes a control interface and a data interface, and the central management and control device 101 interacts with the edge management and control device 103 on the control plane and the data plane based on the control interface and the data interface to complete the scheduling and management of the edge cloud node 102. The data interface is used for data transmission between the central management and control device 101 and the edge management and control device 103. The control interface has but not limited to the following functions:
1、资源调度能力:中心管控设备101通过具有资源调度能力的控制接口(可简称为资源调度接口)可从多个维度对边缘云节点进行资源调度,边缘云节点是中心管控设备101进行资源调度的对象;1. Resource scheduling capability: The central control device 101 can perform resource scheduling on edge cloud nodes from multiple dimensions through a control interface with resource scheduling capabilities (can be referred to as resource scheduling interface for short). The edge cloud node is the central control device 101 for resource scheduling Object;
2、镜像管理和分发能力:中心管控设备101通过具有镜像管理和分发能力的控制接口(简称为镜像管理接口)可将镜像提供给边缘云节点,这样,边缘云节点可根据收到的镜像创建相应实例,通过实例提供相应云计算服务;2. Image management and distribution capabilities: The central management and control device 101 can provide images to edge cloud nodes through a control interface with image management and distribution capabilities (referred to as image management interfaces), so that the edge cloud nodes can create images based on the received images Corresponding examples, providing corresponding cloud computing services through examples;
3、运维管理能力:中心管控设备101通过具有运维管理能力的控制接口(简称为运维管理接口)对边缘云节点进行运维管理,运维管理包括但不限于:管控边缘云节点中的应用、虚拟化软件等,监控实例的状态、资源使用量以及基础设施等。3. Operation and maintenance management capability: The central control device 101 performs operation and maintenance management on edge cloud nodes through a control interface with operation and maintenance management capabilities (referred to as the operation and maintenance management interface). The operation and maintenance management includes but is not limited to: control edge cloud nodes Application, virtualization software, etc., monitor the status, resource usage and infrastructure of the instance.
与上述控制接口具有的能力相对应,本实施例的中心管控设备101具有但不限于以下功能:Corresponding to the capabilities of the aforementioned control interface, the central management and control device 101 of this embodiment has but not limited to the following functions:
1、可根据服务需求描述信息,例如云计算服务的规格、需要部署云计算服务的区域、运营商网络的分布、网络时延、负载情况、带宽成本、需要的资源类型和/或资源设备的性能要求等,对边缘云节点进行调度;1. Information can be described according to service requirements, such as the specifications of cloud computing services, the areas where cloud computing services need to be deployed, the distribution of operator networks, network delays, load conditions, bandwidth costs, required resource types and/or resource equipment Performance requirements, etc., to schedule edge cloud nodes;
2、可获取云计算服务所需的镜像,将镜像提供给边缘云节点中相应资源设备进行配置安装,以供相应资源设备创建相应实例来提供云计算服务;2. The image required for cloud computing services can be obtained, and the image can be provided to the corresponding resource equipment in the edge cloud node for configuration and installation, so that the corresponding resource equipment can create corresponding instances to provide cloud computing services;
3、可对边缘云节点进行运维管控,包括但不限于:对边缘云节点中应用、虚拟化组件、实例的状态、资源用量和/或基础设施情况等进行管控,实现远程运维、日志管理等。3. Operation and maintenance management and control of edge cloud nodes can be performed, including but not limited to: management and control of applications, virtualized components, instance status, resource usage and/or infrastructure conditions in edge cloud nodes, to achieve remote operation and maintenance, logs Management etc.
除上述功能之外,中心管控设备也可以具有其它一些功能,例如安全保障功能,涉及对中心管控设备的安全、中心管控设备与边缘管控设备之间以及边缘云节点之间的链路安全、边缘云节点的安全;负责维护网络系统中组网信息等。In addition to the above functions, the central control equipment can also have other functions, such as security assurance functions, involving the security of the central control equipment, the link security between the central control equipment and the edge control equipment, and the edge cloud nodes. Security of cloud nodes; responsible for maintaining networking information in the network system.
在网络系统100中,至少一个边缘云节点102可形成资源池,每个边缘云节点102作为调度对象,在中心管控设备101的调度下对外提供各种资源或云计算服务。其中,中心管控设备101与边缘管控设备102相互配合,可以对至少一个边缘云节点102进行资源调度,也可以针对至少一个边缘云节点102进行镜像的管理和分发,当然,也可以既对至少一个边缘云节点102进行资源调度,又为至少一个边缘云节点102提供镜像。除了针对边缘云节点102进行资源调度和镜像管理和分发之外,对边缘云节点102中的实例进行管控也是网络系统100需要解决的一个问题,成功地解决该问题也是“将云计算放到距离终端更近的边缘云节点中处理”的基础。为此,中心管控设备101与边缘管 控设备103相互配合,还可以对至少一个边缘云节点102中的实例进行管控,例如升级、迁移、关停、重启和释放中的至少一种。In the network system 100, at least one edge cloud node 102 can form a resource pool, and each edge cloud node 102 serves as a scheduling object, and provides various resources or cloud computing services externally under the scheduling of the central management and control device 101. Among them, the central management and control device 101 and the edge management and control device 102 cooperate with each other to perform resource scheduling on at least one edge cloud node 102, and can also perform mirror management and distribution for at least one edge cloud node 102. Of course, it can also perform resource scheduling on at least one edge cloud node 102. The edge cloud node 102 performs resource scheduling and provides a mirror image for at least one edge cloud node 102. In addition to resource scheduling and image management and distribution for the edge cloud node 102, the management and control of the instances in the edge cloud node 102 is also a problem that the network system 100 needs to solve. Successfully solving this problem is also "putting cloud computing in the distance The basis of processing in the edge cloud node closer to the terminal. To this end, the central management and control device 101 and the edge management and control device 103 cooperate with each other, and can also manage and control instances in at least one edge cloud node 102, such as at least one of upgrade, migration, shutdown, restart, and release.
可选地,在对待升级实例进行升级的过程中,边缘管控设备103可协助中心管控设备101依据升级策略,利用镜像版本对应的镜像对待升级实例进行升级。例如,中心管控设备101可以将升级策略和镜像版本对应的镜像发送给边缘管控设备103,由边缘管控设备103依据升级策略,利用镜像版本对应的镜像对待升级实例进行升级。进一步,若每个边缘云节点中都部署有边缘管控设备103,则中心管控设备101可以将升级策略和镜像版本对应的镜像发送给待升级实例所属边缘云节点中的边缘管控设备103,由待升级实例所属边缘云节点中的边缘管控设备103依据升级策略,利用镜像版本对应的镜像对待升级实例进行升级。对边缘管控设备103来说,可采用升级策略指示的升级方法,在升级策略指示的升级时间,将镜像版本对应的镜像提供给待升级实例所在资源设备,由该资源设备利用该镜像对待升级实例进行升级。关于对待升级实例进行升级的其它描述,可参见前述实施例中的描述,在此不再赘述。Optionally, in the process of upgrading the instance to be upgraded, the edge management and control device 103 may assist the central management and control device 101 to upgrade the instance to be upgraded by using the mirror corresponding to the mirror version according to the upgrade strategy. For example, the central management and control device 101 may send the upgrade policy and the image corresponding to the image version to the edge management and control device 103, and the edge management and control device 103 uses the image corresponding to the image version to upgrade the instance to be upgraded according to the upgrade policy. Further, if an edge management and control device 103 is deployed in each edge cloud node, the central management and control device 101 can send the upgrade strategy and the image corresponding to the mirror version to the edge management and control device 103 in the edge cloud node to which the instance to be upgraded belongs, and the waiting The edge management and control device 103 in the edge cloud node to which the upgraded instance belongs uses the image corresponding to the image version to upgrade the instance to be upgraded according to the upgrade strategy. For the edge management and control device 103, the upgrade method indicated by the upgrade strategy can be used. At the upgrade time indicated by the upgrade strategy, the mirror corresponding to the mirror version is provided to the resource device where the instance to be upgraded is located, and the resource device uses the mirror to upgrade the instance. Upgrade. For other descriptions of upgrading the instance to be upgraded, reference may be made to the description in the foregoing embodiment, which is not repeated here.
可选地,在对待迁移实例进行迁移的过程中,边缘管控设备103可协助中心管控设备101控制第二边缘云节点中相应资源设备为待迁移实例进行资源预留或分配。其中,中心管控设备101可以根据待迁移实例的资源需求,确定第二边缘云节点中被调度的资源信息,将该资源信息提供给边缘管控设备103,由边缘管控设备103根据该资源信息,控制第二边缘云节点中相应资源设备为待迁移实例进行资源预留或分配。进一步,若第二边缘云节点中部署有边缘管控设备103,则中心管控设备101可以将资源信息提供给第二边缘云节点中的边缘管控设备103,由第二边缘云节点中的边缘管控设备103根据该资源信息,控制第二边缘云节点中相应资源设备为待迁移实例进行资源预留或分配。Optionally, during the migration of the instance to be migrated, the edge management and control device 103 may assist the central management and control device 101 to control the corresponding resource device in the second edge cloud node to reserve or allocate resources for the instance to be migrated. Among them, the central management and control device 101 may determine the scheduled resource information in the second edge cloud node according to the resource requirements of the instance to be migrated, and provide the resource information to the edge management and control device 103, and the edge management and control device 103 controls the resource according to the resource information. The corresponding resource device in the second edge cloud node reserves or allocates resources for the instance to be migrated. Further, if the edge management and control device 103 is deployed in the second edge cloud node, the central management and control device 101 can provide resource information to the edge management and control device 103 in the second edge cloud node, and the edge management and control device in the second edge cloud node 103, according to the resource information, controls the corresponding resource device in the second edge cloud node to reserve or allocate resources for the instance to be migrated.
另外,在对待迁移实例进行迁移的过程中,边缘管控设备103还可协助中心管控设备101将待迁移实例迁移到第二边缘云节点中相应资源设备预留或分配的资源上。中心管控设备101可以向边缘管控设备103发送迁移指令,该迁移指令指示边缘管控设备103获取待迁移实例对应的镜像或实例快照并提供给第二边缘云节点中相应资源设备,供第二边缘云节点中相应资源设备根据该镜像或实例快照在预留或分配的资源上创建待迁移实例。进一步,若第二边缘云节点中部署有边缘管控设备103,则中心管控设备101可以向第二边缘云节点中的边缘管控设备103发送迁移指令,指示第二边缘云节点中的边缘管控设备103获取待迁移实例对应的镜像或快照并提供给第二边缘云节点中相应资源设备,供第二边缘云节点中相应资源设备根据该镜像或快照在预留或分配的资源上创建 待迁移实例。In addition, in the process of migrating the instance to be migrated, the edge management and control device 103 may also assist the central management and control device 101 in migrating the instance to be migrated to the resources reserved or allocated by the corresponding resource device in the second edge cloud node. The central management and control device 101 may send a migration instruction to the edge management and control device 103. The migration instruction instructs the edge management and control device 103 to obtain the image or instance snapshot corresponding to the instance to be migrated and provide it to the corresponding resource device in the second edge cloud node for the second edge cloud The corresponding resource device in the node creates an instance to be migrated on the reserved or allocated resources according to the image or instance snapshot. Further, if the edge management and control device 103 is deployed in the second edge cloud node, the central management and control device 101 may send a migration instruction to the edge management and control device 103 in the second edge cloud node to instruct the edge management and control device 103 in the second edge cloud node The image or snapshot corresponding to the instance to be migrated is obtained and provided to the corresponding resource device in the second edge cloud node for the corresponding resource device in the second edge cloud node to create the instance to be migrated on the reserved or allocated resources according to the image or snapshot.
可选地,如果实例迁移过程使用的是快照,则根据快照的存储方式不同,边缘管控设备103获取快照的方式也会有所不同。如果快照存储在第一边缘云节点内,则要看第一边缘云节点的状态是否可用,如果第一边缘云节点处于不可用状态,则不适合使用快照进行实例迁移,需要改为使用镜像进行实例迁移;如果第一边缘云节点处于可用状态,则边缘管控设备可以从第一边缘云节点中获取快照。如果快照是分散存储在其它边缘云节点中,则获取快照的过程与第一边缘云节点的状态无关,边缘管控设备可以在其它边缘云节点处于可用状态的情况下从其它边缘云节点中获取快照。边缘管控设备在获取快照后,将快照拷贝提供给第二边缘云节点中相应资源设备,供相应资源设备通过快照创建待迁移实例。其中,通过快照创建实例能够恢复打快照时保存的数据。Optionally, if a snapshot is used in the instance migration process, the way in which the edge management and control device 103 obtains the snapshot will be different according to the storage mode of the snapshot. If the snapshot is stored in the first edge cloud node, it depends on whether the state of the first edge cloud node is available. If the first edge cloud node is unavailable, it is not suitable to use the snapshot for instance migration, and you need to use mirroring instead. Instance migration; if the first edge cloud node is in an available state, the edge management and control device can obtain a snapshot from the first edge cloud node. If the snapshots are scattered and stored in other edge cloud nodes, the process of obtaining the snapshot has nothing to do with the state of the first edge cloud node, and the edge management and control device can obtain the snapshot from other edge cloud nodes when the other edge cloud nodes are available. . After obtaining the snapshot, the edge management and control device provides the snapshot copy to the corresponding resource device in the second edge cloud node for the corresponding resource device to create an instance to be migrated through the snapshot. Among them, creating an instance through a snapshot can restore the data saved when the snapshot is taken.
可选地,如果实例迁移过程使用的是镜像,边缘管控设备103在获取镜像时,可以先判断第二边缘云节点中是否存储待迁移实例对应的镜像。如果第二边缘云节点有相应镜像,则边缘管控设备可以直接将第二边缘云节点中的相应镜像提供给第二边缘云节点中相应资源设备,供相应资源设备通过镜像创建出待迁移实例。如果第二边缘云节点没有相应镜像,边缘管控设备可以向中心管控设备请求相应镜像;中心管控设备可以从镜像库中获取镜像提供给边缘管控设备,或者指示边缘管控设备从其他存储有相应镜像的边缘云节点处获取镜像;边缘管控设备在获取相应镜像后提供给第二边缘云节点中相应资源设备,供相应资源设备通过镜像创建出待迁移实例。其中,中心管控设备指示边缘管控设备从其他存储有相应镜像的边缘云节点处获取镜像的过程可参见后续镜像管理与分发相关实施例中的描述,在此不再赘述。Optionally, if a mirror is used in the instance migration process, the edge management and control device 103 may first determine whether the mirror corresponding to the instance to be migrated is stored in the second edge cloud node when acquiring the mirror. If the second edge cloud node has a corresponding image, the edge management and control device can directly provide the corresponding image in the second edge cloud node to the corresponding resource device in the second edge cloud node for the corresponding resource device to create an instance to be migrated through the image. If the second edge cloud node does not have a corresponding image, the edge management and control device can request the corresponding image from the central management and control device; the central management and control device can obtain the image from the mirror library and provide it to the edge management and control device, or instruct the edge management and control device to store the corresponding image from another The image is obtained at the edge cloud node; the edge management and control device provides the corresponding resource device in the second edge cloud node after obtaining the corresponding image, so that the corresponding resource device can create an instance to be migrated through the image. The process of the central management and control device instructing the edge management and control device to obtain images from other edge cloud nodes that store corresponding images may refer to the description in the subsequent image management and distribution related embodiments, which will not be repeated here.
在本申请下述实施例中,将对中心管控设备或者中心管控设备与边缘管控设备配合所实现的其它各种功能展开描述。In the following embodiments of the present application, various other functions implemented by the central control device or the cooperation of the central control device and the edge control device will be described.
资源调度功能:Resource scheduling function:
中心管控设备可对至少一个边缘云进行资源调度,主要是指根据服务需求描述信息,从网络系统100中的至少一个边缘云节点102中确定可被调度的目标边缘云节点及目标边缘云节点中被调度的资源信息;将该资源信息发送给边缘管控设备103,以供边缘管控设备103控制目标边缘云节点中相应资源设备进行资源分配或预留。可选地,目标边缘云节点的数量可以由用户指定,也可以由资源中心管控设备根据服务需求描述信息自主确定,可以是一个,也可以是多个。服务需求描述信息可以由服务需求方直接提交,也可以是从服务需求方提交的服务相关的信息中提取或计算得到的。服务需求方可以是 用户,也可以是应用、物理机或需要某一服务的另一服务等。The central management and control device can perform resource scheduling on at least one edge cloud, which mainly refers to determining the target edge cloud node and target edge cloud node that can be scheduled from at least one edge cloud node 102 in the network system 100 according to service demand description information Scheduled resource information; the resource information is sent to the edge management and control device 103 for the edge management and control device 103 to control the corresponding resource device in the target edge cloud node for resource allocation or reservation. Optionally, the number of target edge cloud nodes can be specified by the user, or can be independently determined by the resource center management and control device according to the service requirement description information, and it can be one or more. The service demand description information can be directly submitted by the service demander, or it can be extracted or calculated from the service-related information submitted by the service demander. The service demander can be a user, an application, a physical machine, or another service that requires a certain service.
这里所描述的资源调度功能主要包括边缘云节点的选择和边缘云节点内的资源调度两个方面,但不限于这两个方面。其中,边缘云节点内部的资源调度具体体现为确定目标边缘云节点中被调度的资源信息和提供资源信息的操作,主要目的是在每一个边缘云节点的粒度上把云计算服务分配到最终的基础资源,例如服务器等资源设备上。其中,中心管控设备可维护各边缘云节点包含的资源的信息,作为资源调度的基础。The resource scheduling function described here mainly includes the selection of edge cloud nodes and the resource scheduling within the edge cloud nodes, but it is not limited to these two aspects. Among them, the internal resource scheduling of the edge cloud node is specifically embodied as the operation of determining the scheduled resource information in the target edge cloud node and providing resource information. The main purpose is to allocate cloud computing services to the final at the granularity of each edge cloud node. Basic resources, such as server and other resource equipment. Among them, the central control equipment can maintain the information of the resources contained in each edge cloud node as the basis for resource scheduling.
可选地,服务需求描述信息中包括边缘云节点选择参数和资源选择参数。边缘云节点选择参数是指选择目标边缘云节点所需的参数;资源选择参数是指选择边缘云节点内被调度的资源所需的信息。基于此,中心管控设备可以从服务需求描述信息中解析出边缘云节点选择参数和资源选择参数;根据边缘云节点选择参数从至少一个边缘云节点中确定被调度的目标边缘云节点,并根据资源选择参数确定目标边缘云节点中被调度的资源信息。Optionally, the service requirement description information includes edge cloud node selection parameters and resource selection parameters. The edge cloud node selection parameter refers to the parameter required to select the target edge cloud node; the resource selection parameter refers to the information required to select the scheduled resource in the edge cloud node. Based on this, the central management and control equipment can parse out the edge cloud node selection parameters and resource selection parameters from the service demand description information; determine the scheduled target edge cloud node from at least one edge cloud node according to the edge cloud node selection parameters, and according to the resource The selection parameters determine the scheduled resource information in the target edge cloud node.
例如,服务需求描述信息中可以包括调度域和/或云计算服务的QoS要求,这些参数可以作为边缘云节点选择参数。其中,调度域指向需要部署云计算服务的区域,这决定了应该被调度的边缘云节点的地理位置。云计算服务的QoS要求可以包括云计算服务对网络时延、负载情况和/或带宽成本等的要求。基于此,中心管控设备可以根据调度域和/或云计算服务的QoS要求,结合至少一个边缘云节点的地理位置和资源剩余量,选择能够满足调度域和/或QoS要求的边缘云节点作为目标边缘云节点。For example, the service requirement description information may include the scheduling domain and/or the QoS requirements of the cloud computing service, and these parameters may be used as edge cloud node selection parameters. Among them, the scheduling domain points to the area where cloud computing services need to be deployed, which determines the geographic location of edge cloud nodes that should be scheduled. The QoS requirements of cloud computing services may include the requirements of cloud computing services on network delay, load conditions, and/or bandwidth costs. Based on this, the central management and control device can select the edge cloud node that can meet the scheduling domain and/or QoS requirements as the target according to the QoS requirements of the scheduling domain and/or cloud computing service, combined with the geographic location and resource remaining amount of at least one edge cloud node Edge cloud node.
例如,中心管控设备可以根据调度域,结合至少一个边缘云节点的地理位置,选择调度域指向的边缘云节点作为目标边缘云节点。或者,中心管控设备还可以根据云计算服务的QoS要求,例如网络时延、负载情况和/或带宽成本等要求,从边缘云节点中选择满足网络时延、负载情况和/或带宽成本要求的边缘云节点作为目标边缘云节点。当然,中心管控设备也可以同时根据调度域和云计算服务的QoS要求,结合至少一个边缘云节点的地理位置和资源剩余量,选择能够同时满足调度域和QoS要求的边缘云节点作为目标边缘云节点。For example, the central management and control device may select the edge cloud node pointed to by the scheduling domain as the target edge cloud node in combination with the geographic location of at least one edge cloud node according to the scheduling domain. Alternatively, the central management and control device can also select the edge cloud node that meets the network delay, load and/or bandwidth cost requirements based on the QoS requirements of the cloud computing service, such as network delay, load conditions, and/or bandwidth cost requirements. The edge cloud node serves as the target edge cloud node. Of course, the central management and control equipment can also select the edge cloud node that can meet the scheduling domain and QoS requirements as the target edge cloud based on the QoS requirements of the scheduling domain and cloud computing services at the same time, combined with the geographic location and remaining amount of resources of at least one edge cloud node. node.
服务需求描述信息中除了包含调度域和/或云计算服务的QoS要求这些信息之外,还可以包括云计算服务所需的资源类型、资源数量和/或资源设备的性能等参数,这些参数可以作为资源选择参数。基于此,中心管控设备在确定目标边缘云节点之后,可以根据资源选择参数确定目标边缘云节点中被调度的资源信息。这里的资源信息可以包括:资源类型、资源数量和/或对资源设备的性能要求等信息,便于边缘管控设备据此控制目标 边缘云节点中相应资源设备进行资源分配或预留。例如,资源类型可以包括但不限于:CPU、GPU等计算资源,内存、硬盘等存储资源,带宽资源等资源类型。以CPU资源为例,资源数量可以是12个CPU、24个CPU等,以内存资源为例,资源数量可以是16G内存、32G内存等;以带宽资源为例,资源数量可以是1M带宽,10M带宽等。In addition to the information about the scheduling domain and/or the QoS requirements of the cloud computing service, the service requirement description information can also include the resource type, the number of resources, and/or the performance of the resource equipment required by the cloud computing service. These parameters can be As a resource selection parameter. Based on this, after determining the target edge cloud node, the central management and control device can determine the scheduled resource information in the target edge cloud node according to the resource selection parameters. The resource information here may include: resource type, resource quantity, and/or performance requirements for resource devices, so that the edge management and control device can control the corresponding resource device in the target edge cloud node to allocate or reserve resources accordingly. For example, resource types may include, but are not limited to: computing resources such as CPU and GPU, storage resources such as memory and hard disk, and resource types such as bandwidth resources. Taking CPU resources as an example, the number of resources can be 12 CPUs, 24 CPUs, etc., taking memory resources as an example, the number of resources can be 16G memory, 32G memory, etc.; taking bandwidth resources as an example, the number of resources can be 1M bandwidth, 10M Bandwidth etc.
可选地,中心管控设备还可以具有算力编排的功能,算力编排是面向相对复杂一些的应用场景,将多个云计算服务绑定在一起作为最小的资源需求单元,这样,在资源调度过程中,可将绑定在一起的多个云计算服务作为整体,为它们选择同一个或几个边缘云节点,由同一个或几个边缘云节点为它们共同提供资源。算力编排完善了资源调度的多样性,增加了资源调度的灵活性,但未对资源调度的整体流程产生影响。Optionally, the central management and control device can also have the function of computing power orchestration. The computing power orchestration is oriented to relatively complex application scenarios. Multiple cloud computing services are bound together as the smallest resource requirement unit. In this way, in the resource scheduling In the process, multiple cloud computing services can be bound together as a whole, and one or several edge cloud nodes can be selected for them, and the same or several edge cloud nodes can provide resources for them together. Computing power scheduling improves the diversity of resource scheduling and increases the flexibility of resource scheduling, but it does not affect the overall process of resource scheduling.
镜像管理与分发功能:Image management and distribution functions:
中心管控设备的镜像管理功能,主要是指对镜像进行管理,并为边缘云节点提供所需的镜像。这样,边缘云节点可根据镜像在相应资源设备上创建实例,进而由所创建的实例为用户提供所需的云计算服务。The image management function of the central control device mainly refers to the management of images and the provision of required images for edge cloud nodes. In this way, the edge cloud node can create an instance on the corresponding resource device according to the image, and then the created instance can provide users with required cloud computing services.
在实际应用中,需要为边缘云节点提供镜像的场景是多种多样的。例如,在用户(例如服务需求方)提交服务需求描述信息的情况下,中心管控设备可以为被调度的目标边缘云节点提供相应镜像。又例如,在边缘云节点上已有实例为用户提供云计算服务的情况下,用户需要进行业务扩容时,可以向中心管控设备提交扩容需求,为了实现扩容目的,需要为目前正为用户提供云计算服务的边缘云节点提供相应镜像,以便该边缘云节点基于镜像创建新的实例,从而达到扩容的目的。为便于描述和区分,在下面描述中,将需要为其提供镜像的边缘云节点记为第三边缘云节点,第三边缘云节点可以是网络系统中的任一边缘云节点,具体视应用场景而定。下面以中心管控设备为第三边缘云节点提供镜像为例,对中心管控设备的镜像管理功能进行说明。In practical applications, there are various scenarios where mirroring needs to be provided for edge cloud nodes. For example, in a case where a user (such as a service demander) submits service demand description information, the central management and control device can provide a corresponding image for the scheduled target edge cloud node. For another example, when there are instances on edge cloud nodes that provide users with cloud computing services, when users need to expand their business, they can submit their expansion requirements to the central control equipment. In order to achieve the purpose of expansion, they need to provide cloud computing services for users. The edge cloud node of the computing service provides a corresponding image, so that the edge cloud node creates a new instance based on the image, so as to achieve the purpose of capacity expansion. For ease of description and distinction, in the following description, the edge cloud node that needs to be mirrored is recorded as the third edge cloud node. The third edge cloud node can be any edge cloud node in the network system, depending on the application scenario. Depends. The following takes the central control device to provide a mirror image for the third edge cloud node as an example to describe the image management function of the central control device.
在需要为第三边缘云节点提供镜像时,中心管控设备可以先确定需要向第三边缘云节点提供的目标镜像;然后,为第三边缘云节点提供目标镜像,以供第三边缘云节点利用目标镜像提供云计算服务。When it is necessary to provide a mirror image for the third edge cloud node, the central control device can first determine the target image that needs to be provided to the third edge cloud node; then, provide the target image for the third edge cloud node for use by the third edge cloud node The target image provides cloud computing services.
在本实施例的网络系统100中,维护有镜像库,该镜像库用于存储系统中的镜像。用户可以选择使用镜像库中的镜像。例如,可以向用户提供一个镜像配置界面,该界面上设有下拉菜单,下拉菜单包括很多可供用户选择的镜像,用户可以选择自己使用的镜像。基于此,在需要为第三边缘云节点提供镜像时,中心管控设备可以从镜像库中获取 第三边缘云节点所需的镜像,然后将镜像提供给第三边缘云节点,并将镜像的使用权限开放给相应用户。可选地,中心管控设备可以直接将目标镜像下发给第三边缘云节点,也可以指示第三边缘云节点到指定存储位置下载目标镜像。In the network system 100 of this embodiment, a mirror library is maintained, and the mirror library is used to store images in the system. Users can choose to use the mirror in the mirror library. For example, the user can be provided with a mirror configuration interface with a drop-down menu. The drop-down menu includes many mirrors that can be selected by the user, and the user can choose the mirror to use. Based on this, when it is necessary to provide a mirror image for the third edge cloud node, the central management and control device can obtain the image required by the third edge cloud node from the mirror library, and then provide the image to the third edge cloud node and use the image The permissions are open to the corresponding users. Optionally, the central management and control device may directly issue the target image to the third edge cloud node, or instruct the third edge cloud node to download the target image to a designated storage location.
除此之外,中心管控设备还可以维护已下发镜像与已下发镜像所在边缘云节点的对应关系。该对应关系中可以包括已下发镜像的标识信息与已发下镜像所在边缘云节点的标识信息。已下发镜像是指中心管控设备已经提供(例如下发)给某个或某些边缘云节点的镜像;已下发镜像所在边缘云节点是指已下发镜像被提供给的边缘云节点。同一镜像可能被提供(例如下发)给一个边缘云节点,也可能被提供(例如下发)给多个边缘云节点。In addition, the central control device can also maintain the correspondence between the issued image and the edge cloud node where the issued image is located. The correspondence relationship may include the identification information of the issued image and the identification information of the edge cloud node where the image has been issued. The issued image refers to the image that the central control device has provided (for example, issued) to one or some edge cloud nodes; the edge cloud node where the issued image is located refers to the edge cloud node to which the issued image is provided. The same image may be provided (for example, distributed) to one edge cloud node, or may be provided (for example, distributed) to multiple edge cloud nodes.
基于所维护的已下发镜像与已下发镜像所在边缘云节点的对应关系,在需要为第三边缘云节点提供镜像时,中心管控设备还可以控制第三边缘云节点从已经具有该镜像的其它边缘云节点获取该镜像,无需直接向第三边缘云节点提供镜像,一定程度上可以减轻中心管控设备的处理负担,在控制合理的情况下,还可以提高镜像的获取效率。Based on the maintained correspondence between the issued image and the edge cloud node where the issued image is located, when the image needs to be provided for the third edge cloud node, the central management and control device can also control the third edge cloud node from the image that already has the image. Other edge cloud nodes acquire the image without directly providing the image to the third edge cloud node, which can reduce the processing burden of the central control device to a certain extent, and can also improve the efficiency of image acquisition under the condition of reasonable control.
详细地,在需要为第三边缘云节点提供镜像时,中心管控设备可以确定需要向第三边缘云节点提供的镜像,为了便于描述和区分,在本申请实施例中,将需要向第三边缘云节点提供的镜像记为目标镜像;根据目标镜像的信息,在所维护的已下发镜像与已下发镜像所在边缘云节点的对应关系中进行匹配;若在该对应关系中匹配到与目标镜像对应的第四边缘云节点,这说明该目标镜像已经被提供给第四边缘云节点,则可以将第四边缘云节点处的目标镜像提供给第三边缘云节点;其中,第四边缘云节点也可以网络系统中的边缘云节点,其数量可以是一个,也可以是多个。对第三边缘云节点来说,可在中心管控设备101的控制下,获取第四边缘云节点处的目标镜像。In detail, when a mirror image needs to be provided to the third edge cloud node, the central management and control device may determine the image that needs to be provided to the third edge cloud node. For ease of description and distinction, in the embodiment of the present application, the third edge cloud node The image provided by the cloud node is recorded as the target image; according to the information of the target image, a match is made in the correspondence between the maintained issued image and the edge cloud node where the issued image is located; if the corresponding relationship is matched with the target Mirror the corresponding fourth edge cloud node, which means that the target image has been provided to the fourth edge cloud node, and the target image at the fourth edge cloud node can be provided to the third edge cloud node; among them, the fourth edge cloud The node can also be an edge cloud node in the network system, and the number can be one or more. For the third edge cloud node, under the control of the central management and control device 101, the target image at the fourth edge cloud node can be obtained.
在此说明,在网络系统100包括边缘管控设备103的情况下,中心管控设备具体可以将第四边缘云节点与目标镜像的信息发送给边缘管控设备;边缘管控设备103根据第四边缘云节点与目标镜像的信息,将第四边缘云节点处的目标镜像提供给第三边缘云节点中的相应资源设备,供相应资源设备根据目标镜像创建可提供云计算服务的实例,进而为服务需求方提供该云计算服务。其中,第四边缘云节点的信息可以是任何能够标识第四边缘云节点的信息,例如可以是第四边缘云节点的ID、名称或地理位置等信息。目标镜像的信息可以是任何能够标识目标镜像的信息,例如可以是目标镜像的ID、名称或编号等。It is explained here that in the case that the network system 100 includes the edge management and control device 103, the central management and control device may specifically send the information of the fourth edge cloud node and the target mirror to the edge management and control device; the edge management and control device 103 may communicate with the fourth edge cloud node according to the The target image information, the target image at the fourth edge cloud node is provided to the corresponding resource device in the third edge cloud node, for the corresponding resource device to create an instance that can provide cloud computing services based on the target image, and then provide it to the service demander The cloud computing service. The information of the fourth edge cloud node may be any information that can identify the fourth edge cloud node, for example, it may be information such as the ID, name, or geographic location of the fourth edge cloud node. The information of the target image can be any information that can identify the target image, such as the ID, name, or number of the target image.
进一步,在第三边缘云节点和第四边缘云节点中均部署有边缘管控设备103的情况 下,则中心管控设备101具体可以将第四边缘云节点与目标镜像的信息发送给第三边缘云节点中的边缘管控设备,供第三边缘云节点中的边缘管控设备通过其与第四边缘云节点中的边缘管控设备之间的通信通道从第四边缘云节点处获取目标镜像并提供给第三边缘云节点中的相应资源设备。对第三边缘云节点中的边缘管控设备103来说,可接收中心管控设备101发送的第四边缘云节点和目标镜像的信息,根据第四边缘云节点与目标镜像的信息,通过其与第四边缘云节点中的边缘管控设备之间的通信通道,从第四边缘云节点处获取目标镜像,将目标镜像提供给第三边缘云节点中相应资源设备,供相应资源设备根据目标镜像创建可提供云计算服务的实例,进而提供云计算服务。Further, in the case where the edge management and control device 103 is deployed in both the third edge cloud node and the fourth edge cloud node, the central management and control device 101 may specifically send information about the fourth edge cloud node and the target image to the third edge cloud The edge management and control device in the node is used for the edge management and control device in the third edge cloud node to obtain the target image from the fourth edge cloud node through the communication channel between it and the edge management and control device in the fourth edge cloud node and provide it to the first edge cloud node. Corresponding resource equipment in the three edge cloud nodes. For the edge management and control device 103 in the third edge cloud node, it can receive the information of the fourth edge cloud node and the target mirror image sent by the central management and control device 101, and according to the information of the fourth edge cloud node and the target mirror image, through it and the first The communication channel between the edge management and control devices in the four-edge cloud node obtains the target image from the fourth edge cloud node, and provides the target image to the corresponding resource device in the third edge cloud node, so that the corresponding resource device can create an image based on the target image. Provide instances of cloud computing services, and then provide cloud computing services.
更进一步,第三边缘云节点中的边缘管控设备103通过其与第四边缘云节点中的边缘管控设备之间的通信通道,从第四边缘云节点获取目标镜像的一种过程包括:第三边缘云节点中的边缘管控设备103通过其与第四边缘云节点中的边缘管控设备之间的通信通道,向第四边缘云节点中的边缘管控设备103发送获取目标镜像的请求,该请求中携带有目标镜像的信息。第四边缘云节点中的边缘管控设备103接收该请求,根据该请求中携带的目标镜像的信息,判断第四边缘云节点中是否存在目标镜像,在第四边缘云节点中存在目标镜像的情况下,通过其与第三边缘云节点中边缘管控设备103之间的通信通道,将目标镜像返回给第三边缘云节点中的边缘管控设备103,或者,将目标镜像在第四边缘云节点中的存储地址返回给第三边缘云节点中的边缘管控设备103。第三边缘云节点中的边缘管控设备103接收第四边缘云节点中的边缘管控设备103返回的目标镜像,或者接收第四边缘云节点中的边缘管控设备103返回的目标镜像在第四边缘云节点中的存储地址,根据该存储地址读取或下载目标镜像。Furthermore, the edge management and control device 103 in the third edge cloud node obtains the target image from the fourth edge cloud node through the communication channel between it and the edge management and control device in the fourth edge cloud node. The process includes: third The edge management and control device 103 in the edge cloud node sends a request for acquiring the target image to the edge management and control device 103 in the fourth edge cloud node through the communication channel between it and the edge management and control device in the fourth edge cloud node. Carry the information of the target image. The edge management and control device 103 in the fourth edge cloud node receives the request, and determines whether the target image exists in the fourth edge cloud node according to the target image information carried in the request, and whether the target image exists in the fourth edge cloud node Next, through the communication channel between it and the edge management and control device 103 in the third edge cloud node, the target image is returned to the edge management and control device 103 in the third edge cloud node, or the target image is mirrored in the fourth edge cloud node The storage address of is returned to the edge management and control device 103 in the third edge cloud node. The edge management and control device 103 in the third edge cloud node receives the target image returned by the edge management and control device 103 in the fourth edge cloud node, or receives the target image returned by the edge management and control device 103 in the fourth edge cloud node in the fourth edge cloud. The storage address in the node, read or download the target image according to the storage address.
值得说明的是,第三边缘云节点中的边缘管控设备103与第四边缘云节点中的边缘管控设备103可以自行建立通信通道,也可以在中心管控设备101的控制下建立通道。可选地,中心管控设备还可以控制不同边缘管控设备之间建立通信通道,并负责维护边缘管控设备之间已有通信通道的信息,例如可以维护哪些边缘管控设备之间已经建立通信通道,通信通道何时建立,通信通道的状态,保持时长等信息。基于此,中心管控设备在确定目标镜像已经被提供给第四边缘云节点之后,且在将第四边缘云节点和目标镜像的信息提供给第三边缘云节点中的边缘管控设备之前,还可以根据所维护的边缘管控设备之间已有通信通道的信息,判断第三边缘云节点中的边缘管控设备与第四边缘云节点中的边缘管控设备之间是否已经存在通信通道;若判断结果为否,即第三边缘云节点中的边缘管控设备与第四边缘云节点中的边缘管控设备之间尚不存在通信通道,则可以 控制第三边缘云节点中的边缘管控设备和第四边缘云节点中的边缘管控设备建立通信通道,以便于第三边缘云节点中的边缘管控设备能够通过该通信通道从第四边缘云节点处获取目标镜像。并且,在第三边缘云节点中的边缘管控设备与第四边缘云节点中的边缘管控设备建立通信通道之后,中心管控设备将第四边缘云节点和目标镜像的信息提供给第三边缘云节点中的边缘管控设备。当然,若判断结果为是,即第三边缘云节点中的边缘管控设备与第四边缘云节点中的边缘管控设备之间已经存在通信通道,则可以直接将第四边缘云节点和目标镜像的信息提供给第三边缘云节点中的边缘管控设备。It is worth noting that the edge management and control device 103 in the third edge cloud node and the edge management and control device 103 in the fourth edge cloud node may establish a communication channel by themselves, or may establish a channel under the control of the central management and control device 101. Optionally, the central management and control device can also control the establishment of communication channels between different edge management and control devices, and is responsible for maintaining the information of the existing communication channels between the edge management and control devices, for example, which edge management and control devices have established communication channels and communication When the channel is established, the status of the communication channel, and the retention time information. Based on this, after the central management and control device determines that the target image has been provided to the fourth edge cloud node, and before providing the information of the fourth edge cloud node and the target image to the edge management and control device in the third edge cloud node, it can also According to the information of the existing communication channel between the maintained edge management and control devices, determine whether there is a communication channel between the edge management and control device in the third edge cloud node and the edge management and control device in the fourth edge cloud node; if the judgment result is No, that is, there is no communication channel between the edge management and control device in the third edge cloud node and the edge management and control device in the fourth edge cloud node, you can control the edge management and control device in the third edge cloud node and the fourth edge cloud The edge management and control device in the node establishes a communication channel, so that the edge management and control device in the third edge cloud node can obtain the target image from the fourth edge cloud node through the communication channel. In addition, after the edge management and control device in the third edge cloud node establishes a communication channel with the edge management and control device in the fourth edge cloud node, the central management and control device provides the fourth edge cloud node and target image information to the third edge cloud node Edge control equipment in China. Of course, if the judgment result is yes, that is, there is already a communication channel between the edge management and control device in the third edge cloud node and the edge management and control device in the fourth edge cloud node, you can directly mirror the fourth edge cloud node and the target The information is provided to the edge management and control device in the third edge cloud node.
值得说明的是,中心管控设备也可以在将第四边缘云节点和目标镜像的信息提供给第三边缘云节点中的边缘管控设备之后,根据所维护的边缘管控设备之间已有通信通道的信息,判断第三边缘云节点中的边缘管控设备与第四边缘云节点中的边缘管控设备之间是否已经存在通信通道;若判断结果为否,即第三边缘云节点中的边缘管控设备与第四边缘云节点中的边缘管控设备之间尚不存在通信通道,则可以控制第三边缘云节点中的边缘管控设备和第四边缘云节点中的边缘管控设备建立通信通道,以便于第三边缘云节点中的边缘管控设备能够通过该通信通道从第四边缘云节点处获取目标镜像。It is worth noting that the central management and control device can also provide the information of the fourth edge cloud node and the target image to the edge management and control device in the third edge cloud node according to the existing communication channel between the maintained edge management and control devices. Information, determine whether there is a communication channel between the edge management and control device in the third edge cloud node and the edge management and control device in the fourth edge cloud node; if the judgment result is no, that is, the edge management and control device in the third edge cloud node and If there is no communication channel between the edge management and control devices in the fourth edge cloud node, the edge management and control device in the third edge cloud node and the edge management and control device in the fourth edge cloud node can be controlled to establish a communication channel to facilitate the third The edge management and control device in the edge cloud node can obtain the target image from the fourth edge cloud node through the communication channel.
在一些可选实施例中,为了保证第三边缘云节点获取目标镜像的效率,中心管控设备在将第四边缘云节点处的目标镜像提供给第三边缘云节点之前,还可以根据第四边缘云节点的属性,判断第四边缘云节点是否适合为第三边缘云节点提供目标镜像;若判断结果为是,即第四边缘云节点适合为第三边缘云节点提供目标镜像,则可以将第四边缘云节点处的目标镜像提供给第三边缘云节点;若判断结果为否,则可以从镜像库中获取目标镜像并将目标镜像提供给第三边缘云节点。In some optional embodiments, in order to ensure the efficiency of obtaining the target image by the third edge cloud node, the central management and control device may also provide the target image at the fourth edge cloud node to the third edge cloud node according to the fourth edge cloud node. The attribute of the cloud node to determine whether the fourth edge cloud node is suitable for providing the target image for the third edge cloud node; if the judgment result is yes, that is, the fourth edge cloud node is suitable for providing the target image for the third edge cloud node, the first The target image at the fourth edge cloud node is provided to the third edge cloud node; if the judgment result is no, the target image can be obtained from the image library and the target image is provided to the third edge cloud node.
值得说明的是,根据应用场景和应用需求的不同,可以结合第四边缘云节点的不同属性,从不同角度判断第四边缘云节点是否适合为第三边缘云节点提供目标镜像。下面举例说明:It is worth noting that according to different application scenarios and application requirements, different attributes of the fourth edge cloud node can be combined to determine from different angles whether the fourth edge cloud node is suitable for providing a target image for the third edge cloud node. The following example illustrates:
例如,可以结合第四边缘云节点所属的运营商,判断第四边缘云节点所属的运营商与第一边缘云节点所属的运营商是否相同;若判断结果为是,说明第四边缘云节点与第一边缘云节点是同运营商下的边缘云节点,两者可以进行数据传输,且数据传输速率相对于跨运营商的数据传输速率要快,适合为第一边缘云节点提供目标镜像。For example, it can be combined with the operator to which the fourth edge cloud node belongs to determine whether the operator to which the fourth edge cloud node belongs is the same as the operator to which the first edge cloud node belongs; if the judgment result is yes, it means that the fourth edge cloud node is The first edge cloud node is an edge cloud node under the same operator. The two can perform data transmission, and the data transmission rate is faster than the cross-operator data transmission rate, which is suitable for providing target mirroring for the first edge cloud node.
又例如,可以结合第四边缘云节点的位置属性,判断第四边缘云节点到第三边缘云节点之间的距离是否小于设定的距离阈值;若判断结果为是,说明第四边缘云节点与第 三边缘云节点相距较近,适合为第三边缘云节点提供目标镜像,这样由与第三边缘云节点相距较近的第四边缘云节点为第三边缘云节点提供镜像,便于第三边缘云节点快速获取到镜像,提高效率。第四边缘云节点到第三边缘云节点之间的距离可以是两个边缘云节点之间的平均距离,也可以是两个边缘云节点的中心之间的距离,还可以是两个边缘云节点相距最近的外边缘之间的距离等,可根据需求灵活定义。For another example, the location attribute of the fourth edge cloud node can be combined to determine whether the distance between the fourth edge cloud node and the third edge cloud node is less than the set distance threshold; if the judgment result is yes, the fourth edge cloud node Close to the third edge cloud node, it is suitable to provide the target image for the third edge cloud node. In this way, the fourth edge cloud node that is closer to the third edge cloud node provides a mirror image for the third edge cloud node, which is convenient for the third edge cloud node. The edge cloud node quickly obtains the image to improve efficiency. The distance between the fourth edge cloud node and the third edge cloud node can be the average distance between two edge cloud nodes, or the distance between the centers of two edge cloud nodes, or two edge clouds The distance between nodes and the nearest outer edge can be flexibly defined according to requirements.
又例如,可以结合第四边缘云节点的带宽属性,判断第四边缘云节点的可用带宽是否大于设定带宽阈值;若判断结果为是,说明第四边缘云节点的带宽资源比较充裕,适合为第三边缘云节点提供目标镜像,这样由带宽资源比较充裕的第四边缘云节点为第三边缘云节点提供镜像,可保证镜像的传输速率,便于第三边缘云节点快速获取到镜像,提高效率。For another example, the bandwidth attribute of the fourth edge cloud node can be combined to determine whether the available bandwidth of the fourth edge cloud node is greater than the set bandwidth threshold; if the judgment result is yes, it means that the bandwidth resource of the fourth edge cloud node is relatively abundant, and it is suitable for The third edge cloud node provides the target image, so that the fourth edge cloud node with sufficient bandwidth resources provides the image for the third edge cloud node, which can ensure the transmission rate of the image, and facilitate the third edge cloud node to quickly obtain the image and improve efficiency .
又例如,可以结合第四边缘云节点的负载属性,判断第四边缘云节点的负载量是否小于设定负载量阈值;若判断结果为是,说明第四边缘云节点的负载较轻,适合为第三边缘云节点提供目标镜像,这样由负载较轻的第四边缘云节点为第三边缘云节点提供镜像,一方面可实现负载均衡,另一方面也便于第三边缘云节点快速获取到镜像,提高效率。For another example, it can be combined with the load attribute of the fourth edge cloud node to determine whether the load of the fourth edge cloud node is less than the set load threshold; if the judgment result is yes, it means that the load of the fourth edge cloud node is lighter, and it is suitable for The third edge cloud node provides the target image, so that the lighter-loaded fourth edge cloud node provides the image for the third edge cloud node. On the one hand, it can achieve load balancing, and on the other hand, it is also convenient for the third edge cloud node to quickly obtain the image. ,Improve efficiency.
值得说明的是,上面列举的几种方式可以择一使用,也可以以任意组合方式组合使用,关于组合使用的情况,对此不做过多描述。It is worth noting that the several methods listed above can be used alternatively or combined in any combination. Regarding the combined use, this will not be described too much.
进一步,在第四边缘云节点为多个的情况下,可以结合第四边缘云节点的多个属性,对上述几种方式进行组合使用,进而从中选择出适合为第一边缘云节点提供目标镜像的第四边缘云节点。例如,若第四边缘云节点为多个,则可以结合多个第四边缘云节点所属的运营商,从多个第四边缘云节点中选择出与第一边缘云节点属于同一运营商的第四边缘云节点;进而,若选择出的第四边缘云节点仍为多个,则可以进一步根据选择出的第四边缘云节点的负载量,从中选择负载量最小或低于设定负载量阈值的第四边缘云节点,为第一边缘云节点提供目标镜像。Further, in the case of multiple fourth edge cloud nodes, multiple attributes of the fourth edge cloud node can be combined to use the above several methods in combination, and then select a target image suitable for the first edge cloud node. The fourth edge cloud node. For example, if there are multiple fourth edge cloud nodes, the operators to which the multiple fourth edge cloud nodes belong can be combined to select the first edge cloud node belonging to the same operator as the first edge cloud node from the multiple fourth edge cloud nodes. Four edge cloud nodes; furthermore, if there are still multiple selected fourth edge cloud nodes, you can further select the minimum load or lower than the set load threshold according to the load of the selected fourth edge cloud node The fourth edge cloud node provides a target image for the first edge cloud node.
在一些可选实施例中,有可能已经向第三边缘云节点提供过目标镜像,例如,在业务扩容场景中,在目前正在为服务需求方提供云计算服务的边缘云节点中创建新实例需要使用的镜像与之前已有实例使用的镜像相同,如果该边缘云节点中还保存有之前已有实例使用的镜像,则可以不用重复为该边缘云节点提供镜像。针对这种情况,为了节约资源,中心管控设备在将第四边缘云节点处的目标镜像提供给第三边缘云节点之前,可 以判断所维护的已下发镜像与已下发镜像所在边缘云节点的对应关系中是否包括第三边缘云节点;若判断结果为是,表明已经向第三边缘云节点提供过目标镜像,且第三边缘云节点中仍保存有目标镜像,则可以将目标镜像的信息提供给第三边缘云节点,供第三边缘云节点读取其中存储的目标镜像,无需再次传输目标镜像,这可节约传输目标镜像消耗的网络资源等;若判断结果为否,表明尚未向第三边缘云节点提供过目标镜像,或者第三边缘云节点中已经不存在目标镜像,则可以将第四边缘云节点处的目标镜像提供给第三边缘云节点。其中,在第三边缘云节点中部署有边缘管控设备的情况下,若中心管控设备判断出所维护的已下发镜像与已下发镜像所在边缘云节点的对应关系中包含目标镜像,可以将目标镜像的信息提供给第三边缘云节点中的边缘管控设备,第三边缘云节点中的边缘管控设备根据目标镜像的信息可以从第三边缘云节点中存储镜像的空间中获取目标镜像,将目标镜像提供给第三边缘云节点中的相应资源设备,以供相应资源设备根据目标镜像创建可提供云计算服务的实例。In some optional embodiments, the target image may have been provided to the third edge cloud node. For example, in a business expansion scenario, it is necessary to create a new instance in the edge cloud node that is currently providing cloud computing services to the service demander. The image used is the same as the image used by the previous instance. If the edge cloud node still stores the image used by the previous instance, there is no need to repeatedly provide the image for the edge cloud node. In view of this situation, in order to save resources, the central management and control device can determine the maintained issued image and the edge cloud node where the issued image is located before providing the target image at the fourth edge cloud node to the third edge cloud node Whether the third edge cloud node is included in the corresponding relationship; if the judgment result is yes, it indicates that the target image has been provided to the third edge cloud node, and the target image is still stored in the third edge cloud node, then the target image can be The information is provided to the third edge cloud node for the third edge cloud node to read the target image stored in it, without the need to transmit the target image again, which can save network resources consumed by the transmission of the target image; if the judgment result is no, it indicates that it has not been sent to The third edge cloud node has provided the target image, or the target image no longer exists in the third edge cloud node, the target image at the fourth edge cloud node may be provided to the third edge cloud node. Among them, when the edge management and control device is deployed in the third edge cloud node, if the central management and control device determines that the corresponding relationship between the maintained issued image and the edge cloud node where the issued image contains the target image, the target image The information of the image is provided to the edge management and control device in the third edge cloud node, and the edge management and control device in the third edge cloud node can obtain the target image from the storage space of the image in the third edge cloud node according to the information of the target image. The image is provided to the corresponding resource device in the third edge cloud node, so that the corresponding resource device can create an instance that can provide cloud computing services according to the target image.
进一步可选地,同一边缘云节点有可能为同一用户或不同用户提供多种云计算服务,也就可能接收到多个镜像,这些镜像会被存储在边缘云节点中。边缘云节点可以提供一定存储空间,用来存储镜像。考虑到边缘云节点中镜像的存储空间有一定限制,为了能有足够的存储空间存储新接收的镜像,边缘云节点需要对本地存储的镜像进行淘汰处理。在本实施例中,中心管控设备负责为边缘云节点提供镜像的淘汰策略。中心管控设备可以生成镜像的淘汰策略,将该淘汰策略下发至各边缘云节点,各边缘云节点按照该淘汰策略对所存储的镜像进行淘汰处理。其中,在网络系统中包括边缘管控设备的情况下,中心管控设备可以将淘汰策略下发至边缘管控设备,由边缘管控设备根据淘汰策略对各边缘云节点中存储的镜像进行淘汰处理。进一步,在每个边缘云节点中均部署有边缘管控设备的情况下,中心管控设备可以将淘汰策略下发给各边缘云节点中的边缘管控设备,由各边缘云节点中的边缘管控设备根据淘汰策略对其所属边缘云节点中存储的镜像进行淘汰处理。Further optionally, the same edge cloud node may provide multiple cloud computing services for the same user or different users, and may receive multiple images, and these images will be stored in the edge cloud node. Edge cloud nodes can provide a certain amount of storage space for storing images. Considering that the storage space of the image in the edge cloud node is limited, in order to have enough storage space to store the newly received image, the edge cloud node needs to eliminate the locally stored image. In this embodiment, the central management and control device is responsible for providing a mirroring elimination strategy for edge cloud nodes. The central management and control device can generate the elimination strategy of the image, deliver the elimination strategy to each edge cloud node, and each edge cloud node performs elimination processing on the stored image according to the elimination strategy. Among them, in the case that the network system includes edge management and control equipment, the central management and control equipment can issue the elimination strategy to the edge management and control equipment, and the edge management and control equipment eliminates the images stored in each edge cloud node according to the elimination strategy. Furthermore, in the case where edge management and control equipment is deployed in each edge cloud node, the central management and control equipment can issue the elimination strategy to the edge management and control equipment in each edge cloud node, and the edge management and control equipment in each edge cloud node will be The elimination strategy eliminates the image stored in the edge cloud node to which it belongs.
可选地,淘汰策略可以是接收时间最早淘汰策略,即按照镜像的接收时间,优先淘汰接收时间最早的镜像。或者,淘汰策略可以是使用频次最少淘汰策略,即按照镜像的使用频率,优先淘汰使用频次最少的镜像。或者,淘汰策略可以是占用资源最大淘汰策略,即按照镜像占用的存储空间的大小,优先淘汰占用存储空间最大的镜像。Optionally, the elimination strategy may be an elimination strategy with the earliest receiving time, that is, according to the receiving time of the image, the image with the earliest receiving time is preferentially eliminated. Alternatively, the elimination strategy may be the elimination strategy with the least frequency of use, that is, the image with the least frequency of use is preferentially eliminated according to the frequency of use of the image. Alternatively, the elimination strategy may be the elimination strategy with the largest resource occupation, that is, according to the size of the storage space occupied by the image, the image with the largest storage space is first eliminated.
对边缘云节点来说,可以定期按照上述淘汰策略,对本节点中存储的镜像进行淘汰处理;或者,也可以在每当需要接收或获取新的镜像时,判断本节点中是否有足够存储 空间存储新的镜像,并在本节点中没有足够存储空间时,按照上述淘汰策略,对本节点中存储的镜像进行淘汰处理,以便于存储新的镜像。以第三边缘云节点需要从第四边缘云节点获取目标镜像为例,在第三边缘云节点中的边缘管控设备从第四边缘云节点处获取目标镜像之前,第三边缘云节点中的边缘管控设备可以判断第三边缘云节点中是否有足够存储空间存储目标镜像;若第三边缘云节点中没有足够存储空间,则根据淘汰策略,对第三边缘云节点中存储的镜像进行淘汰处理,以便有足够存储空间存储目标镜像。可选地,若第三边缘云节点中有足够存储空间,则可以暂时不对第三边缘云节点中存储的镜像进行淘汰处理。For edge cloud nodes, the image stored in the node can be eliminated regularly according to the above elimination strategy; or, whenever a new image needs to be received or acquired, it can be judged whether there is enough storage space in the node for storage If there is not enough storage space in the current node for the new image, the image stored in the node is eliminated according to the above elimination strategy, so as to store the new image. Taking the third edge cloud node to obtain the target image from the fourth edge cloud node as an example, before the edge management and control device in the third edge cloud node obtains the target image from the fourth edge cloud node, the edge of the third edge cloud node The management and control device can determine whether there is enough storage space in the third edge cloud node to store the target image; if there is not enough storage space in the third edge cloud node, it will eliminate the image stored in the third edge cloud node according to the elimination strategy. In order to have enough storage space to store the target image. Optionally, if there is enough storage space in the third edge cloud node, the image stored in the third edge cloud node may not be eliminated temporarily.
可选地,如图1c所示,该网络系统100还包括:镜像构建设备104。该镜像构建设备104可部署在一个或多个边缘云节点中,主要负责应用镜像的构建、验证等。镜像构建设备104可以提供边缘云环境,可以构建与边缘云环境适配的镜像,也可以验证镜像是否与边缘云环境适配,对于与边缘云环境不适配的镜像可以重构,或输出不适配的提示信息等。基于镜像构建设备104,用户可以向网络系统100中新增镜像。Optionally, as shown in FIG. 1c, the network system 100 further includes: an image construction device 104. The image construction device 104 may be deployed in one or more edge cloud nodes, and is mainly responsible for the construction and verification of application images. The image construction device 104 can provide an edge cloud environment, can build an image that is compatible with the edge cloud environment, and can also verify whether the image is compatible with the edge cloud environment. The image that is not compatible with the edge cloud environment can be reconstructed or output Adapted prompt information, etc. Based on the image building device 104, the user can add a new image to the network system 100.
在一种新增镜像的可选实施方式中,用户(例如服务需求方)可以向中心管控设备提交新增镜像的第三请求,该第三请求中包括镜像构建信息;中心管控设备向镜像构建设备发送构建请求,该构建请求包括镜像构建信息;镜像构建设备接收到构建请求之后,从中获取镜像构建信息,根据镜像构建信息构建与边缘云环境适配的镜像,将所构建的镜像返回给中心管控设备;中心管控设备接收镜像构建设备返回的新构建的镜像,并添加到镜像库中,不断丰富镜像库。In an optional implementation of adding a new image, a user (such as a service demander) can submit a third request for adding a new image to the central management and control device, and the third request includes image construction information; The device sends a construction request, which includes image construction information; after receiving the construction request, the image construction device obtains the image construction information from it, constructs an image adapted to the edge cloud environment based on the image construction information, and returns the constructed image to the center Control equipment; the central control equipment receives the newly constructed mirror image returned by the mirror construction equipment and adds it to the mirror library to continuously enrich the mirror library.
在另一种新增镜像的可选实施方式中,可以面向用户(例如服务需求方)提供一种镜像的规则和规范,让用户自己制作或生成镜像,用户生成或制作的镜像需要符合边缘云环境的安全、规范等相关要求。用户在制作或生成镜像之后,可以向中心管控设备发送新增镜像的第四请求,该第四请求中包括待新增镜像,该新增镜像是指用户制作或生成的镜像,本实施例并不限定用户制作或生成镜像的方式。中心管控设备接收第四请求,从第四请求中获取待新增镜像,将待新增镜像发送给镜像构建设备;镜像构建设备将待新增镜像与边缘云环境进行适配;若待新增镜像与边缘云环境适配,镜像构建设备向中心管控设备返回待新增镜像与边缘云环境适配的消息;若待新增镜像与边缘云环境不适配,镜像构建设备向中心管控设备返回待新增镜像与边缘云环境不适配的消息。In another optional implementation of newly-added mirroring, a mirroring rule and specification can be provided to users (such as service demanders), allowing users to make or generate mirrors by themselves. The mirrors generated or made by users need to conform to the edge cloud Environmental safety, regulations and other related requirements. After making or generating the image, the user can send a fourth request for adding a new image to the central control device. The fourth request includes the image to be added. The new image refers to the image made or generated by the user. This embodiment does not It does not limit the way users make or generate images. The central control device receives the fourth request, obtains the image to be added from the fourth request, and sends the image to be added to the image construction device; the image construction device adapts the image to be added to the edge cloud environment; if it is to be added The image is adapted to the edge cloud environment, and the image construction device returns a message to the central control device that the new image is adapted to the edge cloud environment; if the new image is not compatible with the edge cloud environment, the image construction device returns to the central control device A message that the new image is not compatible with the edge cloud environment.
对中心管控设备来说,若接收到镜像构建设备返回的待新增镜像与边缘云环境适配 的消息,则将待新增镜像添加至镜像库中;若接收到镜像构建服务镜像构建设备返回的待新增镜像与边缘云环境不适配的消息,或者通知用户对待新增镜像进行重构后重新提交,或者通知用户提供待新增镜像的重构方法,以供镜像构建服务镜像构建设备按照该重构方法将待新增镜像重构成与边缘云环境适配的镜像。若用户提供待新增镜像的重构方法,则中心管控设备可以将该重构方法提供给镜像构建设备,镜像构建设备按照该重构方法对待新增镜像进行重构,使之与边缘云环境相适配,并将重构后的镜像返回给中心管控设备;中心管控设备接收重构后的镜像并添加到镜像库中。For the central control equipment, if it receives a message from the mirror construction device that the new image to be added is adapted to the edge cloud environment, it will add the new mirror to the mirror library; if the mirror construction service is received, the mirror construction device returns The message that the new image to be added is not compatible with the edge cloud environment, or informs the user to re-submit the new image after reconstruction, or informs the user to provide the reconstruction method of the new image for the image building service image building equipment According to the reconstruction method, the newly added image is reconstructed into an image adapted to the edge cloud environment. If the user provides a reconstruction method for the newly added image, the central control device can provide the reconstruction method to the image construction device, and the image construction device reconstructs the newly added image according to the reconstruction method to make it compatible with the edge cloud environment It adapts and returns the reconstructed image to the central control device; the central control device receives the reconstructed image and adds it to the mirror library.
在此说明,镜像构建设备104可以是一台具有镜像构建、验证等功能的逻辑设备(例如可以是一个可提供镜像构建环境和资源,具备应用部署、镜像验证等功能的实例),这些功能可以部署一台物理机或虚拟机上实现,也可以分散性地部署在多台物理机或虚拟机上。当然,本实施例的镜像构建设备104也可以是一台或多台具有镜像构建、验证等功能的物理设备。本申请实施例并不限定镜像构建设备的实现结构,凡是具有上述功能的设备结构均适用于本申请实施例。It is explained here that the image construction device 104 may be a logical device with functions such as image construction and verification (for example, it may be an instance that can provide image construction environment and resources, and has functions such as application deployment and image verification). These functions can be It can be implemented on one physical machine or virtual machine, or it can be distributed on multiple physical machines or virtual machines. Of course, the image construction device 104 of this embodiment may also be one or more physical devices with functions such as image construction and verification. The embodiments of this application do not limit the implementation structure of the image construction device, and any device structure with the above-mentioned functions is applicable to the embodiments of this application.
在本申请实施例中,不仅可以向镜像库中新增镜像,也可以删除没有用或长时间不用的镜像,以节约存储空间。例如,中心管控设备可以定期或实时地统计镜像库中各镜像的使用频次,将使用频次小于频次阈值的镜像作为待删除镜像,并执行镜像删除流程将其删除。又例如,中心管控设备也可以接收用户(例如服务需求方)提交的镜像删除请求,将该镜像删除请求指示删除的镜像作为待删除镜像,并执行镜像删除流程将其删除。其中,镜像删除请求中可以携带需要删除的镜像的信息,例如ID、名称或编号等。In the embodiments of the present application, not only mirrors can be added to the mirror library, but also mirrors that are not used or have not been used for a long time can be deleted to save storage space. For example, the central management and control device can count the usage frequency of each mirror in the mirror library regularly or in real time, use mirrors with a frequency less than the frequency threshold as the mirrors to be deleted, and execute the mirror deletion process to delete them. For another example, the central management and control device may also receive a mirror deletion request submitted by a user (such as a service demander), use the mirror deleted in the mirror deletion request as a mirror to be deleted, and execute the mirror deletion process to delete it. Wherein, the image deletion request may carry information of the image to be deleted, such as ID, name, or serial number.
对中心管控设备来说,可以采用但不限于上述任一方式确定待删除镜像,在确定待删除镜像之后,一方面可以将待删除镜像从镜像库中删除,另一方面可以指示存储有待删除镜像的边缘云节点将待删除镜像删除。其中,中心管控设备可以根据待删除镜像,在所维护的已下发镜像与已下发镜像所在边缘云节点的对应关系中进行匹配,根据匹配结果确定存储有待删除镜像的边缘云节点。若在该对应关系中匹配到与待删除镜像对应的第五边缘云节点,说明曾经向第五边缘云节点下发过待删除镜像,且第五边缘云节点中仍存储有待删除镜像,于是向第五边缘云节点发送删除指令,该删除指令中携带有待删除镜像的信息,以指示第五边缘云节点将其中存储的待删除镜像删除。第五边缘云节点可能是一个,也可能是多个。For the central control device, any of the above methods can be used to determine the image to be deleted. After the image to be deleted is determined, the image to be deleted can be deleted from the mirror library on the one hand, and the image to be deleted can be indicated to be stored on the other hand. The edge cloud node will delete the image to be deleted. Among them, the central management and control device may match the maintained corresponding relationship between the issued image and the edge cloud node where the issued image is located according to the image to be deleted, and determine the edge cloud node storing the image to be deleted according to the matching result. If the fifth edge cloud node corresponding to the image to be deleted is matched in the corresponding relationship, it means that the image to be deleted has been issued to the fifth edge cloud node, and the image to be deleted is still stored in the fifth edge cloud node, so The fifth edge cloud node sends a deletion instruction, and the deletion instruction carries information about the image to be deleted to instruct the fifth edge cloud node to delete the image to be deleted stored therein. The fifth edge cloud node may be one or multiple.
在此说明,在网络系统100包括边缘管控设备103的情况下,中心管控设备具体可以将删除指令发送给边缘管控设备103;边缘管控设备103接收中心管控设备下发的删 除指令,从该删除指令中获取待删除镜像的信息,根据待删除镜像的信息,判断第五边缘云节点中是否存储有待删除镜像;如果存储有待删除镜像,将第五边缘云节点中的待删除镜像删除。进一步,若第五边缘云节点中部署有边缘管控设备103,则中心管控设备101具体可以将删除指令发送给第五边缘云节点中的边缘管控设备103;第五边缘云节点中的边缘管控设备103接收中心管控设备下发的删除指令,从该删除指令中获取待删除镜像的信息,根据待删除镜像的信息,判断第五边缘云节点中是否存储有待删除镜像;如果存储有待删除镜像,将第五边缘云节点中的存储的待删除镜像删除。It is explained here that when the network system 100 includes the edge management and control device 103, the central management and control device may specifically send a deletion instruction to the edge management and control device 103; the edge management and control device 103 receives the deletion instruction issued by the central management and control device, and then deletes the instruction from the Obtain the information of the image to be deleted in, and determine whether the image to be deleted is stored in the fifth edge cloud node according to the information of the image to be deleted; if the image to be deleted is stored in the storage, delete the image to be deleted in the fifth edge cloud node. Further, if the edge management and control device 103 is deployed in the fifth edge cloud node, the central management and control device 101 may specifically send a deletion instruction to the edge management and control device 103 in the fifth edge cloud node; the edge management and control device in the fifth edge cloud node 103 receives the delete instruction issued by the central management and control device, obtains the information of the image to be deleted from the delete instruction, and determines whether the image to be deleted is stored in the fifth edge cloud node according to the information of the image to be deleted; The image to be deleted stored in the fifth edge cloud node is deleted.
当中心管控设备将待删除镜像从镜像库中删除,且存储有待删除镜像的边缘云节点也将其中存储的待删除镜像删除后,镜像删除流程完成。When the central control device deletes the image to be deleted from the image library, and the edge cloud node storing the image to be deleted also deletes the image to be deleted stored in it, the image deletion process is completed.
对边缘云节点中的资源设备来说,无论是何种方式,在获取镜像后,在中心管控设备101或边缘管控设备103的控制下可通过硬件或软件支持的能力以虚拟化的形式为实例提供计算、网络和存储等资源,对应的镜像会以系统盘的形式挂载到对应的实例。在实例创建完成后,对实例尝试启动,在成功启动对应的实例后,就可以利用这些资源设备的能力提供云计算服务。其中,资源设备在边缘管控设备的控制下为实例提供计算、网络和存储等资源包括:边缘管控设备根据中心管控设备提供的资源模板从目标边缘节云点内分配或预留的资源中申请相关的计算资源、存储资源和/或网络资源;通过调用目标边缘云节点内的计算、存储、网络等执行器进行相关资源的创建动作。其中,资源的创建动作包括:处理存储相关的资源,根据镜像的配置信息及镜像内容创建实例的系统盘,根据资源模板创建对应的数据盘;创建实例依赖的网络资源,例如IP地址、虚拟交换机等;以及结合资源模板创建计算资源。Regarding the resource equipment in the edge cloud node, regardless of the method, after the image is obtained, the capabilities that can be supported by hardware or software under the control of the central control device 101 or the edge control device 103 are in the form of virtualization as an example Provide resources such as computing, network, and storage, and the corresponding image will be mounted to the corresponding instance in the form of a system disk. After the instance is created, try to start the instance. After the corresponding instance is successfully started, the capabilities of these resource devices can be used to provide cloud computing services. Among them, the resource device provides computing, network, and storage resources for the instance under the control of the edge management and control device, including: the edge management and control device applies for related resources from the resources allocated or reserved in the target edge node cloud according to the resource template provided by the central management and control device The computing resources, storage resources and/or network resources of the target edge cloud node are used to create related resources by calling the calculation, storage, network and other executors in the target edge cloud node. Among them, resource creation actions include: processing storage-related resources, creating an instance system disk based on the configuration information and content of the image, creating a corresponding data disk based on the resource template; creating network resources that the instance depends on, such as IP addresses, virtual switches And so on; and combine resource templates to create computing resources.
运维管理功能:Operation and maintenance management functions:
在本申请实施例中,中心管控设备可以在边缘管控设备的协助下,对至少一个边缘云节点进行运维管控。详细地,边缘管控设备可以对至少一个边缘云节点进行运维监控并将运维监控数据上报给中心管控设备,供中心管控设备根据运维监控数据对至少一个边缘云节点进行管控。中心管控设备可以根据边缘管控设备上报的运维监控数据对至少一个边缘云节点进行运维管控。可选地,对边缘管控设备来说,可在中心管控设备的控制下,对至少一个边缘云节点进行运维监控并将运维监控数据上报给中心管控设备,以供中心管控设备根据运维监控数据对至少一个边缘云节点进行运维管控。或者,边缘管控设备可以根据定时任务,周期性地对至少一个边缘云节点进行运维监控并将运维监控 数据上报给中心管控设备。无论是在哪种实施方式中,边缘管控设备主要发挥监控、数据采集、上报等功能,而运维决策由中心管控设备决定。In the embodiment of the present application, the central management and control device may perform operation and maintenance management and control on at least one edge cloud node with the assistance of the edge management and control device. In detail, the edge management and control device may perform operation and maintenance monitoring on at least one edge cloud node and report the operation and maintenance monitoring data to the central management and control device, so that the central management and control device can manage and control at least one edge cloud node according to the operation and maintenance monitoring data. The central management and control device can perform operation and maintenance control on at least one edge cloud node according to the operation and maintenance monitoring data reported by the edge management and control device. Optionally, for the edge management and control equipment, the operation and maintenance monitoring of at least one edge cloud node can be carried out under the control of the central management and control equipment and the operation and maintenance monitoring data is reported to the central management and control equipment for the central management and control equipment according to the operation and maintenance The monitoring data controls the operation and maintenance of at least one edge cloud node. Alternatively, the edge management and control device may periodically perform operation and maintenance monitoring on at least one edge cloud node according to a timed task and report the operation and maintenance monitoring data to the central management and control device. In either implementation mode, the edge management and control equipment mainly performs functions such as monitoring, data collection, and reporting, while the operation and maintenance decisions are determined by the central management and control equipment.
其中,中心管控设备控制边缘管控设备对至少一个边缘云节点进行运维监控,可以采用但不限于以下可选实施方式:Among them, the central management and control device controls the edge management and control device to perform operation and maintenance monitoring of at least one edge cloud node, which can adopt but not limited to the following optional implementation manners:
在一可选实施方式中,中心管控设备可以向边缘管控设备发送第一类运维监控指令,以指示边缘管控设备从至少一个运维维度对至少一个边缘云节点进行运维监控并将至少一个运维维度上的运维监控数据上报给中心管控设备。第一类运维监控指令是一种指示边缘管控设备从至少一个运维维度对至少一个边缘云节点进行运维监控并上报至少一个运维维度上的运维监控数据的监控指令。对边缘管控设备来说,可以接收中心管控设备发送的第一类运维监控指令,根据第一类运维监控指令,从至少一个运维维度上对至少一个边缘云节点进行运维监控,并将至少一个运维维度上的运维监控数据上报给中心管控设备。中心管控设备根据边缘管控设备上报的至少一个运维维度上的运维监控数据对至少一个边缘云节点进行运维管控。值得说明的是,至少一个运维维度可根据应用需求灵活设定,并预置到边缘管控设备和中心管控设备中。关于运维维度的举例参见后续实施例。In an optional embodiment, the central management and control device may send the first type of operation and maintenance monitoring instruction to the edge management and control device to instruct the edge management and control device to perform operation and maintenance monitoring on at least one edge cloud node from at least one operation and maintenance dimension and to The operation and maintenance monitoring data in the operation and maintenance dimension is reported to the central control equipment. The first type of operation and maintenance monitoring instruction is a monitoring instruction that instructs the edge management and control device to perform operation and maintenance monitoring of at least one edge cloud node from at least one operation and maintenance dimension and report operation and maintenance monitoring data in at least one operation and maintenance dimension. For edge management and control equipment, it can receive the first type of operation and maintenance monitoring instructions sent by the central management and control equipment, and according to the first type of operation and maintenance monitoring instructions, perform operation and maintenance monitoring of at least one edge cloud node from at least one operation and maintenance dimension, and Report the operation and maintenance monitoring data on at least one operation and maintenance dimension to the central control equipment. The central management and control device controls the operation and maintenance of at least one edge cloud node according to the operation and maintenance monitoring data in at least one operation and maintenance dimension reported by the edge management and control device. It is worth noting that at least one operation and maintenance dimension can be flexibly set according to application requirements and preset into edge control equipment and central control equipment. For examples of operation and maintenance dimensions, refer to the subsequent embodiments.
在另一可选实施方式中,中心管控设备可以有选择地在某个或某些运维维度上对至少一个边缘云节点进行运维管控。基于此,中心管控设备可以向边缘管控设备发送第二类运维监控指令,第二类运维监控指令与指定运维维度对应,用于指示边缘管控设备在指定运维维度上对至少一个边缘云节点进行运维监控并上报指定运维维度上的运维监控数据。对边缘管控设备来说,可接收中心管控设备发送的第二类运维监控指令,根据第二类运维监控指令在指定运维维度上对至少一个边缘云节点进行运维监控,并将指定运维维度上的运维监控数据上报给中心管控设备,以供中心管控设备根据指定运维维度上的运维监控数据对至少一个边缘云节点进行运维管控。中心管控设备还用于接收边缘管控设备发送的指定运维维度上的运维监控数据,根据指定运维维度上的运维监控数据对至少一个边缘云节点进行运维管控。In another optional implementation manner, the central management and control device may selectively perform operation and maintenance control on at least one edge cloud node in one or some operation and maintenance dimensions. Based on this, the central management and control device can send the second type of operation and maintenance monitoring instructions to the edge management and control device. The second type of operation and maintenance monitoring instruction corresponds to the specified operation and maintenance dimension, and is used to instruct the edge management and control device to check at least one edge in the specified operation and maintenance dimension. The cloud node performs operation and maintenance monitoring and reports the operation and maintenance monitoring data on the specified operation and maintenance dimension. For edge management and control equipment, it can receive the second type of operation and maintenance monitoring instructions sent by the central management and control equipment, and perform operation and maintenance monitoring on at least one edge cloud node in the specified operation and maintenance dimension according to the second type of operation and maintenance monitoring instructions, and specify The operation and maintenance monitoring data in the operation and maintenance dimension is reported to the central management and control device, so that the central management and control device can perform operation and maintenance control on at least one edge cloud node according to the operation and maintenance monitoring data in the designated operation and maintenance dimension. The central management and control device is also used to receive the operation and maintenance monitoring data in the specified operation and maintenance dimension sent by the edge management and control device, and perform operation and maintenance control on at least one edge cloud node according to the operation and maintenance monitoring data in the specified operation and maintenance dimension.
可选地,边缘管控设备根据定时任务,周期性地对至少一个边缘云节点进行运维监控可以是根据定时任务,周期性地从至少一个运维维度对至少一个边缘云节点进行运维监控;进一步,可以将至少一个运维维度上的运维监控数据上报给中心管控设备。其中,不同运维维度上的监控周期可以相同,也可以不相同。例如,边缘管控设备可以每隔10分钟对边缘云节点进行一次安全漏洞扫描,或者每隔5分钟对边缘云节点进行流量监控。Optionally, the edge management and control device periodically performs operation and maintenance monitoring of at least one edge cloud node according to a timed task, may periodically perform operation and maintenance monitoring of at least one edge cloud node from at least one operation and maintenance dimension according to a timed task; Further, the operation and maintenance monitoring data in at least one operation and maintenance dimension can be reported to the central control equipment. Among them, the monitoring period on different operation and maintenance dimensions may be the same or different. For example, the edge management and control device can scan the edge cloud node for security vulnerabilities every 10 minutes, or monitor the traffic of the edge cloud node every 5 minutes.
值得说明的是,指定运维维度可以是一个,也可以是多个。在指定运维维度是多个的情况下,每个指定运维维度可以对应一个第二类运维监控指令,即中心管控设备可以向边缘管控设备发送多个第二类运维监控指令,每个第二类运维监控指令对应一个指定运维维度。或者,在指定运维为度为多个的情况,多个指定运维维度也可以对应同一个第二类运维监控指令,即中心管控设备可以向边缘管控设备发送一个第二类运维监控指令,该第二类运维监控指令对应多个指定运维维度。It is worth noting that there can be one or more designated O&M dimensions. In the case of multiple designated operation and maintenance dimensions, each designated operation and maintenance dimension can correspond to a second-type operation and maintenance monitoring instruction, that is, the central control device can send multiple second-type operation and maintenance monitoring instructions to the edge control device. The second type of operation and maintenance monitoring instruction corresponds to a specified operation and maintenance dimension. Or, in the case of multiple designated operation and maintenance degrees, multiple designated operation and maintenance dimensions can also correspond to the same second-type operation and maintenance monitoring instruction, that is, the central management and control device can send a second-type operation and maintenance monitoring to the edge management and control device Instruction, this second type of operation and maintenance monitoring instruction corresponds to multiple specified operation and maintenance dimensions.
上述至少一个运维维度或指定运维维度可以包括但不限于以下维度:处于运行态的对象维度,日志维度,安全维度,资源维度等。进一步,处于运行态的对象维度可包括对象的运行状态维度和/或对象的生命周期维度;安全维度可包括:流量攻击维度和/或安全漏洞维度。The aforementioned at least one operation and maintenance dimension or specified operation and maintenance dimension may include but is not limited to the following dimensions: the object dimension in the running state, the log dimension, the security dimension, and the resource dimension. Further, the object dimension in the running state may include the operating state dimension of the object and/or the life cycle dimension of the object; the security dimension may include: the traffic attack dimension and/or the security vulnerability dimension.
结合上述列举的几个运维维度,中心管控设备在边缘管控设备协助下,对至少一个边缘云节点进行运维管控包括但不限于以下至少一种运维管控示例:Combining the several O&M dimensions listed above, the central management and control device, with the assistance of the edge management and control device, performs O&M control on at least one edge cloud node, including but not limited to at least one of the following O&M control examples:
运维管控示例1:中心管控设备控制边缘管控设备对至少一个边缘云节点中处于运行态的对象进行状态监控。其中,控制方式包括向边缘管控设备发送第一类运维监控指令或发送与对象的运行状态维度对应的第二类运维监控指令。边缘管控设备在中心管控设备的控制下,或者,根据定时任务周期性地,对至少一个边缘云节点中处于运行态的对象进行状态监控,将监控到的处于运行态的对象的运行状态上报给中心管控设备。中心管控设备从边缘管控设备上报的处于运行态的对象的运行状态中识别出运行状态异常的对象,为便于描述和区分,将运行状态异常的对象称为目标对象,并针对目标对象进行异常处理。其中,边缘云节点中处于运行态的对象包括但不限于:实例、镜像、容器、其它虚拟组件、物理机、CPU和/或硬盘等。根据处于运行态的对象的不同,运行状态异常情况也会有所不同。例如,对实例来说,可能的异常情况包括但不限于:中断、报错和/或故障等。又例如,对物理机来说,可能的异常情况包括但不限于:死机、黑屏、报警和/或物理机上运行的应用程序出现闪退等。根据目标对象以及运行状态异常情况的不同,异常处理方式也会有所不同,例如可以包括但不限于:报警,停止或重启目标对象,迁移,删除并重建目标对象等。Operation and maintenance management and control example 1: The central management and control device controls the edge management and control device to monitor the status of objects in at least one edge cloud node that are in operation. Among them, the control method includes sending a first type of operation and maintenance monitoring instruction to the edge management and control device or sending a second type of operation and maintenance monitoring instruction corresponding to the operating state dimension of the object. The edge management and control equipment is under the control of the central management and control equipment, or periodically according to timing tasks, monitors the status of the objects in the running state of at least one edge cloud node, and reports the running status of the monitored objects in the running state to Central control equipment. The central management and control equipment identifies objects with abnormal operating status from the operating status of the objects in the operating status reported by the edge management and control equipment. For ease of description and distinction, the objects with abnormal operating status are called target objects, and exception handling is performed on the target objects. . Among them, the objects in the running state in the edge cloud node include, but are not limited to: instances, images, containers, other virtual components, physical machines, CPUs, and/or hard disks. According to the different objects in the running state, the abnormal situation of the running state will be different. For example, for the instance, possible abnormal conditions include, but are not limited to: interruption, error reporting, and/or failure. For another example, for a physical machine, possible abnormal conditions include, but are not limited to: crashes, black screens, alarms, and/or crashes of applications running on the physical machine. Depending on the target object and the abnormal situation of the running state, the exception handling method will be different, for example, it can include but not limited to: alarm, stop or restart the target object, migrate, delete and rebuild the target object, etc.
运维管控示例2:中心管控设备控制边缘管控设备对至少一个边缘云节点中处于运行态的对象的生命周期进行监控。其中,控制方式包括向边缘管控设备发送第一类运维监控指令或发送与对象的生命周期维度对应的第二类运维监控指令。边缘管控设备在中心管控设备的控制下,或者,根据定时任务周期性地,监控至少一个边缘云节点中处于 运行态的对象的生命周期,并将监控到的处于运行态的对象的生命周期上报给中心管控设备。中心管控设备根据边缘管控设备上报的处于运行态的对象的生命周期,控制处于运行态的对象停止、停止后重启,迁移或删除。Operation and maintenance control example 2: The central control device controls the edge control device to monitor the life cycle of at least one edge cloud node in the running state. Among them, the control method includes sending the first type of operation and maintenance monitoring instruction to the edge management and control device or sending the second type of operation and maintenance monitoring instruction corresponding to the life cycle dimension of the object. The edge control device is under the control of the central control device, or periodically according to a scheduled task, monitors the life cycle of at least one edge cloud node in the running state, and reports the life cycle of the monitored object in the running state Give the center control equipment. The central management and control device controls the stopping, restarting, migration or deletion of the running object after stopping, according to the life cycle of the running object reported by the edge management and control device.
运维管控示例3:中心管控设备控制边缘管控设备采集至少一个边缘云节点中的日志数据。其中,控制方式包括向边缘管控设备发送第一类运维监控指令或发送与日志维度对应的第二类运维监控指令。边缘管控设备在中心管控设备的控制下,或者,根据定时任务周期性地,采集至少一个边缘云节点中的日志数据,并将采集到的日志数据上报给中心管控设备。中心管控设备接收边缘管控设备上报的日志数据,对日志数据进行数据分析,并根据数据分析结果执行后续动作,例如可以计费、风控和/或增减实例等。根据日志数据的不同,后续动作也会有所不同。可选地,日志数据可以包括但不限于:边缘云节点中各项性能、指标等数据,例如:实例的带宽流量、实例当前的运行情况、实例的IO负载、物理机的带宽流量、物理机当前的运行情况、物理机的IO负载、边缘管控设备的运行情况和/或其它虚拟化组件的运行情况等。Operation and maintenance control example 3: The central control device controls the edge control device to collect log data in at least one edge cloud node. Among them, the control method includes sending the first type of operation and maintenance monitoring instruction to the edge management and control device or sending the second type of operation and maintenance monitoring instruction corresponding to the log dimension. The edge management and control device collects log data in at least one edge cloud node under the control of the central management and control device or periodically according to a timed task, and reports the collected log data to the central management and control device. The central management and control device receives the log data reported by the edge management and control device, performs data analysis on the log data, and performs follow-up actions based on the data analysis results, such as billing, risk control, and/or adding or subtracting instances. Depending on the log data, follow-up actions will vary. Optionally, log data may include, but is not limited to: various performance, indicators and other data in edge cloud nodes, such as: instance bandwidth traffic, instance current running status, instance IO load, physical machine bandwidth traffic, physical machine The current operating status, the IO load of the physical machine, the operating status of the edge management and control equipment, and/or the operating status of other virtualization components, etc.
可选地,中心管控设备不仅可以收集边缘管控设备上报的各边缘云节点的日志数据,还具备数据巡检的能力,对于一些数据,若中心管控设备存储的与边缘云节点中的数据不一致,可以主动向该边缘云节点同步最新的数据,例如可以向边缘云节点同步最新版本的镜像等。Optionally, the central control device can not only collect the log data of each edge cloud node reported by the edge control device, but also has the ability to perform data inspection. For some data, if the data stored by the central control device is inconsistent with the data in the edge cloud node, The latest data can be actively synchronized with the edge cloud node, for example, the latest version of the image can be synchronized with the edge cloud node.
运维管控示例4:中心管控设备控制边缘管控设备对至少一个边缘云节点进行流量监控。其中,控制方式包括向边缘管控设备发送第一类运维监控指令或发送与流量攻击维度对应的第二类运维监控指令。边缘管控设备在中心管控设备的控制下,或者,根据定时任务周期性地,对至少一个边缘云节点进行流量监控,并将监控到的流量攻击事件上报给中心管控设备。中心管控设备对边缘云节点中出现的流量攻击事件进行阻断处理。可选地,边缘管控设备还可以将监控到的流量数据上报给中心管控设备,中心管控设备还可以根据流量数据对至少一个边缘云节点进行流量攻击防御等。Operation and maintenance control example 4: The central control device controls the edge control device to monitor the traffic of at least one edge cloud node. Among them, the control method includes sending the first type of operation and maintenance monitoring instruction to the edge management and control device or sending the second type of operation and maintenance monitoring instruction corresponding to the traffic attack dimension. The edge management and control equipment is under the control of the central management and control equipment, or periodically according to a timing task, monitors the flow of at least one edge cloud node, and reports the monitored traffic attack events to the central management and control equipment. The central management and control equipment blocks traffic attack events that occur in edge cloud nodes. Optionally, the edge management and control device may also report the monitored flow data to the central management and control device, and the central management and control device may also perform flow attack defense on at least one edge cloud node based on the flow data.
运维管控示例5:中心管控设备控制边缘管控设备对至少一个边缘云节点进行网络安全漏洞扫描。其中,控制方式包括向边缘管控设备发送第一类运维监控指令或发送与网络安全维度对应的第二类运维监控指令。边缘管控设备在中心管控设备的控制下,或者,根据定时任务周期性地,对至少一个边缘云节点进行网络安全漏洞扫描,并将扫描到的网络安全漏洞问题上报给中心管控设备。中心管控设备接收边缘管控设备上报的网络安全漏洞问题,对该网络安全漏洞问题进行修复。Operation and maintenance control example 5: The central control equipment controls the edge control equipment to scan for network security vulnerabilities on at least one edge cloud node. Among them, the control method includes sending the first type of operation and maintenance monitoring instruction to the edge management and control device or sending the second type of operation and maintenance monitoring instruction corresponding to the network security dimension. The edge management and control equipment is under the control of the central management and control equipment, or periodically according to timing tasks, scans for network security vulnerabilities on at least one edge cloud node, and reports the scanned network security vulnerabilities to the central management and control equipment. The central control equipment receives the network security vulnerabilities reported by the edge control equipment, and repairs the network security vulnerabilities.
运维管控示例6:中心管控设备控制边缘管控设备监控至少一个边缘云节点中的资源用量。其中,控制方式包括向边缘管控设备发送第一类运维监控指令或发送与资源维度对应的第二类运维监控指令。边缘管控设备在中心管控设备的控制下,或者,根据定时任务周期性地,监控至少一个边缘云节点中的资源用量,并将监控到的资源用量信息上报给中心管控设备。中心管控设备根据边缘管控设备上报的资源用量信息,对至少一个边缘云节点进行资源扩容或减容。这里的资源包括各种资源信息,例如物理机等设备资源,存储资源,CPU、GPU等计算资源,带宽等网络资源等等。Operation and maintenance control example 6: The central control device controls the edge control device to monitor the resource usage in at least one edge cloud node. Wherein, the control method includes sending the first type of operation and maintenance monitoring instruction to the edge management and control device or sending the second type of operation and maintenance monitoring instruction corresponding to the resource dimension. The edge management and control device is under the control of the central management and control device, or periodically according to a timed task, monitors the resource usage in at least one edge cloud node, and reports the monitored resource usage information to the central management and control device. The central management and control device performs resource expansion or reduction on at least one edge cloud node based on the resource usage information reported by the edge management and control device. The resources here include various resource information, such as equipment resources such as physical machines, storage resources, computing resources such as CPUs and GPUs, and network resources such as bandwidth.
进一步,若每个边缘云节点中均部署有边缘管控设备,则每个边缘管控设备可以在中心管控设备的控制下,对其所属边缘云节点进行运维监控并将其所属边缘云节点中的运维监控数据上报给中心管控设备。中心管控设备可以接收每个边缘云节点中的边缘管控设备上报的运维监控数据,根据每个边缘云节点中的运维监控数据对每个边缘云节点进行运维管控。Further, if edge management and control equipment is deployed in each edge cloud node, each edge management and control device can, under the control of the central management and control equipment, perform operation and maintenance monitoring on its edge cloud node and monitor the operation and maintenance of its edge cloud node. The operation and maintenance monitoring data is reported to the central control equipment. The central management and control device can receive the operation and maintenance monitoring data reported by the edge management and control device in each edge cloud node, and perform operation and maintenance management and control on each edge cloud node according to the operation and maintenance monitoring data in each edge cloud node.
本申请实施例并不限定中心管控设备与边缘管控设备的实现结构。可选地,一种中心管控设备的结构框架,包括:资源调度管控模块、镜像管控模块以及中心运维模块;该中心运维模块进一步包括:中心监控单元、中心日志单元以及中心安全单元等。相应地,一种边缘管控设备的结构框架示,包括:资源调度服务模块、镜像服务模块以及边缘运维模块;该边缘运维模块进一步包括:边缘监控单元、边缘日志单元以及边缘安全单元等。The embodiments of the present application do not limit the implementation structure of the central management and control device and the edge management and control device. Optionally, a structural framework of a central management and control device includes: a resource scheduling management and control module, a mirror management and control module, and a central operation and maintenance module; the central operation and maintenance module further includes: a central monitoring unit, a central log unit, and a central security unit. Correspondingly, a structural framework of edge management and control equipment includes: a resource scheduling service module, a mirroring service module, and an edge operation and maintenance module; the edge operation and maintenance module further includes: an edge monitoring unit, an edge log unit, and an edge security unit.
其中,中心管控设备中的资源调度管控模块与边缘管控设备中的资源调度服务模块相互配合,可对边缘云节点进行资源调度,资源调度功能可参见下文中的描述。中心管控设备中的镜像管控模块与边缘管控设备中的镜像服务模块相互配合,可针对边缘云节点进行镜像的管理与分发等,镜像管理与分发功能可参见下文中的描述。Among them, the resource scheduling management and control module in the central management and control device cooperates with the resource scheduling service module in the edge management and control device to perform resource scheduling on edge cloud nodes. For the resource scheduling function, please refer to the description below. The image management and control module in the central management and control device cooperates with the image service module in the edge management and control device to perform image management and distribution for edge cloud nodes. For image management and distribution functions, please refer to the description below.
中心管控设备中的中心运维模块与边缘管控设备中的边缘运维模块相互配合,可对边缘云节点进行运维管控。上述运维管控示例1-6可由中心运维模块和边缘运维模块中的相应单元配合实施。运维管控示例3可由中心运维模块中的中心日志单元和边缘运维模块中的边缘日志单元配合实现。详细地,中心日志单元向边缘日志单元发送第一类运维监控指令或发送与日志维度对应的第二类运维监控指令;边缘日志单元根据第一类或第二类运维监控指令采集边缘云节点中的日志数据并上报给中心日志单元;中心日志单元对日志数据进行数据分析,并根据数据分析结果执行后续动作。运维管控示例4和5,可由中心运维模块中的中心安全单元和边缘运维模块中的边缘安全单元配合实现。详细 地,中心安全单元向边缘安全单元发送第一类运维监控指令或发送与流量攻击或网络安全维度对应的第二类运维指令;边缘安全单元可以根据第一类或第二类运维指令对边缘云节点进行流量监控或网络安全漏洞扫描,并将监控到的流量攻击事件或网络漏洞安全问题上报给中心安全单元;中心安全单元对流量攻击事件进行阻断或对网络安全漏洞问题进行修复。运维管控示例1、2和6,可由中心运维模块中的中心监控单元和边缘运维模块中的边缘监控单元配合实现,详细实施过程不做赘述。The central operation and maintenance module in the central management and control equipment and the edge operation and maintenance module in the edge management and control equipment cooperate with each other to perform operation and maintenance management and control on the edge cloud nodes. The above operation and maintenance control examples 1-6 can be implemented by the corresponding units in the central operation and maintenance module and the edge operation and maintenance module. Operation and maintenance control example 3 can be realized by the cooperation of the central log unit in the central operation and maintenance module and the edge log unit in the edge operation and maintenance module. In detail, the central log unit sends the first type of operation and maintenance monitoring instruction or the second type of operation and maintenance monitoring instruction corresponding to the log dimension to the edge log unit; the edge log unit collects the edge according to the first or second type of operation and maintenance monitoring instruction The log data in the cloud node is reported to the central log unit; the central log unit performs data analysis on the log data and executes follow-up actions based on the data analysis results. Operation and maintenance control examples 4 and 5 can be realized by the cooperation of the central security unit in the central operation and maintenance module and the edge security unit in the edge operation and maintenance module. In detail, the central security unit sends the first type of operation and maintenance monitoring instruction to the edge security unit or sends the second type of operation and maintenance instruction corresponding to the traffic attack or the network security dimension; the edge security unit can be based on the first or second type of operation and maintenance Instruct the edge cloud nodes to perform traffic monitoring or network security vulnerability scanning, and report the monitored traffic attack events or network vulnerability security issues to the central security unit; the central security unit blocks traffic attack events or conducts network security vulnerability issues repair. Operation and maintenance control examples 1, 2 and 6 can be implemented by the central monitoring unit in the central operation and maintenance module and the edge monitoring unit in the edge operation and maintenance module, and the detailed implementation process is not repeated.
由上述可知,在边缘管控设备的协助下,中心管控设备可以了解边缘云节点中各实例的健康、资源用量、日志数据和/或基础设施的情况,可实现远程运维、日志管理等。It can be seen from the above that with the assistance of the edge management and control equipment, the central management and control equipment can understand the health, resource usage, log data and/or infrastructure conditions of each instance in the edge cloud node, and can realize remote operation and maintenance, log management, etc.
在本申请实施例中,除了中心管控设备可以对至少一个边缘云节点进行运维管控之外,在中心管控设备不对边缘云节点进行运维管控或者无法对边缘云节点进行运维管控的情况下,边缘管控设备可以自主地对至少一个边缘云节点进行运维管控。In the embodiments of the present application, in addition to the central management and control device that can perform O&M management and control on at least one edge cloud node, in the case where the central management and control device does not perform O&M management and control on the edge cloud node or cannot perform O&M management and control on the edge cloud node , The edge management and control device can autonomously perform operation and maintenance management and control on at least one edge cloud node.
例如,边缘管控设备可以监控其与中心管控设备之间的连接情况,在与中心管控设备失去连接的情况下,可以确定中心管控设备无法对边缘云节点进行运维管控,则可以自主地从至少一个运维维度对至少一个边缘云节点进行运维管控。For example, the edge management and control device can monitor the connection between it and the central management and control device. When the connection with the central management and control device is lost, it can be determined that the central management and control device cannot perform operation and maintenance control on the edge cloud node. One operation and maintenance dimension controls the operation and maintenance of at least one edge cloud node.
又例如,在中心管控设备通过向边缘管控设备发送第一类运维监控指令,以控制边缘管控设备对至少一个边缘云节点进行运维监控的方式下,边缘管控设备可以等待接收中心管控设备发送的第一类运维监控指令,若未接收到中心管控设备发送的第一类运维监控指令,可以确定中心管控设备不对或无法对至少一个边缘云节点进行运维管控,则可以自主地从至少一个运维维度对至少一个边缘云节点进行运维管控。可选地,边缘管控设备和中心管控设备可以预先约定第一类运维监控指令的等待时长,若超过了所述等待时长仍未接收到中心管控设备发送的第一类运维监控指令,则确定未接收到中心管控设备发送的第一类运维监控指令。For another example, when the central management and control device sends the first type of operation and maintenance monitoring instructions to the edge management and control device to control the edge management and control device to perform operation and maintenance monitoring of at least one edge cloud node, the edge management and control device can wait to receive the central management and control device to send If it does not receive the first type of operation and maintenance monitoring instruction sent by the central control device, it can be determined that the central control device is incorrect or cannot perform the operation and maintenance control on at least one edge cloud node. At least one operation and maintenance dimension controls the operation and maintenance of at least one edge cloud node. Optionally, the edge management and control device and the central management and control device may pre-appoint the waiting time for the first type of operation and maintenance monitoring instruction. If the waiting time is exceeded and the first type of operation and maintenance monitoring instruction sent by the central management and control device is not received, then It is determined that the first type of operation and maintenance monitoring instruction sent by the central control equipment has not been received.
又例如,在中心管控设备通过向边缘管控设备发送与指定运维维度对应的第二类运维监控指令,以控制边缘管控设备从指定运维维度对至少一个边缘云节点进行运维监控的方式下,边缘管控设备可以等待接收中心管控设备发送的第二类运维监控指令,若在指定运维维度上未接收到中心管控设备发送的第二类运维监控指令,可以确定中心管控设备在指定运维维度上不对或无法对至少一个边缘云节点进行运维管控,则可以自主地从指定运维维度对至少一个边缘云节点进行运维管控。For another example, the central management and control device sends the second type of operation and maintenance monitoring instructions corresponding to the specified operation and maintenance dimension to the edge management and control device to control the manner in which the edge management and control device performs operation and maintenance monitoring of at least one edge cloud node from the specified operation and maintenance dimension The edge management and control device can wait to receive the second type of operation and maintenance monitoring instruction sent by the central control device. If the second type of operation and maintenance monitoring instruction sent by the central control device is not received in the specified operation and maintenance dimension, it can be determined that the central control device is in If the specified operation and maintenance dimension is incorrect or unable to perform operation and maintenance control on at least one edge cloud node, it is possible to autonomously perform operation and maintenance control on at least one edge cloud node from the specified operation and maintenance dimension.
进一步可选地,若边缘管控设备在与中心管控设备失去连接的情况下,自主地从至少一个运维维度对至少一个边缘云节点进行运维管控,则在与中心管控设备恢复连接后, 还可以将失去连接期间的运维管控数据同步给中心管控设备。值得说明的是,运维管控数据主要包括运维管控的策略、方式、效果等数据,当然,也可以包括运维监控数据。Further optionally, if the edge management and control device autonomously controls the operation and maintenance of at least one edge cloud node from at least one operation and maintenance dimension when the connection with the central management and control device is lost, then after the connection with the central management and control device is restored, The operation and maintenance control data during the loss of connection can be synchronized to the central control equipment. It is worth noting that the operation and maintenance control data mainly includes data such as strategies, methods, and effects of operation and maintenance control, and of course, it can also include operation and maintenance monitoring data.
上述至少一个运维维度或指定运维维度可以包括但不限于以下几个维度:处于运行态的对象维度,日志维度,安全维度,资源维度等。进一步,处于运行态的对象维度可包括对象的运行状态维度和/或对象的生命周期维度;安全维度可包括:流量攻击维度和/或安全漏洞维度。The above-mentioned at least one operation and maintenance dimension or the designated operation and maintenance dimension may include, but is not limited to, the following dimensions: the object dimension in the running state, the log dimension, the security dimension, and the resource dimension. Further, the object dimension in the running state may include the operating state dimension of the object and/or the life cycle dimension of the object; the security dimension may include: the traffic attack dimension and/or the security vulnerability dimension.
结合上述列举的几个运维维度,边缘管控设备自主地对至少一个边缘云节点进行运维管控包括但不限于以下至少一种运维管控示例:Combining the several O&M dimensions listed above, the edge management and control device autonomously performs O&M control on at least one edge cloud node, including but not limited to at least one of the following O&M control examples:
运维管控示例a:自主地对至少一个边缘云节点中处于运行态的对象进行状态监控,并针对监控到的运行状态异常的目标对象进行异常处理。关于处于运行态的对象以及运行状态异常情况等,可参见上文中的描述,在此不再赘述。Operation and maintenance management and control example a: autonomously monitor the status of objects in the running state in at least one edge cloud node, and perform exception handling for the monitored target objects whose running status is abnormal. Regarding the objects in the running state and the abnormal conditions of the running state, please refer to the above description, which will not be repeated here.
可选地,在示例a中,边缘管控设备在针对目标对象进行异常处理时,具体用于:对目标对象的异常运行状态进行分析,根据分析结果确定至少一种候选处理方式;从至少一种候选处理方式中获取目标处理方式,根据目标处理方式对目标对象进行异常处理。Optionally, in example a, when the edge management and control device performs abnormal processing on the target object, it is specifically used to: analyze the abnormal operating state of the target object, and determine at least one candidate processing method according to the analysis result; In the candidate processing method, the target processing method is acquired, and the target object is abnormally processed according to the target processing method.
更进一步,边缘管控设备在获取目标处理方式时,具体用于:在边缘管控设备与中心管控设备保持连接的情况下,将至少一种候选处理方式上报给中心管控设备,以供中心管控设备从中选择处理方式;接收中心管控设备返回的处理方式作为目标处理方式;或者,在边缘管控设备与中心管控设备失去连接的情况下,输出至少一种候选处理方式至边缘运维管控人员,以供边缘运维人员从中选择处理方式;响应于边缘运维管控人员的选择操作,确定被选择的处理方式作为目标处理方式;或者,在与中心管控设备失去连接的情况下,按照设定的选择策略,从至少一种候选处理方式中选择目标处理方式。Furthermore, when the edge management and control device obtains the target processing mode, it is specifically used to: when the edge management and control device maintains a connection with the central management and control device, report at least one candidate processing method to the central management and control device for the central management and control device to use. Select the processing method; receive the processing method returned by the central control device as the target processing method; or, in the case that the edge control device loses the connection with the central control device, output at least one candidate processing method to the edge operation and maintenance control personnel for the edge The operation and maintenance personnel select the processing method; in response to the selection operation of the edge operation and maintenance management and control personnel, determine the selected processing method as the target processing method; or, in the case of loss of connection with the central control equipment, follow the set selection strategy, The target processing method is selected from at least one candidate processing method.
运维管控示例b、自主地监控至少一个边缘云节点中处于运行态的对象的生命周期,并根据监控结果控制处于运行态的对象停止、停止后重启或删除。对于容器或实例,可以控制容器或实例停止执行、停止后重启,或者将容器或实例删除等。Operation and maintenance management and control example b. Autonomously monitor the life cycle of objects in the running state in at least one edge cloud node, and control the objects in the running state to stop, restart or delete after stopping according to the monitoring results. For containers or instances, you can control the container or instance to stop execution, restart after stopping, or delete the container or instance, etc.
运维管控示例c:自主地采集至少一个边缘云节点中的日志数据,对日志数据进行数据分析,并根据数据分析结果执行后续动作。日志数据包括但不限于边缘云节点中实例的带宽流量、实例当前的运行情况、实例的IO负载、物理机的带宽流量、物理机当前的运行情况、物理机的IO负载、边缘管控设备的运行情况和/或其它虚拟化组件的运行情况等。可选地,根据日志数据的分析结果可以进行计费、风控、资源重分配等后续动作,但不限于此。Operation and maintenance control example c: autonomously collect log data in at least one edge cloud node, perform data analysis on the log data, and perform follow-up actions based on the data analysis results. Log data includes, but is not limited to, the bandwidth traffic of the instance in the edge cloud node, the current running status of the instance, the IO load of the instance, the bandwidth traffic of the physical machine, the current running status of the physical machine, the IO load of the physical machine, and the operation of edge control equipment. Status and/or operation status of other virtualization components. Optionally, subsequent actions such as billing, risk control, and resource reallocation can be performed according to the analysis result of the log data, but are not limited to this.
运维管控示例d、自主地对至少一个边缘云节点进行流量监控,并针对监控到的流量攻击事件进行阻断处理。Operation and maintenance control example d. Autonomously monitor the traffic of at least one edge cloud node, and block the monitored traffic attack events.
运维管控示例e:自主地对至少一个边缘云节点进行网络安全漏洞扫描,并针对扫描到的网络安全漏洞问题进行修复。Operation and maintenance control example e: Autonomously scan for network security vulnerabilities on at least one edge cloud node, and fix the scanned network security vulnerabilities.
运维管控示例f:自主地监控至少一个边缘云节点中的资源用量,并根据监控结果对至少一个边缘云节点进行资源扩容或减容。这里的资源包括但不限于:物理机等设备资源,内存、磁盘等存储资源,CPU、GPU等计算资源,带宽等网络资源。对这些资源来说,用量较高时,可以针对这些资源进行扩容,用量较低时,可以针对这些资源进行减容。Operation and maintenance control example f: autonomously monitor the resource usage in at least one edge cloud node, and perform resource expansion or reduction on at least one edge cloud node according to the monitoring result. The resources here include but are not limited to: equipment resources such as physical machines, storage resources such as memory and disks, computing resources such as CPU and GPU, and network resources such as bandwidth. For these resources, when the usage is high, the capacity can be expanded for these resources, and when the usage is low, the capacity can be reduced for these resources.
进一步,若每个边缘云节点中均部署有边缘管控设备,则每个边缘管控设备可以在中心管控设备不对或无法对其所属边缘云节点进行运维管控的情况下,自主地对其所属边缘云节点进行运维管控。Further, if edge management and control equipment is deployed in each edge cloud node, each edge management and control device can autonomously belong to its edge when the central management and control device is incorrect or cannot perform operation and maintenance control on its edge cloud node. Cloud nodes perform operation and maintenance management and control.
可选地,在上述边缘管控设备自主地对至少一个边缘云节点进行运维管控的示例a-示例e中,边缘管控设备可以根据定时任务,周期性地对至少一个边缘云节点进行运维管控。例如,在示例d中,边缘管控设备可以根据定时任务,每隔10分钟对至少一个边缘云节点进行流量监控,并针对监控到的流量攻击事件进行阻断处理。又例如,在示例e中,边缘管控设备可以根据定时任务,每隔5分钟对至少一个边缘云节点进行网络安全漏洞扫描,并针对扫描到的网络安全漏洞问题进行修复。当然,边缘管控设备也可以根据其它方式的自主策略,自主地对至少一个边缘云节点进行运维管控,例如可以在每天某个固定的时间点,自主地对至少一个边缘云节点进行运维管控。Optionally, in the foregoing example a-example e in which the edge management and control device autonomously controls the operation and maintenance of at least one edge cloud node, the edge management and control device may periodically perform the operation, maintenance, management and control on at least one edge cloud node according to a timing task . For example, in example d, the edge management and control device can monitor the traffic of at least one edge cloud node every 10 minutes according to the scheduled task, and block the monitored traffic attack event. For another example, in Example e, the edge management and control device may scan for network security vulnerabilities on at least one edge cloud node every 5 minutes according to a scheduled task, and fix the scanned network security vulnerabilities. Of course, the edge management and control device can also autonomously control the operation and maintenance of at least one edge cloud node according to other independent strategies. For example, it can autonomously control the operation and maintenance of at least one edge cloud node at a fixed time every day. .
结合上述示例1-6以及示例a-f可知,在本实施例中,中心管控设备与边缘管控设备相结合,中心管控设备可在边缘管控设备的协助下对至少一个边缘云节点进行运维管控,除此之外,边缘管控设备也具备一定的自行运维管控的能力,可以在中心管控设备不对或无法对边缘云节点进行运维管控的情况下,自主地对边缘云节点进行运维管控,实现两级运维管控,可以更加充分、全面地对边缘云节点进行运维管控,为“将云计算放到距离终端更近的边缘云节点中处理”提供了条件,进而可借助边缘云节点中的资源为用户提供云计算服务,有利于降低响应时延,减轻中心云或传统云计算平台的压力,降低带宽成本。Combining the above examples 1-6 and example af, it can be seen that in this embodiment, the central management and control device is combined with the edge management and control device, and the central management and control device can perform operation, maintenance, management and control on at least one edge cloud node with the assistance of the edge management and control device, except In addition, the edge management and control equipment also has a certain ability of self-operation, maintenance, and control. When the central management and control equipment is not correct or the edge cloud node cannot be operated and maintained, the edge cloud node can be independently operated and maintained to achieve Two-level operation and maintenance management and control can more fully and comprehensively control the operation and maintenance of edge cloud nodes, and provide conditions for "putting cloud computing in edge cloud nodes closer to the terminal for processing", and then can use edge cloud nodes The resources to provide users with cloud computing services are conducive to reducing response delays, reducing the pressure on the central cloud or traditional cloud computing platforms, and reducing bandwidth costs.
综上可知,在本申请实施例提供的网络系统中,基于集中管控的方式对边缘云节点的资源,镜像,实例,运维等进行统一管控,可以最大程度的对边缘云节点进行管理和 协调,可降低出现单点自制或全网信息不同步而导致的错误,而且可以利用集中管控的特性达到资源调度的最优化,避免出现边缘局部资源浪费的情况。In summary, in the network system provided by the embodiments of the present application, the resources, mirroring, instances, operation and maintenance of edge cloud nodes are uniformly controlled based on centralized management and control, and the edge cloud nodes can be managed and coordinated to the greatest extent. , It can reduce errors caused by single-point self-control or unsynchronized information of the entire network, and can use the characteristics of centralized management to achieve the optimization of resource scheduling, avoiding the waste of local resources at the edge.
除了上述网络系统之外,本申请实施例从中心管控设备的角度提供了实例管控方法,下面进行详细描述。In addition to the foregoing network system, the embodiments of the present application provide example management and control methods from the perspective of central management and control equipment, which are described in detail below.
图2a为本申请示例性实施例提供的一种实例管控方法的流程示意图。如图2a所示,该方法包括:Fig. 2a is a schematic flowchart of an example management and control method provided by an exemplary embodiment of this application. As shown in Figure 2a, the method includes:
21a、确定部署于网络系统中至少一个边缘云节点中的至少一个实例,其中,至少一个实例可为服务需求方提供云计算服务。21a. Determine at least one instance deployed in at least one edge cloud node in the network system, where at least one instance can provide cloud computing services for the service demander.
22a、对至少一个实例进行管控,以供至少一个实例为服务需求方提供云计算服务。22a. Manage and control at least one instance, so that at least one instance provides cloud computing services for service demanders.
在本实施例中,网络系统包括至少一个边缘云节点,至少一个边缘云节点中部署有至少一个实例,至少一个实例可为服务需求方提供云计算服务。中心管控设备确定边缘云节点中的至少一个实例,对至少一个实例进行管控,以供至少一个实例为服务需求方提供云计算服务。这里的服务需求方可以任何需要使用边缘云节点中的实例提供的云计算服务的设备、应用、系统或另一服务。以系统为例,服务需求方可以是但不限于:在线视频系统、风险管控系统、客户信息管理系统或数据分发系统等。In this embodiment, the network system includes at least one edge cloud node, at least one instance is deployed in the at least one edge cloud node, and at least one instance can provide cloud computing services for service demanders. The central management and control device determines at least one instance in the edge cloud node, and controls the at least one instance so that the at least one instance provides cloud computing services for the service demander. The service demander here may be any device, application, system or another service that needs to use the cloud computing service provided by the instance in the edge cloud node. Taking the system as an example, the service demander can be, but not limited to: online video systems, risk management systems, customer information management systems, or data distribution systems.
可选地,中心管控设备可以对至少一个实例进行各种管控,例如可以包括升级、迁移、关停、重启和释放等中的至少一种,但不限于此。Optionally, the central management and control device may perform various management and control on at least one instance, for example, it may include at least one of upgrade, migration, shutdown, restart, and release, but is not limited thereto.
如图2b所示,中心管控设备对实例进行升级管控的过程包括以下步骤:As shown in Figure 2b, the process for the central control equipment to upgrade and control an instance includes the following steps:
21b、从至少一个实例中确定待升级实例;21b. Determine the instance to be upgraded from at least one instance;
22b、向服务需求方发送升级请求,以供服务需求方结合待升级实例上的业务情况为待升级实例确定升级策略;22b. Send an upgrade request to the service demander for the service demander to determine an upgrade strategy for the instance to be upgraded in combination with the business situation on the instance to be upgraded;
23b、接收服务需求方返回的升级策略,依据升级策略对待升级实例进行升级。23b. Receive the upgrade strategy returned by the service demander, and upgrade the instance to be upgraded according to the upgrade strategy.
在实际应用中,随着业务需求的变化或镜像版本的更新,有可能对镜像或相应实例进行升级。中心管控设备可以从至少一个实例中确定待升级实例,待升级实例可以是一个或多个;向服务需求方发送升级请求,以供服务需求方结合待升级实例上的业务情况为待升级实例确定升级策略。该升级请求携带有待升级实例的标识类信息,例如待升级实例的ID、名称等,也可以是待升级实例对应服务的ID、名称等,还可以是待升级实例对应镜像的ID、名称等信息。服务需求方在接收到升级请求后,可根据该升级请求确定待升级实例,结合待升级实例上的业务情况,例如待升级实例上的业务请求及响应状态 等,判断待升级实例是否适合升级,什么时间适合升级,采用什么方法进行升级等,进而为该待升级实例生成升级策略并返回给中心管控设备。中心管控设备接收服务需求方发送的升级策略,依据升级策略对待升级实例进行升级。In actual applications, as business requirements change or the mirror version is updated, it is possible to upgrade the mirror or the corresponding instance. The central control device can determine the instance to be upgraded from at least one instance, and there can be one or more instances to be upgraded; send an upgrade request to the service demander, so that the service demander can determine the instance to be upgraded based on the business situation of the instance to be upgraded Upgrade strategy. The upgrade request carries the identification information of the instance to be upgraded, such as the ID and name of the instance to be upgraded. It can also be the ID and name of the service corresponding to the instance to be upgraded. It can also be the ID, name and other information of the image corresponding to the instance to be upgraded. . After receiving the upgrade request, the service demander can determine the instance to be upgraded according to the upgrade request, and combine the business conditions on the instance to be upgraded, such as the business request and response status on the instance to be upgraded, to determine whether the instance to be upgraded is suitable for upgrade. What time is suitable for upgrading, what method to use for upgrading, etc., and then generate an upgrade strategy for the instance to be upgraded and return it to the central control device. The central control equipment receives the upgrade strategy sent by the service demander, and upgrades the instance to be upgraded according to the upgrade strategy.
在一可选实施例中,服务需求方可结合待升级实例上的业务情况,例如已接收到且尚未完成的业务请求(简称为存量业务请求)的数量,是否还有新增的业务请求(增量业务请求)等,判断什么时间可以对待升级实例进行升级,也就是说,升级策略中可以包括升级时间。基于此,中心管控设备可以在升级策略中指定的升级时间开始对待升级实例进行升级。除此之外,升级策略可以包括升级方法,基于此,中心管控设备可以采用升级策略中指定的升级方法对待升级实例进行升级。可选地,升级策略可以包括升级时间和升级方法,则中心管控设备可以采用升级策略中指定的升级方法,在升级策略中指定的升级时间开始对待升级实例进行升级。可选地,升级策略还可以包括是否升级等信息,并在升级的情况下,进一步包括升级时间和/或升级方法。In an optional embodiment, the service demander can combine the business conditions on the instance to be upgraded, such as the number of business requests that have been received and not yet completed (referred to as inventory business requests), and whether there are any new business requests ( Incremental service request), etc., to determine when the instance to be upgraded can be upgraded, that is, the upgrade strategy can include the upgrade time. Based on this, the central control device can start to upgrade the instance to be upgraded at the upgrade time specified in the upgrade policy. In addition, the upgrade strategy may include an upgrade method. Based on this, the central control device may use the upgrade method specified in the upgrade strategy to upgrade the instance to be upgraded. Optionally, the upgrade strategy may include an upgrade time and an upgrade method, and the central management and control device may adopt the upgrade method specified in the upgrade strategy, and upgrade the instance to be upgraded at the upgrade time specified in the upgrade strategy. Optionally, the upgrade strategy may also include information such as whether to upgrade, and in the case of upgrade, it further includes the upgrade time and/or the upgrade method.
在一可选实施例中,对实例进行升级,可由中心管控设备发起。例如,中心管控设备可以监控各实例对应镜像的版本信息,当发现新版本的镜像时,可以确定需要对与该新版本的镜像对应的实例进行升级;或者,也可以监控各实例的运行状态、生命周期等信息,当发现实例运行过程中出现漏洞、不稳定、功能不全、CPU或内存资源消耗过大等问题时,可以确定需要对出现这些问题的实例进行升级。In an optional embodiment, upgrading the instance can be initiated by the central control device. For example, the central control device can monitor the version information of the mirror corresponding to each instance, and when a new version of the mirror is found, it can determine that the instance corresponding to the new version of the mirror needs to be upgraded; or, it can also monitor the running status of each instance, Life cycle and other information. When problems such as vulnerabilities, instability, insufficiency, excessive consumption of CPU or memory resources are found during the running of an instance, it can be determined that the instance with these problems needs to be upgraded.
在一可选实施例中,对实例进行升级,也可以由服务需求方发起。例如,根据业务需求,需要对实例进行升级时,服务需求方可以向中心管控设备发送升级描述信息,该升级描述信息包括实例过滤条件。在该情况下,步骤21b包括:接收服务需求方发送的升级描述信息;根据实例过滤条件,从至少一个实例中确定待升级实例。In an optional embodiment, upgrading the instance may also be initiated by the service demander. For example, when an instance needs to be upgraded according to business requirements, the service demander can send upgrade description information to the central management and control device, and the upgrade description information includes instance filter conditions. In this case, step 21b includes: receiving upgrade description information sent by the service demander; and determining the instance to be upgraded from at least one instance according to the instance filter condition.
其中,对待升级实例进行升级主要是指:关停待升级实例,根据相应版本(一般是指新版本)的镜像对待升级实例进行更新,更新完后再重启实例。其中,对待升级实例进行升级所需的镜像版本可以由中心管控设备确定,例如将相应镜像的最新版本作为升级所需的镜像版本,也可以由服务需求方指定。可选地,服务需求方可以将升级所需的镜像版本携带在升级描述信息中提供给中心管控设备,例如该升级描述信息可以包括“对所有或指定实例进行镜像版本A到镜像版本B的升级”等信息。基于此,依据升级策略对待升级实例进行升级,包括:依据升级策略,利用镜像版本对应的镜像对待升级实例进行升级。Among them, upgrading the instance to be upgraded mainly refers to: shutting down the instance to be upgraded, updating the instance to be upgraded according to the mirror image of the corresponding version (generally, the new version), and restarting the instance after the update. The image version required for upgrading the instance to be upgraded can be determined by the central management and control device. For example, the latest version of the corresponding image can be used as the image version required for the upgrade, or it can be specified by the service demander. Optionally, the service demander can carry the image version required for the upgrade in the upgrade description information and provide it to the central management and control device. For example, the upgrade description information can include "Upgrade from mirror version A to mirror version B for all or specified instances. "And other information. Based on this, upgrading the instance to be upgraded according to the upgrade strategy includes: according to the upgrade strategy, using the mirror corresponding to the mirror version to upgrade the instance to be upgraded.
更进一步,在网络系统包括边缘管控设备的情况下,依据升级策略,利用镜像版本 对应的镜像对待升级实例进行升级可以为:将升级策略和镜像版本对应的镜像发送给网络系统中的边缘管控设备,以供边缘管控设备依据升级策略,利用镜像版本对应的镜像对待升级实例进行升级。Furthermore, in the case that the network system includes edge management and control equipment, according to the upgrade strategy, using the mirror corresponding to the mirror version to upgrade the instance to be upgraded can be: sending the upgrade strategy and the mirror corresponding to the mirror version to the edge management and control equipment in the network system , So that the edge management and control device uses the mirror corresponding to the mirror version to upgrade the instance to be upgraded according to the upgrade strategy.
如图2c所示,中心管控设备对实例进行迁移管控的过程包括以下步骤:As shown in Figure 2c, the process of the central management and control device for instance migration management and control includes the following steps:
21c、从至少一个实例中确定待迁移实例,待迁移实例属于第一边缘云节点;21c. Determine the instance to be migrated from at least one instance, and the instance to be migrated belongs to the first edge cloud node;
22c、判断第一边缘云节点满足节点内迁移条件;若判断结果为是,即第一边缘云节点满足节点内迁移条件,执行步骤23c;若判断结果为否,即第一边缘云节点不满足节点内迁移条件,执行步骤24c。22c. Determine that the first edge cloud node meets the intra-node migration condition; if the judgment result is yes, that is, the first edge cloud node meets the intra-node migration condition, go to step 23c; if the judgment result is no, the first edge cloud node does not meet the For intra-node migration conditions, go to step 24c.
23c、对待迁移实例进行边缘云节点内的迁移。23c. The instance to be migrated is migrated within the edge cloud node.
24c、对待迁移实例进行跨边缘云节点的迁移。24c. Perform cross-edge cloud node migration for the instance to be migrated.
在一些情况下需要对实例进行迁移。例如,在整个边缘云节点故障或不可用的情况下,需要将该边缘云节点中的实例迁移到其它边缘云节点中。又例如,在承载某个实例的物理机出现故障或宕机的情况下,需要将该物理机上的实例迁移到其它物理机上。又例如,可能因为业务需要,需要将某个或某些实例从一个边缘云节点迁移到其它边缘云节点中。又例如,在需要进行资源归并的情况下,也需要对某个或某些实例进行迁移。In some cases, the instance needs to be migrated. For example, in the case that the entire edge cloud node is faulty or unavailable, the instances in the edge cloud node need to be migrated to other edge cloud nodes. For another example, in the case of a failure or downtime of a physical machine hosting an instance, the instance on the physical machine needs to be migrated to another physical machine. For another example, it may be necessary to migrate one or some instances from one edge cloud node to other edge cloud nodes due to business needs. For another example, when resources need to be merged, one or some instances need to be migrated.
在中心管控设备的管控下,可对边缘云节点中的实例进行迁移。中心管控设备从至少一个实例中确定待迁移实例。待迁移实例可以是一个或多个;若待迁移实例是多个,多个待迁移实例可部署于同一边缘云节点中,也可以部署于不同边缘云节点中。Under the control of the central control device, instances in the edge cloud node can be migrated. The central control device determines the instance to be migrated from at least one instance. There may be one or more instances to be migrated; if there are multiple instances to be migrated, the multiple instances to be migrated can be deployed in the same edge cloud node or in different edge cloud nodes.
可选地,中心管控设备可以监控至少一个边缘云节点中部署的至少一个实例的状态,根据至少一个实例的状态,获取出现故障的实例和/或运行中发生指定事件的实例作为待迁移实例。其中,出现故障的实例是指不能正常运行的实例,例如可以是发生宕机的物理机上的实例,也可以是本身宕机的实例等。指定事件主要是指一些出现后实例仍能正常运行的事件,可以根据应用需求灵活设定,对此不做限定。举例说明,指定事件可以是一些预警或告警事件等,虽然发生一些预警或告警事件,但实例并未产生实际问题,仍可运行(即未故障),但有故障隐患,可在故障前进行迁移。另外,中心管控设备维护有各边缘云节点的信息以及各边缘云节点中部署的各实例的信息,基于此,可以确定待迁移实例所属的边缘云节点,为便于描述和区分,将待迁移实例在迁移前所属的边缘云节点记为第一边缘云节点。Optionally, the central management and control device may monitor the state of at least one instance deployed in at least one edge cloud node, and obtain a failed instance and/or an instance in which a specified event occurs during operation as an instance to be migrated according to the state of at least one instance. Among them, a failed instance refers to an instance that cannot operate normally, for example, it can be an instance on a physical machine where the downtime occurs, or an instance that itself is down. The designated event mainly refers to some events that the instance can still run normally after occurrence, which can be flexibly set according to application requirements, and there is no restriction on this. For example, the specified event can be some early warning or alarm events, etc. Although some early warning or alarm events occur, the instance does not produce actual problems and can still run (that is, no failure), but there are hidden dangers of failure and can be migrated before failure. . In addition, the central control equipment maintains the information of each edge cloud node and the information of each instance deployed in each edge cloud node. Based on this, the edge cloud node to which the instance to be migrated belongs can be determined. For ease of description and distinction, the instance to be migrated The edge cloud node to which it belongs before the migration is recorded as the first edge cloud node.
可选地,中心管控设备可以根据资源归并需求,从至少一个实例中确定待迁移实例,进而对待迁移实例进行迁移。其中,资源归并主要是通过实例迁移对资源碎片进行整合 的过程,经过整合后,边缘云节点中的资源碎片会减少甚至不存在,这有利于提高边缘云节点中的资源利用率。值得说明的是,资源归并需求可以是系统级的,也可以节点级的。系统级的资源归并是指从整个网络系统的角度考虑,通过实例迁移对整个网络系统中的资源碎片进行整合;节点级的资源归并是指从边缘云节点的角度考虑,通过实例迁移对边缘云节点中的资源碎片进行整合。Optionally, the central management and control device may determine the instance to be migrated from at least one instance according to resource merging requirements, and then migrate the instance to be migrated. Among them, resource merging is mainly the process of integrating resource fragments through instance migration. After integration, the resource fragments in edge cloud nodes will be reduced or even nonexistent, which is conducive to improving resource utilization in edge cloud nodes. It is worth noting that resource merging requirements can be system-level or node-level. System-level resource merging refers to the integration of resource fragments in the entire network system from the perspective of the entire network system through instance migration; node-level resource merging refers to the perspective of edge cloud nodes and the use of instance migration The resource fragments in the node are integrated.
可选地,资源归并需求可以是服务需求方提供的。例如,服务需求方需要部署一个新的实例时,若为其服务的边缘云节点中各资源设备上的可用资源均不足以承载该新实例,可以对该边缘云节点中的实例进行迁移实现资源整合,从而为新实例提供足够的资源。或者,资源归并需求也可以是中心管控设备的定期行为。例如,中心管控设备定期执行资源碎片检查,当发现碎片率达到一定的阈值并可以执行实例迁移时,对各边缘云节点中的资源碎片进行整合,提高边缘云节点中的资源利用率。Optionally, the resource consolidation requirement may be provided by the service demander. For example, when a service demander needs to deploy a new instance, if the available resources on each resource device in the edge cloud node it serves are not enough to carry the new instance, the instance in the edge cloud node can be migrated to implement resources Integration to provide sufficient resources for new instances. Or, the resource consolidation requirement can also be the regular behavior of the central control equipment. For example, the central management and control equipment regularly performs resource fragmentation checks. When the fragmentation rate reaches a certain threshold and instance migration can be performed, the resource fragmentation in each edge cloud node is integrated to improve resource utilization in the edge cloud node.
其中,资源归并需求中包含有与资源归并相关的信息。例如,资源归并需求中可以包含为了达到资源归并目的需要迁移的实例的信息,基于此,可根据资源归并需求,直接确定待迁移实例。又例如,资源归并需求中可以包含需要资源归并的边缘云节点的信息。基于此,可根据资源归并需求,确定需要进行资源归并的边缘云节点,本实施例中将需要资源归并的边缘云节点称为第一边缘云节点;进而可以结合第一边缘云节点中各资源设备上剩余的可用资源和第一边缘云节点中各实例需要的资源,确定待迁移实例。Among them, the resource consolidation requirements contain information related to resource consolidation. For example, the resource merging requirements may include information about instances that need to be migrated to achieve the purpose of resource merging. Based on this, the instances to be migrated can be directly determined according to the resource merging requirements. For another example, the resource merging requirements may include information about edge cloud nodes that need to be merged. Based on this, the edge cloud node that needs to be merged can be determined according to the resource merging requirements. In this embodiment, the edge cloud node that needs to be merged is called the first edge cloud node; in turn, the resources in the first edge cloud node can be combined The remaining available resources on the device and the resources required by each instance in the first edge cloud node determine the instance to be migrated.
无论是哪种应用场景,在确定待迁移实例后,中心管控设备可以判断待迁移实例所属的第一边缘云节点是否满足节点内迁移条件;若第一边缘云节点满足节点内迁移条件,则对待迁移实例进行边缘云节点内的迁移;若第一边缘云节点不满足节点内迁移条件,则对待迁移实例进行跨边缘云节点的迁移。Regardless of the application scenario, after determining the instance to be migrated, the central control device can determine whether the first edge cloud node to which the instance to be migrated belongs meets the intra-node migration conditions; if the first edge cloud node meets the intra-node migration conditions, it will be treated The migration instance performs intra-edge cloud node migration; if the first edge cloud node does not meet the intra-node migration condition, the migration instance to be migrated is migrated across edge cloud nodes.
可选地,中心管控设备可以判断第一边缘云节点当前是否处于可用状态;若第一边缘云节点当前处于可用状态,判断第一边缘云节点的可用资源是否足够承载待迁移实例;若第一边缘云节点的可用资源足够承载待迁移实例,确定第一边缘云节点满足节点内迁移条件;若第一边缘云节点当前处于不可用状态,或者第一边缘云节点的可用资源不足以承载待迁移实例,确定第一边缘云节点不满足节点内迁移条件。在本申请实施例中,将实例的迁移划分为两种类型:节点内迁移和跨节点迁移。其中,第一边缘云节点的可用资源主要是指第一边缘云节点中各台资源设备上的可用资源;相应地,判断第一边缘云节点的可用资源是否足够承载待迁移实例主要是指判断第一边缘云节点中是否存在可用资源足以承载待迁移实例的资源设备。Optionally, the central management and control device may determine whether the first edge cloud node is currently available; if the first edge cloud node is currently available, determine whether the available resources of the first edge cloud node are sufficient to carry the instance to be migrated; The available resources of the edge cloud node are sufficient to carry the instance to be migrated, and it is determined that the first edge cloud node meets the migration conditions within the node; if the first edge cloud node is currently in an unavailable state, or the available resources of the first edge cloud node are insufficient to carry the instance to be migrated For example, it is determined that the first edge cloud node does not meet the intra-node migration condition. In the embodiments of the present application, the migration of instances is divided into two types: intra-node migration and cross-node migration. Among them, the available resources of the first edge cloud node mainly refer to the available resources on each resource device in the first edge cloud node; accordingly, judging whether the available resources of the first edge cloud node are sufficient to carry the instance to be migrated mainly refers to judging Whether there is a resource device with sufficient resources available in the first edge cloud node to carry the instance to be migrated.
值得说明的是,在资源归并场景中,为了实现资源归并的实例迁移主要是节点内迁移,当然,也可以是跨节点迁移。可选地,在根据第一边缘云节点中各资源设备上剩余的可用资源和第一边缘云节点中各实例需要的资源确定待迁移实例的过程中,还可以确定待迁移实例需要迁移到的资源设备,该资源设备是第一边缘云节点中剩余的可用资源可以承载待迁移实例的资源设备。当然,若第一边缘云节点中不存在剩余的可用资源可以承载待迁移实例的资源设备,可以针对待迁移实例进行跨节点迁移。鉴于资源归并的目的,在针对待迁移实例进行跨节点迁移的过程中,优先考虑将待迁移实例迁移到其它边缘云节点中已经被使用且剩余的可用资源可以承载待迁移实例的资源设备上;进一步,在有多个已经被使用且剩余的可用资源可以承载待迁移实例的资源设备的情况下,可以以资源碎片最小为原则,从中选择剩余的可用资源与待迁移实例需要的资源的匹配度较高的资源设备,尽量产生较少的资源碎片或不产生资源碎片。It is worth noting that in the resource merging scenario, the instance migration for resource merging is mainly intra-node migration, and of course, it can also be cross-node migration. Optionally, in the process of determining the instance to be migrated according to the remaining available resources on each resource device in the first edge cloud node and the resources required by each instance in the first edge cloud node, the instance to be migrated may also be determined A resource device, where the resource device is a resource device whose remaining available resources in the first edge cloud node can carry the instance to be migrated. Of course, if there are no remaining available resources in the first edge cloud node that can carry resource devices of the instance to be migrated, cross-node migration can be performed for the instance to be migrated. In view of the purpose of resource merging, in the process of cross-node migration for the instances to be migrated, priority is given to migrating the instances to be migrated to other edge cloud nodes that have been used and the remaining available resources can carry the resource devices of the instances to be migrated; Further, in the case that there are multiple resources that have been used and remaining available resources can carry the resource equipment of the instance to be migrated, the principle of minimum resource fragmentation can be used to select the matching degree between the remaining available resources and the resources required by the instance to be migrated. Higher resource equipment, try to produce less resource fragments or no resource fragments.
对于节点内迁移:可选地,可以通过热迁移技术保证实例所提供云计算服务的连续性,关于热迁移技术可参见现有技术,在此不再赘述。For intra-node migration: Optionally, the continuity of the cloud computing service provided by the instance can be ensured through the hot migration technology. For the hot migration technology, please refer to the prior art, which will not be repeated here.
对于跨节点迁移:中心管控设备可以从至少一个边缘云节点选择第二边缘云节点,第二边缘云节点不同于第一边缘云节点,且第二边缘云节点中的可用资源足够承载待迁移实例,即有足够资源;将待迁移实例迁移到第二边缘云节点中,并将待迁移实例在第二边缘云节点中的属性信息发送给服务需求方,以供服务需求方基于该属性信息针对待迁移实例进行业务调度。其中,待迁移实例在第二边缘云节点中的属性信息是指在待迁移实例迁移到第二边缘云节点之后,外部(例如服务需求方或服务需求方授权的第三方)针对待迁移实例进行业务调度所需的信息,例如可以包括但不限于:第二边缘云节点所在的地区、运营商信息和/或公网IP等信息。For cross-node migration: the central control device can select a second edge cloud node from at least one edge cloud node, the second edge cloud node is different from the first edge cloud node, and the available resources in the second edge cloud node are sufficient to carry the instance to be migrated , That is, sufficient resources; migrate the instance to be migrated to the second edge cloud node, and send the attribute information of the instance to be migrated in the second edge cloud node to the service demander, so that the service demander can target The instance to be migrated performs business scheduling. Among them, the attribute information of the instance to be migrated in the second edge cloud node means that after the instance to be migrated is migrated to the second edge cloud node, an external (for example, a service demander or a third party authorized by the service demander) conducts an operation on the instance to be migrated. Information required for service scheduling may include, but is not limited to, for example, information such as the area where the second edge cloud node is located, operator information, and/or public network IP.
可选地,在选择第二边缘云节点时,可以采用但不限于以下方式:Optionally, when selecting the second edge cloud node, the following methods can be used but not limited to:
方式1:可以根据其它边缘云节点与第一边缘云节点之间的距离,选择与第一边缘云节点的距离小于设定距离阈值的边缘云节点,或者选择与第一边缘云节点距离最近的边缘云节点,或者从与第一边缘云节点距离最近的N个边缘云节点中任意选择一个边缘云节点,作为第二边缘云节点。在方式1中,第二边缘云节点距离第一边缘云节点距离最近或较近,可节约数据传输时间,有利于提高迁移效率。Method 1: According to the distance between other edge cloud nodes and the first edge cloud node, select the edge cloud node whose distance from the first edge cloud node is less than the set distance threshold, or select the closest distance to the first edge cloud node An edge cloud node, or an edge cloud node arbitrarily selected from the N edge cloud nodes closest to the first edge cloud node as the second edge cloud node. In Manner 1, the second edge cloud node is closest or relatively close to the first edge cloud node, which can save data transmission time and help improve migration efficiency.
方式2:可以根据其它边缘云节点的带宽资源,从中选择带宽资源相对充足的边缘云节点,例如选择带宽资源最大的,或者选择带宽资源大于设定带宽阈值的,或者选择带宽使用率较低的边缘云节点,作为第二边缘云节点。在方式2中,第二边缘云节点的 带宽资源充足,可提高数据传输速率,有利于提高迁移效率。Method 2: You can select edge cloud nodes with relatively sufficient bandwidth resources according to the bandwidth resources of other edge cloud nodes. For example, select the edge cloud node with the largest bandwidth resource, or select the bandwidth resource greater than the set bandwidth threshold, or select the bandwidth utilization rate lower The edge cloud node serves as the second edge cloud node. In method 2, the bandwidth resources of the second edge cloud node are sufficient, which can increase the data transmission rate and help improve migration efficiency.
方式3:可以根据其它边缘云节点当前的负载情况,从中选择负载相对较轻的边缘云节点,例如选择负载量最小的,或者选择负载量小于设定负载量阈值的边缘云节点,作为第二边缘云节点。在方式3中,第二边缘云节点的负载较轻,可有足够资源且能够及时处理实例迁移,有利于提高迁移效率。Method 3: According to the current load situation of other edge cloud nodes, select the edge cloud node with relatively light load, for example, select the edge cloud node with the smallest load, or select the edge cloud node with the load less than the set load threshold as the second Edge cloud node. In mode 3, the load of the second edge cloud node is lighter, it has sufficient resources and can handle instance migration in time, which is beneficial to improve migration efficiency.
可选地,在将待迁移实例迁移到第二边缘云节点时,中心管控设备可根据待迁移实例的资源需求,在第二边缘云节点中为待迁移实例进行资源预留或分配;在资源预留或分配成功后,将待迁移实例迁移到第二边缘云节点中预留或分配的资源上。例如,可结合待迁移实例的资源需求,确定待迁移实例需要的资源类型、资源量和/或对资源设备的性能要求等信息,根据这些信息在第二边缘云节点中进行资源预留或分配,可为实例成功迁移提供资源保障。Optionally, when migrating the instance to be migrated to the second edge cloud node, the central management and control device may reserve or allocate resources for the instance to be migrated in the second edge cloud node according to the resource requirements of the instance to be migrated; After the reservation or allocation is successful, the instance to be migrated is migrated to the resources reserved or allocated in the second edge cloud node. For example, the resource requirements of the instances to be migrated can be combined to determine the type of resources, the amount of resources and/or the performance requirements of the resource equipment required by the instances to be migrated, and resource reservation or allocation can be performed in the second edge cloud node based on this information , Which can provide resource guarantee for successful instance migration.
可选地,若待迁移实例是出现故障的实例,即不可正常运行的实例,中心管控设备还可以将该迁移事件通知给服务需求方,服务需求方可以做出合适的响应动作,比如更新该实例在服务需求方中的信息,或针对实例迁移过程中的宕机情况做出容灾响应。进一步,可在通知迁移事件的过程中,一并将待迁移实例在第二边缘云节点中的属性信息提供给服务需求方。当然,也可以在将待迁移实例成功迁移至第二边缘云节点之后,将待迁移实例在第二边缘云节点中的属性信息提供给服务需求方。Optionally, if the instance to be migrated is a failed instance, that is, an instance that is not functioning normally, the central control device can also notify the service demander of the migration event, and the service demander can make appropriate response actions, such as updating the The information of the instance in the service demander, or the disaster recovery response to the downtime during the instance migration. Further, in the process of notifying the migration event, the attribute information of the instance to be migrated in the second edge cloud node may be provided to the service demander. Of course, after the instance to be migrated is successfully migrated to the second edge cloud node, the attribute information of the instance to be migrated in the second edge cloud node may be provided to the service demander.
可选地,若待迁移实例是运行过程中发生指定事件的实例,即虽发生指定事件但仍可正常运行的实例,中心管控设备还可以向服务需求方发送迁移请求,以供服务需求方结合待迁移实例上的业务情况为待迁移实例确定迁移策略;接收服务需求方发送的迁移策略,依据迁移策略将待迁移实例迁移到第二边缘云节点中。该迁移策略主要包括是否迁移、迁移时间以及迁移方式中的至少一个信息。Optionally, if the instance to be migrated is an instance in which a specified event occurs during operation, that is, an instance that can run normally despite the occurrence of a specified event, the central control device can also send a migration request to the service demander for the service demander to combine The business situation on the instance to be migrated determines the migration strategy for the instance to be migrated; the migration strategy sent by the service demander is received, and the instance to be migrated is migrated to the second edge cloud node according to the migration strategy. The migration strategy mainly includes at least one information of whether to migrate, migration time, and migration mode.
进一步可选地,中心管控设备可以将待迁移实例在第二边缘云节点中的属性信息连同上述迁移请求一并发送给服务需求方。或者,也可以在将待迁移实例成功迁移至第二边缘云节点之后,将待迁移实例在第二边缘云节点中的属性信息提供给服务需求方。Further optionally, the central management and control device may send the attribute information of the instance to be migrated in the second edge cloud node together with the migration request to the service demander. Alternatively, after the instance to be migrated is successfully migrated to the second edge cloud node, the attribute information of the instance to be migrated in the second edge cloud node may be provided to the service demander.
进一步可选地,若待迁移实例是发生指定事件但仍可正常运行的实例,在迁移过程中,待迁移实例继续运行在第一边缘云节点中,这样迁移过程中的业务请可继续调度到第一边缘云节点中的待迁移实例上,保证业务连续性。在将待迁移实例成功迁移到第二边缘云节点中,且服务需求方确保将新的业务请求全部调度到已迁移到第二边缘云节点中,且第一边缘云节点中的业务请求逐步减少最终没有新的业务请求,即运行于第一边 缘云节点中的待迁移实例上不再有任何业务请求的情况下,中心管控设备可将第一边缘云节点中的待迁移实例释放掉。可选地,服务需求方在确定运行于第一边缘云节点中的待迁移实例上不再有任何业务请求,既没有存量业务请求也没有增量业务请求之后,可以向中心管控设备发送释放通知;中心管控设备接收服务需求方发送的释放通知,根据该释放通知将运行在第一边缘云节点中的待迁移实例释放掉。进一步,中心管控设备还可以将运行在第一边缘云节点中的待迁移实例的运行状态同步给第二边缘云节点中的待待迁移实例。Further optionally, if the instance to be migrated is an instance that has a specified event but can still run normally, during the migration process, the instance to be migrated continues to run on the first edge cloud node, so that the business during the migration process can continue to be scheduled to Ensure business continuity on the instances to be migrated in the first edge cloud node. After successfully migrating the instance to be migrated to the second edge cloud node, and the service demander ensures that all new business requests are scheduled to the migrated second edge cloud node, and the business requests in the first edge cloud node are gradually reduced In the end, there is no new service request, that is, when there are no longer any service requests on the instance to be migrated running in the first edge cloud node, the central management and control device can release the instance to be migrated in the first edge cloud node. Optionally, after determining that the service requester no longer has any service requests on the instance to be migrated running on the first edge cloud node, and there is neither an inventory service request nor an incremental service request, it may send a release notice to the central control device ; The central control device receives the release notification sent by the service demander, and releases the instance to be migrated running in the first edge cloud node according to the release notification. Further, the central management and control device may also synchronize the running state of the instance to be migrated running in the first edge cloud node to the instance to be migrated in the second edge cloud node.
进一步,无论待迁移实例是哪种实例,将待迁移实例迁移到第二边缘云节点中,主要是控制第二边缘云节点中相应资源设备根据待迁移实例对应的镜像或快照在预留或分配的资源上创建待迁移实例的过程。Further, regardless of the instance to be migrated, migrating the instance to be migrated to the second edge cloud node is mainly to control the corresponding resource equipment in the second edge cloud node to reserve or allocate according to the mirror or snapshot corresponding to the instance to be migrated The process of creating an instance to be migrated on the resource.
进一步,在网络系统中包括边缘管控设备的情况下,中心管控设备可以根据待迁移实例的资源需求,确定第二边缘云节点中被调度的资源信息,将该资源信息发送给边缘管控设备,由边缘管控设备根据该资源信息,控制第二边缘云节点中相应资源设备为待迁移实例进行资源预留或分配。然后,中心管控设备可以向边缘管控设备发送迁移指令,该迁移指令指示边缘管控设备获取待迁移实例对应的镜像或实例快照并提供给第二边缘云节点中相应资源设备,供第二边缘云节点中相应资源设备根据该镜像或实例快照在预留或分配的资源上创建待迁移实例。进一步,若第二边缘云节点中部署有边缘管控设备,则中心管控设备可以向第二边缘云节点中的边缘管控设备发送迁移指令,指示第二边缘云节点中的边缘管控设备获取待迁移实例对应的镜像或快照并提供给第二边缘云节点中相应资源设备,供第二边缘云节点中相应资源设备根据该镜像或快照在预留或分配的资源上创建待迁移实例。Further, in the case that the edge management and control device is included in the network system, the central management and control device may determine the scheduled resource information in the second edge cloud node according to the resource requirements of the instance to be migrated, and send the resource information to the edge management and control device, and The edge management and control device controls the corresponding resource device in the second edge cloud node to reserve or allocate resources for the instance to be migrated according to the resource information. Then, the central management and control device can send a migration instruction to the edge management and control device. The migration instruction instructs the edge management and control device to obtain the image or instance snapshot corresponding to the instance to be migrated and provide it to the corresponding resource device in the second edge cloud node for the second edge cloud node According to the image or instance snapshot, the corresponding resource device creates an instance to be migrated on the reserved or allocated resources. Further, if an edge management and control device is deployed in the second edge cloud node, the central management and control device may send a migration instruction to the edge management and control device in the second edge cloud node to instruct the edge management and control device in the second edge cloud node to obtain the instance to be migrated The corresponding image or snapshot is provided to the corresponding resource device in the second edge cloud node for the corresponding resource device in the second edge cloud node to create an instance to be migrated on the reserved or allocated resources according to the image or snapshot.
在本申请方法实施例中,在中心管控设备的管控下,边缘云节点中的实例可以为服务需求方提供云计算服务,达到了借助边缘云节点中的资源为用户提供服务的目的,使得“将云计算放到距离终端更近的边缘云节点中处理”成为现实,有利于降低响应时延,减轻与边缘云节点对应的中心云或传统的云计算平台等的压力,降低带宽成本。In the method embodiment of the present application, under the control of the central management and control device, the instance in the edge cloud node can provide cloud computing services to the service demander, achieving the purpose of providing services to users by using the resources in the edge cloud node, so that " It has become a reality to place cloud computing in edge cloud nodes closer to the terminal, which will help reduce response delays, reduce the pressure on the central cloud or traditional cloud computing platforms corresponding to edge cloud nodes, and reduce bandwidth costs.
需要说明的是,在上述实施例及附图中的描述的一些流程中,包含了按照特定顺序出现的多个操作,但是应该清楚了解,这些操作可以不按照其在本文中出现的顺序来执行或并行执行,操作的序号如21a、22a等,仅仅是用于区分开各个不同的操作,序号本身不代表任何的执行顺序。另外,这些流程可以包括更多或更少的操作,并且这些操作可以按顺序执行或并行执行。需要说明的是,本文中的“第一”、“第二”等描述,是 用于区分不同的消息、设备、模块等,不代表先后顺序,也不限定“第一”和“第二”是不同的类型。It should be noted that some processes described in the above embodiments and drawings include multiple operations appearing in a specific order, but it should be clearly understood that these operations may not be performed in the order in which they appear in this article. Or execute in parallel. The sequence numbers of operations, such as 21a, 22a, etc., are only used to distinguish different operations. The sequence number itself does not represent any execution order. In addition, these processes may include more or fewer operations, and these operations may be executed sequentially or in parallel. It should be noted that the descriptions of "first" and "second" in this article are used to distinguish different messages, devices, modules, etc., and do not represent a sequence, nor do they limit the "first" and "second" Are different types.
图3为本申请示例性实施例提供的一种中心管控设备的结构示意图。如图3所示,该中心管控设备包括:存储器31和处理器32。FIG. 3 is a schematic structural diagram of a central management and control device provided by an exemplary embodiment of this application. As shown in FIG. 3, the central management and control device includes: a memory 31 and a processor 32.
存储器31,用于存储计算机程序,并可被配置为存储其它各种数据以支持在中心管控设备上的操作。这些数据的示例包括用于在中心管控设备上操作的任何应用程序或方法的指令,消息,图片,视频等。The memory 31 is used to store computer programs, and can be configured to store various other data to support operations on the central control device. Examples of these data include instructions, messages, pictures, videos, etc. used to operate any application or method on the central control device.
处理器32,与存储器31耦合,用于执行存储器31中的计算机程序,以用于:确定部署于网络系统中至少一个边缘云节点中的至少一个实例,至少一个实例可为服务需求方提供云计算服务;对至少一个实例进行管控,以供至少一个实例为服务需求方提供云计算服务。The processor 32 is coupled with the memory 31 and is configured to execute the computer program in the memory 31 to determine at least one instance deployed in at least one edge cloud node in the network system, and the at least one instance can provide the cloud for the service demander Computing services: At least one instance is managed and controlled so that at least one instance provides cloud computing services for the service demander.
可选地,对至少一个实例进行的管控包括:升级、迁移、关停、重启和释放中的至少一种。Optionally, the management and control of at least one instance includes: at least one of upgrade, migration, shutdown, restart, and release.
在一可选实施例中,如图3所示,该中心管控设备还包括:通信组件33。基于此,处理器32在对至少一个实例进行升级时,具体用于:从至少一个实例中确定待升级实例;通过通信组件33向服务需求方发送升级请求,以供服务需求方结合待升级实例上的业务情况为待升级实例确定升级策略;通过通信组件33接收服务需求方返回的升级策略,依据升级策略对待升级实例进行升级。In an optional embodiment, as shown in FIG. 3, the central management and control device further includes: a communication component 33. Based on this, when the processor 32 upgrades at least one instance, it is specifically configured to: determine the instance to be upgraded from the at least one instance; send an upgrade request to the service demander through the communication component 33, so that the service demander can combine the instance to be upgraded The above business situation determines the upgrade strategy for the instance to be upgraded; the communication component 33 receives the upgrade strategy returned by the service demander, and upgrades the instance to be upgraded according to the upgrade strategy.
进一步,处理器32在从至少一个实例中确定待升级实例时,具体用于:通过通信组件33接收服务需求方发送的升级描述信息,升级描述信息包括实例过滤条件;根据实例过滤条件,从至少一个实例中确定待升级实例。Further, when the processor 32 determines the instance to be upgraded from at least one instance, it is specifically configured to: receive the upgrade description information sent by the service demander through the communication component 33, where the upgrade description information includes instance filter conditions; according to the instance filter conditions, from at least In one instance, the instance to be upgraded is determined.
进一步,升级描述信息还包括:升级所需的镜像版本。则,处理器32在依据升级策略对待升级实例进行升级时,具体用于:依据升级策略,利用镜像版本对应的镜像对待升级实例进行升级。Further, the upgrade description information also includes: the image version required for the upgrade. Then, when the processor 32 upgrades the instance to be upgraded according to the upgrade strategy, it is specifically configured to: according to the upgrade strategy, use the mirror corresponding to the mirror version to upgrade the instance to be upgraded.
更进一步,处理器32在依据升级策略,利用镜像版本对应的镜像对待升级实例进行升级时,具体用于:将升级策略和镜像版本对应的镜像发送给网络系统中的边缘管控设备,以供边缘管控设备依据升级策略,利用镜像版本对应的镜像对待升级实例进行升级。其中,升级策略包括但不限于:是否升级、升级时间和升级方法中的至少一个信息。Furthermore, when the processor 32 uses the mirror corresponding to the mirror version to upgrade the instance to be upgraded according to the upgrade strategy, it is specifically used to: send the mirror corresponding to the upgrade strategy and the mirror version to the edge management and control device in the network system for the edge According to the upgrade strategy, the control equipment uses the mirror corresponding to the mirror version to upgrade the instance to be upgraded. Wherein, the upgrade strategy includes but is not limited to: at least one piece of information in whether to upgrade, upgrade time, and upgrade method.
在一可选实施例中,处理器32在对至少一个实例进行迁移时,具体用于:从至少一个实例中确定待迁移实例,待迁移实例所属的边缘云节点记为第一边缘云节点;若第一 边缘云节点满足节点内迁移条件,对待迁移实例进行边缘云节点内的迁移;若第一边缘云节点不满足节点内迁移条件,对待迁移实例进行跨边缘云节点的迁移。In an optional embodiment, when the processor 32 migrates at least one instance, it is specifically configured to: determine the instance to be migrated from the at least one instance, and the edge cloud node to which the instance to be migrated belongs is recorded as the first edge cloud node; If the first edge cloud node meets the intra-node migration condition, the instance to be migrated is migrated within the edge cloud node; if the first edge cloud node does not meet the intra-node migration condition, the instance to be migrated is migrated across edge cloud nodes.
可选地,处理器32在从至少一个实例中确定待迁移实例时,具体用于:根据至少一个实例的状态,将出现故障的实例和/或运行过程中发生指定事件的实例作为待迁移实例。Optionally, when the processor 32 determines the instance to be migrated from the at least one instance, it is specifically configured to: according to the state of the at least one instance, use the failed instance and/or the instance in which a specified event occurs during operation as the instance to be migrated .
可选地,处理器32在从至少一个实例中确定待迁移实例时,具体用于:根据资源归并需求,从至少一个实例中确定待迁移实例。Optionally, when the processor 32 determines the instance to be migrated from the at least one instance, it is specifically configured to: determine the instance to be migrated from the at least one instance according to resource merging requirements.
进一步,处理器32在根据资源归并需求确定待迁移实例时,具体用于:根据资源归并需求,确定需要资源归并的第一边缘云节点;结合所述第一边缘云节点中各资源设备上剩余的可用资源和所述第一边缘云节点中各实例需要的资源,确定所述待迁移实例。Further, when the processor 32 determines the instance to be migrated according to the resource merging demand, it is specifically configured to: determine the first edge cloud node that needs resource merging according to the resource merging demand; combine the remaining resources on each resource device in the first edge cloud node The available resources of and the resources required by each instance in the first edge cloud node determine the instance to be migrated.
可选地,处理器32还用于:判断第一边缘云节点是否处于可用状态;若第一边缘云节点处于可用状态,判断第一边缘云节点的可用资源是否足够承载待迁移实例;若第一边缘云节点的可用资源足够承载待迁移实例,则确定第一边缘云节点满足节点内迁移条件;若第一边缘云节点处于不可用状态,或者第一边缘云节点的可用资源不足以承载待迁移实例,则确定第一边缘云节点不满足节点内迁移条件。Optionally, the processor 32 is further configured to: determine whether the first edge cloud node is in an available state; if the first edge cloud node is in an available state, determine whether the available resources of the first edge cloud node are sufficient to carry the instance to be migrated; If the available resources of an edge cloud node are sufficient to carry the instance to be migrated, it is determined that the first edge cloud node meets the migration conditions within the node; if the first edge cloud node is in an unavailable state, or the available resources of the first edge cloud node are insufficient to carry the instance For the migration instance, it is determined that the first edge cloud node does not meet the intra-node migration condition.
可选地,处理器32在对待迁移实例进行跨边缘云节点的迁移时,具体用于:从至少一个边缘云节点选择第二边缘云节点,第二边缘云节点不同于第一边缘云节点;将待迁移实例迁移到第二边缘云节点中,并将待迁移实例在第二边缘云节点中的属性信息发送给服务需求方,以供服务需求方基于该属性信息针对待迁移实例进行业务调度。Optionally, the processor 32 is specifically configured to select a second edge cloud node from at least one edge cloud node when the instance to be migrated is migrated across edge cloud nodes, where the second edge cloud node is different from the first edge cloud node; Migrate the instance to be migrated to the second edge cloud node, and send the attribute information of the instance to be migrated in the second edge cloud node to the service demander, so that the service demander can perform business scheduling for the instance to be migrated based on the attribute information .
可选地,若待迁移实例是运行过程中发生指定事件的实例,即发生指定事件但仍可正常运行的示例,则,处理器32在将待迁移实例迁移到第二边缘云节点中时,具体用于:通过通信组件33向服务需求方发送迁移请求,以供服务需求方结合待迁移实例上的业务情况为待迁移实例确定迁移策略;通过通信组件33接收服务需求方发送的迁移策略,依据迁移策略,将待迁移实例迁移到第二边缘云节点中。Optionally, if the instance to be migrated is an instance in which a specified event occurs during operation, that is, an example in which a specified event occurs but can still run normally, then when the processor 32 migrates the instance to be migrated to the second edge cloud node, It is specifically used for: sending a migration request to the service demander through the communication component 33, so that the service demander can determine the migration strategy for the instance to be migrated in combination with the business situation on the instance to be migrated; receiving the migration strategy sent by the service demander through the communication component 33, According to the migration strategy, the instance to be migrated is migrated to the second edge cloud node.
可选地,处理器32在将待迁移实例迁移到第二边缘云节点中时,具体用于:根据待迁移实例的资源需求,控制第二边缘云节点中相应资源设备为待迁移实例进行资源预留或分配;在资源预留或分配成功后,将待迁移实例迁移到第二边缘云节点中相应资源设备预留或分配的资源上。Optionally, when the processor 32 migrates the instance to be migrated to the second edge cloud node, it is specifically configured to: according to the resource requirements of the instance to be migrated, control the corresponding resource device in the second edge cloud node to perform resources for the instance to be migrated Reservation or allocation: After the resource reservation or allocation is successful, the instance to be migrated is migrated to the resource reserved or allocated by the corresponding resource device in the second edge cloud node.
可选地,处理器32在将待迁移实例迁移到第二边缘云节点中相应资源设备预留或分配的资源上时,具体用于:控制第二边缘云节点中相应资源设备根据待迁移实例对应的 镜像或快照在预留或分配的资源上创建待迁移实例。Optionally, when the processor 32 migrates the instance to be migrated to the resource reserved or allocated by the corresponding resource device in the second edge cloud node, it is specifically configured to: control the corresponding resource device in the second edge cloud node according to the instance to be migrated The corresponding image or snapshot creates an instance to be migrated on the reserved or allocated resources.
进一步可选地,处理器32具体用于:向网络系统中的边缘管控设备发送迁移指令,该迁移指令指示所述边缘管控设备获取所述待迁移实例对应的镜像或实例快照并提供给第二边缘云节点中相应资源设备,以供相应资源设备在预留或分配的资源上创建所述待迁移实例。Further optionally, the processor 32 is specifically configured to send a migration instruction to the edge management and control device in the network system, and the migration instruction instructs the edge management and control device to obtain the image or instance snapshot corresponding to the instance to be migrated and provide it to the second The corresponding resource device in the edge cloud node allows the corresponding resource device to create the instance to be migrated on the reserved or allocated resources.
可选地,处理器32还用于:通过通信组件33接收服务需求方发送的释放通知,并依据释放通知将运行在第一边缘云节点中的待迁移实例释放掉;其中,在迁移过程中,待迁移实例继续运行在第一边缘云节点中;其中,释放通知是服务需求方在确定运行于第一边缘云节点中的待迁移实例上不再有任何业务请求之后发送的。Optionally, the processor 32 is further configured to: receive a release notification sent by the service demander through the communication component 33, and release the instance to be migrated running in the first edge cloud node according to the release notification; wherein, during the migration process , The instance to be migrated continues to run in the first edge cloud node; wherein the release notification is sent after the service demander determines that there is no longer any service request on the instance to be migrated running in the first edge cloud node.
进一步,如图3所示,该中心管控设备还包括:显示器34、电源组件35和音频组件36等其它组件。图3中仅示意性给出部分组件,并不意味着中心管控设备只包括图3所示组件。另外,图3中虚线框内的组件为可选组件,具体可视中心管控设备实现形态而定。如果中心管控设备是服务器形态的设备,可选地,可以不包括显示器34和音频组件36;若中心管控设备是终端设备形态的设备,可选地,可以包括显示器34和音频组件36。Furthermore, as shown in FIG. 3, the central management and control device further includes: a display 34, a power supply component 35, an audio component 36 and other components. Only some of the components are schematically shown in FIG. 3, which does not mean that the central control equipment only includes the components shown in FIG. In addition, the components in the dashed box in Figure 3 are optional components, which may be determined by the implementation of the central control equipment. If the central management and control device is a server-shaped device, it may optionally not include the display 34 and the audio component 36; if the central management and control device is a terminal device-type device, it may optionally include the display 34 and the audio component 36.
相应地,本申请实施例还提供一种存储有计算机程序的计算机可读存储介质,计算机程序被一个或多个处理器执行时,致使一个或多个处理器实现上述方法实施例中可由中心管控设备执行的各步骤或操作。Correspondingly, an embodiment of the present application also provides a computer-readable storage medium storing a computer program. When the computer program is executed by one or more processors, the one or more processors can implement the above method in the above-mentioned method embodiments. Steps or operations performed by the equipment.
上述图3中的通信组件被配置为便于通信组件所在设备和其他设备之间有线或无线方式的通信。通信组件所在设备可以接入基于通信标准的无线网络,如WiFi,2G或3G,或它们的组合。在一个示例性实施例中,通信组件经由广播信道接收来自外部广播管理系统的广播信号或广播相关信息。在一个示例性实施例中,所述通信组件还可以包括近场通信(NFC)模块,射频识别(RFID)技术,红外数据协会(IrDA)技术,超宽带(UWB)技术,蓝牙(BT)技术等。The communication component in FIG. 3 is configured to facilitate wired or wireless communication between the device where the communication component is located and other devices. The device where the communication component is located can access a wireless network based on communication standards, such as WiFi, 2G or 3G, or a combination of them. In an exemplary embodiment, the communication component receives a broadcast signal or broadcast related information from an external broadcast management system via a broadcast channel. In an exemplary embodiment, the communication component may further include a near field communication (NFC) module, radio frequency identification (RFID) technology, infrared data association (IrDA) technology, ultra-wideband (UWB) technology, and Bluetooth (BT) technology Wait.
上述图3中的显示器包括屏幕,其屏幕可以包括液晶显示器(LCD)和触摸面板(TP)。如果屏幕包括触摸面板,屏幕可以被实现为触摸屏,以接收来自用户的输入信号。触摸面板包括一个或多个触摸传感器以感测触摸、滑动和触摸面板上的手势。所述触摸传感器可以不仅感测触摸或滑动动作的边界,而且还检测与所述触摸或滑动操作相关的持续时间和压力。The display in FIG. 3 described above includes a screen, and the screen may include a liquid crystal display (LCD) and a touch panel (TP). If the screen includes a touch panel, the screen may be implemented as a touch screen to receive input signals from the user. The touch panel includes one or more touch sensors to sense touch, sliding, and gestures on the touch panel. The touch sensor may not only sense the boundary of a touch or slide action, but also detect the duration and pressure related to the touch or slide operation.
上述图3中的电源组件,为电源组件所在设备的各种组件提供电力。电源组件可以 包括电源管理系统,一个或多个电源,及其他与为电源组件所在设备生成、管理和分配电力相关联的组件。The power supply components in Figure 3 above provide power for various components of the equipment where the power supply components are located. The power supply component may include a power management system, one or more power supplies, and other components associated with the generation, management, and distribution of power for the equipment where the power supply component is located.
上述图3中的音频组件,可被配置为输出和/或输入音频信号。例如,音频组件包括一个麦克风(MIC),当音频组件所在设备处于操作模式,如呼叫模式、记录模式和语音识别模式时,麦克风被配置为接收外部音频信号。所接收的音频信号可以被进一步存储在存储器或经由通信组件发送。在一些实施例中,音频组件还包括一个扬声器,用于输出音频信号。The audio component in FIG. 3 may be configured to output and/or input audio signals. For example, the audio component includes a microphone (MIC). When the device where the audio component is located is in an operating mode, such as call mode, recording mode, and voice recognition mode, the microphone is configured to receive external audio signals. The received audio signal can be further stored in a memory or sent via a communication component. In some embodiments, the audio component further includes a speaker for outputting audio signals.
本领域内的技术人员应明白,本发明的实施例可提供为方法、系统、或计算机程序产品。因此,本发明可采用完全硬件实施例、完全软件实施例、或结合软件和硬件方面的实施例的形式。而且,本发明可采用在一个或多个其中包含有计算机可用程序代码的计算机可用存储介质(包括但不限于磁盘存储器、CD-ROM、光学存储器等)上实施的计算机程序产品的形式。Those skilled in the art should understand that the embodiments of the present invention may be provided as methods, systems, or computer program products. Therefore, the present invention may adopt the form of a complete hardware embodiment, a complete software embodiment, or an embodiment combining software and hardware. Moreover, the present invention may adopt the form of a computer program product implemented on one or more computer-usable storage media (including but not limited to disk storage, CD-ROM, optical storage, etc.) containing computer-usable program codes.
本发明是参照根据本发明实施例的方法、设备(系统)、和计算机程序产品的流程图和/或方框图来描述的。应理解可由计算机程序指令实现流程图和/或方框图中的每一流程和/或方框、以及流程图和/或方框图中的流程和/或方框的结合。可提供这些计算机程序指令到通用计算机、专用计算机、嵌入式处理机或其他可编程数据处理设备的处理器以产生一个机器,使得通过计算机或其他可编程数据处理设备的处理器执行的指令产生用于实现在流程图一个流程或多个流程和/或方框图一个方框或多个方框中指定的功能的装置。The present invention is described with reference to flowcharts and/or block diagrams of methods, devices (systems), and computer program products according to embodiments of the present invention. It should be understood that each process and/or block in the flowchart and/or block diagram, and the combination of processes and/or blocks in the flowchart and/or block diagram can be implemented by computer program instructions. These computer program instructions can be provided to the processor of a general-purpose computer, a special-purpose computer, an embedded processor, or other programmable data processing equipment to generate a machine, so that the instructions executed by the processor of the computer or other programmable data processing equipment are generated It is a device that realizes the functions specified in one process or multiple processes in the flowchart and/or one block or multiple blocks in the block diagram.
这些计算机程序指令也可存储在能引导计算机或其他可编程数据处理设备以特定方式工作的计算机可读存储器中,使得存储在该计算机可读存储器中的指令产生包括指令装置的制造品,该指令装置实现在流程图一个流程或多个流程和/或方框图一个方框或多个方框中指定的功能。These computer program instructions can also be stored in a computer-readable memory that can guide a computer or other programmable data processing equipment to work in a specific manner, so that the instructions stored in the computer-readable memory produce an article of manufacture including the instruction device. The device implements the functions specified in one process or multiple processes in the flowchart and/or one block or multiple blocks in the block diagram.
这些计算机程序指令也可装载到计算机或其他可编程数据处理设备上,使得在计算机或其他可编程设备上执行一系列操作步骤以产生计算机实现的处理,从而在计算机或其他可编程设备上执行的指令提供用于实现在流程图一个流程或多个流程和/或方框图一个方框或多个方框中指定的功能的步骤。These computer program instructions can also be loaded on a computer or other programmable data processing equipment, so that a series of operation steps are executed on the computer or other programmable equipment to produce computer-implemented processing, so as to execute on the computer or other programmable equipment. The instructions provide steps for implementing functions specified in a flow or multiple flows in the flowchart and/or a block or multiple blocks in the block diagram.
在一个典型的配置中,计算设备包括一个或多个处理器(CPU)、输入/输出接口、网络接口和内存。In a typical configuration, the computing device includes one or more processors (CPU), input/output interfaces, network interfaces, and memory.
内存可能包括计算机可读介质中的非永久性存储器,随机存取存储器(RAM)和/或 非易失性内存等形式,如只读存储器(ROM)或闪存(flash RAM)。内存是计算机可读介质的示例。The memory may include non-permanent memory in computer readable media, random access memory (RAM) and/or non-volatile memory, such as read-only memory (ROM) or flash memory (flash RAM). Memory is an example of computer readable media.
计算机可读介质包括永久性和非永久性、可移动和非可移动媒体可以由任何方法或技术来实现信息存储。信息可以是计算机可读指令、数据结构、程序的模块或其他数据。计算机的存储介质的例子包括,但不限于相变内存(PRAM)、静态随机存取存储器(SRAM)、动态随机存取存储器(DRAM)、其他类型的随机存取存储器(RAM)、只读存储器(ROM)、电可擦除可编程只读存储器(EEPROM)、快闪记忆体或其他内存技术、只读光盘只读存储器(CD-ROM)、数字多功能光盘(DVD)或其他光学存储、磁盒式磁带,磁带磁磁盘存储或其他磁性存储设备或任何其他非传输介质,可用于存储可以被计算设备访问的信息。按照本文中的界定,计算机可读介质不包括暂存电脑可读媒体(transitory media),如调制的数据信号和载波。Computer-readable media include permanent and non-permanent, removable and non-removable media, and information storage can be realized by any method or technology. The information can be computer-readable instructions, data structures, program modules, or other data. Examples of computer storage media include, but are not limited to, phase change memory (PRAM), static random access memory (SRAM), dynamic random access memory (DRAM), other types of random access memory (RAM), read-only memory (ROM), electrically erasable programmable read-only memory (EEPROM), flash memory or other memory technology, CD-ROM, digital versatile disc (DVD) or other optical storage, Magnetic cassettes, magnetic tape magnetic disk storage or other magnetic storage devices or any other non-transmission media can be used to store information that can be accessed by computing devices. According to the definition in this article, computer-readable media does not include transitory media, such as modulated data signals and carrier waves.
还需要说明的是,术语“包括”、“包含”或者其任何其他变体意在涵盖非排他性的包含,从而使得包括一系列要素的过程、方法、商品或者设备不仅包括那些要素,而且还包括没有明确列出的其他要素,或者是还包括为这种过程、方法、商品或者设备所固有的要素。在没有更多限制的情况下,由语句“包括一个……”限定的要素,并不排除在包括所述要素的过程、方法、商品或者设备中还存在另外的相同要素。It should also be noted that the terms "include", "include" or any other variants thereof are intended to cover non-exclusive inclusion, so that a process, method, product or equipment including a series of elements not only includes those elements, but also includes Other elements that are not explicitly listed, or include elements inherent to this process, method, commodity, or equipment. If there are no more restrictions, the element defined by the sentence "including a..." does not exclude the existence of other identical elements in the process, method, commodity, or equipment that includes the element.
以上所述仅为本申请的实施例而已,并不用于限制本申请。对于本领域技术人员来说,本申请可以有各种更改和变化。凡在本申请的精神和原理之内所作的任何修改、等同替换、改进等,均应包含在本申请的权利要求范围之内。The above descriptions are only examples of this application and are not used to limit this application. For those skilled in the art, this application can have various modifications and changes. Any modification, equivalent replacement, improvement, etc. made within the spirit and principle of this application shall be included in the scope of the claims of this application.
Claims (25)
- 一种实例管控方法,其特征在于,包括:An example management and control method, characterized in that it includes:确定部署于网络系统中至少一个边缘云节点中的至少一个实例,所述至少一个实例可为服务需求方提供云计算服务;Determine at least one instance deployed in at least one edge cloud node in the network system, where the at least one instance can provide cloud computing services for the service demander;对所述至少一个实例进行管控,以供所述至少一个实例为所述服务需求方提供云计算服务。The at least one instance is managed and controlled, so that the at least one instance provides cloud computing services for the service demander.
- 根据权利要求1所述的方法,其特征在于,对所述至少一个实例进行的管控包括:升级、迁移、关停、重启和释放中的至少一种。The method according to claim 1, wherein the management and control of the at least one instance includes at least one of upgrade, migration, shutdown, restart, and release.
- 根据权利要求2所述的方法,其特征在于,对所述至少一个实例进行升级,包括:The method of claim 2, wherein upgrading the at least one instance comprises:从所述至少一个实例中确定待升级实例;Determine the instance to be upgraded from the at least one instance;向所述服务需求方发送升级请求,以供所述服务需求方结合所述待升级实例上的业务情况为所述待升级实例确定升级策略;Sending an upgrade request to the service demander, so that the service demander determines an upgrade strategy for the instance to be upgraded in combination with the business situation on the instance to be upgraded;接收所述服务需求方返回的升级策略,依据所述升级策略对所述待升级实例进行升级。The upgrade strategy returned by the service demander is received, and the instance to be upgraded is upgraded according to the upgrade strategy.
- 根据权利要求3所述的方法,其特征在于,从所述至少一个实例中确定待升级实例,包括:The method according to claim 3, wherein determining the instance to be upgraded from the at least one instance comprises:接收所述服务需求方发送的升级描述信息,所述升级描述信息包括实例过滤条件;Receiving upgrade description information sent by the service demander, where the upgrade description information includes instance filter conditions;根据所述实例过滤条件,从所述至少一个实例中确定所述待升级实例。According to the instance filter condition, the instance to be upgraded is determined from the at least one instance.
- 根据权利要求4所述的方法,其特征在于,所述升级描述信息还包括:升级所需的镜像版本;则,依据所述升级策略对所述待升级实例进行升级,包括:The method according to claim 4, wherein the upgrade description information further includes: an image version required for the upgrade; then, upgrading the instance to be upgraded according to the upgrade strategy includes:依据所述升级策略,利用所述镜像版本对应的镜像对所述待升级实例进行升级。According to the upgrade strategy, the instance to be upgraded is upgraded using the image corresponding to the image version.
- 根据权利要求5所述的方法,其特征在于,依据所述升级策略,利用所述镜像版本对应的镜像对所述待升级实例进行升级,包括:The method according to claim 5, wherein, according to the upgrade strategy, upgrading the instance to be upgraded using the image corresponding to the image version comprises:将所述升级策略和所述镜像版本对应的镜像提供给所述网络系统中的边缘管控设备,以供所述边缘管控设备依据所述升级策略,利用所述镜像版本对应的镜像对所述待升级实例进行升级。The upgrade strategy and the mirror corresponding to the mirror version are provided to the edge management and control device in the network system, so that the edge management and control device uses the mirror corresponding to the mirror version to perform the processing on the waiting device according to the upgrade strategy. Upgrade the instance to upgrade.
- 根据权利要求3-6任一项所述的方法,其特征在于,所述升级策略包括:是否升级、升级时间和升级方法中的至少一个信息。The method according to any one of claims 3-6, wherein the upgrade strategy includes at least one of information of whether to upgrade, upgrade time, and upgrade method.
- 根据权利要求2所述的方法,其特征在于,对所述至少一个实例进行迁移,包括:The method according to claim 2, wherein migrating the at least one instance comprises:从所述至少一个实例中确定待迁移实例,所述待迁移实例属于第一边缘云节点;Determine an instance to be migrated from the at least one instance, where the instance to be migrated belongs to the first edge cloud node;若所述第一边缘云节点满足节点内迁移条件,对所述待迁移实例进行边缘云节点内的迁移;If the first edge cloud node meets the intra-node migration condition, perform intra-edge cloud node migration on the instance to be migrated;若所述第一边缘云节点不满足节点内迁移条件,对所述待迁移实例进行跨边缘云节点的迁移。If the first edge cloud node does not meet the intra-node migration condition, perform cross-edge cloud node migration on the instance to be migrated.
- 根据权利要求8所述的方法,其特征在于,从所述至少一个实例中确定待迁移实例,包括:The method according to claim 8, wherein determining the instance to be migrated from the at least one instance comprises:根据所述至少一个实例的状态,将出现故障的实例和/或运行过程中发生指定事件的实例作为所述待迁移实例;或者According to the state of the at least one instance, use the failed instance and/or the instance in which a specified event occurs during operation as the instance to be migrated; or根据资源归并需求,从至少一个实例中确定待迁移实例。According to resource merging requirements, an instance to be migrated is determined from at least one instance.
- 根据权利要求9所述的方法,其特征在于,根据资源归并需求,从至少一个实例中确定待迁移实例,包括:The method according to claim 9, characterized in that, determining the instance to be migrated from at least one instance according to resource merging requirements comprises:根据资源归并需求,确定需要资源归并的第一边缘云节点;According to resource merging requirements, determine the first edge cloud node that needs resource merging;结合所述第一边缘云节点中各资源设备上剩余的可用资源和所述第一边缘云节点中各实例需要的资源,确定所述待迁移实例。Combining the remaining available resources on each resource device in the first edge cloud node and the resources required by each instance in the first edge cloud node, determine the instance to be migrated.
- 根据权利要求8所述的方法,其特征在于,还包括:The method according to claim 8, further comprising:判断所述第一边缘云节点是否处于可用状态;Determine whether the first edge cloud node is in an available state;若所述第一边缘云节点处于可用状态,判断所述第一边缘云节点的可用资源是否足够承载所述待迁移实例;If the first edge cloud node is in an available state, determining whether the available resources of the first edge cloud node are sufficient to carry the instance to be migrated;若所述第一边缘云节点的可用资源足够承载所述待迁移实例,则确定所述第一边缘云节点满足节点内迁移条件;If the available resources of the first edge cloud node are sufficient to carry the instance to be migrated, determining that the first edge cloud node meets the intra-node migration condition;若所述第一边缘云节点处于不可用状态,或者所述第一边缘云节点的可用资源不足以承载所述待迁移实例,则确定所述第一边缘云节点不满足节点内迁移条件。If the first edge cloud node is in an unavailable state, or the available resources of the first edge cloud node are insufficient to carry the instance to be migrated, it is determined that the first edge cloud node does not meet the intra-node migration condition.
- 根据权利要求8-11任一项所述的方法,其特征在于,对所述待迁移实例进行跨边缘云节点的迁移,包括:The method according to any one of claims 8-11, wherein the migration of the instance to be migrated across edge cloud nodes comprises:从所述至少一个边缘云节点选择第二边缘云节点,所述第二边缘云节点不同于所述第一边缘云节点;Selecting a second edge cloud node from the at least one edge cloud node, where the second edge cloud node is different from the first edge cloud node;将所述待迁移实例迁移到所述第二边缘云节点中,并将所述待迁移实例在所述第二边缘云节点中的属性信息发送给所述服务需求方,以供所述服务需求方基于所述属性信息针对所述待迁移实例进行业务调度。Migrate the instance to be migrated to the second edge cloud node, and send the attribute information of the instance to be migrated in the second edge cloud node to the service demander for the service demand The party performs service scheduling for the instance to be migrated based on the attribute information.
- 根据权利要求12所述的方法,其特征在于,若所述待迁移实例是运行过程中发 生指定事件的实例,则,将所述待迁移实例迁移到所述第二边缘云节点中,包括:The method according to claim 12, wherein if the instance to be migrated is an instance in which a specified event occurs during operation, migrating the instance to be migrated to the second edge cloud node comprises:向所述服务需求方发送迁移请求,以供所述服务需求方结合所述待迁移实例上的业务情况为所述待迁移实例确定迁移策略;Sending a migration request to the service demander, so that the service demander determines a migration strategy for the instance to be migrated in combination with the business situation on the instance to be migrated;接收所述服务需求方发送的迁移策略,依据所述迁移策略,将所述待迁移实例迁移到所述第二边缘云节点中。Receive the migration strategy sent by the service demander, and migrate the instance to be migrated to the second edge cloud node according to the migration strategy.
- 根据权利要求12所述的方法,其特征在于,将所述待迁移实例迁移到所述第二边缘云节点中,包括:The method according to claim 12, wherein migrating the instance to be migrated to the second edge cloud node comprises:根据所述待迁移实例的资源需求,控制所述第二边缘云节点中相应资源设备为所述待迁移实例进行资源预留或分配;According to the resource requirements of the instance to be migrated, controlling the corresponding resource device in the second edge cloud node to reserve or allocate resources for the instance to be migrated;在资源预留或分配成功后,将所述待迁移实例迁移到所述第二边缘云节点中相应资源设备预留或分配的资源上。After the resource reservation or allocation is successful, the instance to be migrated is migrated to the resource reserved or allocated by the corresponding resource device in the second edge cloud node.
- 根据权利要求14所述的方法,其特征在于,将所述待迁移实例迁移到所述第二边缘云节点中相应资源设备预留或分配的资源上,包括:The method according to claim 14, wherein the migrating the instance to be migrated to the resource reserved or allocated by the corresponding resource device in the second edge cloud node comprises:控制所述第二边缘云节点中相应资源设备根据所述待迁移实例对应的镜像或实例快照在预留或分配的资源上创建所述待迁移实例。Control the corresponding resource device in the second edge cloud node to create the instance to be migrated on the reserved or allocated resources according to the image or instance snapshot corresponding to the instance to be migrated.
- 根据权利要求15所述的方法,其特征在于,控制所述第二边缘云节点中相应资源设备根据所述待迁移实例对应的镜像或实例快照在预留或分配的资源上创建所述待迁移实例,包括:The method according to claim 15, wherein the corresponding resource device in the second edge cloud node is controlled to create the to-be-migrated resource on the reserved or allocated resource according to the mirror image or instance snapshot corresponding to the to-be-migrated instance. Examples include:向所述网络系统中的边缘管控设备发送迁移指令,所述迁移指令指示所述边缘管控设备获取所述待迁移实例对应的镜像或实例快照并提供给所述第二边缘云节点中相应资源设备,以供所述相应资源设备在预留或分配的资源上创建所述待迁移实例。Send a migration instruction to the edge management and control device in the network system, where the migration instruction instructs the edge management and control device to obtain the image or instance snapshot corresponding to the instance to be migrated and provide it to the corresponding resource device in the second edge cloud node , So that the corresponding resource device can create the instance to be migrated on the reserved or allocated resource.
- 根据权利要求12所述的方法,其特征在于,还包括:The method according to claim 12, further comprising:接收所述服务需求方发送的释放通知,并依据所述释放通知将运行在所述第一边缘云节点中的所述待迁移实例释放掉;其中,在迁移过程中,所述待迁移实例继续运行在所述第一边缘云节点中;Receive a release notification sent by the service demander, and release the instance to be migrated running in the first edge cloud node according to the release notification; wherein, during the migration process, the instance to be migrated continues Running in the first edge cloud node;其中,所述释放通知是所述服务需求方在确定运行于所述第一边缘云节点中的所述待迁移实例上不再有任何业务请求之后发送的。Wherein, the release notification is sent by the service demander after determining that there is no longer any service request on the instance to be migrated running in the first edge cloud node.
- 一种网络系统,其特征在于,包括:中心管控设备,以及至少一个边缘云节点;A network system, characterized by comprising: a central management and control device, and at least one edge cloud node;所述至少一个边缘云节点中部署有至少一个实例,所述至少一个实例可为服务需求方提供云计算服务;At least one instance is deployed in the at least one edge cloud node, and the at least one instance can provide cloud computing services for service demanders;所述中心管控设备,用于对所述至少一个实例进行管控,以供所述至少一个实例为所述服务需求方提供云计算服务。The central management and control device is configured to manage and control the at least one instance, so that the at least one instance provides cloud computing services for the service demander.
- 根据权利要求18所述的网络系统,其特征在于,所述中心管控设备对所述至少一个实例的管控包括:升级、迁移、关停、重启和释放中的至少一种。The network system according to claim 18, wherein the management and control of the at least one instance by the central management and control device includes at least one of upgrade, migration, shutdown, restart, and release.
- 根据权利要求19所述的网络系统,其特征在于,所述中心管控设备在对所述至少一个实例进行升级时,具体用于:The network system according to claim 19, wherein, when the central management and control device upgrades the at least one instance, it is specifically configured to:从所述至少一个实例中确定待升级实例;Determine the instance to be upgraded from the at least one instance;向所述服务需求方发送升级请求,以供所述服务需求方结合所述待升级实例上的业务情况为所述待升级实例确定升级策略;Sending an upgrade request to the service demander, so that the service demander determines an upgrade strategy for the instance to be upgraded in combination with the business situation on the instance to be upgraded;接收所述服务需求方发送的升级策略,依据所述升级策略对所述待升级实例进行升级。The upgrade strategy sent by the service demander is received, and the instance to be upgraded is upgraded according to the upgrade strategy.
- 根据权利要求19所述的网络系统,其特征在于,所述中心管控设备在对所述至少一个实例进行迁移时,具体用于:The network system according to claim 19, wherein, when the central management and control device migrates the at least one instance, it is specifically configured to:从所述至少一个实例中确定待迁移实例,并确定所述待迁移实例所属的第一边缘云节点;Determine the instance to be migrated from the at least one instance, and determine the first edge cloud node to which the instance to be migrated belongs;若所述第一边缘云节点满足节点内迁移条件,对所述待迁移实例进行边缘云节点内的迁移;If the first edge cloud node meets the intra-node migration condition, perform intra-edge cloud node migration on the instance to be migrated;若所述第一边缘云节点不满足节点内迁移条件,对所述待迁移实例进行跨边缘云节点的迁移。If the first edge cloud node does not meet the intra-node migration condition, perform cross-edge cloud node migration on the instance to be migrated.
- 根据权利要求21所述的网络系统,其特征在于,所述中心管控设备在对所述待迁移实例进行跨边缘云节点的迁移时,具体用于:The network system according to claim 21, wherein the central management and control device is specifically configured to: when migrating the instance to be migrated across edge cloud nodes:从所述至少一个边缘云节点选择第二边缘云节点,所述第二边缘云节点不同于所述第一边缘云节点;Selecting a second edge cloud node from the at least one edge cloud node, where the second edge cloud node is different from the first edge cloud node;将所述待迁移实例迁移到所述第二边缘云节点中,并将所述待迁移实例在所述第二边缘云节点中的属性信息发送给所述服务需求方,以供所述服务需求方基于所述属性信息针对所述待迁移实例进行业务调度。Migrate the instance to be migrated to the second edge cloud node, and send the attribute information of the instance to be migrated in the second edge cloud node to the service demander for the service demand The party performs service scheduling for the instance to be migrated based on the attribute information.
- 根据权利要求18-22任一项所述的网络系统,其特征在于,还包括:边缘管控设备;The network system according to any one of claims 18-22, further comprising: edge management and control equipment;所述边缘管控设备,用于配合所述中心管控设备对所述至少一个实例进行管控。The edge management and control device is used to cooperate with the central management and control device to manage and control the at least one instance.
- 一种中心管控设备,其特征在于,包括:存储器和处理器;A central management and control device, which is characterized by comprising: a memory and a processor;所述存储器,用于存储计算机程序;当所述计算机程序被所述处理器执行时,致使所述处理器实现权利要求1-17任一项所述方法中的步骤。The memory is configured to store a computer program; when the computer program is executed by the processor, the processor is caused to implement the steps in the method of any one of claims 1-17.
- 一种存储有计算机程序的计算机可读存储介质,其特征在于,当所述计算机程序被一个或多个处理器执行时,致使所述一个或多个处理器实现权利要求1-17任一项所述方法中的步骤。A computer-readable storage medium storing a computer program, wherein when the computer program is executed by one or more processors, the one or more processors are caused to implement any one of claims 1-17 The steps in the method.
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201910277465.4A CN111800282B (en) | 2019-04-08 | 2019-04-08 | Network system, instance management and control method, device and storage medium |
CN201910277465.4 | 2019-04-08 |
Publications (1)
Publication Number | Publication Date |
---|---|
WO2020207266A1 true WO2020207266A1 (en) | 2020-10-15 |
Family
ID=72751930
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/CN2020/081570 WO2020207266A1 (en) | 2019-04-08 | 2020-03-27 | Network system, instance management method, device, and storage medium |
Country Status (2)
Country | Link |
---|---|
CN (2) | CN116170316A (en) |
WO (1) | WO2020207266A1 (en) |
Cited By (12)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN113992499A (en) * | 2021-11-16 | 2022-01-28 | 中国电信集团系统集成有限责任公司 | Disaster recovery method, storage medium and system based on dynamic migration of services |
CN113992675A (en) * | 2021-10-26 | 2022-01-28 | 云知声(上海)智能科技有限公司 | IOT cloud platform and edge gateway cooperative work method, system and storage medium |
CN114301775A (en) * | 2021-12-31 | 2022-04-08 | 中国联合网络通信集团有限公司 | Inventory business nano-management method and device and computer readable storage medium |
CN114553726A (en) * | 2022-02-23 | 2022-05-27 | 深圳市众功软件有限公司 | Network security operation and maintenance method and system based on function and resource level |
CN114598654A (en) * | 2022-01-30 | 2022-06-07 | 阿里巴巴(中国)有限公司 | Content delivery network CDN-based flow equalization processing method and device |
CN114760304A (en) * | 2022-03-30 | 2022-07-15 | 中国电信股份有限公司 | Computing power information processing method and system and computing power gateway |
CN115002681A (en) * | 2021-03-02 | 2022-09-02 | 中国移动通信有限公司研究院 | Computing power sensing network and using method and storage medium thereof |
CN115361389A (en) * | 2022-10-20 | 2022-11-18 | 阿里巴巴(中国)有限公司 | Cloud computing instance creation method and device |
CN115514817A (en) * | 2021-06-23 | 2022-12-23 | 中国移动通信有限公司研究院 | Information processing method, information processing equipment and computer readable storage medium |
WO2023082749A1 (en) * | 2021-11-15 | 2023-05-19 | 中电信数智科技有限公司 | Service recovery method and system based on mec edge cloud, and storage medium |
WO2024037439A1 (en) * | 2022-08-17 | 2024-02-22 | 维沃移动通信有限公司 | Computing power task migration method and apparatus, and device |
US12088460B1 (en) * | 2023-09-20 | 2024-09-10 | Verizon Patent And Licensing Inc. | Systems and methods for seamless edge service transfer |
Families Citing this family (14)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN112486667B (en) * | 2020-11-03 | 2022-03-18 | 深圳市中博科创信息技术有限公司 | Method and device for accurately processing data based on edge calculation |
CN112769897B (en) * | 2020-12-21 | 2023-04-18 | 北京百度网讯科技有限公司 | Synchronization method and device of edge calculation message, electronic equipment and storage medium |
CN114760313B (en) * | 2020-12-29 | 2023-11-24 | 中国联合网络通信集团有限公司 | Service scheduling method and service scheduling device |
CN113190378B (en) * | 2020-12-31 | 2024-04-02 | 华数云科技有限公司 | Edge cloud disaster recovery method based on distributed cloud platform |
CN113296903A (en) * | 2021-02-01 | 2021-08-24 | 阿里巴巴集团控股有限公司 | Edge cloud system, edge control method, control node and storage medium |
CN112995682B (en) * | 2021-04-21 | 2021-08-03 | 军事科学院系统工程研究院网络信息研究所 | Method and device for deploying and migrating video cloud service |
CN113259359B (en) * | 2021-05-21 | 2022-08-02 | 重庆紫光华山智安科技有限公司 | Edge node capability supplementing method, system, medium and electronic terminal |
CN113572821B (en) * | 2021-07-05 | 2024-06-04 | 山东师范大学 | Edge cloud node task cooperative processing method and system |
CN113342478B (en) * | 2021-08-04 | 2022-02-01 | 阿里云计算有限公司 | Resource management method, device, network system and storage medium |
CN114338166B (en) * | 2021-12-29 | 2024-07-02 | 支付宝(杭州)信息技术有限公司 | Edge equipment risk processing method, device, equipment and cloud server |
CN114301809B (en) * | 2021-12-31 | 2024-02-09 | 郑州云海信息技术有限公司 | Edge computing platform architecture |
CN114401183A (en) * | 2022-01-17 | 2022-04-26 | 杭州瑞网广通信息技术有限公司 | Edge cloud disaster recovery system, method and device based on distributed cloud platform |
CN116094923B (en) * | 2023-01-30 | 2023-08-25 | 杭州优云科技有限公司 | Gateway updating method and device after cloud instance migration and electronic equipment |
CN116887220B (en) * | 2023-08-10 | 2024-05-24 | 谷梵科技(青田)有限公司 | V2X service high availability method, system, device and storage medium based on cloud edge cooperation |
Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101977242A (en) * | 2010-11-16 | 2011-02-16 | 西安电子科技大学 | Layered distributed cloud computing architecture and service delivery method |
US20130311551A1 (en) * | 2010-04-07 | 2013-11-21 | Limelight Networks, Inc. | Edge-based resource spin-up for cloud computing |
WO2019042000A1 (en) * | 2017-08-31 | 2019-03-07 | 华为技术有限公司 | Instance switching method and associated device |
US20190087231A1 (en) * | 2017-09-19 | 2019-03-21 | University-Industry Cooperation Group Of Kyung-Hee University | System of cloud computing and method for detaching load in cloud computing system |
CN110266744A (en) * | 2019-02-27 | 2019-09-20 | 中国联合网络通信集团有限公司 | Location-based edge cloud resource dispatching method and system |
Family Cites Families (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US9141487B2 (en) * | 2013-01-15 | 2015-09-22 | Microsoft Technology Licensing, Llc | Healing cloud services during upgrades |
CN107018539A (en) * | 2016-01-27 | 2017-08-04 | 中兴通讯股份有限公司 | The ambulant processing method and processing device of application |
CN107295699A (en) * | 2016-03-30 | 2017-10-24 | 中兴通讯股份有限公司 | The terminating approach and device of application example, using, edge calculations platform, node |
CN113194157B (en) * | 2017-06-30 | 2022-10-28 | 华为技术有限公司 | Method and device for converting application instance address |
CN108632813B (en) * | 2018-05-21 | 2021-05-28 | 北京邮电大学 | Mobility management method and system for mobile edge computing |
CN109302483B (en) * | 2018-10-17 | 2021-02-02 | 网宿科技股份有限公司 | Application program management method and system |
-
2019
- 2019-04-08 CN CN202310139017.4A patent/CN116170316A/en active Pending
- 2019-04-08 CN CN201910277465.4A patent/CN111800282B/en active Active
-
2020
- 2020-03-27 WO PCT/CN2020/081570 patent/WO2020207266A1/en active Application Filing
Patent Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20130311551A1 (en) * | 2010-04-07 | 2013-11-21 | Limelight Networks, Inc. | Edge-based resource spin-up for cloud computing |
CN101977242A (en) * | 2010-11-16 | 2011-02-16 | 西安电子科技大学 | Layered distributed cloud computing architecture and service delivery method |
WO2019042000A1 (en) * | 2017-08-31 | 2019-03-07 | 华为技术有限公司 | Instance switching method and associated device |
US20190087231A1 (en) * | 2017-09-19 | 2019-03-21 | University-Industry Cooperation Group Of Kyung-Hee University | System of cloud computing and method for detaching load in cloud computing system |
CN110266744A (en) * | 2019-02-27 | 2019-09-20 | 中国联合网络通信集团有限公司 | Location-based edge cloud resource dispatching method and system |
Non-Patent Citations (2)
Title |
---|
ALIBABA CLOUD COMPUTING CO., LTD. ET AL.: "Non-official translation: Information Technology, Cloud Computing, General Technical Requirements for Edge Cloud Computing", NON-OFFICIAL TRANSLATION: CHINA OPEN SOURCE CLOUD LEAGUE STANDARD COSCL XXXX-2019, 23 September 2019 (2019-09-23), DOI: 20200618134028PX * |
ETRI ET AL.: "FS_CAV – Update requirements for massive wireless sensor networks use case", S1-174488, 3GPP TSG-SA WG1 MEETING #80, 1 December 2017 (2017-12-01), XP051379105, DOI: 20200617010703A * |
Cited By (16)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN115002681A (en) * | 2021-03-02 | 2022-09-02 | 中国移动通信有限公司研究院 | Computing power sensing network and using method and storage medium thereof |
CN115514817A (en) * | 2021-06-23 | 2022-12-23 | 中国移动通信有限公司研究院 | Information processing method, information processing equipment and computer readable storage medium |
CN113992675A (en) * | 2021-10-26 | 2022-01-28 | 云知声(上海)智能科技有限公司 | IOT cloud platform and edge gateway cooperative work method, system and storage medium |
WO2023082749A1 (en) * | 2021-11-15 | 2023-05-19 | 中电信数智科技有限公司 | Service recovery method and system based on mec edge cloud, and storage medium |
CN113992499B (en) * | 2021-11-16 | 2023-08-15 | 中电信数智科技有限公司 | Disaster recovery method, storage medium and system based on service dynamic migration |
CN113992499A (en) * | 2021-11-16 | 2022-01-28 | 中国电信集团系统集成有限责任公司 | Disaster recovery method, storage medium and system based on dynamic migration of services |
CN114301775A (en) * | 2021-12-31 | 2022-04-08 | 中国联合网络通信集团有限公司 | Inventory business nano-management method and device and computer readable storage medium |
CN114301775B (en) * | 2021-12-31 | 2023-07-28 | 中国联合网络通信集团有限公司 | Method and device for managing stock service and computer readable storage medium |
CN114598654A (en) * | 2022-01-30 | 2022-06-07 | 阿里巴巴(中国)有限公司 | Content delivery network CDN-based flow equalization processing method and device |
CN114553726A (en) * | 2022-02-23 | 2022-05-27 | 深圳市众功软件有限公司 | Network security operation and maintenance method and system based on function and resource level |
CN114760304A (en) * | 2022-03-30 | 2022-07-15 | 中国电信股份有限公司 | Computing power information processing method and system and computing power gateway |
CN114760304B (en) * | 2022-03-30 | 2024-09-27 | 中国电信股份有限公司 | Processing method, processing system and computing gateway of computing information |
WO2024037439A1 (en) * | 2022-08-17 | 2024-02-22 | 维沃移动通信有限公司 | Computing power task migration method and apparatus, and device |
CN115361389A (en) * | 2022-10-20 | 2022-11-18 | 阿里巴巴(中国)有限公司 | Cloud computing instance creation method and device |
CN115361389B (en) * | 2022-10-20 | 2023-04-11 | 阿里巴巴(中国)有限公司 | Cloud computing instance creating method and device |
US12088460B1 (en) * | 2023-09-20 | 2024-09-10 | Verizon Patent And Licensing Inc. | Systems and methods for seamless edge service transfer |
Also Published As
Publication number | Publication date |
---|---|
CN111800282B (en) | 2023-03-28 |
CN111800282A (en) | 2020-10-20 |
CN116170316A (en) | 2023-05-26 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
WO2020207266A1 (en) | Network system, instance management method, device, and storage medium | |
WO2020207265A1 (en) | Network system, management and control method and device, and storage medium | |
WO2020207264A1 (en) | Network system, service provision and resource scheduling method, device, and storage medium | |
CN115633050B (en) | Mirror image management method, device and storage medium | |
US11658916B2 (en) | Simple integration of an on-demand compute environment | |
WO2022161430A1 (en) | Edge cloud system, edge management and control method, management and control node, and storage medium | |
CN113169952B (en) | Container cloud management system based on block chain technology | |
WO2022007552A1 (en) | Processing node management method, configuration method and related apparatus | |
US11356385B2 (en) | On-demand compute environment | |
WO2020147330A1 (en) | Data stream processing method and system | |
CN111800285B (en) | Instance migration method and device and electronic equipment | |
CN113296882A (en) | Container arranging method, device, system and storage medium | |
WO2013104217A1 (en) | Cloud infrastructure based management system and method for performing maintenance and deployment for application system | |
CN113301078A (en) | Network system, service deployment and network division method, device and storage medium | |
WO2020063550A1 (en) | Policy decision method, apparatus and system, and storage medium, policy decision unit and cluster | |
WO2024164894A1 (en) | Method for traffic control and data replication, node, system, and storage medium | |
CN114296891A (en) | Task scheduling method, system, computing device, storage medium and program product | |
CN114301909B (en) | Edge distributed management and control system, method, equipment and storage medium | |
CN113138717B (en) | Node deployment method, device and storage medium | |
CN113918297A (en) | Distributed scheduling system, distributed scheduling method, device and medium | |
CN114327752A (en) | Micro-service configuration method, device and equipment | |
CN118659974A (en) | Method, apparatus, computer device, readable storage medium, and computer program product for expanding and contracting capacity of data collector | |
CN117596243A (en) | Big data component monitoring operation and maintenance platform construction method, system, equipment and medium | |
CN115098259A (en) | Resource management method and device, cloud platform, equipment and storage medium | |
JP2020113148A (en) | Virtual base management device, method for managing virtual base, and virtual base management program |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 20787874 Country of ref document: EP Kind code of ref document: A1 |
|
NENP | Non-entry into the national phase |
Ref country code: DE |
|
122 | Ep: pct application non-entry in european phase |
Ref document number: 20787874 Country of ref document: EP Kind code of ref document: A1 |