CN110968444A - Data processing method and device of cloud computing platform - Google Patents

Data processing method and device of cloud computing platform Download PDF

Info

Publication number
CN110968444A
CN110968444A CN201811162037.9A CN201811162037A CN110968444A CN 110968444 A CN110968444 A CN 110968444A CN 201811162037 A CN201811162037 A CN 201811162037A CN 110968444 A CN110968444 A CN 110968444A
Authority
CN
China
Prior art keywords
cloud computing
computing platform
physical machine
control node
virtual machine
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201811162037.9A
Other languages
Chinese (zh)
Inventor
张崛起
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing Gridsum Technology Co Ltd
Original Assignee
Beijing Gridsum Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing Gridsum Technology Co Ltd filed Critical Beijing Gridsum Technology Co Ltd
Priority to CN201811162037.9A priority Critical patent/CN110968444A/en
Publication of CN110968444A publication Critical patent/CN110968444A/en
Pending legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/07Responding to the occurrence of a fault, e.g. fault tolerance
    • G06F11/0703Error or fault processing not based on redundancy, i.e. by taking additional measures to deal with the error or fault not making use of redundancy in operation, in hardware, or in data representation
    • G06F11/0706Error or fault processing not based on redundancy, i.e. by taking additional measures to deal with the error or fault not making use of redundancy in operation, in hardware, or in data representation the processing taking place on a specific hardware platform or in a specific software environment
    • G06F11/0709Error or fault processing not based on redundancy, i.e. by taking additional measures to deal with the error or fault not making use of redundancy in operation, in hardware, or in data representation the processing taking place on a specific hardware platform or in a specific software environment in a distributed system consisting of a plurality of standalone computer nodes, e.g. clusters, client-server systems
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/07Responding to the occurrence of a fault, e.g. fault tolerance
    • G06F11/0703Error or fault processing not based on redundancy, i.e. by taking additional measures to deal with the error or fault not making use of redundancy in operation, in hardware, or in data representation
    • G06F11/0751Error or fault detection not based on redundancy
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/07Responding to the occurrence of a fault, e.g. fault tolerance
    • G06F11/0703Error or fault processing not based on redundancy, i.e. by taking additional measures to deal with the error or fault not making use of redundancy in operation, in hardware, or in data representation
    • G06F11/0793Remedial or corrective actions
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F9/00Arrangements for program control, e.g. control units
    • G06F9/06Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
    • G06F9/44Arrangements for executing specific programs
    • G06F9/455Emulation; Interpretation; Software simulation, e.g. virtualisation or emulation of application or operating system execution engines
    • G06F9/45533Hypervisors; Virtual machine monitors
    • G06F9/45558Hypervisor-specific management and integration aspects
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F9/00Arrangements for program control, e.g. control units
    • G06F9/06Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
    • G06F9/44Arrangements for executing specific programs
    • G06F9/455Emulation; Interpretation; Software simulation, e.g. virtualisation or emulation of application or operating system execution engines
    • G06F9/45533Hypervisors; Virtual machine monitors
    • G06F9/45558Hypervisor-specific management and integration aspects
    • G06F2009/45562Creating, deleting, cloning virtual machine instances
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F2201/00Indexing scheme relating to error detection, to error correction, and to monitoring
    • G06F2201/815Virtual

Abstract

The invention discloses a data processing method and device of a cloud computing platform. The method comprises the following steps: detecting whether a first physical machine in a cloud computing platform is down or not or whether a control node installed on the first physical machine is down or not, wherein a component of the control node is deployed on an operating system of a virtual machine, and the virtual machine is installed on the first physical machine; under the condition of downtime, a physical machine which controls normal operation in the cloud computing platform acquires an image file of a control node through a network file system and installs a virtual machine, wherein the network file system is installed on a second physical machine in the cloud computing platform, files in the network file system are stored in a preset directory, and files in the preset directory are set to be readable and writable by other physical machines in the cloud computing platform; and running the service of the control node of the cloud computing platform on the operating system of the newly installed virtual machine. By the method and the device, the effect of automatically recovering the management capability after the cloud platform controls the node to be down is achieved.

Description

Data processing method and device of cloud computing platform
Technical Field
The invention relates to the field of computers, in particular to a data processing method and device of a cloud computing platform.
Background
Cloud computing platforms, such as OpenStack cloud computing platforms, have been widely applied to the construction of private cloud platforms of various large companies, and currently used deployment architectures are mainly divided into two types, one type is to deploy all components of a control node to the same host, and the other type is to deploy each component to multiple hosts, both types of architectures have defects, and once any host that deploys the control node components goes down, the cloud platform is not manageable.
Aiming at the problem that a cloud platform is unmanageable due to down of a control node in the related art, an effective solution is not provided at present.
Disclosure of Invention
The invention mainly aims to provide a data processing method and device for a cloud computing platform, and aims to solve the problem that the cloud platform is unmanageable due to down of a control node.
In order to achieve the above object, according to an aspect of the present invention, there is provided a data processing method of a cloud computing platform, the method including: detecting whether a first physical machine in a cloud computing platform is down or not or whether a control node installed on the first physical machine is down, wherein components of the control node are deployed on an operating system of a virtual machine, and the virtual machine is installed on the first physical machine; under the condition of downtime, a physical machine which controls normal operation in the cloud computing platform acquires an image file of the control node through a network file system and installs a virtual machine, wherein the network file system is installed on a second physical machine in the cloud computing platform, files in the network file system are stored in a preset directory, and the files in the preset directory are set to be read and written by other physical machines in the cloud computing platform; and running the service of the control node of the cloud computing platform on the operating system of the newly installed virtual machine.
Further, before detecting whether a first physical machine in a cloud computing platform is down or whether a control node installed on the first physical machine is down, the method further includes: building a network file system on the second physical machine; creating a virtual machine on a first physical machine, and storing an image file generated by the virtual machine in a preset directory of the network file system; deploying a component of a control node of the cloud computing platform onto an operating system of a virtual machine of the first physical machine; deploying components of computing nodes of the cloud computing platform onto each physical machine of the cloud computing platform.
Further, before controlling a physical machine normally running in the cloud computing platform to acquire an image file of the control node through a network file system and install a virtual machine, the method further includes: and storing the operation data file generated in the operation process of the cloud computing platform in a preset directory of the network file system in real time.
Further, before detecting whether a first physical machine in a cloud computing platform is down or whether a control node installed on the first physical machine is down, the method further includes: deploying a python program to any physical machine in the cloud computing platform except the first physical machine; the method for detecting whether a first physical machine in a cloud computing platform is down or not or whether a control node installed on the first physical machine is down comprises the following steps: detecting whether a first physical machine in a cloud computing platform is down or not or whether a control node installed on the first physical machine is down or not through a python program.
Further, the cloud computing platform comprises an OpenStack cloud computing platform, and the virtual machines comprise centros 7 virtual machines created using a system virtualization module.
In order to achieve the above object, according to another aspect of the present invention, there is also provided a data processing apparatus of a cloud computing platform, the apparatus including: the cloud computing system comprises a detection unit, a processing unit and a control unit, wherein the detection unit is used for detecting whether a first physical machine in a cloud computing platform is down or not or detecting whether a control node installed on the first physical machine is down, wherein a component of the control node is deployed on an operating system of a virtual machine, and the virtual machine is installed on the first physical machine; the acquiring unit is used for controlling a physical machine which normally runs in the cloud computing platform to acquire an image file of the control node through a network file system and install a virtual machine under the condition of downtime, wherein the network file system is installed on a second physical machine in the cloud computing platform, files in the network file system are stored in a preset directory, and the files in the preset directory are set to be readable and writable by other physical machines in the cloud computing platform; the first processing unit is used for running the service of the control node of the cloud computing platform on the operating system of the newly installed virtual machine.
Further, the apparatus further comprises: the device comprises a building unit and a control unit, wherein the building unit is used for building a network file system on a second physical machine before detecting whether a first physical machine in a cloud computing platform is down or not or detecting whether a control node installed on the first physical machine is down; the system comprises a creating unit, a storage unit and a processing unit, wherein the creating unit is used for creating a virtual machine on a first physical machine and storing an image file generated by the virtual machine in a preset directory of the network file system; a second processing unit, configured to deploy a component of a control node of the cloud computing platform onto an operating system of a virtual machine of the first physical machine; a third processing unit to deploy components of the compute nodes of the cloud computing platform onto each physical machine of the cloud computing platform.
Further, the apparatus further comprises: and the storage unit is used for storing the operation data file generated in the operation process of the cloud computing platform in a preset directory of the network file system in real time before the physical machine which controls the normal operation of the cloud computing platform acquires the image file of the control node through the network file system and installs the virtual machine.
In order to achieve the above object, according to another aspect of the present invention, there is also provided a storage medium including a stored program, where when the program runs, a device where the storage medium is located is controlled to execute the data processing method of the cloud computing platform according to the present invention.
In order to achieve the above object, according to another aspect of the present invention, there is also provided a processor configured to execute a program, where the program executes the data processing method of the cloud computing platform according to the present invention.
The method comprises the steps of detecting whether a first physical machine in a cloud computing platform is down or not or detecting whether a control node installed on the first physical machine is down or not, wherein a component of the control node is deployed on an operating system of a virtual machine, and the virtual machine is installed on the first physical machine; under the condition of downtime, a physical machine which controls normal operation in the cloud computing platform acquires an image file of a control node through a network file system and installs a virtual machine, wherein the network file system is installed on a second physical machine in the cloud computing platform, files in the network file system are stored in a preset directory, and files in the preset directory are set to be readable and writable by other physical machines in the cloud computing platform; the service of the control node of the cloud computing platform is operated on the operating system of the newly installed virtual machine, the problem that the cloud platform is unmanageable due to the fact that the control node is down is solved, and the effect of automatically recovering the management capacity after the control node of the cloud platform is down is achieved.
Drawings
The accompanying drawings, which are incorporated in and constitute a part of this application, illustrate embodiments of the invention and, together with the description, serve to explain the invention and not to limit the invention. In the drawings:
fig. 1 is a flowchart of a data processing method of a cloud computing platform according to an embodiment of the present invention;
FIG. 2 is a schematic diagram of a cloud platform according to an embodiment of the invention;
FIG. 3 is a flow diagram of the cloud platform operating principle according to an embodiment of the present invention;
fig. 4 is a schematic diagram of a data processing apparatus of a cloud computing platform according to an embodiment of the present invention.
Detailed Description
It should be noted that the embodiments and features of the embodiments in the present application may be combined with each other without conflict. The present invention will be described in detail below with reference to the embodiments with reference to the attached drawings.
In order to make the technical solutions better understood by those skilled in the art, the technical solutions in the embodiments of the present application will be clearly and completely described below with reference to the drawings in the embodiments of the present application, and it is obvious that the described embodiments are only partial embodiments of the present application, but not all embodiments. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present application.
It should be noted that the terms "first," "second," and the like in the description and claims of this application and in the drawings described above are used for distinguishing between similar elements and not necessarily for describing a particular sequential or chronological order. It should be understood that the data so used may be interchanged under appropriate circumstances such that embodiments of the application described herein may be used. Furthermore, the terms "comprises," "comprising," and "having," and any variations thereof, are intended to cover a non-exclusive inclusion, such that a process, method, system, article, or apparatus that comprises a list of steps or elements is not necessarily limited to those steps or elements expressly listed, but may include other steps or elements not expressly listed or inherent to such process, method, article, or apparatus.
For convenience of description, several terms referred to in the embodiments of the present application are explained below:
the OpenStack cloud computing platform: the OpenStack is a cloud computing management platform project, and is formed by combining several main components to complete specific work. OpenStack supports almost all types of cloud environments, and the project aims to provide a cloud computing management platform which is simple to implement, can be expanded in a large scale, is rich and has a unified standard. OpenStack provides a solution for infrastructure as a service (IaaS) through various complementary services, each providing an API for integration.
OpenStack control node (control node for short): the method is used for deploying management class components in the OpenStack cluster, and the management class components comprise api interfaces, management interfaces and the like of various services.
OpenStack compute node (compute node for short): the method is used for deploying the computing components in the OpenStack cluster and providing virtual machine resources.
KVM: the short name of Kernel-based Virtual Machine is an open-source system virtualization module, and is integrated in each main release version of Linux after Linux 2.6.20.
Virtual machine: a Virtual Machine (Virtual Machine) refers to a complete computer system with complete hardware system functionality, operating in a completely isolated environment, simulated by software.
NFS: a network file system that allows sharing of resources between computers in a network over a TCP/IP network.
The embodiment of the invention provides a data processing method of a cloud computing platform.
Fig. 1 is a flowchart of a data processing method of a cloud computing platform according to an embodiment of the present invention, and as shown in fig. 1, the method includes the following steps:
step S102: detecting whether a first physical machine in a cloud computing platform is down or not or whether a control node installed on the first physical machine is down or not, wherein a component of the control node is deployed on an operating system of a virtual machine, and the virtual machine is installed on the first physical machine;
step S104: under the condition of downtime, a physical machine which controls normal operation in the cloud computing platform acquires an image file of a control node through a network file system and installs a virtual machine, wherein the network file system is installed on a second physical machine in the cloud computing platform, files in the network file system are stored in a preset directory, and files in the preset directory are set to be readable and writable by other physical machines in the cloud computing platform;
step S106: and running the service of the control node of the cloud computing platform on the operating system of the newly installed virtual machine.
The embodiment adopts the steps that whether a first physical machine in a cloud computing platform is down or not or whether a control node installed on the first physical machine is down is detected, wherein a component of the control node is deployed on an operating system of a virtual machine, and the virtual machine is installed on the first physical machine; under the condition of downtime, a physical machine which controls normal operation in the cloud computing platform acquires an image file of a control node through a network file system and installs a virtual machine, wherein the network file system is installed on a second physical machine in the cloud computing platform, files in the network file system are stored in a preset directory, and files in the preset directory are set to be readable and writable by other physical machines in the cloud computing platform; the service of the control node of the cloud computing platform is operated on the operating system of the newly installed virtual machine, the problem that the cloud platform is unmanageable due to the fact that the control node is down is solved, and the effect of automatically recovering the management capacity after the control node of the cloud platform is down is achieved.
In the embodiment of the invention, the cloud computing platform can be an OpenStack cloud computing platform, the cloud computing platform is provided with a plurality of physical machines, the control node is installed on the first physical machine, the network file system is installed on the second physical machine, files in the network file system can be read and written by all the physical machines on the same cloud computing platform, when the first physical machine is down or the control node installed on the first physical machine is down, the image files of the control nodes can be obtained on other physical machines which normally run through the pre-installed network file system, the virtual machines are re-installed according to the image files to obtain new control nodes, and running the service of the control node of the cloud computing platform on the operating system of the newly installed virtual machine, so that the cloud platform can be managed on the physical machine which runs normally through the new control node. As long as the cloud computing platform is provided with a normally-running physical machine, the management capability can be automatically recovered after the control node is in fact, and the problem that the cloud computing platform is unmanageable after the control node is down in the related technology is solved.
Optionally, before detecting whether a first physical machine in the cloud computing platform is down or not or detecting whether a control node installed on the first physical machine is down, a network file system is set up on a second physical machine in the cloud computing platform; creating a virtual machine on a first physical machine, and storing an image file generated by the virtual machine in a preset directory; deploying a component of a control node of a cloud computing platform onto an operating system of a virtual machine of a first physical machine; deploying components of a compute node of a cloud computing platform onto each physical machine of the cloud computing platform.
When the platform is built, a virtual machine needs to be built on a first physical machine, a network file system is built on a second physical machine, an image file generated in the installation process of the virtual machine is stored in a preset directory of the network file system, a component of a control node is deployed on a virtual machine operating system of the first physical machine, and a component of a computing node is deployed on each physical machine on the cloud computing platform.
Optionally, before the mirror image file of the control node is acquired through the network file system on the physical machine which normally runs in the cloud computing platform and the virtual machine is reinstalled, the running data file generated in the running process of the cloud computing platform is stored in the preset directory in real time.
In the normal operation process of the cloud computing platform, the operation data file generated by the operation of the platform is stored in the preset directory in real time, and because the file in the network file system can be visible by all servers in the platform, when a certain server writes or modifies data, all other servers can be synchronously visible, so that the storage of the data file can be regarded as real-time, and after an exception occurs, the virtual machine is reinstalled to continue to operate by using the stored operation data, so that the normal operation of the data after the cloud computing platform recovers management can be ensured.
Optionally, before detecting whether a first physical machine in the cloud computing platform is down or not or detecting whether a control node installed on the first physical machine is down, deploying the python program to any physical machine in the cloud computing platform except the first physical machine; the detecting whether a first physical machine in the cloud computing platform is down or not or whether a control node installed on the first physical machine is down comprises the following steps: detecting whether a first physical machine in the cloud computing platform is down or not or whether a control node installed on the first physical machine is down or not through a python program.
When detecting whether the physical machine is down, the detection can be performed through a python program deployed on a physical machine other than the first physical machine, or through programs of other language types.
Optionally, the cloud computing platform comprises an OpenStack cloud computing platform, and the virtual machine may be a centros 7 virtual machine created using a system virtualization module.
The embodiment of the invention uses a network sharing file system (NFS) to share the image by deploying all components of the control node to the same virtual machine (generating an image file), and the sharing is real-time. The states of the physical machines are detected in real time through the programs, when the physical machine with the control node is down or is stopped for maintenance, the programs call any physical machine which normally runs to start the virtual machine through the control node image, the cloud platform can continue to manage at the moment, and the problem that the cloud platform is unmanageable due to the down of the control node is solved.
The embodiment of the present invention also provides a preferred implementation manner, and the following describes the technical solution of the embodiment of the present invention with reference to the preferred implementation manner.
Fig. 2 is a schematic diagram of a cloud platform according to an embodiment of the present invention, as shown in fig. 2, the cloud platform includes: the system comprises four physical machines (10.202.60.50-10.202.60.53), wherein the four physical machines are all provided with computing nodes, one of the four physical machines is provided with a control node, and the other physical machine is provided with a Python program for detecting whether the physical machine provided with the control node normally operates.
Fig. 3 is a flowchart of a working principle of a cloud platform according to an embodiment of the present invention, and as shown in fig. 3, a python program continuously detects states of a physical machine and a virtual machine installed on the physical machine, determines whether a virtual machine where a control node is located is alive, if the virtual machine is alive, continuously detects, if the virtual machine is not alive, detects whether a control node is alive, if the virtual machine is alive, continuously detects, if the virtual machine is not alive, it indicates that a current control node has a fault and needs to be reinstalled, acquires an image file of the control node from an NFS, and starts the control node on another physical machine using a KVM, so as to implement continuous management of the cloud platform.
Specifically, the detailed steps are as follows:
1. the NFS is built on physical machine 10.202.60.53 and allows other servers (physical machines) to be read and written.
2. All physical machines mount 10.202.60.53 the NFS to/data directory and create a vms directory under/data for storing the image files generated by the virtual machines.
3. Kvm is used on the physical machine 10.202.60.50 to create a Centos7 virtual machine, and images generated by the virtual machine are stored in the/data/vms directory.
4. Deploying all components of the control node to the virtual machine operating system, such as: keystone, neutron, glance, mysql, nova-api, etc., which are not described herein.
5. All components of the computing node are respectively deployed on each physical machine, and the description is not expanded here.
6. The python program is deployed to 10.202.60.53.
7. When the python program detects that a physical machine (10.202.60.50) or a control node is down, the program automatically acquires the control node image on other physical machines through NFS, and starts a brand new control node by using KVM. And at the moment, the management capability of the OpenStack cluster is automatically recovered.
In the technical scheme of the embodiment of the invention, the virtual machine for creating the control node can use other virtualization technologies (such as vmware, hyperV and the like), the detection program can use other languages except python, the state of the control node is periodically (or in real time) detected by using the automation program, and the state is adjusted in real time, so that the OpenStack cloud platform can be flexibly deployed, and the problem that the cloud platform is unmanagable due to downtime of the control node is solved.
It should be noted that the steps illustrated in the flowcharts of the figures may be performed in a computer system such as a set of computer-executable instructions and that, although a logical order is illustrated in the flowcharts, in some cases, the steps illustrated or described may be performed in an order different than presented herein.
The embodiment of the invention provides a data processing device of a cloud computing platform, which can be used for executing the data processing method of the cloud computing platform.
Fig. 4 is a schematic diagram of a data processing apparatus of a cloud computing platform according to an embodiment of the present invention, as shown in fig. 4, the apparatus includes:
the system comprises a detection unit 10, a processing unit and a control unit, wherein the detection unit is used for detecting whether a first physical machine in a cloud computing platform is down or not or detecting whether a control node installed on the first physical machine is down, wherein a component of the control node is deployed on an operating system of a virtual machine, and the virtual machine is installed on the first physical machine;
the acquiring unit 20 is configured to, in the event of a downtime, control a physical machine, which normally operates in the cloud computing platform, to acquire an image file of a control node through a network file system and install a virtual machine, where the network file system is installed on a second physical machine in the cloud computing platform, files in the network file system are stored in a preset directory, and files in the preset directory are set to be readable and writable by other physical machines in the cloud computing platform;
and the first processing unit 30 is configured to run a service of a control node of the cloud computing platform on an operating system of the newly installed virtual machine.
The embodiment adopts a detection unit 10, configured to detect whether a first physical machine in a cloud computing platform is down or not, or whether a control node installed on the first physical machine is down, where a component of the control node installed on the first physical machine is deployed on an operating system of a virtual machine, and the virtual machine is installed on the first physical machine; the obtaining unit 20 is configured to, when a shutdown occurs, control a physical machine in the cloud computing platform to normally operate to obtain an image file of the control node through a network file system and install a virtual machine, where the network file system is installed on a second physical machine in the cloud computing platform, files in the network file system are stored in a preset directory, and files in the preset directory are set to be readable and writable by other physical machines in the cloud computing platform, so that a problem that the cloud platform is unmanagable due to shutdown of the control node is solved, and an effect of automatically recovering management capability after the shutdown of the control node of the cloud platform is achieved.
Optionally, the apparatus further comprises: the device comprises a building unit, a storage unit and a control unit, wherein the building unit is used for building a network file system on a second physical machine before detecting whether a first physical machine in the cloud computing platform is down or not or whether a control node installed on the first physical machine is down or not; the system comprises a creating unit, a storage unit and a processing unit, wherein the creating unit is used for creating a virtual machine on a first physical machine and storing an image file generated by the virtual machine in a preset directory of a network file system; the second processing unit is used for deploying the components of the control node of the cloud computing platform to the operating system of the virtual machine of the first physical machine; and the third processing unit is used for deploying the components of the computing nodes of the cloud computing platform to each physical machine of the cloud computing platform.
Optionally, the apparatus further comprises: and the storage unit is used for storing the operation data file generated in the operation process of the cloud computing platform in a preset directory of the network file system in real time before the physical machine which controls the normal operation of the cloud computing platform acquires the image file of the control node through the network file system and installs the virtual machine.
The data processing device of the cloud computing platform comprises a processor and a memory, the detection unit, the acquisition unit, the first processing unit and the like are stored in the memory as program units, and the processor executes the program units stored in the memory to realize corresponding functions.
The processor comprises a kernel, and the kernel calls the corresponding program unit from the memory. One or more than one kernel can be set, and the management capability is automatically recovered after the cloud platform controls the nodes to be down by adjusting kernel parameters.
The memory may include volatile memory in a computer readable medium, Random Access Memory (RAM) and/or nonvolatile memory such as Read Only Memory (ROM) or flash memory (flash RAM), and the memory includes at least one memory chip.
An embodiment of the present invention provides a storage medium on which a program is stored, and the program implements a data processing method of the cloud computing platform when executed by a processor.
The embodiment of the invention provides a processor, which is used for running a program, wherein the data processing method of a cloud computing platform is executed when the program runs.
The embodiment of the invention provides equipment, which comprises a processor, a memory and a program which is stored on the memory and can run on the processor, wherein the processor executes the program and realizes the following steps: detecting whether a first physical machine in a cloud computing platform is down or not or whether a control node installed on the first physical machine is down or not, wherein a component of the control node is deployed on an operating system of a virtual machine, and the virtual machine is installed on the first physical machine; under the condition of downtime, a physical machine which controls normal operation in the cloud computing platform acquires an image file of a control node through a network file system and installs a virtual machine, wherein the network file system is installed on a second physical machine in the cloud computing platform, files in the network file system are stored in a preset directory, and files in the preset directory are set to be readable and writable by other physical machines in the cloud computing platform; the service of the control node of the cloud computing platform is operated on the operating system of the newly installed virtual machine, and the problem that the cloud platform is unmanageable due to downtime of the control node is solved. The device herein may be a server, a PC, a PAD, a mobile phone, etc.
The present application further provides a computer program product adapted to perform a program for initializing the following method steps when executed on a data processing device: detecting whether a first physical machine in a cloud computing platform is down or not or whether a control node installed on the first physical machine is down or not, wherein a component of the control node is deployed on an operating system of a virtual machine, and the virtual machine is installed on the first physical machine; under the condition of downtime, a physical machine which controls normal operation in the cloud computing platform acquires an image file of a control node through a network file system and installs a virtual machine, wherein the network file system is installed on a second physical machine in the cloud computing platform, files in the network file system are stored in a preset directory, and files in the preset directory are set to be readable and writable by other physical machines in the cloud computing platform; the service of the control node of the cloud computing platform is operated on the operating system of the newly installed virtual machine, and the problem that the cloud platform is unmanageable due to downtime of the control node is solved.
As will be appreciated by one skilled in the art, embodiments of the present application may be provided as a method, system, or computer program product. Accordingly, the present application may take the form of an entirely hardware embodiment, an entirely software embodiment or an embodiment combining software and hardware aspects. Furthermore, the present application may take the form of a computer program product embodied on one or more computer-usable storage media (including, but not limited to, disk storage, CD-ROM, optical storage, and the like) having computer-usable program code embodied therein.
The present application is described with reference to flowchart illustrations and/or block diagrams of methods, apparatus (systems), and computer program products according to embodiments of the application. It will be understood that each flow and/or block of the flow diagrams and/or block diagrams, and combinations of flows and/or blocks in the flow diagrams and/or block diagrams, can be implemented by computer program instructions. These computer program instructions may be provided to a processor of a general purpose computer, special purpose computer, embedded processor, or other programmable data processing apparatus to produce a machine, such that the instructions, which execute via the processor of the computer or other programmable data processing apparatus, create means for implementing the functions specified in the flowchart flow or flows and/or block diagram block or blocks.
These computer program instructions may also be stored in a computer-readable memory that can direct a computer or other programmable data processing apparatus to function in a particular manner, such that the instructions stored in the computer-readable memory produce an article of manufacture including instruction means which implement the function specified in the flowchart flow or flows and/or block diagram block or blocks.
These computer program instructions may also be loaded onto a computer or other programmable data processing apparatus to cause a series of operational steps to be performed on the computer or other programmable apparatus to produce a computer implemented process such that the instructions which execute on the computer or other programmable apparatus provide steps for implementing the functions specified in the flowchart flow or flows and/or block diagram block or blocks.
In a typical configuration, a computing device includes one or more processors (CPUs), input/output interfaces, network interfaces, and memory.
The memory may include forms of volatile memory in a computer readable medium, Random Access Memory (RAM) and/or non-volatile memory, such as Read Only Memory (ROM) or flash memory (flash RAM). The memory is an example of a computer-readable medium.
Computer-readable media, including both non-transitory and non-transitory, removable and non-removable media, may implement information storage by any method or technology. The information may be computer readable instructions, data structures, modules of a program, or other data. Examples of computer storage media include, but are not limited to, phase change memory (PRAM), Static Random Access Memory (SRAM), Dynamic Random Access Memory (DRAM), other types of Random Access Memory (RAM), Read Only Memory (ROM), Electrically Erasable Programmable Read Only Memory (EEPROM), flash memory or other memory technology, compact disc read only memory (CD-ROM), Digital Versatile Discs (DVD) or other optical storage, magnetic cassettes, magnetic tape magnetic disk storage or other magnetic storage devices, or any other non-transmission medium that can be used to store information that can be accessed by a computing device. As defined herein, a computer readable medium does not include a transitory computer readable medium such as a modulated data signal and a carrier wave.
It should also be noted that the terms "comprises," "comprising," or any other variation thereof, are intended to cover a non-exclusive inclusion, such that a process, method, article, or apparatus that comprises a list of elements does not include only those elements but may include other elements not expressly listed or inherent to such process, method, article, or apparatus. Without further limitation, an element defined by the phrase "comprising an … …" does not exclude the presence of other identical elements in the process, method, article, or apparatus that comprises the element.
As will be appreciated by one skilled in the art, embodiments of the present application may be provided as a method, system, or computer program product. Accordingly, the present application may take the form of an entirely hardware embodiment, an entirely software embodiment or an embodiment combining software and hardware aspects. Furthermore, the present application may take the form of a computer program product embodied on one or more computer-usable storage media (including, but not limited to, disk storage, CD-ROM, optical storage, and the like) having computer-usable program code embodied therein.
The above are merely examples of the present application and are not intended to limit the present application. Various modifications and changes may occur to those skilled in the art. Any modification, equivalent replacement, improvement, etc. made within the spirit and principle of the present application should be included in the scope of the claims of the present application.

Claims (10)

1. A data processing method of a cloud computing platform is characterized by comprising the following steps:
detecting whether a first physical machine in a cloud computing platform is down or not or whether a control node installed on the first physical machine is down, wherein components of the control node are deployed on an operating system of a virtual machine, and the virtual machine is installed on the first physical machine;
under the condition of downtime, a physical machine which controls normal operation in the cloud computing platform acquires an image file of the control node through a network file system and installs a virtual machine, wherein the network file system is installed on a second physical machine in the cloud computing platform, files in the network file system are stored in a preset directory, and the files in the preset directory are set to be read and written by other physical machines in the cloud computing platform;
and running the service of the control node of the cloud computing platform on the operating system of the newly installed virtual machine.
2. The method of claim 1, wherein prior to detecting whether a first physical machine in a cloud computing platform or a control node installed on the first physical machine is down, the method further comprises:
building a network file system on the second physical machine;
creating a virtual machine on a first physical machine, and storing an image file generated by the virtual machine in a preset directory of the network file system;
deploying a component of a control node of the cloud computing platform onto an operating system of a virtual machine of the first physical machine;
deploying components of computing nodes of the cloud computing platform onto each physical machine of the cloud computing platform.
3. The method of claim 1,
before controlling a physical machine which normally runs in the cloud computing platform to acquire the image file of the control node through a network file system and install a virtual machine, the method further comprises the following steps: and storing the operation data file generated in the operation process of the cloud computing platform in a preset directory of the network file system in real time.
4. The method of claim 1,
before detecting whether a first physical machine in a cloud computing platform is down or whether a control node installed on the first physical machine is down, the method further comprises: deploying a python program to any physical machine in the cloud computing platform except the first physical machine;
detecting whether a first physical machine in a cloud computing platform is down or not or whether a control node installed on the first physical machine is down comprises: detecting whether a first physical machine in a cloud computing platform is down or not or whether a control node installed on the first physical machine is down or not through a python program.
5. The method of claim 1, wherein the cloud computing platform comprises an OpenStack cloud computing platform, and wherein the virtual machine comprises a Centos7 virtual machine created using a system virtualization module.
6. A data processing apparatus of a cloud computing platform, comprising:
the cloud computing system comprises a detection unit, a processing unit and a control unit, wherein the detection unit is used for detecting whether a first physical machine in a cloud computing platform is down or not or detecting whether a control node installed on the first physical machine is down, wherein a component of the control node is deployed on an operating system of a virtual machine, and the virtual machine is installed on the first physical machine;
the acquiring unit is used for controlling a physical machine which normally runs in the cloud computing platform to acquire an image file of the control node through a network file system and install a virtual machine under the condition of downtime, wherein the network file system is installed on a second physical machine in the cloud computing platform, files in the network file system are stored in a preset directory, and the files in the preset directory are set to be readable and writable by other physical machines in the cloud computing platform;
the first processing unit is used for running the service of the control node of the cloud computing platform on the operating system of the newly installed virtual machine.
7. The apparatus of claim 6, further comprising:
the device comprises a building unit and a control unit, wherein the building unit is used for building a network file system on a second physical machine before detecting whether a first physical machine in a cloud computing platform is down or not or detecting whether a control node installed on the first physical machine is down;
the system comprises a creating unit, a storage unit and a processing unit, wherein the creating unit is used for creating a virtual machine on a first physical machine and storing an image file generated by the virtual machine in a preset directory of the network file system;
a second processing unit, configured to deploy a component of a control node of the cloud computing platform onto an operating system of a virtual machine of the first physical machine;
a third processing unit to deploy components of the compute nodes of the cloud computing platform onto each physical machine of the cloud computing platform.
8. The apparatus of claim 6, further comprising:
and the storage unit is used for storing the operation data file generated in the operation process of the cloud computing platform in a preset directory of the network file system in real time before the physical machine which controls the normal operation of the cloud computing platform acquires the image file of the control node through the network file system and installs the virtual machine.
9. A storage medium, characterized in that the storage medium includes a stored program, and when the program runs, the device where the storage medium is located is controlled to execute the data processing method of the cloud computing platform according to any one of claims 1 to 5.
10. A processor, wherein the processor is configured to execute a program, wherein the program executes the data processing method of the cloud computing platform according to any one of claims 1 to 5.
CN201811162037.9A 2018-09-30 2018-09-30 Data processing method and device of cloud computing platform Pending CN110968444A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201811162037.9A CN110968444A (en) 2018-09-30 2018-09-30 Data processing method and device of cloud computing platform

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201811162037.9A CN110968444A (en) 2018-09-30 2018-09-30 Data processing method and device of cloud computing platform

Publications (1)

Publication Number Publication Date
CN110968444A true CN110968444A (en) 2020-04-07

Family

ID=70029267

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201811162037.9A Pending CN110968444A (en) 2018-09-30 2018-09-30 Data processing method and device of cloud computing platform

Country Status (1)

Country Link
CN (1) CN110968444A (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113157499A (en) * 2021-03-25 2021-07-23 北京交通大学 Safe computer platform based on cloud computing
CN113312064A (en) * 2021-06-01 2021-08-27 中国民航信息网络股份有限公司 Installation configuration method and device of physical machine and computer readable medium

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103440160A (en) * 2013-08-15 2013-12-11 华为技术有限公司 Virtual machine recovering method and virtual machine migration method , device and system
CN104679579A (en) * 2015-03-19 2015-06-03 杭州华三通信技术有限公司 Virtual machine migration method and device in cluster system
CN105335214A (en) * 2015-11-12 2016-02-17 国云科技股份有限公司 Virtual machine failure detection and recovery method
CN106972970A (en) * 2017-03-31 2017-07-21 山东超越数控电子有限公司 A kind of method for avoiding oVirt cloud platform management node Single Point of Faliures

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103440160A (en) * 2013-08-15 2013-12-11 华为技术有限公司 Virtual machine recovering method and virtual machine migration method , device and system
CN104679579A (en) * 2015-03-19 2015-06-03 杭州华三通信技术有限公司 Virtual machine migration method and device in cluster system
CN105335214A (en) * 2015-11-12 2016-02-17 国云科技股份有限公司 Virtual machine failure detection and recovery method
CN106972970A (en) * 2017-03-31 2017-07-21 山东超越数控电子有限公司 A kind of method for avoiding oVirt cloud platform management node Single Point of Faliures

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113157499A (en) * 2021-03-25 2021-07-23 北京交通大学 Safe computer platform based on cloud computing
CN113157499B (en) * 2021-03-25 2023-12-01 北京交通大学 Secure computer platform based on cloud computing
CN113312064A (en) * 2021-06-01 2021-08-27 中国民航信息网络股份有限公司 Installation configuration method and device of physical machine and computer readable medium
CN113312064B (en) * 2021-06-01 2024-03-26 中国民航信息网络股份有限公司 Method and device for installing and configuring physical machine and computer readable medium

Similar Documents

Publication Publication Date Title
CN113711541B (en) Method and system for changing containerized workload isolation
US11146620B2 (en) Systems and methods for instantiating services on top of services
US10216587B2 (en) Scalable fault tolerant support in a containerized environment
CN107608757B (en) Container-based isolation processing method and related equipment
US9766945B2 (en) Virtual resource scheduling for containers with migration
US8874954B1 (en) Compatibility of high availability clusters supporting application failover with shared storage in a virtualization environment without sacrificing on virtualization features
JP6516730B2 (en) Method and apparatus for backing up and restoring applications
US10860336B2 (en) Managing multi-single-tenant SaaS services
CN108804189B (en) Cloud desktop management method and system
US9959157B1 (en) Computing instance migration
CN107463426B (en) Method and device for cloning virtual machine under KVM virtualization
CN109857516B (en) Cluster migration method and device based on container
JP6663995B2 (en) System and method for backing up a large-scale distributed scale-out data system
US20200241910A1 (en) Methods and apparatus for rack nesting in virtualized server systems
CN109558143B (en) Method and device for deploying applications in cluster
CN110968444A (en) Data processing method and device of cloud computing platform
CN103473113A (en) Universal virtual-machine adopting method
US10175886B1 (en) Systems and methods for handling missing storage image layers while provisioning containers in computer clusters
US10684895B1 (en) Systems and methods for managing containerized applications in a flexible appliance platform
CN113918274A (en) Docker container-based KVM virtual machine migration method
CN112698979A (en) Method and device for processing zookeeper double nodes, storage medium and processor
CN104461757A (en) Method and device for restoring virtual machines
CN111198694A (en) Software installation method and device
US9798571B1 (en) System and method for optimizing provisioning time by dynamically customizing a shared virtual machine
US11593103B1 (en) Anti-pattern detection in extraction and deployment of a microservice

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination