CN109284168B

CN109284168B - Method and system for separating and managing environment configuration and service data of big data platform

Info

Publication number: CN109284168B
Application number: CN201811047918.6A
Authority: CN
Inventors: 黄桥藩
Original assignee: Fujian Sinoregal Software Co ltd
Current assignee: Fujian Sinoregal Software Co ltd
Priority date: 2018-09-10
Filing date: 2018-09-10
Publication date: 2022-12-06
Anticipated expiration: 2038-09-10
Also published as: CN109284168A

Abstract

The invention provides a method for separately managing environment configuration and service data of a big data platform, which comprises the steps of generating a virtual machine mirror image, and sending the virtual machine mirror image to each server through a mirror image version management server; the virtual machine mirror image generates a container through instantiation, the container is operated in a server, and the file of the server is directly mapped to the virtual machine through a file mapping mechanism of the virtual machine, so that the virtual machine directly accesses the file on the server; the invention also provides a system for separating and managing the environment configuration and the service data of the big data platform, so that the light weight of the big data based on the virtualization computing platform is realized.

Description

Method and system for separating and managing environment configuration and service data of big data platform

Technical Field

The invention relates to a method and a system for separating and managing environment configuration and service data of a big data platform.

Background

In a traditional virtual machine, an operating system environment and a service data storage space are both in the virtual machine, that is, data in a hard disk of the whole service is stored in the virtual machine. In an application environment of a server, data is often as high as hundreds of TBs, so a traditional virtual machine image file storage mode is not suitable for large data occupation space and large requirements, and once a server has a failure problem, a file with hundreds of TBs needs to be restored by one image file, which is also a complex and risky operation.

Disclosure of Invention

The technical problem to be solved by the invention is to provide a method and a system for environment configuration and service data separation management of a big data platform, so that light weight and quick iterative switching of a big data configuration environment are realized under the condition of not performing any service data migration, and light weight of big data based on a virtualization computing platform is realized.

One of the present invention is realized by: a method for separating and managing environment configuration and service data of a big data platform comprises the following steps:

step 1, generating a virtual machine image, and sending the virtual machine image to each server through an image version management server;

and 2, generating a container by the virtual machine mirror image through instantiation, operating the container in the server, and directly mapping the file of the server to the virtual machine through a file mapping mechanism of the virtual machine to realize that the virtual machine directly accesses the file on the server.

Further, the generating the virtual machine image in the step 1 is further specifically: instantiating the virtual machine image into a container in the virtual machine of the server, then running the container on the virtual machine of the server, and finally storing the virtual machine as the virtual machine image.

Further, the generating of a container by the virtual machine image through instantiation further specifically includes: the virtual machine image instantiates a container in the virtual machine.

The second invention is realized by the following steps: a big data platform environment configuration and business data separation management system comprises:

the system comprises a generation issuing module, a storage module and a management module, wherein the generation issuing module is used for generating a virtual machine image, the virtual machine image is used for storing system environment configuration data and configuration of a system release version and sending the virtual machine image to each server through an image version management server;

and the operation module generates a container by instantiating the virtual machine mirror image, operates the container in the server, and directly maps the file of the server to the virtual machine through a file mapping mechanism of the virtual machine so as to realize that the virtual machine directly accesses the file on the server.

Further, the generating the virtual machine image in the generating and issuing module is further specifically: instantiating the virtual machine image into a container in the virtual machine of the server, then running the container on the virtual machine of the server, and finally storing the virtual machine as the virtual machine image.

The invention has the following advantages:

1) Lightweight large data cluster mirroring: big data cluster configuration and big data service configuration are separated, and only configuration files of big data are reserved in cluster mirror images, so that light management of mirror image files is achieved.

2) Mirror image management big data configuration version: the big data management mode of the original script plus configuration file is changed, the mirror image management big data configuration is realized, and the iterative update of the big data configuration file is realized by issuing a mirror image version through the mirror image server.

3) Configuration and environmental consistency of big data platforms: the system of the whole big data cluster runs in a container generated by the mirror image of the virtual machine of the same version, the environmental consistency of the whole cluster is realized, and the problem of the difference of cluster servers in the operation and maintenance of the big data in the past is avoided.

Drawings

The invention will be further described with reference to the following examples with reference to the accompanying drawings.

FIG. 1 is a flow chart of the method of the present invention.

Detailed Description

As shown in fig. 1, the method for separating and managing environment configuration and service data of a big data platform of the present invention includes:

step 1, instantiating a virtual machine image into a container in a virtual machine of a server, then operating the container on the virtual machine of the server, finally storing the virtual machine image as the virtual machine image, and sending the virtual machine image to each server through an image version management server;

step 2, instantiating the virtual machine mirror image in the virtual machine to generate a container, operating the container in the server, and directly mapping the file of the server to the virtual machine through a file mapping mechanism of the virtual machine to realize that the virtual machine directly accesses the file on the server.

The invention relates to a big data platform environment configuration and service data separation management system, which comprises:

the method comprises the steps that an issuing module is generated, a virtual machine image is instantiated into a container in a virtual machine of a server, then the container is operated on the virtual machine of the server, finally the virtual machine is stored as the virtual machine image, and the virtual machine image is sent to each server through an image version management server;

the virtual machine mirror image is instantiated in the virtual machine to generate a container, the container is operated in the server, and the file of the server is directly mapped to the virtual machine through a file mapping mechanism of the virtual machine, so that the virtual machine directly accesses the file on the server.

One embodiment of the present invention:

the invention realizes that the virtual machine ISO can directly access the host machine hard disk directory space, the data of the big data platform is left in the host machine, the system environment of big data operation, the big data configuration file and the like are put on the virtual machine platform, the configuration version of the big data platform can be managed as long as the virtual machine image file is managed, the configuration version of the big data platform is separated from the big data service data, and the hierarchical management is realized. The method realizes light-weight rapid iterative switching of a big data configuration environment under the condition of not carrying out any business data migration, realizes light weight of big data based on a virtualization computing platform, and can rapidly carry out switching iterative upgrading of a virtualization container.

The scheme structure is as follows: mirror image layering realization, mirror image and container conversion and mirror image management big data configuration.

The method mainly comprises the following steps:

a, mirror layering implementation

Layering a Linux kernel and a system release version: in a pure Linux system environment, the kernel-related bottom-layer driver configuration and the dependent environment package occupy most of the system files, the system issues version configuration files, and the system kernel occupies hardware resources of a host machine after the system is started.

Layering of service data and big data configuration: the file of the host machine is directly mapped to the virtual machine by realizing a file mapping mechanism of the virtual machine, so that the virtual machine can directly access the file of the host machine, the virtual machine mirror image does not need to store user service data of a big data platform, and the virtual machine mirror image only needs to store system environment configuration data of the big data, so that the layering of the service data of the user big data and the system environment configuration is realized.

Mirror image and container conversion

The mirror image can generate a container to be operated in the host machine through instantiation, the file with the static mirror image is issued by the mirror image management server, the mirror image can be operated on the host machine after the virtual machine of the host machine instantiates the mirror image into the container, and the container operated in the virtual machine can be stored as the mirror image and stored in the mirror image server.

Mirror image management big data configuration

Saving the configuration file as a mirror image: configuration parameters and the like of big data are directly configured in an operating container, and then the container is saved as a mirror image, so that the configuration file can be saved in the mirror image.

Iterative management of big data configuration: the distribution of the configuration files of the large-scale big data clusters can be realized by issuing the updated mirror image through the mirror image version management server, so that the configuration files take effect in each cluster of the big data, the consistency of the environment and the configuration files of the whole big data cluster is ensured, and the problem of system difference is avoided.

Although specific embodiments of the invention have been described above, it will be understood by those skilled in the art that the specific embodiments described are illustrative only and are not limiting upon the scope of the invention, and that equivalent modifications and variations can be made by those skilled in the art without departing from the spirit of the invention, which is to be limited only by the appended claims.

Claims

1. A big data platform environment configuration and service data separation management method is characterized in that: the method comprises the following steps:

step 1, generating a virtual machine image, and sending the virtual machine image to each server through an image version management server; the virtual machine mirror image only stores system environment configuration data of big data;

step 2, generating a container by the virtual machine mirror image through instantiation, operating the container in a server, and directly mapping the file of the server to the virtual machine through a file mapping mechanism of the virtual machine to realize that the virtual machine directly accesses the file on the server;

step 3, iterative management of big data configuration: the virtual machine mirror image which is subjected to the updating configuration is issued through the mirror image version management server, the distribution of the configuration file of the big data cluster is realized, the configuration file is enabled to take effect in each cluster of the big data, and the consistency of the environment and the configuration file of the whole big data cluster is ensured.

2. The big data platform environment configuration and service data separation management method according to claim 1, wherein: the generating the virtual machine image in the step 1 is further specifically: instantiating the virtual machine image into a container in the virtual machine of the server, then running the container on the virtual machine of the server, and finally storing the virtual machine as the virtual machine image.

3. The big data platform environment configuration and service data separation management method according to claim 1, wherein: the generating of a container by the virtual machine image through instantiation further specifically comprises: the virtual machine image instantiates a container in the virtual machine.

4. A big data platform environment configuration and service data separation management system is characterized in that: the method comprises the following steps:

the system comprises a generation issuing module, a storage module and a management module, wherein the generation issuing module is used for generating a virtual machine image, the virtual machine image is used for storing system environment configuration data and configuration of a system release version and sending the virtual machine image to each server through an image version management server; the virtual machine mirror image only stores system environment configuration data of big data;

the operation module generates a container by instantiating the virtual machine mirror image, operates the container in the server, and directly maps the file of the server to the virtual machine through a file mapping mechanism of the virtual machine so as to realize that the virtual machine directly accesses the file on the server;

an iteration management module: the virtual machine mirror image which is subjected to the updating configuration is issued through the mirror image version management server, the distribution of the configuration file of the big data cluster is realized, the configuration file is enabled to take effect in each cluster of the big data, and the consistency of the environment and the configuration file of the whole big data cluster is ensured.

5. The big data platform environment configuration and service data separation management system according to claim 4, wherein: the generating of the virtual machine image in the generating and issuing module further includes: instantiating the virtual machine image into a container in the virtual machine of the server, then running the container on the virtual machine of the server, and finally storing the virtual machine as the virtual machine image.

6. The big data platform environment configuration and service data separation management system according to claim 4, wherein: the generating of a container by the virtual machine image through instantiation further specifically comprises: the virtual machine image is instantiated in a virtual machine to generate a container.