WO2016127756A1 - Flexible deployment method for cluster and management system - Google Patents

Flexible deployment method for cluster and management system Download PDF

Info

Publication number
WO2016127756A1
WO2016127756A1 PCT/CN2016/070999 CN2016070999W WO2016127756A1 WO 2016127756 A1 WO2016127756 A1 WO 2016127756A1 CN 2016070999 W CN2016070999 W CN 2016070999W WO 2016127756 A1 WO2016127756 A1 WO 2016127756A1
Authority
WO
WIPO (PCT)
Prior art keywords
machine
cluster
deployed
configuration
resource pool
Prior art date
Application number
PCT/CN2016/070999
Other languages
French (fr)
Chinese (zh)
Inventor
唐平
Original Assignee
北京京东尚科信息技术有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 北京京东尚科信息技术有限公司 filed Critical 北京京东尚科信息技术有限公司
Publication of WO2016127756A1 publication Critical patent/WO2016127756A1/en

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F15/00Digital computers in general; Data processing equipment in general
    • G06F15/16Combinations of two or more digital computers each having at least an arithmetic unit, a program unit and a register, e.g. for a simultaneous processing of several programs
    • G06F15/177Initialisation or configuration control

Definitions

  • the present invention relates to the field of computer technologies, and in particular, to a method and a management system for cluster elastic deployment in a distributed system.
  • HBase In response to massive data storage, more and more companies are adopting open source databases such as HBase.
  • cluster deployment and operation and maintenance of HBase are very troublesome. For example, if the existing cluster is expanded or the machine with hardware and software failures is removed, it depends on the manual operation of the operation and maintenance personnel. These operations are not only repetitive, error-prone, but also make it difficult to manage them manually by increasing the number of clusters and the size of the cluster itself.
  • HBase cluster management operations
  • Cloudera Manager and Ambari.
  • Cloudera Manager When deploying a cluster, users only need to select the version of the package that HBase depends on, configure the role of each machine through the tool interface, and adjust the configuration of HBase, Zookeeper, and Hadoop, and the specific cluster deployment operations are managed by these. The tool is complete.
  • management tools have many restrictions due to commercial authorization. For example, Cloudera Manager restricts cluster size to less than 100 machine nodes; it is not convenient for multi-cluster management, and management tools are targeted at Installation and management of a cluster, it is difficult to increase or decrease the flexibility of multiple clusters, etc.; lack of audit trails, do not make necessary records for the operation of machines in the production environment, can not help users to trace the causes and responsible persons in case of problems .
  • the present invention provides a method and a management system for cluster elastic deployment.
  • a resource pool By introducing a resource pool and putting all the machines that are to be used for cluster deployment into the resource pool, the maintenance and management of existing resource pools and clusters can be simplified.
  • a method of cluster elastic deployment is provided.
  • the method for cluster elastic deployment of the present invention includes the steps of: defining basic information of a cluster; selecting, from a resource pool consisting of machines to be used for generating a cluster, a machine to be used to generate the cluster; Determining a version of the software of the machine and modifying configuration information of the machine; selecting a service process to be deployed by the machine; and performing an environmental correctness check on the machine according to a detection script included in the machine, wherein The environmental correctness check passes, the image file configuring the machine is obtained from the cloud storage, and the service process to be deployed is enabled on the machine.
  • the basic information for defining the cluster includes: defining a cluster name, a cluster machine type, a use validity period, and a cluster size of the cluster.
  • the performing an environmental correctness check on the machine includes: checking one or more of an operating system version, a JDK version, a host name, a firewall, and a time synchronization configuration of the machine.
  • the method of cluster resilient deployment of the present invention further comprises the step of adding a machine to the cluster, the step comprising: selecting an idle machine from the resource pool and specifying a service process to be deployed thereon; Performing an environmental correctness check on the machine according to the detection script included in the machine, wherein if the environment correctness check passes, obtaining an image file for configuring the machine from the cloud storage, The configuration of the machine is updated and the service process to be deployed is enabled.
  • the method of cluster elastic deployment of the present invention further comprises the step of dynamically deleting the machine to the cluster, the step comprising: selecting a machine that needs to be stopped and stopping the deployment a service process thereon; and detecting a working state of the cluster according to a detection script included in the machine, wherein if the cluster is working properly, resetting configuration information of the machine to an initial status.
  • the method for cluster elastic deployment of the present invention further includes the step of dynamically modifying the configuration of the cluster, the step comprising: backing up an original configuration item of a machine for which configuration item modification is to be performed; a configuration item to be changed of the machine; and reviewing the legality of the modified configuration item according to the detection script included in the machine, wherein the modification is performed if the review of the legality of the configuration item is passed Subsequent configuration items are pushed to the machine.
  • a cluster resilient deployment management system comprising a central management server; a resource pool consisting of machines intended to be used to generate a cluster; and by the central management server, according to the present invention
  • the cluster deployed by the cluster elastic deployment method.
  • the machine in the resource pool that is to be used to generate the cluster can cooperate with the central management server to complete related operations of receiving a command and sending a response message.
  • the machine in the resource pool that is to be used to generate the cluster includes a detection script.
  • the central management server is further configured to acquire a cluster to which the machine belongs, a machine working state, and a machine type.
  • the central management server records an operation log of the cluster and the resource pool.
  • a cluster of elastic computing frameworks that can be dynamically managed is provided by introducing a resource pool in a cluster deployment and putting all the machines to be used for the cluster deployment into the resource pool.
  • FIG. 1 is a schematic diagram of a management system for describing a cluster elastic deployment of the present invention
  • 2A-2D are schematic diagrams for describing a method of cluster elastic deployment of the present invention.
  • FIG. 1 is a schematic diagram of a management system for describing a cluster elastic deployment of the present invention. The working mechanism of the cluster elastic deployment management system of the present invention will be described below with reference to FIG.
  • the cluster elastic deployment management system of the present invention includes a central management server 10 , a cluster 20 , a cloud storage 30 , and a database 40 , as in the prior art cluster management system.
  • Cloud storage 30 is used to store functional resource files such as Hadoop, Zookeeper, HBase image files, and configuration files.
  • the database 40 is used to store related metadata information such as machines, clusters, and status maintenance information.
  • Cluster 20 includes multiple machines and is used to handle tasks from accessing business applications.
  • the central management server 10 reads the files and information stored in the cloud storage 30 and the database 40, and maintains the list information of the machines in the cluster 20 according to the files and information to acquire the cluster to which the machine belongs, the machine working state, and the machine source, and then the cluster.
  • the cluster 20 Perform configuration maintenance, image file management, machine management, and security control operations.
  • the difference from the cluster management system of the prior art is that the cluster 20 in the cluster elastic deployment management system of the present invention is a cluster deployed through a flexible solution, and the management system of the cluster elastic deployment further includes The resource pool 50 consisting of the machines that generate the cluster,
  • the heart management server 10 is also responsible for maintaining the machines in the resource pool 50.
  • the central management server 10 can communicate with the cluster 20 and machines in the resource pool 50 via various communication protocols, such as by the TCP/IP protocol.
  • Each of the resources in the resource pool 50 is capable of cooperating with the central management server 10 to perform operations related to receiving commands and echoing response messages.
  • each machine in the resource pool 50 may be equipped with an HAgent for cooperating with the central management server 10 to perform related operations of receiving commands and echoing response messages.
  • Each machine in resource pool 50 may include a detection script for checking for machine environmental correctness checks and whether related services are functioning properly.
  • the above environmental correctness check includes but is not limited to the check of the operating system version, JDK version, host name, firewall, and time synchronization configuration.
  • the above related services include, but are not limited to, remote storage (RS) and data node (Datanode) service processes.
  • the central management server 10 is also responsible for maintaining the list information of the machines in the resource pool 50, obtaining the cluster to which the machine belongs, the machine working state, and the machine type (physical machine or virtual machine) to prevent the machine from being repeatedly deployed into multiple clusters. After the HAgent is installed on each machine in the resource pool 50, each machine is registered in the central management server 10.
  • the machine For the created cluster 20, if the machine needs to be dynamically added, one or several joins the cluster from the machines of the resource pool 50 through the central management server 10, and then the service process to be deployed on the selected machine is enabled; if the cluster If there is an idle condition in the machine in 20, the partially idle machine is released back to the resource pool 50 by the central management server 10.
  • the access service application When the access service application needs to access the cluster 20, the user of the service obtains the configuration information of the corresponding used cluster, and then sends the configuration information to the access service application so that it can pass an interface such as an HBase application programming interface (API).
  • API application programming interface
  • the cluster 20 is accessed, and the associated access service application of the cluster 20 can be recorded in the central management server 10.
  • the central management server 10 approves the permission of the root authority operation, and only the trial is performed. The batch can only be operated after it has passed.
  • the preferred way is to put the root username and password into the HAgent configuration file.
  • the central management server 10 operates, the root user name and password are used to perform permission verification and log recording, thereby tracking the operator and preventing maliciousness. The operation provides the user with the necessary audit trails.
  • FIGS. 2A-2D are schematic diagrams for describing a method of cluster elastic deployment of the present invention.
  • the core processes of the cluster elastic deployment method of the present invention include creating a cluster, a cluster dynamic adding machine, a cluster dynamic deleting machine, and a cluster dynamic modifying configuration, which will be described one by one with reference to FIGS. 2A-2D.
  • the method for creating a cluster may include: defining basic information of a cluster, including a cluster name, a cluster machine type (physical machine or virtual machine), a usage validity period, and a cluster size; In the resource pool composed of the clustered machines, select the machine to be used to generate the cluster; select the version of the software to be used for the machine such as Hadoop, HBase, Zookeeper and modify the configuration information of the machine; select the service to be deployed for each machine.
  • the machine is checked for environmental correctness; if the check is passed, the image file configured for the machine is obtained from the cloud storage, and the service process to be deployed is enabled through the HAgent.
  • the check fails, leave the machine in the resource pool; or repeat the above selection to use the version of the software for the machine and modify the configuration information of the machine, select the service process to be deployed, and perform the environment on the machine. The steps to check for correctness.
  • a method for dynamically adding a machine according to an embodiment of the present invention may include: selecting an idle machine from a resource pool and designating a service process to be deployed thereon; and performing an environment correct for the machine according to a detection script included in the machine Sex check; if the check passes, get the image file configured for the machine from the cloud storage, update its configuration, and enable the service process to be deployed. Optionally, if the check fails, repeat the step of checking the environmental correctness of the machine; or select another idle machine from the resource pool, and Repeat the above steps.
  • the method for dynamically deleting a machine may include: selecting, by an interface of the central management server, a machine that needs to be stopped and stopping a service process deployed thereon; working on the cluster according to a detection script included in the machine The status is detected; if the cluster is working properly, the configuration information of the machine is reset to the initial state, so that its configuration information is returned to the state when it is put into the resource pool, thereby completing the classification of the machine into the resource pool. operating. Optionally, if the cluster is not working properly, re-enable the stopped service process on that machine.
  • the method for dynamically modifying a configuration of a cluster according to the embodiment of the present invention may include: backing up an original configuration item of a machine for which configuration item modification is to be performed; and modifying a configuration item to be changed through an interface of the central management server; The included detection script reviews the legality of the modified configuration item; if the review is passed, the modified configuration item is pushed to the machine. Optionally, if the review fails, the modified configuration item is overwritten in the machine with the original configuration item of the backup.
  • the objects of the invention can also be achieved by running a program or a set of programs on any computing device.
  • the computing device can be a well-known general purpose device.
  • the object of the present invention can also be achieved by merely providing a program product comprising program code for implementing the method or apparatus. That is to say, such a program product also constitutes the present invention, and a storage medium storing such a program product also constitutes the present invention. It will be apparent that the storage medium may be any known storage medium or any storage medium developed in the future.

Abstract

A flexible deployment method for a cluster (20) and a management system. The method comprises: defining basic information of the cluster (20); selecting, from a resource pool (50) comprising machines intended to be used for generating the cluster (20), a machine intended to be used for generating the cluster (20); selecting a version of software to be used by the machine, and changing configuration information of the machine; selecting a service process to be deployed on the machine; and performing environment accuracy check on the machine according to detection script comprised in the machine, and if the environment accuracy check is successful, acquiring, from a cloud storage (30), a mirror file for configuring the machine, and enabling the to-be-deployed service process on the machine.

Description

集群弹性部署的方法和管理系统Method and management system for cluster elastic deployment 技术领域Technical field
本发明涉及计算机技术领域,特别地涉及分布式系统中的集群弹性部署的方法和管理系统。The present invention relates to the field of computer technologies, and in particular, to a method and a management system for cluster elastic deployment in a distributed system.
背景技术Background technique
为了应对海量数据存储,越来越多的企业采用诸如HBase的开源数据库。然而,作为分布式系统,对于HBase的集群部署和运维非常麻烦,比如对现有集群进行扩容或剔除出现软硬件故障的机器的情况下,要依赖于运维人员的手工操作。这些操作不仅具有重复性,极易出错,而且随着集群个数和集群自身规模的增加,使得难以通过手工操作对其进行管理。In response to massive data storage, more and more companies are adopting open source databases such as HBase. However, as a distributed system, cluster deployment and operation and maintenance of HBase are very troublesome. For example, if the existing cluster is expanded or the machine with hardware and software failures is removed, it depends on the manual operation of the operation and maintenance personnel. These operations are not only repetitive, error-prone, but also make it difficult to manage them manually by increasing the number of clusters and the size of the cluster itself.
为简化诸如HBase的集群管理操作,业界也出现了一些管理工具,比如Cloudera Manager和Ambari。在部署集群时,用户仅需选定好HBase所依赖的软件包版本,通过工具界面配置每个机器的角色,对HBase、Zookeeper和Hadoop的配置进行调整,而具体的集群部署操作则由这些管理工具完成。To simplify cluster management operations such as HBase, there are also some management tools in the industry, such as Cloudera Manager and Ambari. When deploying a cluster, users only need to select the version of the package that HBase depends on, configure the role of each machine through the tool interface, and adjust the configuration of HBase, Zookeeper, and Hadoop, and the specific cluster deployment operations are managed by these. The tool is complete.
即便如上所述的管理工具来实现集群管理操作,仍存在以下弊端:管理工具由于商业授权而存在诸多限制,比如Cloudera Manager限制集群规模小于100机器节点;不便于多集群管理,管理工具均是针对一个集群的安装和管理,而难以对多个集群的弹性增减机器等;缺乏审计追踪,对生产环境中的机器的操作没有做必要的记录,无法帮助用户在出现问题时追查原因及责任人。Even if the management tools described above implement cluster management operations, the following drawbacks exist: management tools have many restrictions due to commercial authorization. For example, Cloudera Manager restricts cluster size to less than 100 machine nodes; it is not convenient for multi-cluster management, and management tools are targeted at Installation and management of a cluster, it is difficult to increase or decrease the flexibility of multiple clusters, etc.; lack of audit trails, do not make necessary records for the operation of machines in the production environment, can not help users to trace the causes and responsible persons in case of problems .
发明内容Summary of the invention
行鉴于此,本发明提供一种集群弹性部署的方法和管理系统,通 过引入资源池,将所有拟用于集群部署的机器放入资源池内,能够简化对现有资源池和集群的维护管理。In view of this, the present invention provides a method and a management system for cluster elastic deployment. By introducing a resource pool and putting all the machines that are to be used for cluster deployment into the resource pool, the maintenance and management of existing resource pools and clusters can be simplified.
为实现上述目的,根据本发明的一个方面,提供了一种集群弹性部署的方法。To achieve the above object, according to an aspect of the present invention, a method of cluster elastic deployment is provided.
本发明的集群弹性部署的方法包括步骤:定义集群的基本信息;从包括由拟用于生成集群的机器所组成的资源池中,选择拟用于生成所述集群的机器;选择将用于所述机器的软件的版本并修改所述机器的配置信息;选择所述机器要部署的服务进程;以及根据所述机器中所包括的检测脚本,对所述机器进行环境正确性检查,其中,如果所述环境正确性检查通过,则从云存储获取对所述机器进行配置的镜像文件,以及在所述机器上启用所述要部署的服务进程。The method for cluster elastic deployment of the present invention includes the steps of: defining basic information of a cluster; selecting, from a resource pool consisting of machines to be used for generating a cluster, a machine to be used to generate the cluster; Determining a version of the software of the machine and modifying configuration information of the machine; selecting a service process to be deployed by the machine; and performing an environmental correctness check on the machine according to a detection script included in the machine, wherein The environmental correctness check passes, the image file configuring the machine is obtained from the cloud storage, and the service process to be deployed is enabled on the machine.
可选地,所述定义集群的基本信息包括:定义所述集群的集群名称、集群机器类型、使用有效期和集群规模。Optionally, the basic information for defining the cluster includes: defining a cluster name, a cluster machine type, a use validity period, and a cluster size of the cluster.
可选地,所述对所述机器进行环境正确性检查包括:对所述机器的操作系统版本、JDK版本、主机名、防火墙、时间同步配置中的一项或多项进行检查。Optionally, the performing an environmental correctness check on the machine includes: checking one or more of an operating system version, a JDK version, a host name, a firewall, and a time synchronization configuration of the machine.
可选地,本发明的集群弹性部署的方法进一步包括向所述集群添加机器的步骤,所述步骤包括:从所述资源池中选择空闲的机器并且指定在其上要部署的服务进程;以及根据所述机器中所包括的检测脚本,对所述机器进行环境正确性检查,其中,如果所述环境正确性检查通过,则从所述云存储获取对所述机器进行配置的镜像文件,对所述机器的配置进行更新,以及启用所述要部署的服务进程。Optionally, the method of cluster resilient deployment of the present invention further comprises the step of adding a machine to the cluster, the step comprising: selecting an idle machine from the resource pool and specifying a service process to be deployed thereon; Performing an environmental correctness check on the machine according to the detection script included in the machine, wherein if the environment correctness check passes, obtaining an image file for configuring the machine from the cloud storage, The configuration of the machine is updated and the service process to be deployed is enabled.
可选地,本发明的集群弹性部署的方法进一步包括向所述集群动态删除机器的步骤,所述步骤包括:选择需要停止的机器并且停止部署 在其上的服务进程;以及根据所述机器中所包括的检测脚本,对所述集群工作状态进行检测,其中,如果所述集群能够工作正常,则将所述机器的配置信息重置为初始状态。Optionally, the method of cluster elastic deployment of the present invention further comprises the step of dynamically deleting the machine to the cluster, the step comprising: selecting a machine that needs to be stopped and stopping the deployment a service process thereon; and detecting a working state of the cluster according to a detection script included in the machine, wherein if the cluster is working properly, resetting configuration information of the machine to an initial status.
可选地,本发明的集群弹性部署的方法进一步包括对所述集群进行动态修改配置的步骤,所述步骤包括:对欲对其进行配置项修改的机器的原配置项进行备份;修改所述机器的要更改的配置项;以及根据所述机器中所包括的检测脚本,对修改后的配置项的合法性进行审查,其中,如果所述配置项的合法性的审查通过,则所述修改后的配置项被推送到所述机器。Optionally, the method for cluster elastic deployment of the present invention further includes the step of dynamically modifying the configuration of the cluster, the step comprising: backing up an original configuration item of a machine for which configuration item modification is to be performed; a configuration item to be changed of the machine; and reviewing the legality of the modified configuration item according to the detection script included in the machine, wherein the modification is performed if the review of the legality of the configuration item is passed Subsequent configuration items are pushed to the machine.
根据本发明的另一方面,提供了一种集群弹性部署的管理系统,包括中心管理服务器;由拟用于生成集群的机器所组成的资源池;以及由所述中心管理服务器,根据本发明的集群弹性部署的方法所部署的集群。According to another aspect of the present invention, there is provided a cluster resilient deployment management system comprising a central management server; a resource pool consisting of machines intended to be used to generate a cluster; and by the central management server, according to the present invention The cluster deployed by the cluster elastic deployment method.
可选地,所述资源池中的拟用于生成集群的机器能够与所述中心管理服务器协同来完成接收命令和回送响应消息的相关操作。Optionally, the machine in the resource pool that is to be used to generate the cluster can cooperate with the central management server to complete related operations of receiving a command and sending a response message.
可选地,所述资源池中的拟用于生成集群的机器包括检测脚本Optionally, the machine in the resource pool that is to be used to generate the cluster includes a detection script.
可选地,所述中心管理服务器还用于获取所述机器的所属集群、机器工作状态以及机器类型。Optionally, the central management server is further configured to acquire a cluster to which the machine belongs, a machine working state, and a machine type.
可选地,所述中心管理服务器记录所述集群和所述资源池的操作日志。Optionally, the central management server records an operation log of the cluster and the resource pool.
根据本发明的技术方案,通过在集群部署中引入资源池,而将所有拟用于集群部署的机器放入资源池内,提供了能够动态管理的弹性计算框架的集群。 According to the technical solution of the present invention, a cluster of elastic computing frameworks that can be dynamically managed is provided by introducing a resource pool in a cluster deployment and putting all the machines to be used for the cluster deployment into the resource pool.
附图说明DRAWINGS
附图用于更好地理解本发明,不构成对本发明的不当限定。其中:The drawings are intended to provide a better understanding of the invention and are not intended to limit the invention. among them:
图1是用于描述本发明的集群弹性部署的管理系统的示意图;1 is a schematic diagram of a management system for describing a cluster elastic deployment of the present invention;
图2A-2D是用于描述本发明的集群弹性部署的方法的示意图。2A-2D are schematic diagrams for describing a method of cluster elastic deployment of the present invention.
具体实施方式detailed description
以下结合附图对本发明的示范性实施例做出说明,其中包括本发明实施例的各种细节以助于理解,应当将它们认为仅仅是示范性的。为了便于描述,以HBase为例做出以下具体说明,然而本领域普通技术人员应当认识到,本发明的方法和管理系统并不限于HBase,而是可以对这里描述的实施例做出各种改变和修改,而不会背离本发明的范围和精神。同样,为了清楚和简明,以下的描述中省略了对公知功能和结构的描述。The exemplary embodiments of the present invention are described with reference to the accompanying drawings, and are in the For convenience of description, the following specific description is made by taking HBase as an example, but those skilled in the art should recognize that the method and management system of the present invention are not limited to HBase, but various changes can be made to the embodiments described herein. And modifications may be made without departing from the scope and spirit of the invention. Also, descriptions of well-known functions and constructions are omitted in the following description for clarity and conciseness.
图1是用于描述本发明的集群弹性部署的管理系统的示意图。如下将参考图1描述本发明的集群弹性部署的管理系统的工作机制。1 is a schematic diagram of a management system for describing a cluster elastic deployment of the present invention. The working mechanism of the cluster elastic deployment management system of the present invention will be described below with reference to FIG.
如图1所示,与现有技术的集群管理系统相同,本发明的集群弹性部署的管理系统包括中心管理服务器10、集群20、云存储30、数据库40。云存储30用于存放功能资源文件,例如Hadoop、Zookeeper、HBase镜像文件和配置文件。数据库40用于存放机器、集群等相关元数据信息以及状态维护信息。集群20包括多台机器并且用于处理来自接入业务应用的任务。中心管理服务器10读取存放在云存储30和数据库40中的文件和信息,根据该文件和信息维护集群20中机器的列表信息,以获取机器所属集群、机器工作状态和机器来源,进而对集群20进行配置维护、镜像文件管理、机器管理、安全控制操作。与现有技术的集群管理系统的不同之处在于,本发明的集群弹性部署的管理系统中的集群20是通过弹性方案所部署的集群,并且该集群弹性部署的管理系统还包括由拟用于生成集群的机器所组成的资源池50,中 心管理服务器10还负责维护资源池50中机器。中心管理服务器10可通过各种通信协议与集群20和资源池50中机器通信,例如通过TCP/IP协议。As shown in FIG. 1 , the cluster elastic deployment management system of the present invention includes a central management server 10 , a cluster 20 , a cloud storage 30 , and a database 40 , as in the prior art cluster management system. Cloud storage 30 is used to store functional resource files such as Hadoop, Zookeeper, HBase image files, and configuration files. The database 40 is used to store related metadata information such as machines, clusters, and status maintenance information. Cluster 20 includes multiple machines and is used to handle tasks from accessing business applications. The central management server 10 reads the files and information stored in the cloud storage 30 and the database 40, and maintains the list information of the machines in the cluster 20 according to the files and information to acquire the cluster to which the machine belongs, the machine working state, and the machine source, and then the cluster. 20 Perform configuration maintenance, image file management, machine management, and security control operations. The difference from the cluster management system of the prior art is that the cluster 20 in the cluster elastic deployment management system of the present invention is a cluster deployed through a flexible solution, and the management system of the cluster elastic deployment further includes The resource pool 50 consisting of the machines that generate the cluster, The heart management server 10 is also responsible for maintaining the machines in the resource pool 50. The central management server 10 can communicate with the cluster 20 and machines in the resource pool 50 via various communication protocols, such as by the TCP/IP protocol.
资源池50中的每台机器都能够与中心管理服务器10协同来完成接收命令和回送响应消息的相关操作。可选地,资源池50中的每台机器可以安装有用于与中心管理服务器10协同来完成接收命令和回送响应消息的相关操作的HAgent。资源池50中的每台机器可以包括用于完成机器环境正确性检查以及相关服务是否正常运行的检查的检测脚本。上述环境正确性检查包括但不限于对于操作系统版本、JDK版本、主机名、防火墙、时间同步配置的检查。上述相关服务包括但不限于远程存储(RS)和数据节点(Datanode)服务进程。Each of the resources in the resource pool 50 is capable of cooperating with the central management server 10 to perform operations related to receiving commands and echoing response messages. Alternatively, each machine in the resource pool 50 may be equipped with an HAgent for cooperating with the central management server 10 to perform related operations of receiving commands and echoing response messages. Each machine in resource pool 50 may include a detection script for checking for machine environmental correctness checks and whether related services are functioning properly. The above environmental correctness check includes but is not limited to the check of the operating system version, JDK version, host name, firewall, and time synchronization configuration. The above related services include, but are not limited to, remote storage (RS) and data node (Datanode) service processes.
中心管理服务器10还负责维护资源池50中的机器的列表信息,获取机器所属集群、机器工作状态以及机器类型(实体机或虚拟机),以防止机器被重复部署到多个集群中。在资源池50中每台机器上安装HAgent后,将在中心管理服务器10中对每台机器进行注册。The central management server 10 is also responsible for maintaining the list information of the machines in the resource pool 50, obtaining the cluster to which the machine belongs, the machine working state, and the machine type (physical machine or virtual machine) to prevent the machine from being repeatedly deployed into multiple clusters. After the HAgent is installed on each machine in the resource pool 50, each machine is registered in the central management server 10.
对于已创建的集群20,如果需要动态添加机器,则通过中心管理服务器10从资源池50的机器中挑选一台或几台加入集群,然后启用拟部署在所选机器上的服务进程;如果集群20中的机器存在空闲情况,则通过中心管理服务器10将部分空闲的机器释放回资源池50。For the created cluster 20, if the machine needs to be dynamically added, one or several joins the cluster from the machines of the resource pool 50 through the central management server 10, and then the service process to be deployed on the selected machine is enabled; if the cluster If there is an idle condition in the machine in 20, the partially idle machine is released back to the resource pool 50 by the central management server 10.
当接入业务应用需要接入集群20时,业务的使用方获取对应的使用的集群的配置信息,然后发送给该接入业务应用,使其可以通过诸如HBase应用程序编程接口(API)的接口访问集群20,同时可以在中心管理服务器10中记录集群20的关联接入业务应用。When the access service application needs to access the cluster 20, the user of the service obtains the configuration information of the corresponding used cluster, and then sends the configuration information to the access service application so that it can pass an interface such as an HBase application programming interface (API). The cluster 20 is accessed, and the associated access service application of the cluster 20 can be recorded in the central management server 10.
另外,当HAgent进行的诸如修改ulimit、hosts需要root权限的操作时,通过中心管理服务器10对允许root权限操作进行审批,仅当审 批通过后才能进行操作。对于权限审批,优选的方式是将root用户名和密码放入HAgent配置文件中,通过中心管理服务器10做操作的时候基于root用户名和密码做权限验证以及日志记录,由此能够追踪操作人员并且防止恶意操作,为用户提供了必要的审计追踪。In addition, when the HAgent performs operations such as modifying the ulimit and the hosts requiring root privileges, the central management server 10 approves the permission of the root authority operation, and only the trial is performed. The batch can only be operated after it has passed. For permission approval, the preferred way is to put the root username and password into the HAgent configuration file. When the central management server 10 operates, the root user name and password are used to perform permission verification and log recording, thereby tracking the operator and preventing maliciousness. The operation provides the user with the necessary audit trails.
图2A-2D是用于描述本发明的集群弹性部署的方法的示意图。本发明的集群弹性部署的方法的核心流程包括创建集群、集群动态添加机器、集群动态删除机器、集群动态修改配置,如下将参考图2A-2D对其逐一进行描述。2A-2D are schematic diagrams for describing a method of cluster elastic deployment of the present invention. The core processes of the cluster elastic deployment method of the present invention include creating a cluster, a cluster dynamic adding machine, a cluster dynamic deleting machine, and a cluster dynamic modifying configuration, which will be described one by one with reference to FIGS. 2A-2D.
图2A是根据本发明实施例的创建集群的方法的示意图。根据本发明实施例的创建集群的方法可以包括:定义集群的基本信息,该基本信息包括集群名称、集群机器类型(实体机或虚拟机)、使用有效期和集群规模;从包括由拟用于生成集群的机器所组成的资源池中,选择拟用于生成集群的机器;选择将用于机器的诸如Hadoop、HBase、Zookeeper的软件的版本并修改机器的配置信息;选择每个机器要部署的服务进程;根据机器中所包括的检测脚本,对机器进行环境正确性检查;如果检查通过,则从云存储获取对机器进行配置的镜像文件,通过HAgent启用要部署的服务进程。可选地,如果检查未通过,则将该机器留在资源池中;或重复上述选择将用于机器的软件的版本并修改机器的配置信息,选择要部署的服务进程,以及对机器进行环境正确性检查的步骤。2A is a schematic diagram of a method of creating a cluster in accordance with an embodiment of the present invention. The method for creating a cluster according to an embodiment of the present invention may include: defining basic information of a cluster, including a cluster name, a cluster machine type (physical machine or virtual machine), a usage validity period, and a cluster size; In the resource pool composed of the clustered machines, select the machine to be used to generate the cluster; select the version of the software to be used for the machine such as Hadoop, HBase, Zookeeper and modify the configuration information of the machine; select the service to be deployed for each machine. Process; according to the detection script included in the machine, the machine is checked for environmental correctness; if the check is passed, the image file configured for the machine is obtained from the cloud storage, and the service process to be deployed is enabled through the HAgent. Optionally, if the check fails, leave the machine in the resource pool; or repeat the above selection to use the version of the software for the machine and modify the configuration information of the machine, select the service process to be deployed, and perform the environment on the machine. The steps to check for correctness.
图2B是根据本发明实施例的集群动态添加机器的方法的示意图。根据本发明实施例的集群动态添加机器的方法可以包括:从资源池中选择一台空闲的机器并且指定其上要部署的服务进程;根据机器中所包括的检测脚本,对该机器进行环境正确性检查;如果检查通过,则从云存储获取对该机器进行配置的镜像文件,对其配置进行更新,启用要部署的服务进程。可选地,如果检查未通过,则对该机器重复环境正确性检查的步骤;或从资源池中另外选择一台空闲的机器,并重 复上述步骤。2B is a schematic diagram of a method of dynamically adding a machine to a cluster according to an embodiment of the present invention. A method for dynamically adding a machine according to an embodiment of the present invention may include: selecting an idle machine from a resource pool and designating a service process to be deployed thereon; and performing an environment correct for the machine according to a detection script included in the machine Sex check; if the check passes, get the image file configured for the machine from the cloud storage, update its configuration, and enable the service process to be deployed. Optionally, if the check fails, repeat the step of checking the environmental correctness of the machine; or select another idle machine from the resource pool, and Repeat the above steps.
图2C是根据本发明实施例的集群动态删除机器的方法的示意图。根据本发明实施例的集群动态删除机器的方法可以包括:通过中心管理服务器的界面,选择需要停止的机器并且停止部署在其上的服务进程;根据该机器中所包括的检测脚本,对集群工作状态进行检测;如果集群能够工作正常,则将该机器的配置信息重置为初始状态,以使其配置信息返回到被放入到资源池时的状态,由此完成该机器归入资源池的操作。可选地,如果集群不能工作正常,则重新启用该机器上的被停止的服务进程。2C is a schematic diagram of a method of cluster dynamic deletion of a machine in accordance with an embodiment of the present invention. The method for dynamically deleting a machine according to an embodiment of the present invention may include: selecting, by an interface of the central management server, a machine that needs to be stopped and stopping a service process deployed thereon; working on the cluster according to a detection script included in the machine The status is detected; if the cluster is working properly, the configuration information of the machine is reset to the initial state, so that its configuration information is returned to the state when it is put into the resource pool, thereby completing the classification of the machine into the resource pool. operating. Optionally, if the cluster is not working properly, re-enable the stopped service process on that machine.
图2D是根据本发明实施例的集群动态修改配置的方法的示意图。根据本发明实施例的集群动态修改配置的方法可以包括:对欲对其进行配置项修改的机器的原配置项进行备份;通过中心管理服务器的界面,修改要更改的配置项;根据该机器中所包括的检测脚本,对修改后的配置项的合法性进行审查;如果审查通过,则修改后的配置项被推送到该机器。可选地,如果审查未通过,则在该机器中用备份的原配置项覆盖修改后的配置项。2D is a schematic diagram of a method for dynamically modifying a configuration of a cluster according to an embodiment of the present invention. The method for dynamically modifying a configuration of a cluster according to the embodiment of the present invention may include: backing up an original configuration item of a machine for which configuration item modification is to be performed; and modifying a configuration item to be changed through an interface of the central management server; The included detection script reviews the legality of the modified configuration item; if the review is passed, the modified configuration item is pushed to the machine. Optionally, if the review fails, the modified configuration item is overwritten in the machine with the original configuration item of the backup.
根据本发明实施例的技术方案,通过在集群部署中引入资源池,而将所有拟用于集群部署的机器放入资源池内,提供了能够动态管理的弹性计算框架的集群,其能够实现可扩展性强、可维护性强、资源利用率高以及人力成本低的效果。According to the technical solution of the embodiment of the present invention, by introducing a resource pool in a cluster deployment, all the machines to be used for the cluster deployment are put into the resource pool, and a cluster capable of dynamically managing the elastic computing framework is provided, which can be expanded. Strong performance, maintainability, high resource utilization and low labor costs.
以上结合具体实施例描述了本发明的基本原理,但是,需要指出的是,对本领域的普通技术人员而言,能够理解本发明的方法和设备的全部或者任何步骤或者部件,可以在任何计算装置(包括处理器、存储介质等)或者计算装置的网络中,以硬件、固件、软件或者它们的组合加以实现,这是本领域普通技术人员在阅读了本发明的说明的情况下运用他们的基本编程技能就能实现的。 The basic principles of the present invention have been described above in connection with the specific embodiments, but it should be noted that those skilled in the art can understand that all or any of the steps or components of the method and apparatus of the present invention may be in any computing device. (including a processor, a storage medium, etc.) or a network of computing devices implemented in hardware, firmware, software, or a combination thereof, which is used by those of ordinary skill in the art in view of the description of the present invention. Programming skills can be achieved.
因此,本发明的目的还可以通过在任何计算装置上运行一个程序或者一组程序来实现。所述计算装置可以是公知的通用装置。因此,本发明的目的也可以仅仅通过提供包含实现所述方法或者装置的程序代码的程序产品来实现。也就是说,这样的程序产品也构成本发明,并且存储有这样的程序产品的存储介质也构成本发明。显然,所述存储介质可以是任何公知的存储介质或者将来开发出的任何存储介质。Thus, the objects of the invention can also be achieved by running a program or a set of programs on any computing device. The computing device can be a well-known general purpose device. Accordingly, the object of the present invention can also be achieved by merely providing a program product comprising program code for implementing the method or apparatus. That is to say, such a program product also constitutes the present invention, and a storage medium storing such a program product also constitutes the present invention. It will be apparent that the storage medium may be any known storage medium or any storage medium developed in the future.
还需要指出的是,在本发明的装置和方法中,显然,各部件或各步骤是可以分解和/或重新组合的。这些分解和/或重新组合应视为本发明的等效方案。并且,执行上述系列处理的步骤可以自然地按照说明的顺序按时间顺序执行,但是并不需要一定按照时间顺序执行。某些步骤可以并行或彼此独立地执行。It should also be noted that in the apparatus and method of the present invention, it is apparent that the various components or steps may be decomposed and/or recombined. These decompositions and/or recombinations should be considered as equivalents to the invention. Also, the steps of performing the series of processes described above may naturally be performed in chronological order in the order illustrated, but need not necessarily be performed in chronological order. Certain steps may be performed in parallel or independently of one another.
上述具体实施方式,并不构成对本发明保护范围的限制。本领域技术人员应该明白的是,取决于设计要求和其他因素,可以发生各种各样的修改、组合、子组合和替代。任何在本发明的精神和原则之内所作的修改、等同替换和改进等,均应包含在本发明保护范围之内。 The above specific embodiments do not constitute a limitation of the scope of the present invention. Those skilled in the art will appreciate that a wide variety of modifications, combinations, sub-combinations and substitutions can occur depending on design requirements and other factors. Any modifications, equivalent substitutions and improvements made within the spirit and scope of the invention are intended to be included within the scope of the invention.

Claims (11)

  1. 一种集群弹性部署的方法,包括步骤:A method for cluster elastic deployment includes the steps of:
    定义集群的基本信息;Define basic information about the cluster;
    从包括由拟用于生成集群的机器所组成的资源池中,选择拟用于生成所述集群的机器;Selecting the machine to be used to generate the cluster from a pool of resources consisting of machines intended to be used to generate the cluster;
    选择将用于所述机器的软件的版本并修改所述机器的配置信息;Selecting a version of software to be used for the machine and modifying configuration information for the machine;
    选择所述机器要部署的服务进程;以及Select the service process to be deployed by the machine; and
    根据所述机器中所包括的检测脚本,对所述机器进行环境正确性检查,其中,如果所述环境正确性检查通过,则从云存储获取对所述机器进行配置的镜像文件,以及在所述机器上启用所述要部署的服务进程。Performing an environmental correctness check on the machine according to the detection script included in the machine, wherein if the environmental correctness check passes, obtaining an image file for configuring the machine from the cloud storage, and Enable the service process to be deployed on the machine.
  2. 根据权利要求1所述的方法,其中,所述定义集群的基本信息包括:定义所述集群的集群名称、集群机器类型、使用有效期和集群规模。The method of claim 1, wherein the defining the basic information of the cluster comprises: defining a cluster name of the cluster, a cluster machine type, a usage validity period, and a cluster size.
  3. 根据权利要求1所述的方法,其中,所述对所述机器进行环境正确性检查包括:对所述机器的操作系统版本、JDK版本、主机名、防火墙、时间同步配置中的一项或多项进行检查。The method of claim 1, wherein said performing an environmental correctness check on said machine comprises: one or more of an operating system version, a JDK version, a host name, a firewall, a time synchronization configuration of said machine Check the item.
  4. 根据权利要求1所述的方法,进一步包括向所述集群添加机器的步骤,所述步骤包括:The method of claim 1 further comprising the step of adding a machine to said cluster, said steps comprising:
    从所述资源池中选择空闲的机器并且指定在其上要部署的服务进程;以及Selecting an idle machine from the resource pool and specifying the service process to be deployed on it;
    根据所述机器中所包括的检测脚本,对所述机器进行环境正确性检查,其中,如果所述环境正确性检查通过,则从所述云存储获取对所述机器进行配置的镜像文件,对所述机器的配置进行更新,以及启用所述要部署的服务进程。 Performing an environmental correctness check on the machine according to the detection script included in the machine, wherein if the environment correctness check passes, obtaining an image file for configuring the machine from the cloud storage, The configuration of the machine is updated and the service process to be deployed is enabled.
  5. 根据权利要求1所述的方法,进一步包括向所述集群动态删除机器的步骤,所述步骤包括:The method of claim 1 further comprising the step of dynamically deleting the machine to said cluster, said steps comprising:
    选择需要停止的机器并且停止部署在其上的服务进程;以及Select the machine that needs to be stopped and stop the service process deployed on it;
    根据所述机器中所包括的检测脚本,对所述集群工作状态进行检测,其中,如果所述集群能够工作正常,则将所述机器的配置信息重置为初始状态。The cluster working state is detected according to a detection script included in the machine, wherein if the cluster can work normally, the configuration information of the machine is reset to an initial state.
  6. 根据权利要求1所述的方法,进一步包括对所述集群进行动态修改配置的步骤,所述步骤包括:The method of claim 1 further comprising the step of dynamically modifying said cluster, said steps comprising:
    对欲对其进行配置项修改的机器的原配置项进行备份;Back up the original configuration items of the machine for which you want to modify the configuration items;
    修改所述机器的要更改的配置项;以及Modifying the configuration item of the machine to be changed;
    根据所述机器中所包括的检测脚本,对修改后的配置项的合法性进行审查,其中,如果所述配置项的合法性的审查通过,则所述修改后的配置项被推送到所述机器。Examining the legality of the modified configuration item according to the detection script included in the machine, wherein if the review of the legality of the configuration item passes, the modified configuration item is pushed to the machine.
  7. 一种集群弹性部署的管理系统,包括:A management system for cluster elastic deployment, including:
    中心管理服务器;Central management server;
    云存储;Cloud storage
    数据库;database;
    由拟用于生成集群的机器所组成的资源池;以及a pool of resources consisting of the machines that are intended to be used to generate the cluster;
    由所述中心管理服务器,根据权利要求1-6中任意一项所述的方法所部署的集群。A cluster deployed by the central management server, according to the method of any of claims 1-6.
  8. 根据权利要求7所述的管理系统,其中,所述资源池中的拟用于生成集群的机器能够与所述中心管理服务器协同来完成接收命令和回送响应消息的相关操作。The management system of claim 7, wherein the machine in the resource pool to be used to generate the cluster is capable of collaborating with the central management server to perform related operations of receiving commands and echoing response messages.
  9. 根据权利要求7所述的管理系统,其中,所述资源池中的拟用于生成集群的机器包括检测脚本。 The management system of claim 7, wherein the machine in the resource pool to be used to generate the cluster comprises a detection script.
  10. 根据权利要求7所述的管理系统,其中,所述中心管理服务器还用于获取所述机器的所属集群、机器工作状态以及机器类型。The management system according to claim 7, wherein the central management server is further configured to acquire a cluster to which the machine belongs, a machine working state, and a machine type.
  11. 根据权利要求7所述的管理系统,其中,所述中心管理服务器记录所述集群和所述资源池的操作日志。 The management system according to claim 7, wherein said central management server records an operation log of said cluster and said resource pool.
PCT/CN2016/070999 2015-02-15 2016-01-15 Flexible deployment method for cluster and management system WO2016127756A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201510084932.3A CN104679717B (en) 2015-02-15 2015-02-15 The method and management system of cluster elasticity deployment
CN201510084932.3 2015-02-15

Publications (1)

Publication Number Publication Date
WO2016127756A1 true WO2016127756A1 (en) 2016-08-18

Family

ID=53314783

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2016/070999 WO2016127756A1 (en) 2015-02-15 2016-01-15 Flexible deployment method for cluster and management system

Country Status (2)

Country Link
CN (1) CN104679717B (en)
WO (1) WO2016127756A1 (en)

Cited By (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110688130A (en) * 2019-10-14 2020-01-14 天津卓朗科技发展有限公司 Physical machine deployment method, physical machine deployment device, readable storage medium and electronic equipment
CN110728372A (en) * 2019-09-27 2020-01-24 达而观信息科技(上海)有限公司 Cluster design method and cluster architecture for dynamic loading of artificial intelligence model
CN111026723A (en) * 2019-11-21 2020-04-17 望海康信(北京)科技股份公司 Big data cluster management control method and device, computer equipment and storage medium
CN112130871A (en) * 2020-09-27 2020-12-25 平安医疗健康管理股份有限公司 Method and device for remotely deploying middleware, computer equipment and storage medium
CN112162702A (en) * 2020-09-25 2021-01-01 科大讯飞股份有限公司 Distributed storage system, virtual machine creation method and data volume expansion method
CN113722057A (en) * 2021-03-12 2021-11-30 京东数字科技控股股份有限公司 Big data cluster processing method and system, electronic device and storage medium
CN114584489A (en) * 2022-03-08 2022-06-03 浪潮云信息技术股份公司 Ssh channel-based remote environment information and configuration detection method and system
CN114745376A (en) * 2020-12-24 2022-07-12 网联清算有限公司 ZooKeeper cluster operation and maintenance method and device, electronic equipment and storage medium

Families Citing this family (12)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104679717B (en) * 2015-02-15 2018-11-27 北京京东尚科信息技术有限公司 The method and management system of cluster elasticity deployment
CN106657167B (en) * 2017-03-10 2020-05-26 联想(北京)有限公司 Management server, server cluster, and management method
CN107332873A (en) * 2017-05-24 2017-11-07 杭州沃趣科技股份有限公司 A kind of method of automatic discovery cluster resource
CN107241445A (en) * 2017-07-31 2017-10-10 郑州云海信息技术有限公司 A kind of distributed server cluster dispositions method and system
CN110138577B (en) * 2018-02-08 2021-09-21 腾讯科技(深圳)有限公司 Cluster creation method, first server and readable storage medium
CN108900356A (en) * 2018-07-25 2018-11-27 郑州云海信息技术有限公司 A kind of cloud service dispositions method and system
CN109783212A (en) * 2018-12-26 2019-05-21 亚信科技(中国)有限公司 Background process distributed scheduling device, method, computer equipment and storage medium
CN110119312A (en) * 2019-04-30 2019-08-13 广州微算互联信息技术有限公司 A kind of dispatching method based on cloud storage, system, device and storage medium
CN110389766B (en) * 2019-06-21 2022-12-27 深圳市汇川技术股份有限公司 HBase container cluster deployment method, system, equipment and computer readable storage medium
CN111339055B (en) * 2020-02-07 2023-08-01 浪潮软件股份有限公司 Big data cluster capacity expansion method and device
CN111835563A (en) * 2020-07-03 2020-10-27 紫光云技术有限公司 Method for modifying configuration of mongodb database cluster parameters on cloud service platform
CN112667337A (en) * 2020-12-28 2021-04-16 紫光云技术有限公司 Method for modifying Kafka cluster parameters on cloud management platform

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2005096736A2 (en) * 2004-03-31 2005-10-20 Unisys Corporation Clusterization with automated deployment of a cluster-unaware application
CN102130938A (en) * 2010-12-03 2011-07-20 中国科学院软件研究所 Resource supply method oriented to Web application host platform
CN103036927A (en) * 2011-09-29 2013-04-10 中国电信股份有限公司 Method, device and system of intelligent traffic control
CN104317610A (en) * 2014-10-11 2015-01-28 福建新大陆软件工程有限公司 Method and device for automatic installation and deployment of hadoop platform
CN104679717A (en) * 2015-02-15 2015-06-03 北京京东尚科信息技术有限公司 Method and management system of elastic cluster deployment

Family Cites Families (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8732191B2 (en) * 2011-06-27 2014-05-20 Oracle International Corporation System and method for improving application connectivity in a clustered database environment
CN102843438A (en) * 2012-09-29 2012-12-26 深圳市博瑞得科技有限公司 Method and system for cloud computation node management
CN103561061B (en) * 2013-10-17 2016-08-10 南京邮电大学 A kind of cloud data mining platform dispositions method of elasticity
CN104158910B (en) * 2014-08-29 2017-12-15 金石易诚(北京)科技有限公司 A kind of high in the clouds Web applications automatically dispose system

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2005096736A2 (en) * 2004-03-31 2005-10-20 Unisys Corporation Clusterization with automated deployment of a cluster-unaware application
CN102130938A (en) * 2010-12-03 2011-07-20 中国科学院软件研究所 Resource supply method oriented to Web application host platform
CN103036927A (en) * 2011-09-29 2013-04-10 中国电信股份有限公司 Method, device and system of intelligent traffic control
CN104317610A (en) * 2014-10-11 2015-01-28 福建新大陆软件工程有限公司 Method and device for automatic installation and deployment of hadoop platform
CN104679717A (en) * 2015-02-15 2015-06-03 北京京东尚科信息技术有限公司 Method and management system of elastic cluster deployment

Cited By (11)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110728372A (en) * 2019-09-27 2020-01-24 达而观信息科技(上海)有限公司 Cluster design method and cluster architecture for dynamic loading of artificial intelligence model
CN110728372B (en) * 2019-09-27 2023-04-25 达而观信息科技(上海)有限公司 Cluster design method and cluster system for dynamic loading of artificial intelligent model
CN110688130A (en) * 2019-10-14 2020-01-14 天津卓朗科技发展有限公司 Physical machine deployment method, physical machine deployment device, readable storage medium and electronic equipment
CN111026723A (en) * 2019-11-21 2020-04-17 望海康信(北京)科技股份公司 Big data cluster management control method and device, computer equipment and storage medium
CN111026723B (en) * 2019-11-21 2023-08-11 望海康信(北京)科技股份公司 Big data cluster management control method and device, computer equipment and storage medium
CN112162702A (en) * 2020-09-25 2021-01-01 科大讯飞股份有限公司 Distributed storage system, virtual machine creation method and data volume expansion method
CN112130871A (en) * 2020-09-27 2020-12-25 平安医疗健康管理股份有限公司 Method and device for remotely deploying middleware, computer equipment and storage medium
CN114745376A (en) * 2020-12-24 2022-07-12 网联清算有限公司 ZooKeeper cluster operation and maintenance method and device, electronic equipment and storage medium
CN114745376B (en) * 2020-12-24 2023-12-15 网联清算有限公司 ZooKeeper cluster operation and maintenance method and device, electronic equipment and storage medium
CN113722057A (en) * 2021-03-12 2021-11-30 京东数字科技控股股份有限公司 Big data cluster processing method and system, electronic device and storage medium
CN114584489A (en) * 2022-03-08 2022-06-03 浪潮云信息技术股份公司 Ssh channel-based remote environment information and configuration detection method and system

Also Published As

Publication number Publication date
CN104679717A (en) 2015-06-03
CN104679717B (en) 2018-11-27

Similar Documents

Publication Publication Date Title
WO2016127756A1 (en) Flexible deployment method for cluster and management system
US11770381B2 (en) Managing security groups for data instances
US10817410B2 (en) Application programming interface for providing access to computing platform definitions
US20220413979A1 (en) Control service for data management
US9565260B2 (en) Account state simulation service for cloud computing environments
US20200067791A1 (en) Client account versioning metadata manager for cloud computing environments
US8805971B1 (en) Client-specified schema extensions in cloud computing environments
US8856077B1 (en) Account cloning service for cloud computing environments
US10409622B2 (en) Orchestration pipeline for providing and operating segmented computing resources
US9210178B1 (en) Mixed-mode authorization metadata manager for cloud computing environments
US7383271B2 (en) Centralized configuration data management for distributed clients
US11436099B2 (en) Backup client agent
US10469315B2 (en) Using computing platform definitions to provide segmented computing platforms in a computing system
US10778810B2 (en) Staging and deployment to multiple service clouds
US10911299B2 (en) Multiuser device staging
JP6382705B2 (en) Virtual device test apparatus, virtual device test method, and virtual device test program
US10572805B2 (en) Service modeling and execution
US9178867B1 (en) Interacting with restricted environments
US10862747B2 (en) Single user device staging
US20240118907A1 (en) Scalable specification and self-governance for autonomous databases, cluster databases and multi-tenant databases in cloud and on-prem environment

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 16748557

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

32PN Ep: public notification in the ep bulletin as address of the adressee cannot be established

Free format text: NOTING OF LOSS OF RIGHTS PURSUANT TO RULE 112(1) EPC (EPO FORM 1205 DATED 05/12/2017)

122 Ep: pct application non-entry in european phase

Ref document number: 16748557

Country of ref document: EP

Kind code of ref document: A1