WO2020192065A1 - Method for achieving cross-cluster high availability, apparatus, system, and device - Google Patents

Method for achieving cross-cluster high availability, apparatus, system, and device Download PDF

Info

Publication number
WO2020192065A1
WO2020192065A1 PCT/CN2019/109106 CN2019109106W WO2020192065A1 WO 2020192065 A1 WO2020192065 A1 WO 2020192065A1 CN 2019109106 W CN2019109106 W CN 2019109106W WO 2020192065 A1 WO2020192065 A1 WO 2020192065A1
Authority
WO
WIPO (PCT)
Prior art keywords
cluster
information
status
data service
available
Prior art date
Application number
PCT/CN2019/109106
Other languages
French (fr)
Chinese (zh)
Inventor
赵云
王志强
徐根林
孙迁
Original Assignee
苏宁云计算有限公司
苏宁易购集团股份有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 苏宁云计算有限公司, 苏宁易购集团股份有限公司 filed Critical 苏宁云计算有限公司
Priority to CA3176454A priority Critical patent/CA3176454A1/en
Publication of WO2020192065A1 publication Critical patent/WO2020192065A1/en

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L67/00Network arrangements or protocols for supporting network services or applications
    • H04L67/01Protocols
    • H04L67/10Protocols in which an application is distributed across nodes in the network
    • H04L67/1095Replication or mirroring of data, e.g. scheduling or transport for data synchronisation between network nodes
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L65/00Network arrangements, protocols or services for supporting real-time applications in data packet communication
    • H04L65/40Support for services or applications
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L67/00Network arrangements or protocols for supporting network services or applications
    • H04L67/01Protocols
    • H04L67/10Protocols in which an application is distributed across nodes in the network

Definitions

  • the present invention relates to the technical field of big data, in particular to a method, device, system and equipment for implementing cross-cluster high availability.
  • High-availability cluster the original English is High Availability Cluster, referred to as HA Cluster.
  • HA Cluster a cluster is a set of computers that provide users with a set of network resources as a whole. These individual computer systems are the nodes of the cluster.
  • a high-availability cluster (HA cluster) refers to a host group that runs like a single system and supports (computers) to continue normal operation.
  • High-availability cluster refers to the server cluster technology for the purpose of reducing service interruption time. It minimizes the impact of software/hardware/man-made failures on the business by protecting users' business programs to provide uninterrupted services to the outside world.
  • multi-clusters are built in the industry, mainly used for read-write separation and data backup, but they do not have the ability to automatically switch access and concurrent access between multiple clusters, which are specifically reflected in the following aspects: 1. Switching clusters requires changes The client is restarted after configuration; 2. The client cannot be controlled to switch clusters on the server; 3. When the waiting time of the client request is too long, it cannot request other clusters at the same time.
  • the embodiments of the present invention provide a method, device, system, and equipment for implementing cross-cluster high availability, which solves many shortcomings in the prior art in cross-cluster.
  • the technical solution is as follows:
  • a method for implementing cross-cluster high availability includes:
  • Monitor cluster information of at least two data service clusters configured through data synchronization, where the cluster information includes at least cluster status information and cluster address information; obtain the cluster information in real time through a distributed coordination service; perform corresponding data based on the cluster information Request access to the service cluster.
  • the monitoring cluster information of at least two data service clusters configured through data synchronization includes: regularly monitoring the cluster status information of the at least two data service clusters; The cluster state information of the changed state is notified to the distributed coordination service.
  • the data synchronization configuration includes: configuring the mutual replication relationship between the at least two data service clusters, and carrying the source cluster The data service request of the ID information is synchronized between the at least two data service clusters.
  • the real-time acquisition of the cluster information through the distributed coordination service includes: real-time monitoring of the cluster information, and real-time response to the cluster status information when the cluster status changes Notice.
  • the requesting access to the corresponding data service cluster according to the cluster information includes: : According to the cluster information, priority is given to accessing the clusters in the order of the available cluster status.
  • prioritizing access to clusters in the order of available cluster status includes: According to the cluster information, the first cluster address is obtained to determine whether the current cluster state is available, and when the judgment result is unavailable, the next cluster address is obtained, and the step of judging whether the current cluster state is available is jumped to; when the judgment result is available, Request access to the current cluster, and further determine whether the current cluster processing time exceeds the preset threshold, if it does not exceed, obtain the cluster feedback result; if it exceeds, concurrently request to access the next available cluster to obtain the priority return cluster feedback result.
  • a device for implementing cross-cluster high availability includes a monitoring module for monitoring cluster information of at least two data service clusters configured through data synchronization, where the cluster information includes at least cluster status Information and cluster address information; a distributed coordination service module for obtaining the cluster information in real time through a distributed coordination service; a request access control module for requesting access to a corresponding data service cluster according to the cluster information.
  • the monitoring module regularly monitors the cluster status information of at least two data service clusters, and notifies the distributed coordination service of the cluster status information of the cluster status change Module.
  • the distributed coordination service module is used for real-time monitoring of the cluster information, and real-time notification of the cluster information when the cluster status changes.
  • the device further includes a data synchronization configuration module that configures the mutual replication relationship between the at least two data service clusters, and includes The data service request of the source cluster ID information is synchronized between the at least two data service clusters.
  • the request access control module preferentially accesses the cluster status according to the cluster information The cluster with the highest available order.
  • the request access control module preferentially accesses the clusters whose cluster status is available in order according to the cluster information. , Including: obtaining the first cluster address according to the cluster information, judging whether the current cluster state is available, when the judgment result is unavailable, obtaining the next cluster address, and jumping to the step of judging whether the current cluster state is available; If the result is available, request access to the current cluster, and further determine whether the current cluster processing time exceeds the preset threshold, if not, obtain the cluster feedback result; if it exceeds, concurrently request access to the next available cluster to obtain the priority return cluster feedback result.
  • a system for realizing cross-cluster high availability includes: at least two data service clusters configured through data synchronization for realizing client request access to data services; the server side includes monitoring A device and a distributed coordination server, the monitoring device is used to monitor the cluster information of the at least two data service clusters, the cluster information includes at least cluster status information and cluster address information; the distributed coordination server is used to pass The distributed coordination service obtains the cluster information in real time; the client includes a controller, and the controller is used to request access to the corresponding data service cluster according to the cluster information received from the server.
  • the monitoring device regularly monitors the cluster status information of at least two data service clusters, and notifies the distributed coordination server of the cluster status information of the cluster status change;
  • the controller obtains the cluster information from the distributed coordination server, obtains the first cluster address according to the cluster information, determines whether the current cluster status is available, and when the judgment result is unavailable, obtains the next cluster address, Jump to the step of judging whether the current cluster status is available; when the judgment result is available, request access to the current cluster, and further judge whether the current cluster processing time exceeds the preset threshold, if not, get the cluster feedback result; if it exceeds, request access concurrently The next available cluster gets the cluster feedback result that is returned first.
  • a device for implementing cross-cluster high availability includes: a processor; a memory for storing executable instructions of the processor; wherein the processor is configured to pass through the The instructions can be executed to execute the steps of the method for achieving cross-cluster high availability described in any one of the above solutions.
  • the cross-cluster high availability implementation method, device, system, and equipment provided by the embodiments of the present invention monitor the cluster information of at least two data service clusters that are synchronized with data, and then obtain the cluster information in real time through the distributed coordination service, and then according to The cluster information obtained in real time is used to request access to the corresponding data service cluster, and the following beneficial effects are achieved:
  • the server cluster when the server cluster performs operation and maintenance operations, it does not affect client access and reduces the abnormal rate of the client; when the request time exceeds the threshold, the client can concurrently request other clusters, reducing the waiting time of the client and reducing the client Maximum request delay;
  • the server has the ability to adjust different client tenants to access different servers, and realizes the ability of the client to access multiple clusters in parallel to reduce the client response time, and has the advantages of efficient and precise control.
  • Figure 1 is a flowchart of a method for implementing cross-cluster high availability provided by an embodiment of the present invention
  • FIG. 2 is an example of the overall framework of a method for implementing cross-cluster high availability provided by an embodiment of the present invention
  • 3 is an example of 103 steps in the method for implementing cross-cluster high availability provided by an embodiment of the present invention
  • FIG. 4 is a schematic structural diagram of a device for implementing cross-cluster high availability provided by an embodiment of the present invention.
  • FIG. 5 is an example of a request sequence for a cross-cluster high-availability implementation device provided by an embodiment of the present invention to perform a cross-cluster high-availability service;
  • FIG. 6 is a schematic diagram of the system structure for implementing cross-cluster high availability provided by an embodiment of the present invention.
  • Fig. 7 is a schematic diagram of a device structure for implementing cross-cluster high availability provided by an embodiment of the present invention.
  • the embodiment of the present invention provides a cross-cluster high-availability implementation method, device, system, and equipment, by monitoring the cluster information of at least two data service clusters that are synchronized with data, and then obtaining the cluster information in real time through a distributed coordination service, Then, request access to the corresponding data service cluster based on the cluster information obtained in real time.
  • This achieves smoother cluster migration and switching between multiple clusters, that is, cluster migration and switching without the client's perception, and the cluster can be seamlessly switched on the server side manually to meet the needs of cluster operation and maintenance; in addition, It can reduce the maximum request delay of the client and has the advantages of high efficiency and precise control.
  • the cross-cluster high-availability implementation method, device, system and equipment not only realize the high availability of multiple clusters of data services such as big data real-time reading and writing to improve overall availability, but also realize the ability of clients to access multiple clusters in parallel. Reduce client response time and can be widely used in big data fields involving high-availability cluster operation and maintenance.
  • FIG. 1 is a flowchart of a method for implementing cross-cluster high availability provided by an embodiment of the present invention
  • FIG. 2 is an example of the overall framework of a method for implementing cross-cluster high availability provided by an embodiment of the present invention.
  • the method for implementing cross-cluster high availability provided by the embodiment of the present invention includes the following steps:
  • the data service types of the data service cluster here include real-time data reading and writing services, etc.
  • the timing frequency can be set to a frequency of 1 to 10 seconds.
  • a service with distributed coordination functions needs to be built for Record each cluster information, especially cluster status information, where the cluster status information mainly includes cluster availability information and status information such as cluster availability changes.
  • step 101 the process of monitoring the cluster information of at least two data service clusters configured through data synchronization can be implemented in other ways besides the method described in the above steps.
  • the embodiment of the present invention is specific The method is not limited.
  • the cluster information is monitored in real time, and the cluster status information of the cluster status change is notified accordingly in real time.
  • the cluster status information of the cluster status change is notified, such as to the client, so that the client after the access configuration accesses the distributed coordination service to obtain the cluster information of each data service cluster.
  • step 102 the process of obtaining cluster information in real time through the distributed coordination service in step 102 may be implemented in other ways in addition to the manner described in the foregoing steps, and the embodiment of the present invention does not limit the specific manner.
  • cluster information priority is given to accessing the clusters with the highest cluster status as available.
  • a distributed coordination service is requested to obtain all configured cluster addresses of the current client tenant.
  • the embodiment of the present invention does not impose special restrictions on the time point for obtaining the configuration cluster address, and it can be selected according to actual conditions.
  • Fig. 3 is an example of step 103 in the method for implementing cross-cluster high availability provided by an embodiment of the present invention. As shown in Figure 3, Step 103 includes the following sub-steps:
  • Step 1 The client requests the distributed coordination service to obtain all the configured cluster addresses of the current client tenant;
  • Step 2 Get the first cluster
  • Step 3 Determine whether the current cluster status is available, if not available, go to Step 4, and adjust to Step 5 if available;
  • Step 4 Get the next cluster and jump to Step 3;
  • Step 5 Request the current cluster
  • Step 6 Determine whether the current cluster processing time exceeds the preset threshold. If it exceeds, go to Step 7, and adjust to Step 9;
  • Step 7 Concurrently request the next available cluster
  • Step 8 Obtain the cluster feedback results that are returned first, and end the current process
  • Step 9 Obtain the cluster feedback result and end the current process.
  • step 103 the process of requesting access to the corresponding data service cluster according to the cluster information can be implemented in other ways besides the method described in the above steps.
  • the specific method is not described in the embodiment of the present invention. limited.
  • FIG. 4 is a schematic structural diagram of a device for implementing cross-cluster high availability provided by an embodiment of the present invention.
  • FIG. 5 is an example of a request sequence for a cross-cluster high-availability implementation device provided by an embodiment of the present invention to perform a cross-cluster high-availability service.
  • the device 2 for implementing cross-cluster high availability provided by the embodiment of the present invention includes a monitoring module 21, a distributed coordination service module 22 and a request access control module 23.
  • the monitoring module 21 is configured to monitor cluster information of at least two data service clusters configured through data synchronization, and the cluster information includes at least cluster state information and cluster address information. Specifically, the monitoring module 21 regularly monitors the cluster status information of at least two data service clusters, and notifies the distributed coordination service module 22 of the cluster status information of the cluster status change.
  • the distributed coordination service module 22 is used to obtain cluster information in real time through the distributed coordination service. Specifically, the distributed coordination service module 22 is used for real-time monitoring of cluster information, and real-time notification of cluster information that has changed in the cluster state.
  • the request access control module 23 is used for requesting access to the corresponding data service cluster according to the cluster information. Specifically, requesting the access control module 23 to preferentially access the clusters whose cluster status is available in order according to the cluster information further includes the following sub-steps:
  • the above-mentioned cross-cluster high-availability implementation device further includes a data synchronization configuration module 24.
  • the data synchronization configuration module 24 configures the mutual replication relationship between at least two data service clusters, and requests the data service with the source cluster ID information in Synchronize between at least two data service clusters.
  • the monitoring module 21, the distributed coordination service module 22, and the data synchronization configuration module 24 are set on the client, and the request access control module 23 is set on the server.
  • the request access control module 23 is set on the server.
  • how to perform terminal settings It can be determined according to actual conditions, and is not particularly limited in the embodiment of the present invention.
  • Figure 6 is a schematic diagram of the system structure for implementing cross-cluster high availability provided by an embodiment of the present invention.
  • the cross-cluster high availability implementation system provided by an embodiment of the present invention includes at least two data service clusters configured through data synchronization , Client 31 and Server 32. Among them, at least two data service clusters are used to implement client requests to access data services.
  • the server 31 includes a monitoring device 311 and a distributed coordination server 312.
  • the monitoring device 311 is used to monitor cluster information of at least two data service clusters.
  • the cluster information includes at least cluster status information and cluster address information.
  • the distributed coordination server 312 is configured to obtain cluster information in real time through the distributed coordination service.
  • the monitoring device 311 regularly monitors the cluster status information of at least two data service clusters, and notifies the distributed coordination server 312 of the cluster status information of the cluster status change.
  • the distributed coordination server 312 is used for real-time monitoring of cluster information, and real-time notification of cluster information that has changed in the cluster state.
  • the client 32 includes a controller 321.
  • the controller 321 is used to request access to the corresponding data service cluster according to the cluster information received from the server 31. Preferably, according to the cluster information, priority is given to accessing the clusters in the order of the available cluster status. .
  • the controller 321 obtains cluster information from the distributed coordination server 312, obtains the first cluster address according to the cluster information, determines whether the current cluster status is available, and when the judgment result is unavailable, obtains the next cluster address and jumps to Steps to determine whether the current cluster status is available; when the result of the determination is available, request access to the current cluster, and further determine whether the current cluster processing time exceeds the preset threshold, if not, obtain the cluster feedback result; if it exceeds, concurrently request access to the next available Cluster, get the cluster feedback result that is returned first.
  • FIG. 7 is a schematic diagram of the structure of a device for implementing cross-cluster high availability provided by an embodiment of the present invention.
  • the device 4 for implementing cross-cluster high availability provided by an embodiment of the present invention includes a processor 41 and a memory 42; the memory 42 is used
  • the executable instructions of the processor 41 are stored therein; wherein the processor 41 is configured to execute the steps of the method for realizing cross-cluster high availability provided by the embodiments of the present invention via the executable instructions.
  • cross-cluster high-availability implementation devices, systems, and equipment provided in the above embodiments trigger cross-cluster high-availability implementation services
  • only the division of the above-mentioned functional modules is used as an example. In actual applications, it can be as required
  • the above-mentioned function allocation is completed by different functional modules, that is, the internal structure of the device, system, or equipment is divided into different functional modules to complete all or part of the functions described above.
  • the cross-cluster high-availability implementation device, system, and equipment provided in the above-mentioned embodiments belong to the same concept as the cross-cluster high-availability implementation method embodiment.
  • the specific implementation process please refer to the method embodiment, which will not be repeated here.
  • the cross-cluster high-availability implementation method, device, system, and equipment provided by the embodiments of the present invention monitor the cluster information of at least two data service clusters for data synchronization, and then obtain real-time data through distributed coordination services Cluster information, and then request access to the corresponding data service cluster according to the cluster information obtained in real time, which has the following beneficial effects compared with the prior art:
  • the server cluster when the server cluster performs operation and maintenance operations, it does not affect client access and reduces the abnormal rate of the client; when the request time exceeds the threshold, the client can concurrently request other clusters, reducing the waiting time of the client and reducing the client Maximum request delay;
  • the server has the ability to adjust different client tenants to access different servers, and realizes the ability of the client to access multiple clusters in parallel to reduce the client response time, and has the advantages of efficient and precise control.
  • These computer program instructions can be provided to the processor of a general-purpose computer, a special-purpose computer, an embedded processor, or other programmable data processing equipment to generate a machine, so that the instructions executed by the processor of the computer or other programmable data processing equipment are generated It is a device that realizes the functions specified in one process or multiple processes in the flowchart and/or one block or multiple blocks in the block diagram.
  • These computer program instructions can also be stored in a computer-readable memory that can guide a computer or other programmable data processing equipment to work in a specific manner, so that the instructions stored in the computer-readable memory produce an article of manufacture including the instruction device.
  • the device implements the functions specified in one process or multiple processes in the flowchart and/or one block or multiple blocks in the block diagram.
  • These computer program instructions can also be loaded on a computer or other programmable data processing equipment, so that a series of operation steps are executed on the computer or other programmable equipment to produce computer-implemented processing, so as to execute on the computer or other programmable equipment.
  • the instructions provide steps for implementing functions specified in a flow or multiple flows in the flowchart and/or a block or multiple blocks in the block diagram.

Abstract

The present invention relates to the technical field of big data, and disclosed are a method for achieving cross-cluster high availability, an apparatus, a system, and a device. The method comprises: monitoring cluster information of at least two data service clusters configured by means of data synchronization, the cluster information at least comprising cluster state information and cluster address information; obtaining the cluster information in real time by means of a distributed coordination service; and requesting access to the corresponding data service clusters according to the cluster information. According to the present invention, cluster information of at least two data service clusters having data synchronized is monitored, the cluster information is then obtained in real time by means of a distributed coordination service, and access to the corresponding data service clusters is requested according to the cluster information obtained in real time, so that many shortcomings concerning cross-cluster in the prior art are overcome.

Description

一种跨集群高可用的实现方法、装置、系统及设备Method, device, system and equipment for implementing cross-cluster high availability 技术领域Technical field
本发明涉及大数据技术领域,特别涉及一种跨集群高可用的实现方法、装置、系统及设备。The present invention relates to the technical field of big data, in particular to a method, device, system and equipment for implementing cross-cluster high availability.
背景技术Background technique
高可用集群,英文原文为High Availability Cluster,简称HA Cluster,简单的说,集群(cluster)就是一组计算机,它们作为一个整体向用户提供一组网络资源。这些单个的计算机系统就是集群的节点(node)。高可用性集群(HA cluster)是指如单系统一样地运行并支持(计算机)持续正常运行的一个主机群。高可用集群是指以减少服务中断时间为目的的服务器集群技术,它通过保护用户的业务程序对外不间断提供的服务,把因软件/硬件/人为造成的故障对业务的影响降低到最小程度。High-availability cluster, the original English is High Availability Cluster, referred to as HA Cluster. Simply put, a cluster is a set of computers that provide users with a set of network resources as a whole. These individual computer systems are the nodes of the cluster. A high-availability cluster (HA cluster) refers to a host group that runs like a single system and supports (computers) to continue normal operation. High-availability cluster refers to the server cluster technology for the purpose of reducing service interruption time. It minimizes the impact of software/hardware/man-made failures on the business by protecting users' business programs to provide uninterrupted services to the outside world.
因此,目前在大数据领域,提供数据实时读写能力的组件,比如HBase和ES等服务,集群内各角色均为高可用设计,多集群之间采用数据同步技术。然而行业内搭建多集群,主要用于读写分离和数据备份使用,但并不具备在多个集群之间自动切换访问和并发访问的能力,具体体现在如下方面:1、切换集群,需要更改客户端配置后重启;2、无法在服务端控制客户端切换集群;3、当客户端请求等待时间过长时,无法同时请求其他集群。Therefore, currently in the field of big data, components that provide real-time data read and write capabilities, such as HBase and ES services, are designed with high availability for each role in the cluster, and data synchronization technology is adopted between multiple clusters. However, multi-clusters are built in the industry, mainly used for read-write separation and data backup, but they do not have the ability to automatically switch access and concurrent access between multiple clusters, which are specifically reflected in the following aspects: 1. Switching clusters requires changes The client is restarted after configuration; 2. The client cannot be controlled to switch clusters on the server; 3. When the waiting time of the client request is too long, it cannot request other clusters at the same time.
发明内容Summary of the invention
为了解决现有技术的问题,本发明实施例提供了一种跨集群高可用的实现方法、装置、系统及设备,解决了现有技术中跨集群方面存在的诸多弊端。所述 技术方案如下:In order to solve the problems of the prior art, the embodiments of the present invention provide a method, device, system, and equipment for implementing cross-cluster high availability, which solves many shortcomings in the prior art in cross-cluster. The technical solution is as follows:
第一方面,提供了一种跨集群高可用的实现方法,所述方法包括:In the first aspect, a method for implementing cross-cluster high availability is provided, and the method includes:
监控通过数据同步配置的至少两个数据服务集群的集群信息,所述集群信息至少包括集群状态信息和集群地址信息;通过分布式协调服务实时获取所述集群信息;根据所述集群信息进行相应数据服务集群的请求访问。Monitor cluster information of at least two data service clusters configured through data synchronization, where the cluster information includes at least cluster status information and cluster address information; obtain the cluster information in real time through a distributed coordination service; perform corresponding data based on the cluster information Request access to the service cluster.
结合第一方面,在第一种可能的实现方式中,所述监控通过数据同步配置的至少两个数据服务集群的集群信息,包括:定时监控至少两个数据服务集群的集群状态信息;将集群状态发生变更的集群状态信息通知至所述分布式协调服务。With reference to the first aspect, in a first possible implementation manner, the monitoring cluster information of at least two data service clusters configured through data synchronization includes: regularly monitoring the cluster status information of the at least two data service clusters; The cluster state information of the changed state is notified to the distributed coordination service.
结合第一方面的第一种可能实现方式,在第二种可能实现方式中,所述数据同步配置包括:配置所述至少两个数据服务集群之间的相互复制关系,并将带有源集群ID信息的数据服务请求在所述至少两个数据服务集群之间进行同步。With reference to the first possible implementation manner of the first aspect, in the second possible implementation manner, the data synchronization configuration includes: configuring the mutual replication relationship between the at least two data service clusters, and carrying the source cluster The data service request of the ID information is synchronized between the at least two data service clusters.
结合第一方面,在第三种可能实现方式中,所述通过分布式协调服务实时获取所述集群信息,包括:实时监听所述集群信息,并实时对集群状态发生变更的集群状态信息进行相应通知。With reference to the first aspect, in a third possible implementation manner, the real-time acquisition of the cluster information through the distributed coordination service includes: real-time monitoring of the cluster information, and real-time response to the cluster status information when the cluster status changes Notice.
结合第一方面及第一方面的第一至三种可能实现方式的任一种,在第四至七种可能实现方式中,所述根据所述集群信息进行相应数据服务集群的请求访问,包括:根据所述集群信息,优先访问集群状态为可用的顺序靠前的集群。Combining the first aspect and any one of the first to three possible implementation manners of the first aspect, in the fourth to seventh possible implementation manners, the requesting access to the corresponding data service cluster according to the cluster information includes: : According to the cluster information, priority is given to accessing the clusters in the order of the available cluster status.
结合第一方面的第四至七种可能实现方式,在第八至十一种可能实现方式中,所述根据所述集群信息,优先访问集群状态为可用的顺序靠前的集群,包括:根据所述集群信息,获取第一个集群地址,判断当前集群状态是否可用,当判断结果为不可用,获取下一个集群地址,跳转至判断当前集群状态是否可用的步骤;当判断结果为可用,请求访问当前集群,进一步判断当前集群处理时间是否超过预设阈值,若未超过,获取集群反馈结果;若超过,并发请求访问下一个可用集群,获取优先返回的集群反馈结果。With reference to the fourth to seventh possible implementation manners of the first aspect, in the eighth to eleventh possible implementation manners, according to the cluster information, prioritizing access to clusters in the order of available cluster status includes: According to the cluster information, the first cluster address is obtained to determine whether the current cluster state is available, and when the judgment result is unavailable, the next cluster address is obtained, and the step of judging whether the current cluster state is available is jumped to; when the judgment result is available, Request access to the current cluster, and further determine whether the current cluster processing time exceeds the preset threshold, if it does not exceed, obtain the cluster feedback result; if it exceeds, concurrently request to access the next available cluster to obtain the priority return cluster feedback result.
第二方面,提供了一种跨集群高可用的实现装置,所述装置包括:监控模块,用于监控通过数据同步配置的至少两个数据服务集群的集群信息,所述集群信 息至少包括集群状态信息和集群地址信息;分布式协调服务模块,用于通过分布式协调服务实时获取所述集群信息;请求访问控制模块,用于根据所述集群信息进行相应数据服务集群的请求访问。In a second aspect, a device for implementing cross-cluster high availability is provided. The device includes a monitoring module for monitoring cluster information of at least two data service clusters configured through data synchronization, where the cluster information includes at least cluster status Information and cluster address information; a distributed coordination service module for obtaining the cluster information in real time through a distributed coordination service; a request access control module for requesting access to a corresponding data service cluster according to the cluster information.
结合第一方面,在第一种可能的实现方式中,所述监控模块定时监控至少两个数据服务集群的集群状态信息,并将集群状态发生变更的集群状态信息通知至所述分布式协调服务模块。With reference to the first aspect, in a first possible implementation manner, the monitoring module regularly monitors the cluster status information of at least two data service clusters, and notifies the distributed coordination service of the cluster status information of the cluster status change Module.
结合第一方面的第一种可能实现方式,在第二种可能实现方式中,所述分布式协调服务模块用于实时监听所述集群信息,并实时对集群状态发生变更的集群信息进行相应通知。In combination with the first possible implementation of the first aspect, in the second possible implementation, the distributed coordination service module is used for real-time monitoring of the cluster information, and real-time notification of the cluster information when the cluster status changes. .
结合第一方面,在第三种可能实现方式中,所述装置还包括数据同步配置模块,所述数据同步配置模块配置所述至少两个数据服务集群之间的相互复制关系,并将带有源集群ID信息的数据服务请求在所述至少两个数据服务集群之间进行同步。With reference to the first aspect, in a third possible implementation manner, the device further includes a data synchronization configuration module that configures the mutual replication relationship between the at least two data service clusters, and includes The data service request of the source cluster ID information is synchronized between the at least two data service clusters.
结合第一方面及第一方面的第一至三种可能实现方式的任一种,在第四至七种可能实现方式中,所述请求访问控制模块根据所述集群信息,优先访问集群状态为可用的顺序靠前的集群。Combining the first aspect and any one of the first to three possible implementation manners of the first aspect, in the fourth to seventh possible implementation manners, the request access control module preferentially accesses the cluster status according to the cluster information The cluster with the highest available order.
结合第一方面的第四至七种可能实现方式,在第八至十一种可能实现方式中,所述请求访问控制模块根据所述集群信息,优先访问集群状态为可用的顺序靠前的集群,包括:根据所述集群信息,获取第一个集群地址,判断当前集群状态是否可用,当判断结果为不可用,获取下一个集群地址,跳转至判断当前集群状态是否可用的步骤;当判断结果为可用,请求访问当前集群,进一步判断当前集群处理时间是否超过预设阈值,若未超过,获取集群反馈结果;若超过,并发请求访问下一个可用集群,获取优先返回的集群反馈结果。In combination with the fourth to seventh possible implementation manners of the first aspect, in the eighth to eleventh possible implementation manners, the request access control module preferentially accesses the clusters whose cluster status is available in order according to the cluster information. , Including: obtaining the first cluster address according to the cluster information, judging whether the current cluster state is available, when the judgment result is unavailable, obtaining the next cluster address, and jumping to the step of judging whether the current cluster state is available; If the result is available, request access to the current cluster, and further determine whether the current cluster processing time exceeds the preset threshold, if not, obtain the cluster feedback result; if it exceeds, concurrently request access to the next available cluster to obtain the priority return cluster feedback result.
第三方面,提供了一种跨集群高可用的实现系统,所述系统包括:通过数据同步配置的至少两个数据服务集群,用于实现客户端对数据服务的请求访问;服务端,包括监控装置和分布式协调服务器,所述监控装置用于监控所述至少两个 数据服务集群的集群信息,所述集群信息至少包括集群状态信息和集群地址信息;所述分布式协调服务器,用于通过分布式协调服务实时获取所述集群信息;客户端,包括控制器,所述控制器用于根据从所述服务端接收的所述集群信息进行相应数据服务集群的请求访问。In a third aspect, a system for realizing cross-cluster high availability is provided. The system includes: at least two data service clusters configured through data synchronization for realizing client request access to data services; the server side includes monitoring A device and a distributed coordination server, the monitoring device is used to monitor the cluster information of the at least two data service clusters, the cluster information includes at least cluster status information and cluster address information; the distributed coordination server is used to pass The distributed coordination service obtains the cluster information in real time; the client includes a controller, and the controller is used to request access to the corresponding data service cluster according to the cluster information received from the server.
结合第三方面,在第一种可能实现方式中,所述监控装置定时监控至少两个数据服务集群的集群状态信息,并将集群状态发生变更的集群状态信息通知至所述分布式协调服务器;所述控制器从所述分布式协调服务器获取所述集群信息,根据所述集群信息,获取第一个集群地址,判断当前集群状态是否可用,当判断结果为不可用,获取下一个集群地址,跳转至判断当前集群状态是否可用的步骤;当判断结果为可用,请求访问当前集群,进一步判断当前集群处理时间是否超过预设阈值,若未超过,获取集群反馈结果;若超过,并发请求访问下一个可用集群,获取优先返回的集群反馈结果。With reference to the third aspect, in a first possible implementation manner, the monitoring device regularly monitors the cluster status information of at least two data service clusters, and notifies the distributed coordination server of the cluster status information of the cluster status change; The controller obtains the cluster information from the distributed coordination server, obtains the first cluster address according to the cluster information, determines whether the current cluster status is available, and when the judgment result is unavailable, obtains the next cluster address, Jump to the step of judging whether the current cluster status is available; when the judgment result is available, request access to the current cluster, and further judge whether the current cluster processing time exceeds the preset threshold, if not, get the cluster feedback result; if it exceeds, request access concurrently The next available cluster gets the cluster feedback result that is returned first.
第四方面,提供了一种跨集群高可用的实现设备,所述设备包括:处理器;存储器,用于存储有所述处理器的可执行指令;其中,所述处理器配置为经由所述可执行指令来执行上述方案任一项所述的实现跨集群高可用的方法的步骤。In a fourth aspect, a device for implementing cross-cluster high availability is provided. The device includes: a processor; a memory for storing executable instructions of the processor; wherein the processor is configured to pass through the The instructions can be executed to execute the steps of the method for achieving cross-cluster high availability described in any one of the above solutions.
本发明实施例提供的跨集群高可用的实现方法、装置、系统及设备,通过对数据同步的至少两个数据服务集群的集群信息进行监控,然后通过分布式协调服务实时获取集群信息,然后根据实时获取的集群信息进行相应数据服务集群的请求访问,实现了以下几个有益效果:The cross-cluster high availability implementation method, device, system, and equipment provided by the embodiments of the present invention monitor the cluster information of at least two data service clusters that are synchronized with data, and then obtain the cluster information in real time through the distributed coordination service, and then according to The cluster information obtained in real time is used to request access to the corresponding data service cluster, and the following beneficial effects are achieved:
1、首先,实现了在多集群间更流畅地进行集群迁移和切换,即进行客户端无感知的集群迁移和切换,并且是可以手工在服务端无缝切换集群,满足集群运维需要,实现了大数据实时读写等数据服务的多集群高可用以提升整体可用性;1. First of all, it realizes smoother cluster migration and switching between multiple clusters, that is, cluster migration and switching that the client does not perceive, and the cluster can be seamlessly switched on the server side manually to meet the needs of cluster operation and maintenance. Multi-cluster high availability of data services such as big data real-time reading and writing to improve overall availability;
2、其次,当服务端集群进行运维操作时,不影响客户端访问,减少客户端异常率;当请求时间超过阈值时,客户端可以并发请求其他集群,减少客户端等待时间,减少客户端最大请求时延;2. Secondly, when the server cluster performs operation and maintenance operations, it does not affect client access and reduces the abnormal rate of the client; when the request time exceeds the threshold, the client can concurrently request other clusters, reducing the waiting time of the client and reducing the client Maximum request delay;
3、再者,使得服务端具备调整不同客户端租户访问不同服务端的能力,实 现了客户端并行访问多集群的能力以减少客户端响应时间,具有高效、精准控制的优点。3. In addition, the server has the ability to adjust different client tenants to access different servers, and realizes the ability of the client to access multiple clusters in parallel to reduce the client response time, and has the advantages of efficient and precise control.
附图说明Description of the drawings
为了更清楚地说明本发明实施例中的技术方案,下面将对实施例描述中所需要使用的附图作简单地介绍,显而易见地,下面描述中的附图仅仅是本发明的一些实施例,对于本领域普通技术人员来讲,在不付出创造性劳动的前提下,还可以根据这些附图获得其他的附图。In order to more clearly describe the technical solutions in the embodiments of the present invention, the following will briefly introduce the accompanying drawings used in the description of the embodiments. Obviously, the accompanying drawings in the following description are only some embodiments of the present invention. For those of ordinary skill in the art, other drawings can be obtained from these drawings without creative work.
图1是本发明实施例提供的跨集群高可用的实现方法流程图;Figure 1 is a flowchart of a method for implementing cross-cluster high availability provided by an embodiment of the present invention;
图2是本发明实施例提供的跨集群高可用的实现方法的整体框架示例;FIG. 2 is an example of the overall framework of a method for implementing cross-cluster high availability provided by an embodiment of the present invention;
图3是本发明实施例提供的跨集群高可用的实现方法中103步骤示例;3 is an example of 103 steps in the method for implementing cross-cluster high availability provided by an embodiment of the present invention;
图4是本发明实施例提供的跨集群高可用的实现装置结构示意图;FIG. 4 is a schematic structural diagram of a device for implementing cross-cluster high availability provided by an embodiment of the present invention;
图5是本发明实施例提供的跨集群高可用的实现装置进行跨集群高可用业务的请求时序示例;5 is an example of a request sequence for a cross-cluster high-availability implementation device provided by an embodiment of the present invention to perform a cross-cluster high-availability service;
图6是本发明实施例提供的跨集群高可用的实现系统结构示意图;6 is a schematic diagram of the system structure for implementing cross-cluster high availability provided by an embodiment of the present invention;
图7是本发明实施例提供的跨集群高可用的实现设备结构示意图。Fig. 7 is a schematic diagram of a device structure for implementing cross-cluster high availability provided by an embodiment of the present invention.
具体实施方式detailed description
为使本发明的目的、技术方案和优点更加清楚,下面将结合本发明实施例中的附图,对本发明实施例中的技术方案进行清楚、完整地描述,显然,所描述的实施例仅仅是本发明一部分实施例,而不是全部的实施例。基于本发明中的实施例,本领域普通技术人员在没有做出创造性劳动前提下所获得的所有其他实施例,都属于本发明保护的范围。需要说明的是,在本发明的描述中,“多个”的含义是两个以上,除非另有明确具体的限定。In order to make the objectives, technical solutions, and advantages of the present invention clearer, the technical solutions in the embodiments of the present invention will be described clearly and completely in conjunction with the accompanying drawings in the embodiments of the present invention. Obviously, the described embodiments are merely Some embodiments of the present invention, but not all embodiments. Based on the embodiments of the present invention, all other embodiments obtained by those of ordinary skill in the art without creative work shall fall within the protection scope of the present invention. It should be noted that, in the description of the present invention, "a plurality of" means two or more, unless otherwise specifically defined.
本发明实施例提供的一种跨集群高可用的实现方法、装置、系统及设备,通过对数据同步的至少两个数据服务集群的集群信息进行监控,然后通过分布式 协调服务实时获取集群信息,然后根据实时获取的集群信息进行相应数据服务集群的请求访问。由此实现了在多集群间更流畅地进行集群迁移和切换,即进行客户端无感知的集群迁移和切换,并且是可以手工在服务端无缝切换集群,满足集群运维需要;另外,还能减少客户端最大请求时延,具有高效、精准控制的优点。因此,该跨集群高可用的实现方法、装置、系统及设备,不仅实现了大数据实时读写等数据服务的多集群高可用以提升整体可用性,还实现了客户端并行访问多集群的能力以减少客户端响应时间,可广泛应用于涉及高可用集群运维的大数据领域。The embodiment of the present invention provides a cross-cluster high-availability implementation method, device, system, and equipment, by monitoring the cluster information of at least two data service clusters that are synchronized with data, and then obtaining the cluster information in real time through a distributed coordination service, Then, request access to the corresponding data service cluster based on the cluster information obtained in real time. This achieves smoother cluster migration and switching between multiple clusters, that is, cluster migration and switching without the client's perception, and the cluster can be seamlessly switched on the server side manually to meet the needs of cluster operation and maintenance; in addition, It can reduce the maximum request delay of the client and has the advantages of high efficiency and precise control. Therefore, the cross-cluster high-availability implementation method, device, system and equipment not only realize the high availability of multiple clusters of data services such as big data real-time reading and writing to improve overall availability, but also realize the ability of clients to access multiple clusters in parallel. Reduce client response time and can be widely used in big data fields involving high-availability cluster operation and maintenance.
下面结合具体实施例及附图,对本发明实施例提供的跨集群高可用的实现方法、装置、系统及设备作详细说明。The method, device, system, and equipment for implementing cross-cluster high availability provided by the embodiments of the present invention will be described in detail below in conjunction with specific embodiments and drawings.
实施例1Example 1
图1是本发明实施例提供的跨集群高可用的实现方法流程图,图2是本发明实施例提供的跨集群高可用的实现方法的整体框架示例。如图1和图2所示,本发明实施例提供的跨集群高可用的实现方法,包括以下步骤:FIG. 1 is a flowchart of a method for implementing cross-cluster high availability provided by an embodiment of the present invention, and FIG. 2 is an example of the overall framework of a method for implementing cross-cluster high availability provided by an embodiment of the present invention. As shown in Figure 1 and Figure 2, the method for implementing cross-cluster high availability provided by the embodiment of the present invention includes the following steps:
101、监控通过数据同步配置的至少两个数据服务集群的集群信息,集群信息至少包括集群状态信息和集群地址信息。101. Monitor cluster information of at least two data service clusters configured through data synchronization, where the cluster information includes at least cluster state information and cluster address information.
这里,可以利用已有或临时搭建的至少两个(即两个或多个)数据服务集群,并在各集群间配置数据同步功能,即配置至少两个数据服务集群之间的相互复制关系,并将带有源集群ID信息的数据服务请求在至少两个数据服务集群之间进行同步,例如,当某一个集群有数据写请求时,将写请求同步至其他集群,请求中记录源集群ID,防止请求回传至源系统。另外,这里数据服务集群的数据服务种类包括数据实时读写服务等。Here, you can use at least two (ie, two or more) data service clusters that have been or temporarily built, and configure the data synchronization function between each cluster, that is, configure the mutual replication relationship between at least two data service clusters, Synchronize the data service request with the source cluster ID information between at least two data service clusters. For example, when a data write request is requested from a certain cluster, the write request is synchronized to other clusters, and the source cluster ID is recorded in the request. To prevent the request from being passed back to the source system. In addition, the data service types of the data service cluster here include real-time data reading and writing services, etc.
具体的,上述101步骤可以按照以下方式进行:Specifically, the above 101 steps can be performed in the following manner:
定时监控至少两个数据服务集群的集群状态信息;将集群状态发生变更的集群状态信息通知至分布式协调服务。Regularly monitor the cluster status information of at least two data service clusters; notify the distributed coordination service of the cluster status information of the cluster status change.
为了定时监控各集群状态信息,需要部署一个监控程序或监控模块,一般定 时频率可以设置为1~10秒一次的频率,除此之外,还需要搭建一个具备分布式协调功能的服务,用于记录各集群信息,尤其是集群状态信息,这里的集群状态信息主要包括集群可用性信息以及集群可用性发生变更等状态信息。In order to monitor the status information of each cluster regularly, a monitoring program or monitoring module needs to be deployed. Generally, the timing frequency can be set to a frequency of 1 to 10 seconds. In addition, a service with distributed coordination functions needs to be built for Record each cluster information, especially cluster status information, where the cluster status information mainly includes cluster availability information and status information such as cluster availability changes.
值得注意的是,步骤101监控通过数据同步配置的至少两个数据服务集群的集群信息的过程,除了上述步骤所述的方式之外,还可以通过其他方式实现该过程,本发明实施例对具体的方式不加以限定。It is worth noting that in step 101, the process of monitoring the cluster information of at least two data service clusters configured through data synchronization can be implemented in other ways besides the method described in the above steps. The embodiment of the present invention is specific The method is not limited.
102、通过分布式协调服务实时获取集群信息。102. Obtain cluster information in real time through the distributed coordination service.
具体的,实时监听集群信息,并实时对集群状态发生变更的集群状态信息进行相应通知。通过分布式协调服务将集群状态发生变更的集群状态信息进行通知,如通知至客户端,从而由经过访问配置的客户端访问分布式协调服务,获取各数据服务集群的集群信息。Specifically, the cluster information is monitored in real time, and the cluster status information of the cluster status change is notified accordingly in real time. Through the distributed coordination service, the cluster status information of the cluster status change is notified, such as to the client, so that the client after the access configuration accesses the distributed coordination service to obtain the cluster information of each data service cluster.
值得注意的是,步骤102通过分布式协调服务实时获取集群信息的过程,除了上述步骤所述的方式之外,还可以通过其他方式实现该过程,本发明实施例对具体的方式不加以限定。It is worth noting that the process of obtaining cluster information in real time through the distributed coordination service in step 102 may be implemented in other ways in addition to the manner described in the foregoing steps, and the embodiment of the present invention does not limit the specific manner.
103、根据集群信息进行相应数据服务集群的请求访问。103. Perform request access to the corresponding data service cluster according to the cluster information.
根据集群信息,优先访问集群状态为可用的顺序靠前的集群。According to the cluster information, priority is given to accessing the clusters with the highest cluster status as available.
具体的,包括以下子步骤:Specifically, it includes the following sub-steps:
根据集群信息,获取第一个集群地址,According to the cluster information, get the first cluster address,
判断当前集群状态是否可用,当判断结果为不可用,获取下一个集群地址,跳转至判断当前集群状态是否可用的步骤;Determine whether the current cluster status is available. When the result of the determination is unavailable, get the next cluster address and jump to the step of determining whether the current cluster status is available;
当判断结果为可用,请求访问当前集群,进一步判断当前集群处理时间是否超过预设阈值,若未超过,获取集群反馈结果;若超过,并发请求访问下一个可用集群,获取优先返回的集群反馈结果。When the judgment result is available, request to access the current cluster, and further judge whether the current cluster processing time exceeds the preset threshold, if not, obtain the cluster feedback result; if it exceeds, concurrently request to access the next available cluster, and obtain the priority return cluster feedback result .
这里,优选地,在获取第一个集群地址之前,请求分布式协调服务,获取当前客户端租户的所有配置的集群地址。事实上,本发明实施例对获取配置集群地址的时间点不加以特别限制,可以根据实际情况进行选择。Here, preferably, before obtaining the first cluster address, a distributed coordination service is requested to obtain all configured cluster addresses of the current client tenant. In fact, the embodiment of the present invention does not impose special restrictions on the time point for obtaining the configuration cluster address, and it can be selected according to actual conditions.
图3是本发明实施例提供的跨集群高可用的实现方法中103步骤示例。如图3所示,103步骤包括以下子步骤:Fig. 3 is an example of step 103 in the method for implementing cross-cluster high availability provided by an embodiment of the present invention. As shown in Figure 3, Step 103 includes the following sub-steps:
Step 1:客户端请求分布式协调服务,获取当前客户端租户的所有配置的集群地址;Step 1: The client requests the distributed coordination service to obtain all the configured cluster addresses of the current client tenant;
Step 2:获取第一个集群;Step 2: Get the first cluster;
Step 3:判断当前集群状态是否可用,不可用跳转至Step 4,可用调整至Step 5;Step 3: Determine whether the current cluster status is available, if not available, go to Step 4, and adjust to Step 5 if available;
Step 4:获取下一个集群,跳转至Step 3;Step 4: Get the next cluster and jump to Step 3;
Step 5:请求当前集群;Step 5: Request the current cluster;
Step 6:判断当前集群处理时间是否超过预设阈值,超过跳转至Step 7,可用调整至Step 9;Step 6: Determine whether the current cluster processing time exceeds the preset threshold. If it exceeds, go to Step 7, and adjust to Step 9;
Step 7:并发请求下一个可用集群;Step 7: Concurrently request the next available cluster;
Step 8:获取优先返回的集群反馈结果,结束当前流程;Step 8: Obtain the cluster feedback results that are returned first, and end the current process;
Step 9:获取集群反馈结果,结束当前流程。Step 9: Obtain the cluster feedback result and end the current process.
值得注意的是,步骤103根据集群信息进行相应数据服务集群的请求访问的过程,除了上述步骤所述的方式之外,还可以通过其他方式实现该过程,本发明实施例对具体的方式不加以限定。It is worth noting that in step 103, the process of requesting access to the corresponding data service cluster according to the cluster information can be implemented in other ways besides the method described in the above steps. The specific method is not described in the embodiment of the present invention. limited.
实施例2Example 2
图4是本发明实施例提供的跨集群高可用的实现装置结构示意图。图5是本发明实施例提供的跨集群高可用的实现装置进行跨集群高可用业务的请求时序示例。如图4和图5所示,本发明实施例提供的跨集群高可用的实现装置2包括监控模块21、分布式协调服务模块22和请求访问控制模块23。Fig. 4 is a schematic structural diagram of a device for implementing cross-cluster high availability provided by an embodiment of the present invention. FIG. 5 is an example of a request sequence for a cross-cluster high-availability implementation device provided by an embodiment of the present invention to perform a cross-cluster high-availability service. As shown in FIG. 4 and FIG. 5, the device 2 for implementing cross-cluster high availability provided by the embodiment of the present invention includes a monitoring module 21, a distributed coordination service module 22 and a request access control module 23.
其中,监控模块21,用于监控通过数据同步配置的至少两个数据服务集群的集群信息,集群信息至少包括集群状态信息和集群地址信息。具体的,监控模块21定时监控至少两个数据服务集群的集群状态信息,并将集群状态发生变更的集群状态信息通知至分布式协调服务模块22。The monitoring module 21 is configured to monitor cluster information of at least two data service clusters configured through data synchronization, and the cluster information includes at least cluster state information and cluster address information. Specifically, the monitoring module 21 regularly monitors the cluster status information of at least two data service clusters, and notifies the distributed coordination service module 22 of the cluster status information of the cluster status change.
分布式协调服务模块22,用于通过分布式协调服务实时获取集群信息。具体的,分布式协调服务模块22用于实时监听集群信息,并实时对集群状态发生变更的集群信息进行相应通知。The distributed coordination service module 22 is used to obtain cluster information in real time through the distributed coordination service. Specifically, the distributed coordination service module 22 is used for real-time monitoring of cluster information, and real-time notification of cluster information that has changed in the cluster state.
请求访问控制模块23,用于根据集群信息进行相应数据服务集群的请求访问。具体的,请求访问控制模块23根据集群信息,优先访问集群状态为可用的顺序靠前的集群,进一步包括以下子步骤:The request access control module 23 is used for requesting access to the corresponding data service cluster according to the cluster information. Specifically, requesting the access control module 23 to preferentially access the clusters whose cluster status is available in order according to the cluster information further includes the following sub-steps:
根据集群信息,获取第一个集群地址,According to the cluster information, get the first cluster address,
判断当前集群状态是否可用,当判断结果为不可用,获取下一个集群地址,跳转至判断当前集群状态是否可用的步骤;Determine whether the current cluster status is available. When the result of the determination is unavailable, get the next cluster address and jump to the step of determining whether the current cluster status is available;
当判断结果为可用,请求访问当前集群,进一步判断当前集群处理时间是否超过预设阈值,若未超过,获取集群反馈结果;若超过,并发请求访问下一个可用集群,获取优先返回的集群反馈结果。When the judgment result is available, request to access the current cluster, and further judge whether the current cluster processing time exceeds the preset threshold, if not, obtain the cluster feedback result; if it exceeds, concurrently request to access the next available cluster, and obtain the priority return cluster feedback result .
另外,上述跨集群高可用的实现装置还包括数据同步配置模块24,数据同步配置模块24配置至少两个数据服务集群之间的相互复制关系,并将带有源集群ID信息的数据服务请求在至少两个数据服务集群之间进行同步。In addition, the above-mentioned cross-cluster high-availability implementation device further includes a data synchronization configuration module 24. The data synchronization configuration module 24 configures the mutual replication relationship between at least two data service clusters, and requests the data service with the source cluster ID information in Synchronize between at least two data service clusters.
优选地,将监控模块21、分布式协调服务模块22和数据同步配置模块24设置在客户端,将请求访问控制模块23设置在服务端。事实上,只要不脱离本发明的发明构思,对于跨集群高可用的实现装置中的监控模块21、分布式协调服务模块22、请求访问控制模块23和数据同步配置模块24,如何进行终端设置,可以根据实际情况来确定,本发明实施例不对其加以特别限定。Preferably, the monitoring module 21, the distributed coordination service module 22, and the data synchronization configuration module 24 are set on the client, and the request access control module 23 is set on the server. In fact, as long as it does not deviate from the inventive concept of the present invention, for the monitoring module 21, the distributed coordination service module 22, the request access control module 23, and the data synchronization configuration module 24 in the cross-cluster high-availability implementation device, how to perform terminal settings, It can be determined according to actual conditions, and is not particularly limited in the embodiment of the present invention.
实施例3Example 3
图6是本发明实施例提供的跨集群高可用的实现系统结构示意图,如图6所示,本发明实施例提供的跨集群高可用的实现系统包括通过数据同步配置的至少两个数据服务集群、客户端31及服务端32。其中,至少两个数据服务集群用于实现客户端对数据服务的请求访问。Figure 6 is a schematic diagram of the system structure for implementing cross-cluster high availability provided by an embodiment of the present invention. As shown in Figure 6, the cross-cluster high availability implementation system provided by an embodiment of the present invention includes at least two data service clusters configured through data synchronization , Client 31 and Server 32. Among them, at least two data service clusters are used to implement client requests to access data services.
服务端31包括监控装置311和分布式协调服务器312,监控装置311用于 监控至少两个数据服务集群的集群信息,集群信息至少包括集群状态信息和集群地址信息。分布式协调服务器312,用于通过分布式协调服务实时获取集群信息。监控装置311定时监控至少两个数据服务集群的集群状态信息,并将集群状态发生变更的集群状态信息通知至分布式协调服务器312。分布式协调服务器312用于实时监听集群信息,并实时对集群状态发生变更的集群信息进行相应通知。The server 31 includes a monitoring device 311 and a distributed coordination server 312. The monitoring device 311 is used to monitor cluster information of at least two data service clusters. The cluster information includes at least cluster status information and cluster address information. The distributed coordination server 312 is configured to obtain cluster information in real time through the distributed coordination service. The monitoring device 311 regularly monitors the cluster status information of at least two data service clusters, and notifies the distributed coordination server 312 of the cluster status information of the cluster status change. The distributed coordination server 312 is used for real-time monitoring of cluster information, and real-time notification of cluster information that has changed in the cluster state.
客户端32包括控制器321,控制器321用于根据从服务端31接收的集群信息进行相应数据服务集群的请求访问,优选地地根据集群信息,优先访问集群状态为可用的顺序靠前的集群。具体地,控制器321从分布式协调服务器312获取集群信息,根据集群信息,获取第一个集群地址,判断当前集群状态是否可用,当判断结果为不可用,获取下一个集群地址,跳转至判断当前集群状态是否可用的步骤;当判断结果为可用,请求访问当前集群,进一步判断当前集群处理时间是否超过预设阈值,若未超过,获取集群反馈结果;若超过,并发请求访问下一个可用集群,获取优先返回的集群反馈结果。The client 32 includes a controller 321. The controller 321 is used to request access to the corresponding data service cluster according to the cluster information received from the server 31. Preferably, according to the cluster information, priority is given to accessing the clusters in the order of the available cluster status. . Specifically, the controller 321 obtains cluster information from the distributed coordination server 312, obtains the first cluster address according to the cluster information, determines whether the current cluster status is available, and when the judgment result is unavailable, obtains the next cluster address and jumps to Steps to determine whether the current cluster status is available; when the result of the determination is available, request access to the current cluster, and further determine whether the current cluster processing time exceeds the preset threshold, if not, obtain the cluster feedback result; if it exceeds, concurrently request access to the next available Cluster, get the cluster feedback result that is returned first.
实施例4Example 4
图7是本发明实施例提供的跨集群高可用的实现设备结构示意图,如图7所示本发明实施例提供的跨集群高可用的实现设备4包括处理器41和存储器42;存储器42,用于存储有处理器41的可执行指令;其中,处理器41配置为经由可执行指令来执行本发明实施例提供的实现跨集群高可用的方法的步骤。FIG. 7 is a schematic diagram of the structure of a device for implementing cross-cluster high availability provided by an embodiment of the present invention. As shown in FIG. 7, the device 4 for implementing cross-cluster high availability provided by an embodiment of the present invention includes a processor 41 and a memory 42; the memory 42 is used The executable instructions of the processor 41 are stored therein; wherein the processor 41 is configured to execute the steps of the method for realizing cross-cluster high availability provided by the embodiments of the present invention via the executable instructions.
需要说明的是:上述实施例提供的跨集群高可用的实现装置、系统及设备在触发跨集群高可用实现业务时,仅以上述各功能模块的划分进行举例说明,实际应用中,可以根据需要而将上述功能分配由不同的功能模块完成,即将装置、系统或设备的内部结构划分成不同的功能模块,以完成以上描述的全部或者部分功能。另外,上述实施例提供的跨集群高可用的实现装置、系统及设备与跨集群高可用的实现方法实施例属于同一构思,其具体实现过程详见方法实施例,这里不再赘述。It should be noted that when the cross-cluster high-availability implementation devices, systems, and equipment provided in the above embodiments trigger cross-cluster high-availability implementation services, only the division of the above-mentioned functional modules is used as an example. In actual applications, it can be as required The above-mentioned function allocation is completed by different functional modules, that is, the internal structure of the device, system, or equipment is divided into different functional modules to complete all or part of the functions described above. In addition, the cross-cluster high-availability implementation device, system, and equipment provided in the above-mentioned embodiments belong to the same concept as the cross-cluster high-availability implementation method embodiment. For the specific implementation process, please refer to the method embodiment, which will not be repeated here.
上述所有可选技术方案,可以采用任意结合形成本发明的可选实施例,在此不再一一赘述。All the above-mentioned optional technical solutions can be combined in any way to form an optional embodiment of the present invention, which will not be repeated here.
综上所述,本发明实施例提供的跨集群高可用的实现方法、装置、系统及设备,通过对数据同步的至少两个数据服务集群的集群信息进行监控,然后通过分布式协调服务实时获取集群信息,然后根据实时获取的集群信息进行相应数据服务集群的请求访问,与现有技术相比具有以下有益效果:In summary, the cross-cluster high-availability implementation method, device, system, and equipment provided by the embodiments of the present invention monitor the cluster information of at least two data service clusters for data synchronization, and then obtain real-time data through distributed coordination services Cluster information, and then request access to the corresponding data service cluster according to the cluster information obtained in real time, which has the following beneficial effects compared with the prior art:
1、首先,实现了在多集群间更流畅地进行集群迁移和切换,即进行客户端无感知的集群迁移和切换,并且是可以手工在服务端无缝切换集群,满足集群运维需要,实现了大数据实时读写等数据服务的多集群高可用以提升整体可用性;1. First of all, it realizes smoother cluster migration and switching between multiple clusters, that is, cluster migration and switching that the client does not perceive, and the cluster can be seamlessly switched on the server side manually to meet the needs of cluster operation and maintenance. Multi-cluster high availability of data services such as big data real-time reading and writing to improve overall availability;
2、其次,当服务端集群进行运维操作时,不影响客户端访问,减少客户端异常率;当请求时间超过阈值时,客户端可以并发请求其他集群,减少客户端等待时间,减少客户端最大请求时延;2. Secondly, when the server cluster performs operation and maintenance operations, it does not affect client access and reduces the abnormal rate of the client; when the request time exceeds the threshold, the client can concurrently request other clusters, reducing the waiting time of the client and reducing the client Maximum request delay;
3、再者,使得服务端具备调整不同客户端租户访问不同服务端的能力,实现了客户端并行访问多集群的能力以减少客户端响应时间,具有高效、精准控制的优点。3. In addition, the server has the ability to adjust different client tenants to access different servers, and realizes the ability of the client to access multiple clusters in parallel to reduce the client response time, and has the advantages of efficient and precise control.
本领域普通技术人员可以理解实现上述实施例的全部或部分步骤可以通过硬件来完成,也可以通过程序来指令相关的硬件完成,所述的程序可以存储于一种计算机可读存储介质中,上述提到的存储介质可以是只读存储器,磁盘或光盘等。Those of ordinary skill in the art can understand that all or part of the steps in the foregoing embodiments can be implemented by hardware, or by a program instructing relevant hardware to be completed. The program can be stored in a computer-readable storage medium. The storage medium mentioned can be a read-only memory, a magnetic disk or an optical disk, etc.
本申请实施例中是参照根据本申请实施例中实施例的方法、设备(系统)、和计算机程序产品的流程图和/或方框图来描述的。应理解可由计算机程序指令实现流程图和/或方框图中的每一流程和/或方框、以及流程图和/或方框图中的流程和/或方框的结合。可提供这些计算机程序指令到通用计算机、专用计算机、嵌入式处理机或其他可编程数据处理设备的处理器以产生一个机器,使得通过计算机或其他可编程数据处理设备的处理器执行的指令产生用于实现在流程图一个流程或多个流程和/或方框图一个方框或多个方框中指定的功能的装置。The embodiments of the present application are described with reference to the flowcharts and/or block diagrams of the methods, equipment (systems), and computer program products according to the embodiments of the present application. It should be understood that each process and/or block in the flowchart and/or block diagram, and the combination of processes and/or blocks in the flowchart and/or block diagram can be implemented by computer program instructions. These computer program instructions can be provided to the processor of a general-purpose computer, a special-purpose computer, an embedded processor, or other programmable data processing equipment to generate a machine, so that the instructions executed by the processor of the computer or other programmable data processing equipment are generated It is a device that realizes the functions specified in one process or multiple processes in the flowchart and/or one block or multiple blocks in the block diagram.
这些计算机程序指令也可存储在能引导计算机或其他可编程数据处理设备以特定方式工作的计算机可读存储器中,使得存储在该计算机可读存储器中的指令产生包括指令装置的制造品,该指令装置实现在流程图一个流程或多个流程和/或方框图一个方框或多个方框中指定的功能。These computer program instructions can also be stored in a computer-readable memory that can guide a computer or other programmable data processing equipment to work in a specific manner, so that the instructions stored in the computer-readable memory produce an article of manufacture including the instruction device. The device implements the functions specified in one process or multiple processes in the flowchart and/or one block or multiple blocks in the block diagram.
这些计算机程序指令也可装载到计算机或其他可编程数据处理设备上,使得在计算机或其他可编程设备上执行一系列操作步骤以产生计算机实现的处理,从而在计算机或其他可编程设备上执行的指令提供用于实现在流程图一个流程或多个流程和/或方框图一个方框或多个方框中指定的功能的步骤。These computer program instructions can also be loaded on a computer or other programmable data processing equipment, so that a series of operation steps are executed on the computer or other programmable equipment to produce computer-implemented processing, so as to execute on the computer or other programmable equipment. The instructions provide steps for implementing functions specified in a flow or multiple flows in the flowchart and/or a block or multiple blocks in the block diagram.
尽管已描述了本申请实施例中的优选实施例,但本领域内的技术人员一旦得知了基本创造性概念,则可对这些实施例作出另外的变更和修改。所以,所附权利要求意欲解释为包括优选实施例以及落入本申请实施例中范围的所有变更和修改。Although the preferred embodiments of the embodiments of the present application have been described, those skilled in the art can make additional changes and modifications to these embodiments once they learn the basic creative concept. Therefore, the appended claims are intended to be interpreted as including the preferred embodiments and all changes and modifications falling within the scope of the embodiments of the present application.
显然,本领域的技术人员可以对本发明进行各种改动和变型而不脱离本发明的精神和范围。这样,倘若本发明的这些修改和变型属于本发明权利要求及其等同技术的范围之内,则本发明也意图包含这些改动和变型在内。Obviously, those skilled in the art can make various changes and modifications to the present invention without departing from the spirit and scope of the present invention. In this way, if these modifications and variations of the present invention fall within the scope of the claims of the present invention and their equivalent technologies, the present invention is also intended to include these modifications and variations.
以上所述仅为本发明的较佳实施例,并不用以限制本发明,凡在本发明的精神和原则之内,所作的任何修改、等同替换、改进等,均应包含在本发明的保护范围之内。The above descriptions are only preferred embodiments of the present invention and are not intended to limit the present invention. Any modification, equivalent replacement, improvement, etc. made within the spirit and principle of the present invention shall be included in the protection of the present invention. Within range.
本说明书中未作详细描述的内容属于本领域专业技术人员公知的现有技术。The content not described in detail in this specification belongs to the prior art known to those skilled in the art.

Claims (15)

  1. 一种跨集群高可用的实现方法,其特征在于,所述方法包括:A method for implementing cross-cluster high availability, characterized in that the method includes:
    监控通过数据同步配置的至少两个数据服务集群的集群信息,所述集群信息至少包括集群状态信息和集群地址信息;Monitoring cluster information of at least two data service clusters configured through data synchronization, where the cluster information includes at least cluster state information and cluster address information;
    通过分布式协调服务实时获取所述集群信息;Obtain the cluster information in real time through a distributed coordination service;
    根据所述集群信息进行相应数据服务集群的请求访问。Request access to the corresponding data service cluster according to the cluster information.
  2. 根据权利要求1所述的方法,其特征在于,所述监控通过数据同步配置的至少两个数据服务集群的集群信息,包括:The method according to claim 1, wherein the monitoring cluster information of at least two data service clusters configured through data synchronization comprises:
    定时监控至少两个数据服务集群的集群状态信息;Regularly monitor the cluster status information of at least two data service clusters;
    将集群状态发生变更的集群状态信息通知至所述分布式协调服务。Notifying the distributed coordination service of the cluster state information of the cluster state change.
  3. 根据权利要求2所述的方法,其特征在于,所述数据同步配置包括:The method according to claim 2, wherein the data synchronization configuration comprises:
    配置所述至少两个数据服务集群之间的相互复制关系,并将带有源集群ID信息的数据服务请求在所述至少两个数据服务集群之间进行同步。Configure the mutual replication relationship between the at least two data service clusters, and synchronize the data service request with the source cluster ID information between the at least two data service clusters.
  4. 根据权利要求1所述的方法,其特征在于,所述通过分布式协调服务实时获取所述集群信息,包括:The method according to claim 1, wherein said obtaining said cluster information in real time through a distributed coordination service comprises:
    实时监听所述集群信息,并实时对集群状态发生变更的集群状态信息进行相应通知。Monitor the cluster information in real time, and notify the cluster status information of the cluster status change in real time.
  5. 根据权利要求1至4任一项所述的方法,其特征在于,所述根据所述集群信息进行相应数据服务集群的请求访问,包括:The method according to any one of claims 1 to 4, wherein the requesting access to a corresponding data service cluster according to the cluster information comprises:
    根据所述集群信息,优先访问集群状态为可用的顺序靠前的集群。According to the cluster information, priority is given to accessing the clusters in the order of the available cluster status.
  6. 根据权利要求5所述的方法,其特征在于,所述根据所述集群信息,优先访问集群状态为可用的顺序靠前的集群,包括:The method according to claim 5, characterized in that, according to the cluster information, prioritizing access to the clusters whose cluster status is available in the first order comprises:
    根据所述集群信息,获取第一个集群地址,According to the cluster information, obtain the first cluster address,
    判断当前集群状态是否可用,当判断结果为不可用,获取下一个集群地址,跳转至判断当前集群状态是否可用的步骤;Determine whether the current cluster status is available. When the result of the determination is unavailable, get the next cluster address and jump to the step of determining whether the current cluster status is available;
    当判断结果为可用,请求访问当前集群,进一步判断当前集群处理时间是否超过预设阈值,若未超过,获取集群反馈结果;若超过,并发请求访问下一个可用集群,获取优先返回的集群反馈结果。When the judgment result is available, request to access the current cluster, and further judge whether the current cluster processing time exceeds the preset threshold, if not, obtain the cluster feedback result; if it exceeds, concurrently request to access the next available cluster, and obtain the priority return cluster feedback result .
  7. 一种跨集群高可用的实现装置,其特征在于,所述装置包括:A device for implementing cross-cluster high availability, characterized in that the device includes:
    监控模块,用于监控通过数据同步配置的至少两个数据服务集群的集群信息,所述集群信息至少包括集群状态信息和集群地址信息;A monitoring module, configured to monitor cluster information of at least two data service clusters configured through data synchronization, where the cluster information includes at least cluster status information and cluster address information;
    分布式协调服务模块,用于通过分布式协调服务实时获取所述集群信息;The distributed coordination service module is used to obtain the cluster information in real time through the distributed coordination service;
    请求访问控制模块,用于根据所述集群信息进行相应数据服务集群的请求访问。The request access control module is used for requesting access to the corresponding data service cluster according to the cluster information.
  8. 根据权利要求7所述的装置,其特征在于,所述监控模块定时监控至少两个数据服务集群的集群状态信息,并将集群状态发生变更的集群状态信息通知至所述分布式协调服务模块。The device according to claim 7, wherein the monitoring module regularly monitors the cluster status information of at least two data service clusters, and notifies the cluster status information of the cluster status changes to the distributed coordination service module.
  9. 根据权利要求7所述的装置,其特征在于,所述分布式协调服务模块用于实时监听所述集群信息,并实时对集群状态发生变更的集群信息进行相应通知。8. The device according to claim 7, wherein the distributed coordination service module is used to monitor the cluster information in real time, and notify the cluster information of the cluster status change in real time.
  10. 根据权利要求7所述的装置,其特征在于,所述装置还包括数据同步配置模块,所述数据同步配置模块配置所述至少两个数据服务集群之间的相互复制关系,并将带有源集群ID信息的数据服务请求在所述至少两个数据服务集群之间进行同步。7. The device according to claim 7, wherein the device further comprises a data synchronization configuration module, the data synchronization configuration module configures the mutual replication relationship between the at least two data service clusters, and carries a source The data service request of the cluster ID information is synchronized between the at least two data service clusters.
  11. 根据权利要求7至10任一项所述的装置,其特征在于,所述请求访问控制模块根据所述集群信息,优先访问集群状态为可用的顺序靠前的集群。The device according to any one of claims 7 to 10, wherein the request access control module preferentially accesses the clusters in the order of the available cluster status according to the cluster information.
  12. 根据权利要求11所述的装置,其特征在于,所述请求访问控制模块根据所述集群信息,优先访问集群状态为可用的顺序靠前的集群,包括:The device according to claim 11, wherein the request access control module, based on the cluster information, preferentially accesses the cluster with the cluster status available in the first order, comprising:
    根据所述集群信息,获取第一个集群地址,According to the cluster information, obtain the first cluster address,
    判断当前集群状态是否可用,当判断结果为不可用,获取下一个集群地址,跳转至判断当前集群状态是否可用的步骤;Determine whether the current cluster status is available. When the result of the determination is unavailable, get the next cluster address and jump to the step of determining whether the current cluster status is available;
    当判断结果为可用,请求访问当前集群,进一步判断当前集群处理时间是否超过预设阈值,若未超过,获取集群反馈结果;若超过,并发请求访问下一个可用集群,获取优先返回的集群反馈结果。When the judgment result is available, request to access the current cluster, and further judge whether the current cluster processing time exceeds the preset threshold, if not, obtain the cluster feedback result; if it exceeds, concurrently request to access the next available cluster, and obtain the priority return cluster feedback result .
  13. 一种跨集群高可用的实现系统,其特征在于,所述系统包括:A system for implementing cross-cluster high availability, characterized in that the system includes:
    通过数据同步配置的至少两个数据服务集群,用于实现客户端对数据服务的请求访问;At least two data service clusters configured through data synchronization are used to implement client requests to access data services;
    服务端,包括监控装置和分布式协调服务器,所述监控装置用于监控所述至少两个数据服务集群的集群信息,所述集群信息至少包括集群状态信息和集群地址信息;所述分布式协调服务器,用于通过分布式协调服务实时获取所述集群信息;The server side includes a monitoring device and a distributed coordination server, the monitoring device is used to monitor the cluster information of the at least two data service clusters, and the cluster information includes at least cluster status information and cluster address information; the distributed coordination A server for obtaining the cluster information in real time through a distributed coordination service;
    客户端,包括控制器,所述控制器用于根据从所述服务端接收的所述集群信息进行相应数据服务集群的请求访问。The client includes a controller, and the controller is configured to request access to a corresponding data service cluster according to the cluster information received from the server.
  14. 根据权利要求13所述的系统,其特征在于,The system according to claim 13, wherein:
    所述监控装置定时监控至少两个数据服务集群的集群状态信息,并将集群状态发生变更的集群状态信息通知至所述分布式协调服务器;The monitoring device regularly monitors the cluster status information of at least two data service clusters, and notifies the distributed coordination server of the cluster status information of the cluster status change;
    所述控制器从所述分布式协调服务器获取所述集群信息,根据所述集群信息,获取第一个集群地址,判断当前集群状态是否可用,当判断结果为不可用,获取下一个集群地址,跳转至判断当前集群状态是否可用的步骤;当判断结果为可用,请求访问当前集群,进一步判断当前集群处理时间是否超过预设阈值,若未超过,获取集群反馈结果;若超过,并发请求访问下一个可用集群,获取优先返回的集群反馈结果。The controller obtains the cluster information from the distributed coordination server, obtains the first cluster address according to the cluster information, determines whether the current cluster status is available, and when the judgment result is unavailable, obtains the next cluster address, Jump to the step of judging whether the current cluster status is available; when the judgment result is available, request access to the current cluster, and further judge whether the current cluster processing time exceeds the preset threshold, if not, get the cluster feedback result; if it exceeds, request access concurrently The next available cluster gets the cluster feedback result that is returned first.
  15. 一种跨集群高可用的实现设备,其特征在于,所述设备包括:A device for realizing cross-cluster high availability, characterized in that the device includes:
    处理器;processor;
    存储器,用于存储有所述处理器的可执行指令;A memory for storing executable instructions of the processor;
    其中,所述处理器配置为经由所述可执行指令来执行权利要求1至6中任一项所述的实现跨集群高可用的方法的步骤。Wherein, the processor is configured to execute the steps of the method for achieving cross-cluster high availability according to any one of claims 1 to 6 via the executable instructions.
PCT/CN2019/109106 2019-03-22 2019-09-29 Method for achieving cross-cluster high availability, apparatus, system, and device WO2020192065A1 (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CA3176454A CA3176454A1 (en) 2019-03-22 2019-09-29 Method for achieving cross-cluster high availability, apparatus, system, and device

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201910225014.6A CN111726388A (en) 2019-03-22 2019-03-22 Cross-cluster high-availability implementation method, device, system and equipment
CN201910225014.6 2019-03-22

Publications (1)

Publication Number Publication Date
WO2020192065A1 true WO2020192065A1 (en) 2020-10-01

Family

ID=72563824

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2019/109106 WO2020192065A1 (en) 2019-03-22 2019-09-29 Method for achieving cross-cluster high availability, apparatus, system, and device

Country Status (3)

Country Link
CN (1) CN111726388A (en)
CA (1) CA3176454A1 (en)
WO (1) WO2020192065A1 (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN114422567A (en) * 2021-12-09 2022-04-29 阿里巴巴(中国)有限公司 Data request processing method, device, system, computer equipment and medium
CN116431430A (en) * 2023-03-01 2023-07-14 合芯科技有限公司 System and method for monitoring and analyzing Zookeeper cluster

Families Citing this family (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113746887A (en) * 2020-11-05 2021-12-03 北京沃东天骏信息技术有限公司 Cross-cluster data request processing method, device and storage medium
CN114257606B (en) * 2021-12-13 2024-03-29 阿里巴巴(中国)有限公司 Data processing method, file management system, storage medium, and program product

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20130204948A1 (en) * 2012-02-07 2013-08-08 Cloudera, Inc. Centralized configuration and monitoring of a distributed computing cluster
CN107463468A (en) * 2016-06-02 2017-12-12 北京京东尚科信息技术有限公司 Buffer memory management method and its equipment
CN107491343A (en) * 2017-09-08 2017-12-19 中国电子科技集团公司第二十八研究所 A kind of across cluster resource scheduling system based on cloud computing
CN107621973A (en) * 2016-07-13 2018-01-23 阿里巴巴集团控股有限公司 A kind of method for scheduling task and device across cluster

Family Cites Families (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103581276B (en) * 2012-08-09 2017-06-09 北京亿赞普网络技术有限公司 Cluster management device, system, service customer end and correlation method
CN104504128B (en) * 2014-12-31 2017-09-12 天津南大通用数据技术股份有限公司 A kind of construction method of the asymmetric deployment of Database Systems based on federal cluster
CN107451147B (en) * 2016-05-31 2020-07-31 北京京东尚科信息技术有限公司 Method and device for dynamically switching kafka clusters
CN106101212A (en) * 2016-06-08 2016-11-09 四川新环佳科技发展有限公司 Big data access method under cloud platform
KR20180086791A (en) * 2017-01-23 2018-08-01 한국전자통신연구원 Cloud system for surpporting big data process and managing method thereof
CN107391633A (en) * 2017-06-30 2017-11-24 北京奇虎科技有限公司 Data-base cluster Automatic Optimal processing method, device and server
CN108199912B (en) * 2017-12-15 2020-09-22 北京奇艺世纪科技有限公司 Method and device for managing and consuming distributed messages of multiple activities in different places
CN108768790A (en) * 2018-06-29 2018-11-06 北京奇虎科技有限公司 Distributed search cluster monitoring method and device, computing device, storage medium

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20130204948A1 (en) * 2012-02-07 2013-08-08 Cloudera, Inc. Centralized configuration and monitoring of a distributed computing cluster
CN107463468A (en) * 2016-06-02 2017-12-12 北京京东尚科信息技术有限公司 Buffer memory management method and its equipment
CN107621973A (en) * 2016-07-13 2018-01-23 阿里巴巴集团控股有限公司 A kind of method for scheduling task and device across cluster
CN107491343A (en) * 2017-09-08 2017-12-19 中国电子科技集团公司第二十八研究所 A kind of across cluster resource scheduling system based on cloud computing

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN114422567A (en) * 2021-12-09 2022-04-29 阿里巴巴(中国)有限公司 Data request processing method, device, system, computer equipment and medium
CN116431430A (en) * 2023-03-01 2023-07-14 合芯科技有限公司 System and method for monitoring and analyzing Zookeeper cluster

Also Published As

Publication number Publication date
CN111726388A (en) 2020-09-29
CA3176454A1 (en) 2020-10-01

Similar Documents

Publication Publication Date Title
WO2020192065A1 (en) Method for achieving cross-cluster high availability, apparatus, system, and device
US11249815B2 (en) Maintaining two-site configuration for workload availability between sites at unlimited distances for products and services
US10084858B2 (en) Managing continuous priority workload availability and general workload availability between sites at unlimited distances for products and services
US10983880B2 (en) Role designation in a high availability node
US9141491B2 (en) Highly available server system based on cloud computing
WO2017114213A1 (en) Method and apparatus for upgrading distributed storage system
US8930527B2 (en) High availability enabler
WO2019152122A1 (en) Systems and methods for performing computing cluster node switchover
US9659078B2 (en) System and method for supporting failover during synchronization between clusters in a distributed data grid
US10785350B2 (en) Heartbeat in failover cluster
US9984139B1 (en) Publish session framework for datastore operation records
US10075326B2 (en) Monitoring file system operations between a client computer and a file server
CN108304296A (en) A kind of server monitoring method, system, equipment and computer readable storage medium
US8958420B1 (en) Methods and apparatus for virtualizing switch control plane engine
CN113821168A (en) Shared storage migration system and method, electronic equipment and storage medium
CN105487946A (en) Fault computer automatic switching method and device
EP3701374A1 (en) A system and method for high-performance general-purpose parallel computing with fault tolerance and tail tolerance
TWI735521B (en) Method and device for upgrading distributed storage system
WO2016108945A1 (en) Cluster arbitration
US10049011B2 (en) Continuing operation of a quorum based system after failures
WO2017114170A1 (en) Control method and device for real-time calculation tasks, and real-time calculation system using same
CN114661522A (en) Storage system and disaster recovery method thereof, disaster recovery device, server and storage medium

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 19921105

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 19921105

Country of ref document: EP

Kind code of ref document: A1

122 Ep: pct application non-entry in european phase

Ref document number: 19921105

Country of ref document: EP

Kind code of ref document: A1

ENP Entry into the national phase

Ref document number: 3176454

Country of ref document: CA