CN107018041B - Data migration method and device in cluster - Google Patents

Data migration method and device in cluster Download PDF

Info

Publication number
CN107018041B
CN107018041B CN201710210145.8A CN201710210145A CN107018041B CN 107018041 B CN107018041 B CN 107018041B CN 201710210145 A CN201710210145 A CN 201710210145A CN 107018041 B CN107018041 B CN 107018041B
Authority
CN
China
Prior art keywords
detection
cluster
node
subdomain
corosync
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201710210145.8A
Other languages
Chinese (zh)
Other versions
CN107018041A (en
Inventor
郝滨
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Hangzhou Dt Dream Technology Co Ltd
Original Assignee
Hangzhou Dt Dream Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Hangzhou Dt Dream Technology Co Ltd filed Critical Hangzhou Dt Dream Technology Co Ltd
Priority to CN201710210145.8A priority Critical patent/CN107018041B/en
Priority to CN201910252182.4A priority patent/CN109981412B/en
Publication of CN107018041A publication Critical patent/CN107018041A/en
Application granted granted Critical
Publication of CN107018041B publication Critical patent/CN107018041B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L41/00Arrangements for maintenance, administration or management of data switching networks, e.g. of packet switching networks
    • H04L41/06Management of faults, events, alarms or notifications
    • H04L41/0654Management of faults, events, alarms or notifications using network fault recovery
    • H04L41/0668Management of faults, events, alarms or notifications using network fault recovery by dynamic selection of recovery network elements, e.g. replacement by the most appropriate element after failure
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L43/00Arrangements for monitoring or testing data switching networks
    • H04L43/08Monitoring or testing based on specific metrics, e.g. QoS, energy consumption or environmental parameters
    • H04L43/0805Monitoring or testing based on specific metrics, e.g. QoS, energy consumption or environmental parameters by checking availability
    • H04L43/0817Monitoring or testing based on specific metrics, e.g. QoS, energy consumption or environmental parameters by checking availability by checking functioning
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L43/00Arrangements for monitoring or testing data switching networks
    • H04L43/10Active monitoring, e.g. heartbeat, ping or trace-route
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L67/00Network arrangements or protocols for supporting network services or applications
    • H04L67/01Protocols
    • H04L67/10Protocols in which an application is distributed across nodes in the network
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L67/00Network arrangements or protocols for supporting network services or applications
    • H04L67/50Network services
    • H04L67/56Provisioning of proxy services
    • H04L67/563Data redirection of data network streams

Landscapes

  • Engineering & Computer Science (AREA)
  • Computer Networks & Wireless Communication (AREA)
  • Signal Processing (AREA)
  • Health & Medical Sciences (AREA)
  • Cardiology (AREA)
  • General Health & Medical Sciences (AREA)
  • Environmental & Geological Engineering (AREA)
  • Debugging And Monitoring (AREA)

Abstract

The present invention proposes data migration method and device in a kind of cluster, and this method is applied on the OpenStack controller of cluster, and data migration method includes the detecting domains obtained in cluster in the cluster, and carries out multidomain treat-ment to detecting domains, obtains multiple detection subdomains;Determine the host node in each detection subdomain, and in the master node by Corosync software configuration;Based on the clustered node in each detection subdomain, heartbeat detection is carried out using Corosync software, obtains testing result, and will test result and be sent to OpenStack controller;The data in cluster are migrated according to testing result.It can effectively avoid cluster centers to jump the scale that detection is limited to cluster interior joint through the invention, reduce heartbeat detection time delay and its shared network bandwidth, effectively promote scheduling of resource efficiency in cluster.

Description

Data migration method and device in cluster
Technical field
The present invention relates to data migration method and devices in Internet technical field more particularly to a kind of cluster.
Background technique
Cluster refers to the server (referred to as node) of one group of collaborative work, they as a whole externally provide resource or Service.Each node is mutually communicated by internal lan in cluster, when a nodes break down, is run on the node Task will be migrated to another node, and to prevent single point failure, which can be referred to as HA (High Availability, high availability).OpenStack is the cloud computing management platform project of an open source, and provide the foundation facility Service the solution of (Infrastructure AS A Service, IAAS), wherein calculate node cluster is responsible for offer Calculate virtualization, the i.e. virtual machine of tenant.
In the related technology, the Heartbeat Agent run in each calculate node, such as open source software corosync, Heartbeat detection between responsible node, wherein host node connects OpenStack controller.If a certain nodes break down, host node Can by heartbeat detection software perceive there are the nodes of delay machine, the information that the node of delay machine can then will be present is sent to OpenStack controller.OpenStack controller upon receiving this information, will be on the delay machine node according to dispatching algorithm Task immigration is to other nodes, to guarantee that the virtual machine of all tenants in cluster is not influenced by node failure.
Under this mode, due to needing to carry out heartbeat detection and election host node between node by network interaction message, With the expansion of cluster scale, the interaction delay of heartbeat detection increases, and network bandwidth resources occupancy is excessive, and, corosync's Node specification is limited in 128, surveys even more small.
Summary of the invention
The present invention is directed to solve at least some of the technical problems in related technologies.
For this purpose, can effectively avoid cluster an object of the present invention is to provide data migration method in a kind of cluster Middle heartbeat detection is limited to the scale of cluster interior joint, reduces heartbeat detection time delay and its shared network bandwidth, effectively promotes collection Scheduling of resource efficiency in group.
It is another object of the present invention to propose data migration device in a kind of cluster.
It is another object of the present invention to propose a kind of non-transitorycomputer readable storage medium.
It is another object of the present invention to propose a kind of computer program product.
In order to achieve the above objectives, data migration method in the cluster that first aspect present invention embodiment proposes, the method Applied on the OpenStack controller of cluster, comprising: obtain the detecting domains in cluster, and carry out subregion to the detecting domains Processing, obtains multiple detection subdomains;Determine the host node in each detection subdomain, and by the Corosync software configuration In the host node;Based on the clustered node in each detection subdomain, heartbeat detection is carried out using Corosync software, is obtained Testing result, and the testing result is sent to the OpenStack controller;According to the testing result to the cluster In data migrated.
Data migration method in the cluster that first aspect present invention embodiment proposes, by obtaining the detecting domains in cluster, And multidomain treat-ment is carried out to detecting domains, multiple detection subdomains are obtained, by the clustered node being then based in each detection subdomain, are adopted Heartbeat detection is carried out with Corosync software, and the data in cluster are carried out according to testing result by OpenStack controller Migration can effectively avoid cluster centers from jumping the scale that detection is limited to cluster interior joint, reduce heartbeat detection time delay and its institute Network bandwidth is accounted for, scheduling of resource efficiency in cluster is effectively promoted.
In order to achieve the above objectives, data migration device in the cluster that second aspect of the present invention embodiment proposes, described device Applied on the OpenStack controller of cluster, comprising: module is obtained, for obtaining the detecting domains in cluster;Multidomain treat-ment mould Block obtains multiple detection subdomains for carrying out multidomain treat-ment to the detecting domains;First determining module, it is described every for determining Host node in a detection subdomain, and by the Corosync software configuration in the host node;Detection module, for being based on Clustered node in each detection subdomain carries out heartbeat detection using Corosync software, obtains testing result, and by the inspection It surveys result and is sent to the OpenStack controller;Transferring module, for according to the testing result to the number in the cluster According to being migrated.
Data migration device in the cluster that second aspect of the present invention embodiment proposes, by obtaining the detecting domains in cluster, And multidomain treat-ment is carried out to detecting domains, multiple detection subdomains are obtained, by the clustered node being then based in each detection subdomain, are adopted Heartbeat detection is carried out with Corosync software, and the data in cluster are carried out according to testing result by OpenStack controller Migration can effectively avoid cluster centers from jumping the scale that detection is limited to cluster interior joint, reduce heartbeat detection time delay and its institute Network bandwidth is accounted for, scheduling of resource efficiency in cluster is effectively promoted.
In order to achieve the above objectives, data migration device in the cluster that third aspect present invention embodiment proposes, feature exist In, comprising: processor;Memory for storage processor executable instruction;Wherein, the processor is configured to: obtain Detecting domains in cluster, and multidomain treat-ment is carried out to the detecting domains, obtain multiple detection subdomains;Determine each detection Host node in domain, and by the Corosync software configuration in the host node;Based on the cluster in each detection subdomain Node carries out heartbeat detection using Corosync software, obtains testing result, and the testing result is sent to described OpenStack controller;The data in the cluster are migrated according to the testing result.
Data migration device in the cluster that third aspect present invention embodiment proposes, by obtaining the detecting domains in cluster, And multidomain treat-ment is carried out to detecting domains, multiple detection subdomains are obtained, by the clustered node being then based in each detection subdomain, are adopted Heartbeat detection is carried out with Corosync software, and the data in cluster are carried out according to testing result by OpenStack controller Migration can effectively avoid cluster centers from jumping the scale that detection is limited to cluster interior joint, reduce heartbeat detection time delay and its institute Network bandwidth is accounted for, scheduling of resource efficiency in cluster is effectively promoted.
In order to achieve the above objectives, the non-transitorycomputer readable storage medium that fourth aspect present invention embodiment proposes, When the instruction in the storage medium is performed by the processor of mobile terminal, so that mobile terminal is able to carry out a kind of cluster Middle data migration method, the method are applied on the OpenStack controller of cluster, which comprises obtain in cluster Detecting domains, and to the detecting domains carry out multidomain treat-ment, obtain multiple detection subdomains;It determines in each detection subdomain Host node, and by the Corosync software configuration in the host node;Based on the clustered node in each detection subdomain, adopt Heartbeat detection is carried out with Corosync software, obtains testing result, and the testing result is sent to the OpenStack and is controlled Device processed;The data in the cluster are migrated according to the testing result.
The non-transitorycomputer readable storage medium that fourth aspect present invention embodiment proposes, by obtaining in cluster Detecting domains, and multidomain treat-ment is carried out to detecting domains, multiple detection subdomains are obtained, by the cluster being then based in each detection subdomain Node carries out heartbeat detection using Corosync software, and by OpenStack controller according to testing result to the number in cluster According to being migrated, cluster centers can effectively be avoided to jump the scale that detection is limited to cluster interior joint, reduce heartbeat detection time delay And its shared network bandwidth, effectively promote scheduling of resource efficiency in cluster.
In order to achieve the above objectives, the computer program product that fifth aspect present invention embodiment proposes, when the computer When instruction processing unit in program product executes, data migration method in a kind of cluster is executed, the method is applied to cluster On OpenStack controller, which comprises the detecting domains in cluster are obtained, and multidomain treat-ment is carried out to the detecting domains, Obtain multiple detection subdomains;Determine the host node in each detection subdomain, and by the Corosync software configuration in institute It states in host node;Based on the clustered node in each detection subdomain, heartbeat detection is carried out using Corosync software, is detected As a result, and the testing result is sent to the OpenStack controller;According to the testing result in the cluster Data are migrated.
The computer program product that fifth aspect present invention embodiment proposes, by obtaining the detecting domains in cluster, and it is right Detecting domains carry out multidomain treat-ment, obtain multiple detection subdomains, by the clustered node being then based in each detection subdomain, use Corosync software carries out heartbeat detection, and is moved according to testing result to the data in cluster by OpenStack controller It moves, can effectively avoid cluster centers from jumping detection and be limited to the scale of cluster interior joint, reduce heartbeat detection time delay and its shared Network bandwidth effectively promotes scheduling of resource efficiency in cluster.
The additional aspect of the present invention and advantage will be set forth in part in the description, and will partially become from the following description Obviously, or practice through the invention is recognized.
Detailed description of the invention
Above-mentioned and/or additional aspect and advantage of the invention will become from the following description of the accompanying drawings of embodiments Obviously and it is readily appreciated that, in which:
Fig. 1 is the flow diagram of data migration method in the cluster of one embodiment of the invention proposition;
Fig. 2 (a) is the schematic diagram of a scenario of data migration method in a cluster in the embodiment of the present invention;
Fig. 2 (b) is the schematic diagram of a scenario of data migration method in another cluster in the embodiment of the present invention;
Fig. 3 is the flow diagram of data migration method in the cluster of another embodiment of the present invention proposition;
Fig. 4 is the flow diagram of data migration method in the cluster of another embodiment of the present invention proposition;
Fig. 5 is the flow diagram of data migration method in the cluster of another embodiment of the present invention proposition;
Fig. 6 is the structural schematic diagram of data migration device in the cluster of one embodiment of the invention proposition;
Fig. 7 is the structural schematic diagram of data migration device in the cluster of another embodiment of the present invention proposition.
Specific embodiment
The embodiment of the present invention is described below in detail, examples of the embodiments are shown in the accompanying drawings, wherein from beginning to end Same or similar label indicates same or similar element or element with the same or similar functions.Below with reference to attached The embodiment of figure description is exemplary, and for explaining only the invention, and is not considered as limiting the invention.On the contrary, this The embodiment of invention includes all changes fallen within the scope of the spiritual and intension of attached claims, modification and is equal Object.
Fig. 1 is the flow diagram of data migration method in the cluster of one embodiment of the invention proposition.
The present embodiment is configured as in cluster illustrating in data migration device with data migration method in the cluster.
Data migration method is applied on the OpenStack controller of cluster in the cluster.
Referring to Fig. 1, data migration method includes: in the cluster
S11: the detecting domains in cluster are obtained, and multidomain treat-ment is carried out to detecting domains, obtain multiple detection subdomains.
The executing subject of the present embodiment is OpenStack controller.
In an embodiment of the present invention, detecting domains can be, for example, and be responsible for data in cluster in OpenStack controller to move The virtual logical space of shifting.Referring to fig. 2, Fig. 2 (a) is the scene signal of data migration method in a cluster in the embodiment of the present invention Figure, Fig. 2 (b) are the schematic diagram of a scenario of data migration method in another cluster in the embodiment of the present invention.In Fig. 2 (a), The virtual logical space for being responsible for Data Migration in cluster in OpenStack controller is divided into two detections subdomains, i.e. Fig. 2 (a) detection subdomain 1 in and detection subdomain 2 include multiple nodes (node server) in each detection subdomain, can be from every Host node is elected in a detection subdomain, and configures Corosync software on the primary node.In the related technology, Corosync software Heartbeat detection is carried out between whole node servers, and in the embodiment of the present invention, Corosync software is based only on detection Node server in subdomain carries out heartbeat detection, referring to fig. 2 (b), wherein Corosync software is based on right in detection subdomain 1 The heartbeat detection data frame of each node is monitored, and is interrupted in the heartbeat detection data frame transfer of detection egress 21, is determined section 21 delay machines of point, then the mark of the node 21 is sent to OpenStack controller by Corosync software, is controlled by OpenStack Device migrates the data in node 21, due to being monitored based on detection subdomain heartbeat detection data frame, thus, effectively Cluster centers are avoided to jump the scale that detection is limited to cluster interior joint.
The embodiment of the present invention is not on the basis of improving and optimizating heartbeat detection agreement, in OpenStack controller The virtual logical space of heartbeat detection between responsible node carries out multidomain treat-ment, obtains multiple detection subdomains, subsequent to be based on Node in each detection subdomain carries out heartbeat detection, since the number of nodes in detection subdomain is smaller, also, in multiple detections Between subdomain, the mutually isolated configuration of the process of heartbeat detection, and in each detection subdomain, node size is controlled, reduces heartbeat Detect time delay and its shared network bandwidth.
Further, in an embodiment of the present invention, the heart can be configured on the host node in each detection subdomain The Corosync software for jumping detection can be compatible with using Corosync software more general at present, both ensure that the network operation Efficiency, and take into account the realization for meeting customer service.
S12: the host node in each detection subdomain is determined, and in the master node by Corosync software configuration.
It is understood that being that one disposed in the cluster is used for according to the working principle of Corosync software On the node of calculating, and then the heartbeat detection data frame between each node is detected to realize the heartbeat detection of cluster interior joint.
In an embodiment of the present invention, multidomain treat-ment, after obtaining multiple detection subdomains, Ke Yi are being carried out to detecting domains The application of heartbeat detection class is configured in each detection subdomain, for example, it may be determined that the node for calculating in each detection subdomain, The node can be referred to as host node, on the primary node configure Corosync software, in turn, by the Corosync software with OpenStack controller is connected, and is controlled by OpenStack controller to the Corosync software in each detection subdomain System is realized and carries out heartbeat detection based on the clustered node in each detection subdomain.
In some embodiments, referring to Fig. 3, step S11 is specifically included:
S31: the detecting domains in cluster are obtained, and predetermined number is determined according to clustering performance.
Wherein, clustering performance can be, for example, network broadband performance, GPU performance etc..
Optionally, as clustered node scale constantly expands, the efficiency that cluster centers jump detection will exponentially decline, Therefore, predetermined number can be determined according to clustering performance, wherein predetermined number can by cluster management personnel depaly, alternatively, It can be preset by the factory program of cluster interior joint server, with no restriction to this.
S32: multidomain treat-ment is carried out to detecting domains based on predetermined number, obtains the detection subdomain of predetermined number.
Optionally, by determining predetermined number according to clustering performance, and detecting domains are carried out at subregion based on predetermined number Reason, obtains the detection subdomain of predetermined number, due to carrying out subregion to detecting domains based on clustering performance, wherein clustering performance is net Network broadband performance, GPU performance etc., therefore, can effective guarantee to detecting domains carry out multidomain treat-ment reasonability, reduce heartbeat inspection Survey time delay and its consumed cluster resource.
In some embodiments, referring to fig. 4, after step s 12, data migration method in the cluster further include:
S41: judge whether host node is in delay machine state using Corosync software.
In an embodiment of the present invention, the inspection of the heartbeat between node whether can be received using Corosync software detection Measured data frame, and then judge whether host node is in delay machine state, and judging result is sent to OpenStack controller, by OpenStack controller is according to judging result triggering following step.
S42: when host node is in delay machine state, again out of detection subdomain in addition to the host node in delay machine state Node in determine host node.
S43: by Corosync software configuration in the host node redefined.
In an embodiment of the present invention, OpenStack controller is in the judgement knot for receiving host node and being in delay machine state After fruit, host node can be determined from the node in detection subdomain in addition to the host node in delay machine state again, and The application of heartbeat detection class is configured on the host node redefined, has ensured the high availability of heartbeat detection process.
In the present embodiment, by OpenStack controller to the Corosync software on host node in each detection subdomain It is controlled, realizes and heartbeat detection is carried out based on the clustered node in each detection subdomain, by judge host node in delaying When machine state, host node can be determined from the node in detection subdomain in addition to the host node in delay machine state again, And the application of heartbeat detection class is configured on the host node redefined, ensure the high availability of heartbeat detection process.
S13: based on the clustered node in each detection subdomain, heartbeat detection is carried out using Corosync software, is examined It surveys as a result, and will test result and be sent to OpenStack controller.
In an embodiment of the present invention, it can be adopted by the Corosync software in the host node in each detection subdomain Heartbeat detection data frame in collection detection subdomain between multiple nodes, to carry out heartbeat detection according to heartbeat detection data frame.
For example, Corosync software in host node can be periodically, or constantly multiple in acquisition testing subdomain Each node in node, the heartbeat detection data frame between host node, then, in a certain subnormal collection process, if The heartbeat detection data frame between some node and host node is not collected, then may determine that the node is in delay machine state, into And it will test result and be sent to OpenStack controller.
Heartbeat detection is carried out according to heartbeat detection data frame by Corosync software, is realized easy.
S14: the data in cluster are migrated according to testing result.
For example, Corosync software can obtain the mark of the node when determining some node for delay machine state in real time Know information, and the identification information is sent to OpenStack controller, preset schedule algorithm is called by OpenStack controller Data in the node are migrated, the data on delay machine node are scheduled by being then based in detection subdomain, and examines It is controllable to survey subdomain interior nodes scale, thus, it is possible to effectively promote scheduling of resource efficiency in cluster.
In the present embodiment, multidomain treat-ment is carried out by obtaining the detecting domains in cluster, and to detecting domains, obtains multiple detections Subdomain, by the clustered node being then based in each detection subdomain, using Corosync software progress heartbeat detection, and by OpenStack controller according to testing result migrates the data in cluster, and cluster centers can effectively be avoided to jump detection It is limited to the scale of cluster interior joint, heartbeat detection time delay and its shared network bandwidth is reduced, effectively promotes resource tune in cluster Spend efficiency.
Fig. 5 is the flow diagram of data migration method in the cluster of another embodiment of the present invention proposition.
The present embodiment is configured as in cluster illustrating in data migration device with data migration method in the cluster.
S501: the detecting domains in cluster are obtained, and predetermined number is determined according to clustering performance.
Wherein, clustering performance can be, for example, network broadband performance, GPU performance of cluster interior joint server etc..
For example, if there are 320 node servers in cluster, if not being responsible in cluster in OpenStack controller The virtual logical space of Data Migration carries out subregion, then OpenStack controller needs to carry out based on 320 node servers In Data Migration, network congestion is easily caused, and at this point it is possible to predefining initial predetermined number is 10, and then by the void Quasi- logical space is divided into 10 detection subdomains, and detects in each detection subdomain whether can have network congestion phenomenon, if not There are network congestion phenomenons, then can trigger S502 according to the predetermined number 10.
S502: multidomain treat-ment is carried out to detecting domains based on predetermined number, obtains the detection subdomain of predetermined number.
S503: the host node in each detection subdomain is determined, and in the master node by Corosync software configuration.
S504;Judge whether host node is in delay machine state using Corosync software, is otherwise held if executing S505 Row S506.
S505: determining host node from the node in detection subdomain in addition to the host node in delay machine state again, By Corosync software configuration in the host node redefined.
S506: based on the clustered node in each detection subdomain, heartbeat detection is carried out using Corosync software, is examined It surveys as a result, and will test result and be sent to OpenStack controller.
It is alternatively possible to by the Corosync software in the host node in each detection subdomain, in acquisition testing subdomain Heartbeat detection data frame between multiple nodes, to carry out heartbeat detection according to heartbeat detection data frame.
S507: according to testing result with the presence or absence of the node in delay machine state in judgement detection subdomain, if so, executing Otherwise S508 executes S510.
The implementation procedure of S502~S507 may refer to above-described embodiment, and therefore not to repeat here.
S508: preset schedule algorithm is called.
In an embodiment of the present invention, preset schedule algorithm be it is pre-set, the preset schedule algorithm can be used for by On clustered node in Data Migration to detection subdomain in delay machine node in addition to the node in delay machine state, preset algorithm It can be called by OpenStack controller.
S509: the section in delay machine state will be removed in the Data Migration in node to detection subdomain according to preset schedule algorithm On clustered node except point.
Optionally, by call directly preset schedule algorithm by the Data Migration in node to detection subdomain except in delaying On clustered node except the node of machine state, the instantaneity of scheduling of resource can be realized.
S510: with no treatment.
In the present embodiment, by the basis of not improved and optimizated to heartbeat detection agreement, to OpenStack controller The virtual logical space of heartbeat detection between middle responsible node carries out multidomain treat-ment, obtains multiple detection subdomains, it is subsequent can be with base Node in each detection subdomain carries out heartbeat detection, since the number of nodes in detection subdomain is smaller, also, in multiple inspections It surveys between subdomain, the mutually isolated configuration of the process of heartbeat detection, and in each detection subdomain, node size is controlled, Neng Gouyou Effect avoids cluster centers from jumping the scale that detection is limited to cluster interior joint, reduces heartbeat detection time delay and its shared network bandwidth. It can be configured to the Corosync software of heartbeat detection on the host node in each detection subdomain, can be compatible with using current More general Corosync software, not only ensure that the efficiency of the network operation, but also take into account the realization for meeting customer service.Pass through root Predetermined number is determined according to clustering performance, and multidomain treat-ment is carried out to detecting domains based on predetermined number, obtains the detection of predetermined number Subdomain, due to carrying out subregion to detecting domains based on clustering performance, wherein clustering performance is network broadband performance, GPU performance etc., Therefore, can effective guarantee detecting domains are carried out with the reasonability of multidomain treat-ment, reduce heartbeat detection time delay and its consumed collection Group's resource.The Corosync software on host node in each detection subdomain is controlled by OpenStack controller, is realized Heartbeat detection is carried out based on the clustered node in each detection subdomain, it, can be with by when judging that host node is in delay machine state Again host node is determined from the node in detection subdomain in addition to the host node in delay machine state, and redefining The application of heartbeat detection class is configured on host node, has ensured the high availability of heartbeat detection process.By obtaining the detection in cluster Domain, and multidomain treat-ment is carried out to detecting domains, multiple detection subdomains are obtained, by the cluster section being then based in each detection subdomain Point carries out heartbeat detection using Corosync software, and by OpenStack controller according to testing result to the data in cluster It is migrated, cluster centers can effectively be avoided to jump the scale that detection is limited to cluster interior joint, extended to when reducing heartbeat detection Its shared network bandwidth effectively promotes scheduling of resource efficiency in cluster.
Fig. 6 is the structural schematic diagram of data migration device in the cluster of one embodiment of the invention proposition.
Data migration device 600 being implemented in combination with by software, hardware or both in the cluster.
Referring to Fig. 6, data migration device 600 includes: to obtain module 601, multidomain treat-ment module 602, first in the cluster Determining module 603, detection module 604 and transferring module 605, wherein
Module 601 is obtained, for obtaining the detecting domains in cluster.
Multidomain treat-ment module 602 obtains multiple detection subdomains for carrying out multidomain treat-ment to detecting domains.
Optionally, in some embodiments, multidomain treat-ment module 602 is specifically included:
Submodule 6021 is determined, for determining predetermined number according to clustering performance.
Submodule 6022 is handled, for carrying out multidomain treat-ment to detecting domains based on predetermined number, obtains the inspection of predetermined number Survey subdomain.
First determining module 603 for determining the host node in each detection subdomain, and Corosync software configuration is existed In host node.
Detection module 604, for carrying out heartbeat using Corosync software based on the clustered node in each detection subdomain Detection, obtains testing result, and will test result and be sent to OpenStack controller.
Optionally, in some embodiments, detection module 604 is specifically used for:
By the Corosync software in the host node in each detection subdomain, in acquisition testing subdomain between multiple nodes Heartbeat detection data frame, with according to heartbeat detection data frame carry out heartbeat detection.
Transferring module 605, for being migrated according to testing result to the data in cluster.
Optionally, in some embodiments, referring to Fig. 7, transferring module 605 is specifically included:
Judging submodule 6051, for judging to detect according to testing result in subdomain with the presence or absence of the section in delay machine state Point.
Submodule 6052 is called, for calling preset schedule algorithm when there is the node for being in delay machine state.
Submodule 6053 is migrated, for according to preset schedule algorithm place will to be removed in the Data Migration in node to detection subdomain On clustered node except the node of delay machine state.
Optionally, in some embodiments, referring to Fig. 7, data migration device 600 includes: in the cluster
Judgment module 606, for judging whether host node is in delay machine state using Corosync software.
Second determining module 607, for being removed out of detection subdomain again and being in delay machine when host node is in delay machine state Host node is determined in node except the host node of state.
Configuration module 608, for by Corosync software configuration in the host node redefined.
It should be noted that the explanation in earlier figures 1- Fig. 5 embodiment to data migration method embodiment in cluster Data migration device 600 in the cluster of the embodiment are also applied for, realization principle is similar, and details are not described herein again.
In the present embodiment, multidomain treat-ment is carried out by obtaining the detecting domains in cluster, and to detecting domains, obtains multiple detections Subdomain, by the clustered node being then based in each detection subdomain, using Corosync software progress heartbeat detection, and by OpenStack controller according to testing result migrates the data in cluster, and cluster centers can effectively be avoided to jump detection It is limited to the scale of cluster interior joint, heartbeat detection time delay and its shared network bandwidth is reduced, effectively promotes resource tune in cluster Spend efficiency.
It should be noted that in the description of the present invention, term " first ", " second " etc. are used for description purposes only, without It can be interpreted as indication or suggestion relative importance.In addition, in the description of the present invention, unless otherwise indicated, the meaning of " multiple " It is two or more.
Any process described otherwise above or method description are construed as in flow chart or herein, and expression includes It is one or more for realizing specific logical function or process the step of executable instruction code module, segment or portion Point, and the range of the preferred embodiment of the present invention includes other realization, wherein can not press shown or discussed suitable Sequence, including according to related function by it is basic simultaneously in the way of or in the opposite order, Lai Zhihang function, this should be of the invention Embodiment person of ordinary skill in the field understood.
It should be appreciated that each section of the invention can be realized with hardware, software, firmware or their combination.Above-mentioned In embodiment, software that multiple steps or method can be executed in memory and by suitable instruction execution system with storage Or firmware is realized.It, and in another embodiment, can be under well known in the art for example, if realized with hardware Any one of column technology or their combination are realized: having a logic gates for realizing logic function to data-signal Discrete logic, with suitable combinational logic gate circuit specific integrated circuit, programmable gate array (PGA), scene Programmable gate array (FPGA) etc..
Those skilled in the art are understood that realize all or part of step that above-described embodiment method carries It suddenly is that relevant hardware can be instructed to complete by program, the program can store in a kind of computer-readable storage medium In matter, which when being executed, includes the steps that one or a combination set of embodiment of the method.
It, can also be in addition, each functional unit in each embodiment of the present invention can integrate in a processing module It is that each unit physically exists alone, can also be integrated in two or more units in a module.Above-mentioned integrated mould Block both can take the form of hardware realization, can also be realized in the form of software function module.The integrated module is such as Fruit is realized and when sold or used as an independent product in the form of software function module, also can store in a computer In read/write memory medium.
Storage medium mentioned above can be read-only memory, disk or CD etc..
In the description of this specification, reference term " one embodiment ", " some embodiments ", " example ", " specifically show The description of example " or " some examples " etc. means specific features, structure, material or spy described in conjunction with this embodiment or example Point is included at least one embodiment or example of the invention.In the present specification, schematic expression of the above terms are not Centainly refer to identical embodiment or example.Moreover, particular features, structures, materials, or characteristics described can be any One or more embodiment or examples in can be combined in any suitable manner.
Although the embodiments of the present invention has been shown and described above, it is to be understood that above-described embodiment is example Property, it is not considered as limiting the invention, those skilled in the art within the scope of the invention can be to above-mentioned Embodiment is changed, modifies, replacement and variant.

Claims (8)

1. data migration method in a kind of cluster, which is characterized in that the method is applied to the OpenStack controller of cluster On, comprising the following steps:
The detecting domains in cluster are obtained, and multidomain treat-ment is carried out to the detecting domains, obtain multiple detection subdomains;
Determine the host node in each detection subdomain, and by Corosync software configuration in the host node;
Based on the clustered node in each detection subdomain, heartbeat detection is carried out using Corosync software, obtains testing result, and The testing result is sent to the OpenStack controller;
The data in the cluster are migrated according to the testing result;
Wherein, described that multidomain treat-ment is carried out to the detecting domains, obtain multiple detection subdomains, comprising:
Predetermined number is determined according to clustering performance;
Pre- multidomain treat-ment is carried out to the detecting domains based on the predetermined number;
Judge in the corresponding each detection subdomain of the pre- multidomain treat-ment with the presence or absence of network blockage phenomenon;
Network blockage phenomenon if it does not exist then carries out multidomain treat-ment to the detecting domains based on the predetermined number, obtains described The detection subdomain of predetermined number.
2. data migration method in cluster as described in claim 1, which is characterized in that described based in each detection subdomain Clustered node carries out heartbeat detection using Corosync software, comprising:
By the Corosync software in the host node in each detection subdomain, multiple nodes in the detection subdomain are acquired Between heartbeat detection data frame, with according to the heartbeat detection data frame carry out heartbeat detection.
3. data migration method in cluster as described in claim 1, which is characterized in that it is described according to the testing result to institute The data stated in cluster are migrated, comprising:
Judged in the detection subdomain according to the testing result with the presence or absence of the node in delay machine state;
When there is the node for being in the delay machine state, preset schedule algorithm is called;
According to the preset schedule algorithm delay machine will be in except described in the Data Migration in the node to the detection subdomain On clustered node except the node of state.
4. data migration method in cluster as described in claim 1, which is characterized in that determination each detection subdomain Interior host node, and by the Corosync software configuration in the host node after, further includes:
Judge whether the host node is in delay machine state using the Corosync software;
When the host node is in the delay machine state, again except the host node in delay machine state out of described detection subdomain Except node in determine host node;
By the Corosync software configuration in the host node redefined.
5. data migration device in a kind of cluster, which is characterized in that described device is applied to the OpenStack controller of cluster On, comprising:
Module is obtained, for obtaining the detecting domains in cluster;
Multidomain treat-ment module obtains multiple detection subdomains for carrying out multidomain treat-ment to the detecting domains;
First determining module, for determining the host node in each detection subdomain, and by Corosync software configuration in institute It states in host node;
Detection module, for carrying out heartbeat detection using Corosync software, obtaining based on the clustered node in each detection subdomain The OpenStack controller is sent to testing result, and by the testing result;
Transferring module, for being migrated according to the testing result to the data in the cluster;
Wherein, the multidomain treat-ment module specifically includes:
Submodule is determined, for determining predetermined number according to clustering performance;
Submodule is pre-processed, for carrying out pre- multidomain treat-ment to the detecting domains based on the predetermined number;
First judging submodule, for judging in the corresponding each detection subdomain of the pre- multidomain treat-ment with the presence or absence of network Clogging;
Submodule is handled, for being based on described default when first judging submodule determines and network blockage phenomenon is not present Number carries out multidomain treat-ment to the detecting domains, obtains the detection subdomain of the predetermined number.
6. data migration device in cluster as claimed in claim 5, which is characterized in that the detection module is specifically used for:
By the Corosync software in the host node in each detection subdomain, multiple nodes in the detection subdomain are acquired Between heartbeat detection data frame, with according to the heartbeat detection data frame carry out heartbeat detection.
7. data migration device in cluster as claimed in claim 5, which is characterized in that the transferring module specifically includes:
Second judgment submodule, for being judged in the detection subdomain according to the testing result with the presence or absence of in delay machine state Node;
Submodule is called, for calling preset schedule algorithm when there is the node for being in the delay machine state;
Submodule is migrated, for will be in the Data Migration in the node to the detection subdomain according to the preset schedule algorithm Except it is described in the node of delay machine state in addition to clustered node on.
8. data migration device in cluster as claimed in claim 5, which is characterized in that further include:
Judgment module, for judging whether the host node is in delay machine state using the Corosync software;
Second determining module, for from when the host node is in the delay machine state, being removed out of described detection subdomain again Host node is determined in node except the host node of delay machine state;
Configuration module, for by the Corosync software configuration in the host node redefined.
CN201710210145.8A 2017-03-31 2017-03-31 Data migration method and device in cluster Active CN107018041B (en)

Priority Applications (2)

Application Number Priority Date Filing Date Title
CN201710210145.8A CN107018041B (en) 2017-03-31 2017-03-31 Data migration method and device in cluster
CN201910252182.4A CN109981412B (en) 2017-03-31 2017-03-31 Data migration method and device in cluster and storage medium

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201710210145.8A CN107018041B (en) 2017-03-31 2017-03-31 Data migration method and device in cluster

Related Child Applications (1)

Application Number Title Priority Date Filing Date
CN201910252182.4A Division CN109981412B (en) 2017-03-31 2017-03-31 Data migration method and device in cluster and storage medium

Publications (2)

Publication Number Publication Date
CN107018041A CN107018041A (en) 2017-08-04
CN107018041B true CN107018041B (en) 2019-05-17

Family

ID=59446531

Family Applications (2)

Application Number Title Priority Date Filing Date
CN201710210145.8A Active CN107018041B (en) 2017-03-31 2017-03-31 Data migration method and device in cluster
CN201910252182.4A Active CN109981412B (en) 2017-03-31 2017-03-31 Data migration method and device in cluster and storage medium

Family Applications After (1)

Application Number Title Priority Date Filing Date
CN201910252182.4A Active CN109981412B (en) 2017-03-31 2017-03-31 Data migration method and device in cluster and storage medium

Country Status (1)

Country Link
CN (2) CN107018041B (en)

Families Citing this family (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107612787B (en) * 2017-11-06 2021-01-12 南京易捷思达软件科技有限公司 Cloud host fault detection method based on Openstack open source cloud platform
CN108092857A (en) * 2018-01-15 2018-05-29 郑州云海信息技术有限公司 A kind of distributed system heartbeat detecting method and relevant apparatus
CN108924244B (en) * 2018-07-24 2022-02-25 阿里巴巴(中国)有限公司 Distributed system and flow distribution method and device for same
CN110460489A (en) * 2019-07-02 2019-11-15 北京云迹科技有限公司 Industrial personal computer heartbeat monitor method and system

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102355369A (en) * 2011-09-27 2012-02-15 华为技术有限公司 Virtual clustered system as well as processing method and processing device thereof
CN103942128A (en) * 2014-04-29 2014-07-23 浪潮电子信息产业股份有限公司 Double-computer reinforcing method for high-performance job scheduling management node
CN104935672A (en) * 2015-06-29 2015-09-23 杭州华三通信技术有限公司 High available realizing method and equipment of load balancing service

Family Cites Families (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7925923B1 (en) * 2008-01-31 2011-04-12 Hewlett-Packard Development Company, L.P. Migrating a virtual machine in response to failure of an instruction to execute
JP5190084B2 (en) * 2010-03-30 2013-04-24 株式会社日立製作所 Virtual machine migration method and system
CN102629906A (en) * 2012-03-30 2012-08-08 浪潮电子信息产业股份有限公司 Design method for improving cluster business availability by using cluster management node as two computers
CN105897507B (en) * 2016-03-31 2019-09-17 杭州数梦工场科技有限公司 The condition detection method and device of node device
CN106161090A (en) * 2016-07-12 2016-11-23 许继集团有限公司 The monitoring method of a kind of subregion group system and device
CN106445641B (en) * 2016-11-02 2020-11-06 深圳前海生生科技有限公司 Data migration method between secure virtual platforms on discrete computing nodes

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102355369A (en) * 2011-09-27 2012-02-15 华为技术有限公司 Virtual clustered system as well as processing method and processing device thereof
CN103942128A (en) * 2014-04-29 2014-07-23 浪潮电子信息产业股份有限公司 Double-computer reinforcing method for high-performance job scheduling management node
CN104935672A (en) * 2015-06-29 2015-09-23 杭州华三通信技术有限公司 High available realizing method and equipment of load balancing service

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
一种基于云计算的高可用性集群设计与实现;张龙兴等;《西安文理学院学报:自然科学版》;20140731;第17卷(第3期);9-13页

Also Published As

Publication number Publication date
CN109981412A (en) 2019-07-05
CN109981412B (en) 2020-11-17
CN107018041A (en) 2017-08-04

Similar Documents

Publication Publication Date Title
CN107018041B (en) Data migration method and device in cluster
CN108256002B (en) Cross-machine-room data synchronization method, device, system and server
EP2614436B1 (en) Controlled automatic healing of data-center services
US10715643B2 (en) Systems and/or methods for intelligent and resilient failover for cloud computing environments
US20080295095A1 (en) Method of monitoring performance of virtual computer and apparatus using the method
CN110865867A (en) Method, device and system for discovering application topological relation
US11190431B2 (en) Prioritized client-server communications based on server health
JP2016119064A (en) End-to-end datacenter performance control
KR20170046721A (en) Energy-saving control method, management server and network device
CN109697120B (en) Method for application migration and electronic device
JP2006338543A (en) Monitoring system and monitoring method
CN110413585B (en) Log processing device, method, electronic device, and computer-readable storage medium
CN104270322A (en) Self-adaptive load balance scheduling mechanism for internet-of-things device access processing platform
WO2023024410A1 (en) Task scheduling method and apparatus, device, storage medium, computer program product, and computer program
US9634885B2 (en) Control target flow specifying method and control target flow specifying device
WO2017075989A1 (en) Method, device and system for virtual machines migration
US10656985B2 (en) Heartbeat failure detection
US20150127816A1 (en) Datacenter health analysis using dns switching
US20190379577A1 (en) Automatic monitoring, correlation, and resolution of network alarm conditions
US20230094964A1 (en) Dynamic management of locations of modules of a platform hosted by a distributed system
CN109039795A (en) A kind of Cloud Server resource monitoring method and system
CN103827831A (en) Live module diagnostic testing
CN107562519A (en) Moving method, system and the server of virtual machine
CN110231981A (en) Service calling method and device
CN110224880A (en) A kind of heartbeat inspecting method and monitoring device

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant