AU2018100381A4 - A physical resource scheduling method in cloud cluster - Google Patents
A physical resource scheduling method in cloud cluster Download PDFInfo
- Publication number
- AU2018100381A4 AU2018100381A4 AU2018100381A AU2018100381A AU2018100381A4 AU 2018100381 A4 AU2018100381 A4 AU 2018100381A4 AU 2018100381 A AU2018100381 A AU 2018100381A AU 2018100381 A AU2018100381 A AU 2018100381A AU 2018100381 A4 AU2018100381 A4 AU 2018100381A4
- Authority
- AU
- Australia
- Prior art keywords
- node
- cloud
- cpu
- task
- network
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Ceased
Links
Abstract
Abstract for 2018100381 A cloud task scheduling algorithm and efficient physical resource scheduling method to improve the utilization of resources and enhance the reliability of their application, which includes the following steps: 1. When a new task arrives at the resource pool, the type and priority of the task must be determined; 2. Gather each node's resource utilization information in the cloud cluster, where the resources include CPU, memory, network and storage; 3. Calculate the score of each node; 4. Obtain a queue by sorting the scores of nodes in the cloud cluster; 5. Assign the pods required by the task to the optimum nodes. Created by Examiner 9/4/2018 Page 1 Task arriving aAnalysis module Computing Parameters m The configuration resource scheduling Scheduling system Resource decision information Collector Node 1 Node 2 Node 3 NodeN CPU1 CPU 2 CPU 3 CPUN The MEM1 MEM2 MEM3 ' MEMN cloud NET NET2 NET NETN custer DISK1 DISK 2 DISK3 DISKN Figure 1
Description
A physical resource scheduling method in cloud cluster
1 TECHNICAL FIELD
The present invention relates to the cloud computing, more specifically, it’s related to a cloud task scheduling algorithm.
2 BACKGROUND
With the development of cloud computing, lots of large-scale enterprises, research institutions, and governments have established their respective data center to provide the storage or/and computing services to public. Due to the large scale of resources, different types of service requirement, and the complexity of applications, it is significant important to develop a efficient physical resource scheduling method to improve the utilization of resources and enhance the reliability of application.
Kubcrnctcs is a portable, extensible opcn-sourcc platform for managing containerized workloads and services, that facilitates both declarative configuration and automation. It has a large, rapidly growing ecosystem. Kubcrnctcs services, support, and tools arc widely available.
But, the resource scheduling methods built in the Kubcrnctcs is quit simple, and can not meet the diverse need of users. At present, the main scheduling methods of Kubcrnctcs includes two kinds of methods: Predicates and Priorities. However, there exit some disadvantages to those approaches. Firstly, the scoring method of CPU and memory resources can not be applied in heterogeneous cluster. Secondly, Only the resources of CPU and memory arc considered whereas the resources of network and storage arc not involved. Thirdly, due to scheduling one single Pod each time, the consistency of several Pods in the same Service can not be guaranteed.
3 DESCRIPTION
The scheduling method proposed in this invention includes following steps: Step 1. When a new task arriving at the resource pool, the type T and priority P of this task must be determined, where T € {1, 2, 3,..., Tmax} and P € {1,2,3,..., Fmax}. More specifically, the type T can be determined through a /v-mcans clustering algorithm based on the resource requirements of task. And the priority P can also be determined through the A'-rricans clustering algorithm based on the request URL.
Step 2. Gather each node’s resource utilization information in the cloud cluster, where the resources include CPU, memory, network and storage.
Step 3. Calculate the score of each node as following, SJotai = wcpuS1cpv-\-wmemorySmemory + wnetWorkSnetwork+wciiSi~S^isli, where Stotal is the total score of node i, and SlCPU, Slmemory, Slnetwork, S‘llsk denote the resource scores of node i in CPU, memory, network and storage respectively, and wepu, wmemory, Wnetwork, Wdisk denote the weight of corresponding indexes. Moreover, wcpu, Wmemory, ^network, U7disk UTC pOSltlVC Constant, and \ Wrr,(-rn.ory | WrI(-f/work I 'didisk = 1-
In a hybrid cloud consisting of heterogeneous nodes, it is highly possible there arc variety of types of CPUs, memory items, network cards and disks. Although there arc so many indicators that can infect the performance of CPU, here we take the dominant frequency as its Key Performance Indicator (KPI) for simplify. Similarly, we take the capacity, bandwidth, and size as the KPIs of memory items, network cards and disks respectively.
Furthermore, we set the score of the node of which the CPU has the best performance in the hybrid cloud as 1 when its CPU is completely unoccupied. Then for any node i in this hybrid cloud, we have SlCPU = (1 — RjCPUi) ^ where f_CPUmax represents the highest dominant frequency of CPU in cloud, f-CPUi is the CPU frequency of node i, and R.CPU,, (0 < R.CPU, < 100%) is its utilization rates. Obviously, for any node i, 0 < SlCPU < 1, and the less utilization and the higher performance lead a higher score.
Similarly, we set the score of the node which has the maximum size of memory in the cloud as 1 when its memory is completely unoccupied. Then for any node i, we have Slmemory = (1 - R_memoryt) whcrc fjmemorymax represents the maximum capability of memory of one node in the cloud, fjmemoryi is the memory capability of node i, and Rjmemoryi (0 < Rjmemoryi < 100%) is its utilization rates. Obviously, for any node i, 0 < Φ <1 yj — ^memory — -*·
For the network resource score of node i, we similarly have Slnetwork = (1 — Rjnetworki) ’kki , where fjnetworkmax represents the maximum bandwidth of network cards in the cloud, f jaetworki is the network cards bandwidth of node i, and R^etworki (0 < Rjnetworki < 100%) is its utilization rates. Obviously, for any node i, 0 < Slnetwork < 1.
And for the storage resource score, we have Sldisk = (1 — Rjdisj) ^^fsikki where /_di.s/cmax represents the maximum storage capability of one node in the cloud, f jdiski is the disk storage capability of node i, and Rjdiski (0 < Rjdiski < 100%) is its utilization rates. Obviously, for any node i, 0 < Sldisk < 1.
Otherwise, different tasks commonly require different resources. For example, the compute-intensive task requires higher performance in computing resources such as CPU and memory, whereas data-transmission task requires higher performance in network resources. Hence, we initially define different combines of weights for different types of tasks. And the user also can dynamically changes the weights according the actual situation.
Step 4. Obtain a queue Q[A] by sorting the scores S\otrd of all N nodes in the cloud cluster, where Q[l] > Q[2] > Q[3] > ... > Q[JV].
Step 5. Assuming that the new task require m Pods in Kubcrnctcs, then they must be assigned to the m optimum nodes from Q[7V], where m « N. For a given priority P of task, Cp is define as its corresponding the consistency threshold. Then the m nodes can be selected through following several substeps. Substep 5.1. Set i = 1.
Substep 5.2. If z > N — m + 1, return empty and end the process there, else, choose the first m scores of Q[AT] from Q[i], i.c., Q[i], Q\i + 1], Q[i + 2], ..., Q[i + m- 1],
Substep 5.3. Calculate the consistency of the chosen m scores which is defined as their statistical variance, i.c., σ = ^ 1 {Q\j\ ~ Qi)2, where
Ql = ijyjL7^1Q[j}·
Substep 5.4. Compare σ with Cp. If sigma > Cp, set i i |1, and repeat the Substep 5.2-5.4, else, obtain the value of i and end this process.
As a result, we obtain the m nodes, i.c., node i, node i + 1,..., node i + rn — 1.
4 FIGURES
Figure 1 is the system framework of the proposed physical resource scheduling method.
Figure 2 briefly displays the proposed physical resource scheduling .
Claims (5)
1 .The scheduling method proposed in this invention includes following steps: Step 1. When a new task arriving at the resource pool, the type T and priority P of this task must be determined, where T € {1, 2,3,..., Tmax} and P € {1,2,3,..., Pmax}. Step
2. Gather each node’s resource utilization information in the cloud cluster, where the resources include CPU, memory, network and storage. Step
3. Calculate the score of each node as following, Sltotal = wcPljScpu I wmemorySmf,rnory-\-WnetworkSnetwor]t-\-W(iis]iSdisk, where Stotai IS the total SCOrC of node i, and SlCPU, Slmemory, Slnetwork, Sldisk denote the resource scores of node i in CPU, memory, network and storage respectively, and wepu, wmemory > wnetworki wchst, denote the weight of corresponding indexes. Moreover, w(:pij, Wmemory s mnetwork> Wdisk arc positive Constant, a,H(l W(jj>fj I I 'd]r)ff/woikp ^disk = 1 · Furthermore, we set the score of the node of which the CPU has the best performance in the hybrid cloud as 1 when its CPU is completely unoccupied. Then for any node i in this hybrid cloud, we have SlCPU = (1 — RjCPUi) j where /-CPUmax represents the highest dominant frequency of CPU in cloud, fJOPUi is the CPU frequency of node i, and R_CPUi (0 < R.CPUi < 100%) is its utilization rates. Obviously, for any node i, 0 < SlCPU < 1, and the less utilization and the higher performance lead a higher score. Similarly,Slmemory = (1 - Rjmemoryi) where fjnemorymax rep resents the maximum capability of memory of one node in the cloud, fjmemoryi is the memory capability of node i, and Rjmemoryi (0 < Rjmemoryi < 100%) is its utilization rates. Obviously, for any node i, 0 < Slmemory < 1. ^network (1 Rjnetworki) f[~eti™°knlKX ’ whcrc /.network^ represents the maximum bandwidth of network cards in the cloud, f jnetworki is the network cards bandwidth of node i, and R_networkt (0 < Rjnetworki < 100%) is its utilization rates. Obviously, for any node *, 0< $ network — Sdisk = (1 — R_disi) ^ ^dfgk^x whcrc /_flhsfcmax represents the maximum storage capability of one node in the cloud, f_diski is the disk storage capability of node i, and Rjdiski (0 < R,jdiskt < 100%) is its utilization rates. Step
4. Obtain a queue 0[Ar] by sorting the scores S\otal of all N nodes in the cloud cluster, whcrc Q[ 1] > Q\ 2] > Q[ 3] > ··· > Qm· Step
5. Assuming that the new task require m Pods in Kubcrnctcs, then they must be assigned to the m optimum nodes from Q[/V], whcrc m « N. For a given priority P of task, Cp is define as its corresponding the consistency threshold. Then the m nodes can be selected through following several substeps. Substep 5.1. Set i = 1. Substep 5.2. If i > N — m + 1, return empty and end the process there, else, choose the first m scores of Q[IV] from Q[i], i.c., Q[i\, Q[i+ 1], Q[i + 2], ..., Q[i + m - 1]. Substep 5.3. Calculate the consistency of the chosen m scores which is defined as their statistical variance, i.c., σ = ^ (Qbl Q*)2j where Qi = Substep 5.4. Compare σ with Cp. If sigma > Cp, set i = i + 1, and repeat the Substep 5.2-5.4, else, obtain the value of i and end this process. As a result, we obtain the m nodes, i.c., node i, node * +1,..., node i + m — l.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
AU2018100381A AU2018100381A4 (en) | 2018-03-27 | 2018-03-27 | A physical resource scheduling method in cloud cluster |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
AU2018100381A AU2018100381A4 (en) | 2018-03-27 | 2018-03-27 | A physical resource scheduling method in cloud cluster |
Publications (1)
Publication Number | Publication Date |
---|---|
AU2018100381A4 true AU2018100381A4 (en) | 2018-05-10 |
Family
ID=62089525
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
AU2018100381A Ceased AU2018100381A4 (en) | 2018-03-27 | 2018-03-27 | A physical resource scheduling method in cloud cluster |
Country Status (1)
Country | Link |
---|---|
AU (1) | AU2018100381A4 (en) |
Cited By (11)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN109471727A (en) * | 2018-10-29 | 2019-03-15 | 北京金山云网络技术有限公司 | A kind of task processing method, apparatus and system |
CN109799805A (en) * | 2019-01-17 | 2019-05-24 | 湖南大学 | A kind of high-performing car electronic schedule algorithm of reliability perception |
CN110502321A (en) * | 2019-07-11 | 2019-11-26 | 新华三大数据技术有限公司 | A kind of resource regulating method and system |
WO2020000944A1 (en) * | 2018-06-25 | 2020-01-02 | 星环信息科技(上海)有限公司 | Preemptive scheduling based resource sharing use method, system and |
US10601905B2 (en) | 2018-07-20 | 2020-03-24 | Red Hat Israel, Ltd. | Priority switching based on resource usage patterns |
CN111131440A (en) * | 2019-12-20 | 2020-05-08 | 腾讯科技(深圳)有限公司 | Resource scheduling method and device and server |
WO2021120550A1 (en) * | 2019-12-19 | 2021-06-24 | Huawei Technologies Co., Ltd. | Methods and apparatus for resource scheduling of resource nodes of a computing cluster or a cloud computing platform |
US11169856B2 (en) | 2019-01-31 | 2021-11-09 | Hewlett Packard Enterprise Development Lp | Container management |
CN114090220A (en) * | 2022-01-13 | 2022-02-25 | 南京信息工程大学 | Hierarchical CPU and memory resource scheduling method |
CN114356531A (en) * | 2022-01-12 | 2022-04-15 | 重庆邮电大学 | Edge calculation task classification scheduling method based on K-means clustering and queuing theory |
CN116225664A (en) * | 2023-04-25 | 2023-06-06 | 中国人民解放军军事科学院系统工程研究院 | Preemptive scheduling method based on task priority of cloud video system |
-
2018
- 2018-03-27 AU AU2018100381A patent/AU2018100381A4/en not_active Ceased
Cited By (13)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2020000944A1 (en) * | 2018-06-25 | 2020-01-02 | 星环信息科技(上海)有限公司 | Preemptive scheduling based resource sharing use method, system and |
US10601905B2 (en) | 2018-07-20 | 2020-03-24 | Red Hat Israel, Ltd. | Priority switching based on resource usage patterns |
CN109471727A (en) * | 2018-10-29 | 2019-03-15 | 北京金山云网络技术有限公司 | A kind of task processing method, apparatus and system |
CN109799805B (en) * | 2019-01-17 | 2020-09-29 | 湖南大学 | Reliability-aware high-performance automobile electronic scheduling algorithm |
CN109799805A (en) * | 2019-01-17 | 2019-05-24 | 湖南大学 | A kind of high-performing car electronic schedule algorithm of reliability perception |
US11169856B2 (en) | 2019-01-31 | 2021-11-09 | Hewlett Packard Enterprise Development Lp | Container management |
CN110502321A (en) * | 2019-07-11 | 2019-11-26 | 新华三大数据技术有限公司 | A kind of resource regulating method and system |
WO2021120550A1 (en) * | 2019-12-19 | 2021-06-24 | Huawei Technologies Co., Ltd. | Methods and apparatus for resource scheduling of resource nodes of a computing cluster or a cloud computing platform |
CN111131440A (en) * | 2019-12-20 | 2020-05-08 | 腾讯科技(深圳)有限公司 | Resource scheduling method and device and server |
CN114356531A (en) * | 2022-01-12 | 2022-04-15 | 重庆邮电大学 | Edge calculation task classification scheduling method based on K-means clustering and queuing theory |
CN114090220A (en) * | 2022-01-13 | 2022-02-25 | 南京信息工程大学 | Hierarchical CPU and memory resource scheduling method |
CN114090220B (en) * | 2022-01-13 | 2022-04-29 | 南京信息工程大学 | Hierarchical CPU and memory resource scheduling method |
CN116225664A (en) * | 2023-04-25 | 2023-06-06 | 中国人民解放军军事科学院系统工程研究院 | Preemptive scheduling method based on task priority of cloud video system |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
AU2018100381A4 (en) | A physical resource scheduling method in cloud cluster | |
US11681547B2 (en) | File operation task optimization | |
Wang et al. | Provision of data-intensive services through energy-and QoS-aware virtual machine placement in national cloud data centers | |
CN104461740B (en) | A kind of cross-domain PC cluster resource polymerization and the method for distribution | |
Choudhary et al. | A dynamic optimization algorithm for task scheduling in cloud environment | |
CN102521012B (en) | Virtual machine-based general processing unit (GPU) cluster management system | |
US9686152B2 (en) | Techniques for tracking resource usage statistics per transaction across multiple layers of protocols | |
CN104580538B (en) | A kind of method of raising Nginx server load balancing efficiency | |
CN104601664B (en) | A kind of control system of cloud computing platform resource management and scheduling virtual machine | |
CN110138732A (en) | Response method, device, equipment and the storage medium of access request | |
US20160034995A1 (en) | Method and system for automated indentification and engagement of service providers | |
CN104902001B (en) | Web request load-balancing method based on operating system virtualization | |
CN104572290A (en) | Method and device for controlling message processing threads | |
US20160134573A1 (en) | System and Method for Equitable Processing of Asynchronous Messages in a Multi-Tenant Platform | |
Patel et al. | Cloud analyst: an insight of service broker policy | |
WO2024016596A1 (en) | Container cluster scheduling method and apparatus, device, and storage medium | |
CN112783607A (en) | Task deployment method and device in container cluster | |
Adrian et al. | Analysis of K-means algorithm for VM allocation in cloud computing | |
CN102855157A (en) | Method for comprehensively scheduling load of servers | |
CN110347477B (en) | Service self-adaptive deployment method and device in cloud environment | |
Hu et al. | Research of scheduling strategy on OpenStack | |
CN103617084B (en) | Emergency command cloud service implementation method and system based on microkernel cluster | |
CN115766876A (en) | Resource scheduling method, device, equipment and storage medium | |
Kasula | Performance analysis of layered architecture to integrate mobile devices and grid computing with a resource scheduling algorithm | |
CN103647712A (en) | Distributed route processing business method and distributed route processing business system |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
FGI | Letters patent sealed or granted (innovation patent) | ||
MK22 | Patent ceased section 143a(d), or expired - non payment of renewal fee or expiry |