WO2017010922A1 - Attribution de ressources informatiques en nuage - Google Patents

Attribution de ressources informatiques en nuage Download PDF

Info

Publication number
WO2017010922A1
WO2017010922A1 PCT/SE2015/050824 SE2015050824W WO2017010922A1 WO 2017010922 A1 WO2017010922 A1 WO 2017010922A1 SE 2015050824 W SE2015050824 W SE 2015050824W WO 2017010922 A1 WO2017010922 A1 WO 2017010922A1
Authority
WO
WIPO (PCT)
Prior art keywords
cloud computing
resource utilization
hosts
weight
predicted
Prior art date
Application number
PCT/SE2015/050824
Other languages
English (en)
Inventor
Nicolas Seyvet
Ignacio Manuel MULAS VIELA
Tony Larsson
Original Assignee
Telefonaktiebolaget Lm Ericsson (Publ)
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Telefonaktiebolaget Lm Ericsson (Publ) filed Critical Telefonaktiebolaget Lm Ericsson (Publ)
Priority to PCT/SE2015/050824 priority Critical patent/WO2017010922A1/fr
Publication of WO2017010922A1 publication Critical patent/WO2017010922A1/fr

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F9/00Arrangements for program control, e.g. control units
    • G06F9/06Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
    • G06F9/46Multiprogramming arrangements
    • G06F9/50Allocation of resources, e.g. of the central processing unit [CPU]
    • G06F9/5005Allocation of resources, e.g. of the central processing unit [CPU] to service a request
    • G06F9/5027Allocation of resources, e.g. of the central processing unit [CPU] to service a request the resource being a machine, e.g. CPUs, Servers, Terminals
    • G06F9/505Allocation of resources, e.g. of the central processing unit [CPU] to service a request the resource being a machine, e.g. CPUs, Servers, Terminals considering the load
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F9/00Arrangements for program control, e.g. control units
    • G06F9/06Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
    • G06F9/46Multiprogramming arrangements
    • G06F9/50Allocation of resources, e.g. of the central processing unit [CPU]
    • G06F9/5061Partitioning or combining of resources
    • G06F9/5077Logical partitioning of resources; Management or configuration of virtualized resources

Definitions

  • Cloud computing components usually make use of different drivers, such as hypervisors and virtual switches, installed on hosts.
  • a hypervisor or virtual machine monitor (VMM) is a piece of computer software, firmware or hardware that creates and runs Virtual Machines (VMs).
  • a virtual switch is a logical switching fabric built into a VM infrastructure so that the Virtual Machines (VMs) can be networked wherever you need them.
  • VMs Virtual Machines
  • Cloud computing can run in bare-metal in some occasions in order to avoid virtualization overheads and achieve better performance using other kinds of isolation systems like containers. Virtualization can also be used in other contexts than cloud computing.
  • the weight for each of the plurality of hosts maybe calculated by adding a predicted resource utilization to a current resource utilization, wherein the predicted resource utilization has a reduced significance compared to the current resource utilization.
  • the weight may also comprise a plurality of predicted resource utilizations for a plurality of time instances, and a prediction further into the future has a reduced significance compared to a prediction nearer into the future.
  • the significance of each predicted resource utilization of the plurality of predicted resource utilizations may be set in dependence on accuracy of the predictions.
  • the significance of each predicted resource utilization of the plurality of predicted resource utilizations may be dynamically set by machine learning by building a model and extracting each predicted resource utilization of the plurality of predicted resource
  • the instructions may further cause the scheduler server to: cause prediction of a resource utilization for each of a plurality of a first type of cloud computing recourses; cause aggregation of the predicted resource utilizations of the first type for each of the plurality of hosts controlling the plurality of cloud computing resources; cause determination of a first weight for each of the plurality of hosts, in dependence on current resource utilization of the first type and predicted resource utilization of the first type; cause prediction of a resource utilization for each of a plurality of a second type of cloud computing recourses; cause aggregation of the predicted resource utilizations of the second type for each of the plurality of hosts controlling the plurality of cloud computing resources; cause determination of a second weight for each of the plurality of hosts, in dependence on current resource utilization of the second type and predicted resource utilization of the second type; and cause determination of a total weight for each of the plurality of hosts, in
  • Fig. 2 is a schematic diagram illustrating an environment
  • a scheduler needs to be aware at any time what the situation is for the cloud computing resources to be able to efficiently decide where to allocate new virtual resources.
  • the scheduler follows a weight mechanism in order to decide where to allocate newly incoming requests. It creates a weight for each of the hosts connected to the cluster and takes into account the currently consumed resources by the virtual resources allocated on it. In this way these resources are ordered in a prioritized list of hosts. The scheduler can then allocate the requested resources and comply with the available ones on a given host.
  • the prediction scores for all users are then sorted from high to low. This is illustrated in Fig. 7, which contains 18 users (each bar in the histogram) and their prediction score. Based on this sorting a number of user classes can be defined. This may for instance be "high”, “med” and “low”. The number of classes and the range for each class is configurable.
  • a high predictability class may be used define a stable user.
  • a low predictability class may be used to define a very dynamic user.
  • a med predictability class maybe used to define users having a predictability being between stable users and very dynamic users. Details for implementation of classification of predictability of users can be found in patent application PCT/EP2014/ 079143.
  • the weights a and bn are dynamically set based on how good/accurately the prediction model captures the resource utilization. Intuitively this means that the predicted resource utilization Yn will get a higher weight bn when the predictions are working well. This can for instance be when we have stable and long-running services that have a certain regularity in its resource utilization data pattern.
  • the machine learning takes into account how stable a host is so it can give more or less importance to the future predicted samples. However, if the host is unstable, the machine learning algorithm will give very little importance to the future samples as they might not be reliable.
  • the difference between the two processes above is that in the first one, the weights are static (for example the a and bn coefficients may be a negative exponential function), while in the second one, the coefficients are dynamic and depend on the accuracy of the predictions done before.
  • the two nodes may e.g. be a centralized scheduler node and a distributed prediction node.
  • This module can e.g. be implemented by the processor 60 of Fig. 8, when running the computer program.

Abstract

L'invention concerne un procédé pour attribuer des ressources informatiques en nuage (22a, 22b) dans un environnement informatique en nuage (21). Le procédé est réalisé par un serveur de planificateur (20) et comprend les étapes consistant : à entraîner la détermination (42) d'un poids pour chacun d'une pluralité d'hôtes (23a, 23b, 23c), en fonction d'une utilisation de ressource courante et d'une utilisation de ressource prédite ; et à entraîner l'attribution (43) d'une ressource informatique en nuage (22a, 22b) en fonction du poids déterminé. L'invention concerne également un serveur de planificateur, un programme d'ordinateur et un produit programme d'ordinateur.
PCT/SE2015/050824 2015-07-14 2015-07-14 Attribution de ressources informatiques en nuage WO2017010922A1 (fr)

Priority Applications (1)

Application Number Priority Date Filing Date Title
PCT/SE2015/050824 WO2017010922A1 (fr) 2015-07-14 2015-07-14 Attribution de ressources informatiques en nuage

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
PCT/SE2015/050824 WO2017010922A1 (fr) 2015-07-14 2015-07-14 Attribution de ressources informatiques en nuage

Publications (1)

Publication Number Publication Date
WO2017010922A1 true WO2017010922A1 (fr) 2017-01-19

Family

ID=57758067

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/SE2015/050824 WO2017010922A1 (fr) 2015-07-14 2015-07-14 Attribution de ressources informatiques en nuage

Country Status (1)

Country Link
WO (1) WO2017010922A1 (fr)

Cited By (15)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107783822A (zh) * 2017-11-10 2018-03-09 郑州云海信息技术有限公司 一种资源管理方法及装置
AU2018200013A1 (en) * 2017-05-05 2018-11-22 Servicenow, Inc. Shared machine learning
CN110502344A (zh) * 2019-08-26 2019-11-26 联想(北京)有限公司 一种数据调整方法及装置
WO2020019017A1 (fr) * 2018-07-24 2020-01-30 Joseph Matthew Appareil, système et procédé de détection de contraintes sans agents dans le nuage avec ia
CN111143050A (zh) * 2018-11-02 2020-05-12 中移(杭州)信息技术有限公司 一种容器集群调度的方法和设备
CN111404974A (zh) * 2019-01-02 2020-07-10 中国移动通信有限公司研究院 一种云计算效能评估方法、装置及评估设备
CN112667398A (zh) * 2020-12-28 2021-04-16 北京奇艺世纪科技有限公司 资源调度方法、装置、电子设备及存储介质
US11126541B2 (en) 2018-05-24 2021-09-21 Red Hat, Inc. Managing resources used during a development pipeline
US11150931B2 (en) 2018-10-30 2021-10-19 Hewlett Packard Enterprise Development Lp Virtual workload migrations
CN114629959A (zh) * 2022-03-22 2022-06-14 北方工业大学 一种云环境中上下文感知的IoT服务调度策略生成方法
US11489731B2 (en) 2016-09-30 2022-11-01 Salesforce.Com, Inc. Techniques and architectures for efficient allocation of under-utilized resources
CN115412467A (zh) * 2022-09-01 2022-11-29 山东正中信息技术股份有限公司 一种电子政务云中租户云资源利用率评估方法及系统
US11609796B2 (en) 2017-12-14 2023-03-21 Google Llc Dynamic capacity optimization for shared computing resources segmented into reservation zones
US11620571B2 (en) 2017-05-05 2023-04-04 Servicenow, Inc. Machine learning with distributed training
CN117608809A (zh) * 2024-01-18 2024-02-27 中国电子科技集团公司第十五研究所 基于梯度提升决策树的多任务计划进度预测系统

Citations (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2001042908A2 (fr) * 1999-12-10 2001-06-14 Art Technology Group, Inc. Procede et systeme d'equilibrage et de gestion de la puissance
US6763372B1 (en) * 2000-07-06 2004-07-13 Nishant V. Dani Load balancing of chat servers based on gradients
US20060236324A1 (en) * 2005-04-14 2006-10-19 International Business Machines (Ibm) Corporation Method and system for performance balancing in a distributed computer system
US20110202657A1 (en) * 2010-02-12 2011-08-18 Elitegroup Computer Systems Co., Ltd. Method for scheduling cloud-computing resource and system applying the same
US20110295999A1 (en) * 2010-05-28 2011-12-01 James Michael Ferris Methods and systems for cloud deployment analysis featuring relative cloud resource importance
US20130111032A1 (en) * 2011-10-28 2013-05-02 International Business Machines Corporation Cloud optimization using workload analysis
US20130145364A1 (en) * 2011-12-01 2013-06-06 Tunghai University Method of dynamic resource allocation for a virtual machine cluster
US20130290539A1 (en) * 2012-04-30 2013-10-31 Alcatel-Lucent Usa Inc. Resource placement in networked cloud based on resource constraints
EP2706731A2 (fr) * 2012-08-13 2014-03-12 Verisign, Inc. Systèmes et procédés de répartition de charge par routage prédictif

Patent Citations (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2001042908A2 (fr) * 1999-12-10 2001-06-14 Art Technology Group, Inc. Procede et systeme d'equilibrage et de gestion de la puissance
US6763372B1 (en) * 2000-07-06 2004-07-13 Nishant V. Dani Load balancing of chat servers based on gradients
US20060236324A1 (en) * 2005-04-14 2006-10-19 International Business Machines (Ibm) Corporation Method and system for performance balancing in a distributed computer system
US20110202657A1 (en) * 2010-02-12 2011-08-18 Elitegroup Computer Systems Co., Ltd. Method for scheduling cloud-computing resource and system applying the same
US20110295999A1 (en) * 2010-05-28 2011-12-01 James Michael Ferris Methods and systems for cloud deployment analysis featuring relative cloud resource importance
US20130111032A1 (en) * 2011-10-28 2013-05-02 International Business Machines Corporation Cloud optimization using workload analysis
US20130145364A1 (en) * 2011-12-01 2013-06-06 Tunghai University Method of dynamic resource allocation for a virtual machine cluster
US20130290539A1 (en) * 2012-04-30 2013-10-31 Alcatel-Lucent Usa Inc. Resource placement in networked cloud based on resource constraints
EP2706731A2 (fr) * 2012-08-13 2014-03-12 Verisign, Inc. Systèmes et procédés de répartition de charge par routage prédictif

Cited By (21)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US11902102B2 (en) 2016-09-30 2024-02-13 Salesforce, Inc. Techniques and architectures for efficient allocation of under-utilized resources
US11489731B2 (en) 2016-09-30 2022-11-01 Salesforce.Com, Inc. Techniques and architectures for efficient allocation of under-utilized resources
AU2018200013A1 (en) * 2017-05-05 2018-11-22 Servicenow, Inc. Shared machine learning
US10445661B2 (en) 2017-05-05 2019-10-15 Servicenow, Inc. Shared machine learning
US11620571B2 (en) 2017-05-05 2023-04-04 Servicenow, Inc. Machine learning with distributed training
CN107783822A (zh) * 2017-11-10 2018-03-09 郑州云海信息技术有限公司 一种资源管理方法及装置
US11609796B2 (en) 2017-12-14 2023-03-21 Google Llc Dynamic capacity optimization for shared computing resources segmented into reservation zones
US11126541B2 (en) 2018-05-24 2021-09-21 Red Hat, Inc. Managing resources used during a development pipeline
WO2020019017A1 (fr) * 2018-07-24 2020-01-30 Joseph Matthew Appareil, système et procédé de détection de contraintes sans agents dans le nuage avec ia
US11150931B2 (en) 2018-10-30 2021-10-19 Hewlett Packard Enterprise Development Lp Virtual workload migrations
CN111143050A (zh) * 2018-11-02 2020-05-12 中移(杭州)信息技术有限公司 一种容器集群调度的方法和设备
CN111143050B (zh) * 2018-11-02 2023-09-19 中移(杭州)信息技术有限公司 一种容器集群调度的方法和设备
CN111404974A (zh) * 2019-01-02 2020-07-10 中国移动通信有限公司研究院 一种云计算效能评估方法、装置及评估设备
CN110502344A (zh) * 2019-08-26 2019-11-26 联想(北京)有限公司 一种数据调整方法及装置
CN112667398B (zh) * 2020-12-28 2023-09-01 北京奇艺世纪科技有限公司 资源调度方法、装置、电子设备及存储介质
CN112667398A (zh) * 2020-12-28 2021-04-16 北京奇艺世纪科技有限公司 资源调度方法、装置、电子设备及存储介质
CN114629959A (zh) * 2022-03-22 2022-06-14 北方工业大学 一种云环境中上下文感知的IoT服务调度策略生成方法
CN114629959B (zh) * 2022-03-22 2023-11-17 北方工业大学 一种云环境中上下文感知的IoT服务调度策略方法
CN115412467A (zh) * 2022-09-01 2022-11-29 山东正中信息技术股份有限公司 一种电子政务云中租户云资源利用率评估方法及系统
CN115412467B (zh) * 2022-09-01 2023-11-07 山东正中信息技术股份有限公司 一种电子政务云中租户云资源利用率评估方法及系统
CN117608809A (zh) * 2024-01-18 2024-02-27 中国电子科技集团公司第十五研究所 基于梯度提升决策树的多任务计划进度预测系统

Similar Documents

Publication Publication Date Title
WO2017010922A1 (fr) Attribution de ressources informatiques en nuage
Ghomi et al. Load-balancing algorithms in cloud computing: A survey
Duc et al. Machine learning methods for reliable resource provisioning in edge-cloud computing: A survey
CN112153700B (zh) 一种网络切片资源管理方法及设备
KR101782345B1 (ko) 엔드-투-엔드 데이터센터 성능 제어
Wuhib et al. Dynamic resource allocation with management objectives—Implementation for an OpenStack cloud
Jennings et al. Resource management in clouds: Survey and research challenges
Mohamed et al. Software-defined networks for resource allocation in cloud computing: A survey
US10289973B2 (en) System and method for analytics-driven SLA management and insight generation in clouds
US10671445B2 (en) Cost-optimal cluster configuration analytics package
Hummaida et al. Adaptation in cloud resource configuration: a survey
US8423646B2 (en) Network-aware virtual machine migration in datacenters
US8510747B2 (en) Method and device for implementing load balance of data center resources
CN111344688B (zh) 云计算中资源提供的方法及系统
Javadpour et al. An energy-optimized embedded load balancing using DVFS computing in cloud data centers
US10896059B2 (en) Dynamically allocating cache in a multi-tenant processing infrastructure
Sedaghat et al. Unifying cloud management: Towards overall governance of business level objectives
CN110221920B (zh) 部署方法、装置、存储介质及系统
Issawi et al. An efficient adaptive load balancing algorithm for cloud computing under bursty workloads
Mohammadi Bahram Abadi et al. Self-adaptive architecture for virtual machines consolidation based on probabilistic model evaluation of data centers in Cloud computing
Lu et al. InSTechAH: Cost-effectively autoscaling smart computing hadoop cluster in private cloud
Hassan et al. Efficient resource scheduling for big data processing in cloud platform
Bose et al. Sla management in cloud computing: A service provider's perspective
Wamser et al. Orchestration and monitoring in fog computing for personal edge cloud service support
Yousaf et al. RAVA—Resource aware VNF agnostic NFV orchestration method for virtualized networks

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 15898413

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 15898413

Country of ref document: EP

Kind code of ref document: A1