WO2011106725A2 - Virtual machine power consumption measurement and management - Google Patents

Virtual machine power consumption measurement and management Download PDF

Info

Publication number
WO2011106725A2
WO2011106725A2 PCT/US2011/026353 US2011026353W WO2011106725A2 WO 2011106725 A2 WO2011106725 A2 WO 2011106725A2 US 2011026353 W US2011026353 W US 2011026353W WO 2011106725 A2 WO2011106725 A2 WO 2011106725A2
Authority
WO
WIPO (PCT)
Prior art keywords
power
virtual machine
power consumption
activity
host server
Prior art date
Application number
PCT/US2011/026353
Other languages
French (fr)
Other versions
WO2011106725A3 (en
Inventor
Aman Kansal
Jie Liu
Douglas C. Burger
Arka Aloke Bhattacharya
Original Assignee
Microsoft Corporation
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Microsoft Corporation filed Critical Microsoft Corporation
Priority to EP18152419.0A priority Critical patent/EP3333668B1/en
Priority to CN201180011019.0A priority patent/CN102770826B/en
Priority to EP11748196.0A priority patent/EP2539791B1/en
Publication of WO2011106725A2 publication Critical patent/WO2011106725A2/en
Publication of WO2011106725A3 publication Critical patent/WO2011106725A3/en

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F1/00Details not covered by groups G06F3/00 - G06F13/00 and G06F21/00
    • G06F1/26Power supply means, e.g. regulation thereof
    • G06F1/32Means for saving power
    • G06F1/3203Power management, i.e. event-based initiation of a power-saving mode
    • G06F1/3234Power saving characterised by the action undertaken
    • GPHYSICS
    • G01MEASURING; TESTING
    • G01RMEASURING ELECTRIC VARIABLES; MEASURING MAGNETIC VARIABLES
    • G01R21/00Arrangements for measuring electric power or power factor
    • G01R21/133Arrangements for measuring electric power or power factor by using digital technique
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F1/00Details not covered by groups G06F3/00 - G06F13/00 and G06F21/00
    • G06F1/26Power supply means, e.g. regulation thereof
    • G06F1/32Means for saving power
    • G06F1/3203Power management, i.e. event-based initiation of a power-saving mode
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F1/00Details not covered by groups G06F3/00 - G06F13/00 and G06F21/00
    • G06F1/26Power supply means, e.g. regulation thereof
    • G06F1/32Means for saving power
    • G06F1/3203Power management, i.e. event-based initiation of a power-saving mode
    • G06F1/3206Monitoring of events, devices or parameters that trigger a change in power modality
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/30Monitoring
    • G06F11/34Recording or statistical evaluation of computer activity, e.g. of down time, of input/output operation ; Recording or statistical evaluation of user activity, e.g. usability assessment
    • G06F11/3447Performance evaluation by modeling
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F1/00Details not covered by groups G06F3/00 - G06F13/00 and G06F21/00
    • G06F1/26Power supply means, e.g. regulation thereof
    • G06F1/32Means for saving power
    • G06F1/3203Power management, i.e. event-based initiation of a power-saving mode
    • G06F1/3206Monitoring of events, devices or parameters that trigger a change in power modality
    • G06F1/3228Monitoring task completion, e.g. by use of idle timers, stop commands or wait commands
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F1/00Details not covered by groups G06F3/00 - G06F13/00 and G06F21/00
    • G06F1/26Power supply means, e.g. regulation thereof
    • G06F1/32Means for saving power
    • G06F1/3203Power management, i.e. event-based initiation of a power-saving mode
    • G06F1/3234Power saving characterised by the action undertaken
    • G06F1/329Power saving characterised by the action undertaken by task scheduling
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/30Monitoring
    • G06F11/34Recording or statistical evaluation of computer activity, e.g. of down time, of input/output operation ; Recording or statistical evaluation of user activity, e.g. usability assessment
    • G06F11/3409Recording or statistical evaluation of computer activity, e.g. of down time, of input/output operation ; Recording or statistical evaluation of user activity, e.g. usability assessment for performance assessment
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/30Monitoring
    • G06F11/34Recording or statistical evaluation of computer activity, e.g. of down time, of input/output operation ; Recording or statistical evaluation of user activity, e.g. usability assessment
    • G06F11/3409Recording or statistical evaluation of computer activity, e.g. of down time, of input/output operation ; Recording or statistical evaluation of user activity, e.g. usability assessment for performance assessment
    • G06F11/3419Recording or statistical evaluation of computer activity, e.g. of down time, of input/output operation ; Recording or statistical evaluation of user activity, e.g. usability assessment for performance assessment by assessing time
    • G06F11/3423Recording or statistical evaluation of computer activity, e.g. of down time, of input/output operation ; Recording or statistical evaluation of user activity, e.g. usability assessment for performance assessment by assessing time where the assessed time is active or idle time
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/30Monitoring
    • G06F11/34Recording or statistical evaluation of computer activity, e.g. of down time, of input/output operation ; Recording or statistical evaluation of user activity, e.g. usability assessment
    • G06F11/3466Performance evaluation by tracing or monitoring
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F2201/00Indexing scheme relating to error detection, to error correction, and to monitoring
    • G06F2201/815Virtual
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F2201/00Indexing scheme relating to error detection, to error correction, and to monitoring
    • G06F2201/865Monitoring of software
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F2201/00Indexing scheme relating to error detection, to error correction, and to monitoring
    • G06F2201/88Monitoring involving counting
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y02TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
    • Y02DCLIMATE CHANGE MITIGATION TECHNOLOGIES IN INFORMATION AND COMMUNICATION TECHNOLOGIES [ICT], I.E. INFORMATION AND COMMUNICATION TECHNOLOGIES AIMING AT THE REDUCTION OF THEIR OWN ENERGY USE
    • Y02D10/00Energy efficient computing, e.g. low power processors, power management or thermal management

Definitions

  • a virtual machine is an emulation of a computing device that is strictly software yet acts like a physical computing device. This means that one or more virtual machines can reside on a single physical host server. It appears to clients and all software running on the virtual machines, however, that each virtual machine is an independent server.
  • Virtualization (or the use of virtual machines) allows multiple applications to be safely placed on a single physical host server because each application operates inside its "virtual" computer and does not affect other applications. This allows server resources to be better utilized and reduces the number of servers required as well as the operational costs of hosting them such as electricity and cooling. Hence, virtualization is an efficient technique for managing computational resources and is gaining widespread acceptance in data centers and large scale computing facilities.
  • Power consumption for a physical server can be measured by metering the electricity supplied to the server.
  • Virtual machines are created in software and there is no easy way to connect a power meter to a virtual machine. This makes it quite difficult to measure power consumption of virtual machines.
  • Techniques used for physical servers can only report the total power draw of the server. But they cannot be used to measure the individual power consumption of each of the multiple virtual machines that reside on the single physical host server. In fact, the power consumption of a virtual machine cannot be measured using purely hardware solutions.
  • the ability to measure the power consumption of a virtual machine is important. Users or server managers are unable to measure the power used by each virtual machine. Measuring this power is useful for several reasons.
  • a virtual machine may have to be shut down or migrated to a different power circuit with excess capacity.
  • software developers can measure how much energy their software consumes when executed on a physical server but cannot measure it when running on a virtual machine. Not being able to measure the power usage cripples the developer's ability to optimize the software for lower power consumption.
  • Another use of measuring a virtual machine's power consumption is to charge back users based on power use. Shared physical servers, such as used in cloud computing, allow multiple users to run virtual machines on the shared physical infrastructure. The owner of the physical servers may wish to charge the virtual machine users based on power use.
  • Embodiments of the virtual machine power metering system and method measure the power consumption of individual virtual machines. Power metering is often used on physical servers to make several power management and operational decisions.
  • Embodiments of the virtual machine power metering system and method allow such decisions to be made also on virtual machines.
  • the general idea is that a power meter measurement, which measures the power consumption of an entire physical host server, is converted into individual virtual machine power meters that measure the power consumption of each individual virtual machine residing on the host server.
  • Embodiments of the virtual machine power metering system and method will work with any number of virtual machines and work across multiple physical host servers.
  • the virtual machine power metering can be used for virtual machine power capping to allow power oversubscription in virtualized environments. This virtual machine power capping can be performed without violating the isolation property.
  • the individual virtual machine power consumption measurements also can be used to track the power consumption of an activity (such as a search query) that spans multiple virtual machines.
  • Embodiments of the virtual machine power metering system and method do not need the hardware to supply a power model. Any information needed is learned on the fly.
  • Embodiments of the system and method automatically learn a power model relating the hardware (or resource) usage (such as central processing unit (CPU) usage, disk usage, memory usage, network, usage, and so forth) to power consumption. Then the hardware usage of each virtual machine is tracked to infer the power consumption of the virtual machine.
  • hardware or resource usage
  • Embodiments of the system and method compute the power consumption in two broad steps.
  • a power model is generated and trained using the total power consumption of the physical host server and resource utilization for a specific virtual machine.
  • Resource utilization is the determination of how much of a resource the virtual machine is using. These resource utilizations are summed to find a total resource utilization for the virtual machine.
  • the power model is generated that relates the total resource utilization to the total power consumption.
  • Embodiments of the system and method also determine optimal power model coefficients during this step. This is achieved by determine the number of resources, N, being used by the virtual machine. A system of N equations having N unknowns is generated and solved to obtain the optimal power model coefficients.
  • Embodiments of the system and method also include a power model learning technique, where the model learned is unique to the workloads hosted within each virtual machine. This accounts for differences in power usage among workloads that can be seen even when the workloads have the same or similar appearing resource usage.
  • embodiments of the system and method calculate the energy used by the virtual machine.
  • Several embodiments may be used to calculate the energy.
  • One embodiment uses the percentages of resources used by the virtual machine and then sums these percentages to obtain the power consumption for the virtual machine.
  • the resources used may be measured in units other than percentages.
  • resources measured may include the CPU, memory, disks, network cards, and fans, to name a few.
  • the power model can be augmented by taking into account the frequency of the CPU of the host server.
  • Embodiments of the system and method can also be used to obtain the power consumption for a specific activity.
  • An activity can be a service, a request (such as a web search query), an e-mail delivery, a webpage retrieval, a database operation, or a file update.
  • the power consumption for the activity can be computed even if the activity is across multiple virtual machines and across multiple host servers.
  • An activity period which is the period of time that the activity is active on the computing infrastructure, is determined.
  • Embodiments of the system and method then compute the power
  • FIG. 1 is a block diagram illustrating a general overview of embodiments of the virtual machine power metering system and method implemented on a host server computing device.
  • FIG. 2 is a block diagram illustrating modules and data used in embodiments of the virtual machine power metering system and method shown in FIG. 1.
  • FIG. 3 is a flow diagram illustrating the general operation of embodiments of the virtual machine power metering system shown in FIGS. 1 and 2.
  • FIG. 4 is a flow diagram illustrating the operational details of embodiments of the power model training module shown in FIG. 2.
  • FIG. 5 is a flow diagram illustrating the operational details of a first embodiment of the energy calculation module shown in FIG. 2.
  • FIG. 6 is a flow diagram illustrating the operational details of a second
  • FIG. 7 is a flow diagram illustrating the operational details of embodiments of the power capping module shown in FIG. 2.
  • FIG. 8 illustrates an example of a suitable computing system environment in which embodiments of the virtual machine power metering system and method shown in FIGS. 1-7 may be implemented.
  • the virtual machine power metering system and method infers energy from resource usage.
  • the virtual machine power metering system and method tracks the resources used by each virtual machine in software and then converts the resource usage to energy by leveraging the power models of the individual resources.
  • the system and method track usage for all the hardware resources, such as the CPU cores, disk arrays, memory banks, network cards, and graphics cards. Tracking the resource usage includes determining what power state the resource was in and what virtual machine was it working for. If the system and method correctly label each resource according to which virtual machine was using it and also measure the power draw during that duration (typically a few microseconds or milliseconds), then the system and method can compute the active energy usage of each virtual machine.
  • the active energy may be defined as the energy consumed by a resource when working on behalf of some virtual machine or the system.
  • the system In addition to the active energy, the system also uses some idle energy and shared energy.
  • the idle energy could include the energy spent in a sleep state for a small duration between active states.
  • the shared energy includes energy used by a resource when the work performed benefits multiple virtual machines, such as the energy spent on disk spin-up before input/output operations from multiple virtual machines. Some of the idle energy is spent regardless of whether any virtual machine is running or not while part of it is affected by the presence of virtual machines.
  • the time durations between a virtual machine's idle periods may affect whether a deeper sleep state is used by the processor or not during the idle time.
  • the idle and shared energies can be reported separately or divided among virtual machines either equally or proportional to their active energy usage, depending on how the metering information is to be used. For the power capping application discussed in detail below, reporting separately works best.
  • embodiments of the system and method leverage power models that relate the software-observable state of a resource to its power usage. If the state changes can be observed at microsecond granularity then the power usage at that fine time scale can be inferred. Observing the power states is non-trivial and several approximations must be made. For instance, the resource states, such as the clock gating of sub-components within the processor or exact nature of mechanical motions within a disk array may not be completely known in some embodiments of the method and system. Moreover, there may be no visibility into power states of certain hardware resources within the platform, such as hardware-controlled device power states or graphics processor activity.
  • the next step is to determine the power usage in that state using its power model.
  • power models are not readily available.
  • Embodiments of the system and method build the power models in situ. To address the second challenge, embodiments of the system and method leverage the knowledge regarding scheduling of resources for the virtual machines. Again, complete visibility into all resources is lacking and trade-offs in instrumentation overhead and accuracy must be made.
  • FIG. 1 is a block diagram illustrating a general overview of embodiments of the virtual machine power metering system 100 and method implemented on a host server computing device 110.
  • embodiments of the virtual machine power metering system 100 and method determine the power consumption of individual virtual machines running on the host server computing device 110 without the need to special hardware.
  • Embodiments of the virtual machine power metering system 100 reside on the host server computing device 110.
  • the host server computing device 110 represents a physical server.
  • One or more virtual machines 120 are hosted by the host server computing device 110. These one or more virtual machines 120 can be represented as VMi, VM 2 , . . . to VMN, where N is the number of virtual machines.
  • a power tracing module 130 measure the full system power, or the total power consumed by the host server computing device 110. In some embodiments, the power tracing module 130 represents a software driver to read the full system power consumption from an available power sensor on the
  • a resource tracing module 140 represents hypervisor performance counters that trace the resource usage of virtual machines.
  • a hypervisor also known as a virtual machine manager
  • the resource tracing module 140 tracks the resources of the host server computing device 110.
  • embodiments of the system 100 use existing hypervisor performance counters. However, in some
  • instrumentation can be added to monitor resource usage where usability with existing hypervisors is not essential.
  • Embodiments of the system 100 process this information to compute and output power consumption for each virtual machine 150.
  • FIG. 2 is a block diagram illustrating modules and data used in embodiments of the virtual machine power metering system 100 and method shown in FIG. 1.
  • embodiments of the virtual machine power metering system 100 include modules for power measurement and management in virtual machines. More specifically,
  • embodiments of the virtual machine power metering system 100 include a power model training module 200 that generates a power model based on information from the power tracing module 130 and the resource tracing module 140.
  • the power model is further refined by taking into account the frequency of the central processing unit (CPU).
  • Embodiments of the system 100 also include an energy calculation module 210 that uses the power model to compute the power consumption for each of the one or more virtual machines 120. As explained below, embodiments of this module use at least two embodiments to obtain the power consumption. The result is the power consumption for each individual virtual machine 150. This information is used by a power capping module 220 to provide power limitations for each virtual machine as desired.
  • the power model training module 200 uses total power usage by the host server 230 and resource utilizations measurements 240 to generate a power model 250. As explained in detail below, a number of measurements are taken to obtain a system of power model equations. This system of equations is solved to find optimal power model coefficients 260.
  • the energy calculation module 210 uses the power tracing module 130
  • the energy calculation module 210 There are two embodiments of the energy calculation module 210, as shown by the dotted lines in FIG. 2.
  • a first embodiment of the energy calculation module 210 uses virtual machine resource percentages to make the calculations 270.
  • the coefficient used is the same for each of the virtual machines. For example, if two virtual machines both use 10% CPU (and no other resources) then their energy use will be the same.
  • a second embodiment of the module 210 uses virtual machine resource percentages and a virtual machine specific power model to make the energy calculations 280. In this embodiment, as an example, even with both virtual machines using 10% CPU each the energy use could be different because one virtual machine could be performing floating point operations while the other virtual machine could be performing fixed point operations (these two operations use different sub-units of the CPU and hence may use different amounts of energy).
  • the mathematical details of both embodiments are discussed below. Both of these embodiments are refined by adding slightly to the complexity of the power model by using a power model augmentation module 290.
  • the power model augmentation module 290 augments the power model by taking into account the frequency of the central processing unit (CPU) of the host server computing device 110.
  • the result is the power consumption for each individual virtual machine 150. This result then is used by the power capping module 220 to cap the power used by each virtual machine. The details of the power capping process are discussed in detail below.
  • Computational activities in a computing infrastructure often consist of multiple tasks performed by multiple virtual machines or server roles hosted on different physical machines.
  • a search query may be first received by a web front end service (also referred to as web frontend server role) and may be hosted inside a virtual machine.
  • the search query then may be processed by multiple search data retrieval servers, advertisement servers, and content-rendering servers. The results of the processing are collected and sent to the user.
  • an activity may consist of receiving an email for a particular user.
  • First an incoming-email server role (possibly hosted within a virtual machine) will receive the message and determine the correct user account for the message.
  • a virus- scanning server may check the contents of the email message.
  • a spam-filtering server may check the contents of the message as well.
  • a storage server may place the message in its storage.
  • the activity in each case involves expenditure of energy on multiple distributed server roles some of which could be hosted on the same machine and some on different machines. It is desirable to determine the energy used by the activity as a whole.
  • FIG. 3 is a flow diagram illustrating the general operation of embodiments of the virtual machine power metering system 100 shown in FIGS. 1 and 2.
  • embodiments of the virtual machine power metering system 100 and method to determine the energy usage of an individual activity that may be distributed over several virtual machines.
  • An activity is defined as a service or request.
  • a search query is an activity that has a verifiable beginning and end.
  • the method begins by selecting an activity for which to determine the power consumption (box 300).
  • the activity spans at least one virtual machine on each of a plurality of physical host servers.
  • a beginning event is generated when the activity is first initiated on the virtual machine (box 310). This beginning event marks the beginning of the tracking of each resource that is used by the activity (box 320).
  • the energy usage (or power consumption) of the relevant virtual machine is determined over the duration of the activity's use (box 330).
  • the activity could start and end on different virtual machines at different times. For example, in the e-mail example just given a virtual machine may start virus scanning the e-mail at time Ti and end the scanning at time T 2 . Another virtual machine may then start analyzing the same e-mail for spam at time T3 and end the analysis at time T4.
  • the power consumption of each virtual machine may be determined using the virtual machine power metering methods described in detail below. This generates power consumption measurements for each virtual machine working on the selected activity. As the activity progresses, the energy used across multiple server roles is added up to generate the total energy used by the activity.
  • Embodiments of the system 100 then determine and mark an ending event in which the activity was terminated (box 340).
  • This ending event may be the fulfillment of a search request, or some other event that marks the time at which the activity was completed or terminated and presented to a user.
  • An activity period is defined as the amount of time between the beginning event and the ending event (box 350).
  • Embodiments of the system 100 determine the total power consumption of devices that were used during the activity period but that were not tracked (box 360).
  • the total energy used by these devices is measured using hardware power metering or using known power models (based on specifications and data sheets or learned in situ). The portion of the total power consumption that was used by the selected activity during the activity period then is determined (box 370). The total energy may be apportioned equally among all activities, divided in the ratio of their execution times, or in the ratios of their previously computed energy spent within the virtual machines.
  • Embodiments of the system 100 then sum the power consumption used during the activity period for the selected activity to obtain the total activity power consumption (box 380). Specifically, the power consumption measurements for each virtual machine working on the activity during the activity period and the portion of the power
  • the total activity power consumption then is output (box 390).
  • the above discussion uses the terms "energy usage” and "power consumption.” These may be reported in units of energy (such as Joules or Kilowatt Hours). Power consumption may also be reported in terms of power drawn, using units of power (such as Watts). In the case where the reporting is in terms of power, a time series representing power drawn over unit time intervals may be provided. The total activity power consumption may be reported as a single number in units of energy per virtual machine activity, or as a single number in units of energy per virtual machine over the activity period. Alternatively, the total activity power consumption may be reported as a time series with multiple values in units of power over the activity period.
  • the power model training module 200 generates a power model that is used to determine the power consumption for a virtual machine.
  • FIG. 4 is a flow diagram illustrating the operational details of embodiments of the power model training module 200 shown in FIG. 2. The method begins by inputting the total power consumption (box 400). This is reported by the power tracing module 130. In other words, the host server computing device 110 reports the total power used to the power tracing module 120.
  • a “resource” can be any component that is part of the computer infrastructure of the host server computing device 110.
  • a resource may be processor (such as a central processing unit (CPU)), a disk, memory, and so forth.
  • the resource tracing module 140 determines how much of each resource is being utilized by a virtual machine (called “resource utilization”). The resource utilization for each resource then is summed to obtain a total resource utilization (box 420).
  • the host server computing device 110 includes a host operating system (not shown) that allocates these resources to any virtual machines residing thereon.
  • the host operating system may say that a virtual machine can get up to 30% of the CPU deposit.
  • the host operating system reserves this 30% it also tracks that the virtual machine is currently using 17% of its allocated 30% of the CPU deposit. So the host operating system both enforces that it never goes above the 30% and tracks what the virtual machine is currently using. This is done for each virtual machine residing on the host server computing device 110. In this way the host server computing device 110 gives the resource tracing module information about how much of a resource each virtual machine is using.
  • a power model is generated based on the resource utilization for each virtual machine. This is done by taking the utilization of each resource and summing over all the resources being used by the virtual machine to obtain the total resource utilization.
  • the power model then relates the resource utilization to total power consumption (TPC) as follows:
  • embodiments of the module 200 determine the M number of resources being used by the virtual machine.
  • a system of M equations can be solved to determine the M unknowns.
  • most embodiments of the system collect a N measurements, where N is greater than M.
  • Known methods to fit a model such as linear regression
  • the use of a N that is greater than M helps overcome
  • embodiments of the module 200 make N number of measurements of the total power consumption of the host server computing device 1 10 at N different combinations of resource utilizations (box 440). This gives a system of N equations and M unknowns. This system of N equations then is solved to obtain M number of optimal power model coefficients (box 450).
  • embodiments of the module 200 generate a power model that relates for each resource its utilization to power consumption (box 460). This is represented using M coefficients that relate the utilization to power consumption for each resource.
  • This power model is used to measure the power used by a virtual machine.
  • the power used by the virtual machine is measured by tracking the hardware resources (such as the CPU, disk, memory, network, and so forth) used by the virtual machine.
  • the output of embodiments of the module 200 is the optimal power model coefficients (box 470).
  • the following example demonstrates how the power models can be obtained and how the resource usage on each hardware resource can be tracked.
  • the CPU as a first resource, and suppose that multiple virtual machines, denoted as VMi, VM 2 , to VM N , are hosted on a single physical host server. Then, suppose that two cores of the N-core CPU are used by VMi for time period ti to Suppose each of these two cores is used to 70% of capacity.
  • PCPU Pidie + (xi/100)*P core + (x 2 /100)*P core + . .. + (x N /100)*P core .
  • the above equation is an example of a power model for the CPU.
  • the power model relates the usage (denoted by utilization percentages, x ls x 2 , . .. , x N ) to the power usage. Note that this is one example of a power model.
  • a more sophisticated model that includes not just the utilization of each core but also the types of instructions being executed by the processor and other more detailed information also may be used.
  • the power drawn by the virtual machine can be computed for each component. Then the power drawn by each component is added up to obtain the total power drawn by the virtual machine.
  • Pidie for the CPU was not assigned to the virtual machine.
  • the user may consider Pidie as wasted power regardless of whether a virtual machine is present or not and hence feel that it is not needed.
  • Other users may consider that Pid le is spent on maintaining a server which is needed because of the virtual machines.
  • the Pidie could be added to the power drawn by the virtual machines, either divided equally among all virtual machines using the server, or divided proportionally in the ratios of their other power drawn.
  • Embodiments of the power model training module 200 use these models to relate the resource utilization to the total power consumption of the host server computing device 1 10 and thereby measure the power used by a virtual machine.
  • the power usage of the CPU depends on several factors, such as the sub-units within the processor that are active, specific instructions executed, on-chip cache usage, if the processor has dynamic voltage frequency scaling (DVFS), sleep state usage over time, and the frequency used (denoted as P-state).
  • DVFS dynamic voltage frequency scaling
  • P-state the frequency used
  • P-state only DVFS frequency states are referred to as P-states and DFS states are referred to as Throttle (or T-states).
  • Throttle or T-states
  • P-state is used to represent all frequency levels as the distinction is not relevant in this context.
  • An accurate power estimation considering these factors can be achieved using a cycle accurate simulator.
  • a lighter weight alternative is to track processor states that capture the major power usage changes.
  • P0, PI, . . ., Pm representing progressively lower frequencies.
  • CO is used to denote the active state
  • CI, C2, . . ., Cn represent
  • n is the number of sleep states (typically n ⁇ ⁇ 3, ... , 6 ⁇ .
  • k pc represents the power used in P-state and C-state combination p, c. If it were possible to measure Pcpu(t) with every P-state and C-state change, then learning the values of k p,c would be easy. Measurement at microsecond granularity is feasible on specially instrumented motherboards but not on realistic platforms. Instead, power can be measured at coarser time granularity (on the order of a second). Hence, embodiments of the power model training module 200 integrate the above power model over a longer time duration, T, say:
  • AT pc represents the time spent in state ⁇ p, c) during the duration T and the summation is taken over all (p, c) combinations.
  • the above equation then can be expressed in terms of observable quantities that are available on real systems, as follows.
  • ucpu(p) represents the percentage of time the processor was active during T, in fixed P-state p.
  • a(p) (k P, co - k p,s i eep ).
  • ycpurfi e the sleep state energy in lowest frequency state
  • y(p) k ps i eep - kp m,s i eep .
  • T 1 second for convenience:
  • Ecpu ⁇ x ⁇ p)ucpu (p) 4 * lP) + />V..
  • ucpu(p) is readily available as an hypervisor performance counter and may thus be used on a real platform to monitor the processor state. If the model parameters are known, the processor energy can be estimated. The method to learn parameters a(p), y(p), and cpu, is described in detail below together with the power models for other resources.
  • Embodiments of the virtual machine power metering system 100 use a light weight approach that simply tracks when a virtual machine is active on a processor core (as is possible by tracking the context switches for every core). The energy used can be tracked using Equation (1) during the durations when the virtual machine is active on the processor. Processor context switch events are available on the many types of hypervisors. Re-writing in terms of a more readily available performance counter, the processor utilization of virtual machine A, denoted UC P U , A(P), and assuming that the P-state stays constant during interval T, the energy usage of a virtual machine A, denoted EC P U , can be written as
  • a true estimate of memory energy usage may use a cycle accurate simulation of its hardware design.
  • prior memory power models have found that the key factor that affects memory energy usage is the read and write throughput. While external instrumentation has been attempted to capture memory throughput accurately, a low overhead estimate of memory throughput is the last level cache (LLC) miss counter available in most processors.
  • LLC last level cache
  • memory power consumption may be written as: EMem i ) — m em ⁇ LLCM ( T) -f m m (7)
  • E Me m(T) represents the energy used by memory over time duration T
  • N LL C M (T) is the number of LLC misses during T
  • a mem and y mem are the linear model parameters.
  • embodiments of the module 200 use LLC misses as an approximation for throughput, misses corresponding to each virtual machine are tracked. On a single core system, embodiments of the module 200 can assign the LLC misses during the time a virtual machine has the active context on the processor as the LLC misses for that virtual machine. Thus, the memory energy used by a virtual machine A becomes:
  • the hypervisor can observe the number of bytes read and written as well as the service times for those reads and writes. However, for individual virtual machines, current hypervisors only track bytes read or written. These are used in the disk energy model, which can be written as follows:
  • E Dis k(T) represents the energy consumed by the disk over time duration T
  • b r and b w are the number of bytes read and written respectively during the interval T.
  • the a parameters and Ydis k are model parameters to be learned.
  • embodiments of the power model training module 200 track the disk usage parameters in equation (9) for individual virtual machines.
  • the time at which the disk activity occurs is usually not the same as when the virtual machine is active on the processor, since the hypervisor may batch 10 interrupts and buffer 10 operations. Thus instead of looking at storage system activity during the active context of a virtual machine, the 10 operations
  • EoiBk f A ::::::::::::: Ctrb * 3 ⁇ 4 ⁇ , + *wbbn* ,A (10) where b r and b W.
  • A represent the number of bytes read and written, respectively, by virtual machine A. Further, it has been found that the difference in energies for disk read and write to be negligible and hence a common parameter, say b- 0 , can be used to represent the sum of bytes read and written. This changes the model to:
  • Virtual machine disk energy can, for the embodiment described above, be computed using:
  • Equations (5), (7), and (11) use certain coefficients, denoted by oc's, and ' s, that need to be learned in situ on the servers.
  • Realistic platforms do not allow measuring EC P U(T), E Me m(T), and E Dis k(T) separately but only the full system power, denoted E sys (T).
  • Equation (13) The following points are worth noting regarding Equation (13) and lead to slight modifications for actual implementation.
  • the constants cpu die, mem, disk, and E static do not have any observable parameters that vary across observations, their individual values from cannot be learned from measurements of E sys . Their sum can hence be denoted as a single constant, .
  • the magnitude of ucpu(p) is a fraction between 0 and 1 while N LL C M and b io take values of the order of a hundred million.
  • N LL C M and b io can be normalized with respect to their maximum values observed on a system, such that the a parameters are scaled to similar magnitudes.
  • the final equation used in learning thus becomes:
  • the output of the power model training module 200 are the optimal power model coefficients 260.
  • the energy calculation module 210 uses the optimal power model coefficients from the power model training module 200 to obtain the power consumption for a virtual machine.
  • the learned coefficient values are used in Equations (6), (8), and (12) and the computed resource energies are summed up to determine the energy usage of each VM. In fact the energy used by the virtual machine on each resource can be reported separately, if desired.
  • FIG. 5 is a flow diagram illustrating the operational details of a first embodiment of the energy calculation module 210 shown in FIG. 2.
  • the method begins by inputting the optimal power model coefficients (box 500).
  • the method also inputs the percentages of each resource that is used by the virtual machine (box 510).
  • Each resource then is multiplied by its corresponding optimal power model coefficients to obtain intermediate resource results (box 520).
  • Each of the intermediate resource results then are summed to obtain a power consumption for the virtual machine (box 530). For example, for each virtual machine the percentage of the CPU resource used by the virtual machine plus the percentage of the disk resource used by the virtual machine, and so forth, and is summed over all of the resources.
  • the output of the module 210 is the power consumption of the virtual machine (box 540).
  • This first embodiment generates linearly independent combinations of power states in Equation (14) through controlled workloads that cause various resource power states to be used.
  • This approach is a basic approach and has the lowest overhead since one workload is used for training.
  • the controlled workloads may not use the processor and other resources in a manner representative of real workloads.
  • the second embodiment of the energy calculation tracks more specific information than the first embodiment. This second embodiment is a variation on the first
  • FIG. 6 is a flow diagram illustrating the operational details of a second
  • the method begins by inputting the optimal power model coefficients (box 600). In addition, the method inputs the percentages of the CPU working for a particular virtual machine (box 610). Next, embodiments of the module 210 compute virtual-machine specific coefficients for each resource using the optimal power model coefficients (box 620).
  • the second embodiment takes this into account.
  • the second embodiment also uses virtual machine-specific coefficients, such that the ( is the i value that works for VMi, ( 2 is the ( i value that works for VM2, and so forth.
  • each of the al 's are for a first resource (such as the CPU resource), and the a 2 's are for a second resource (such as the disk resource), and so forth.
  • the power model augmentation module 290 augments the above power models by taking into account the fact that CPUs can operate a different frequencies.
  • the power augmentation module 290 is used to overcome the accuracy limitations of the two embodiments set forth above.
  • the general idea of the power augmentation module 290 is that the model parameters (or optimal power model coefficients) for each virtual machine are learned separately. Since for a given workload, the unobserved power states are highly correlated to the observed ones, the model based on a small number of observed states will capture the power usage more accurately. In fact, with the power model augmentation module 290, even if the processor counter for LLC misses is not captured due to portability limitations across processors, that simply becomes another unobserved state that for a given workload may be correlated with observed states.
  • the virtual machine power metering system 100 and method includes the virtual machine power metering techniques that were set forth above. These virtual machine power metering techniques can be used for various power management decisions. One of these power management decisions and applications will now be discussed, namely, power capping.
  • the power capping module 220 uses the power consumption computed for each virtual machine using the above techniques for a power capping technique for virtual machines. This virtual machine power capping allows virtual machines to be hosted in power over-subscribed circuits, leading to significant advantages in power provisioning costs.
  • the above power capping method does not apply directly to virtual machines.
  • the physical server hosting multiple virtual machines is capped by throttling the physical resources (such as CPU frequency) in the server, it may cause multiple virtual machines to be throttled even when it is just one virtual machine that exceeded its power rating.
  • the power capping module 220 uses the virtual machine metering information described above to enable power capping for virtual machines.
  • FIG. 7 is a flow diagram illustrating the operational details of embodiments of the power capping module 220 shown in FIG. 2.
  • the power usage of each virtual machine is monitored (box 700).
  • the peak power drawn by that virtual machine is measured (box 710) and virtual machines are placed on physical host servers such that the sum of their peaks is less than or equal to the power capacity allocated to that server (box 720).
  • a first option is that the virtual machines that are exceeding their power budget are throttled to the power level that allows the server to be within its allocated capacity (box 765). Throttling may be implemented by reducing the amount of CPU time allocated to that VM.
  • a second option is that the virtual machines are allowed to operate at higher power but some other lower priority virtual machines are throttled to reduce overall power usage (box 770). For example, a revenue generating virtual machine may be allowed to run above its budget and some background tasks may be throttled temporarily.
  • a third option is that one or more virtual machines are migrated from this server to another server that has excess capacity (box 775).
  • a fourth option is that one or more virtual machines are shut down to be executed later (box 780). Additional options, which are not shown in FIG. 7, include reducing the number of requests being submitted to the virtual machines, and increasing the number of virtual machines on other servers where excess power capacity.
  • the power capping module 220 In other embodiments of the power capping module 220, a variation of the above method is used.
  • the virtual machines or the applications hosted on the virtual machines
  • the peak user load occurs only for a small duration of the day and at other times they are operating below peak.
  • the peak of the sum of the power usages of the virtual machine is lower than the sum of the peaks of the power usages.
  • the peak load of multiple virtual machines sharing a server may occur at different times.
  • some embodiments of the power capping module 220 allocate power capacity for a level lower than the peak. When the server power capacity is exceeded, an appropriate action similar to the above method (such as throttling individual VMs, migrating VMs to other servers, and so forth) is used.
  • Computational activities in a computing infrastructure often consist of multiple tasks performed by multiple virtual machines or server roles hosted on different physical machines.
  • a search query may be first received by a web front end service (also referred to as web frontend server role) and may be hosted inside a virtual machine.
  • the search query then may be processed by multiple search data retrieval servers, advertisement servers, and content-rendering servers.
  • the results of the processing are collected and sent to the user.
  • an activity may consist of receiving an email for a particular user. First an incoming-email server role (possibly hosted within a virtual machine) will receive the message and determine the correct user account for the message.
  • a virus-scanning server may check the contents of the email message.
  • a spam-filtering server may check the contents of the message as well. Then a storage server may place the message in its storage.
  • the activity in each case involves expenditure of energy on multiple distributed server roles some of which could be hosted on the same machine and some on different machines. It is desirable to determine the energy used by the activity as a whole.
  • the virtual machine power metering system 100 and method also includes a technique to determine the energy usage of an individual activity that may be distributed over several virtual machines.
  • An activity is defined as a service or request.
  • a search query is an activity that has a verifiable beginning and end.
  • an event is generated. This begins the tracking of each resource that is used by the activity. For example, when an activity uses a particular server role, the energy usage of the relevant server role is determined over the duration of the activity's use using previously described methods to monitor energy usage of virtual machines. As noted above, the activity could start and end on different virtual machines at different times. As the activity progresses, the energy used across multiple server roles is added up to generate the total energy used by the activity.
  • some energy may be spent by devices that are not part of the computing infrastructure as tracked by the virtual machine power metering method. These could include cooling equipment for the computing facility and such peripherals.
  • the energy used in these devices is estimated as follows. The total energy used by those devices is measured using hardware power metering or using known power models (based on specifications and data sheets or learned in situ). Then, the total activity workload served by the computational facility is tracked using only the methods that monitor computing infrastructure that are accessible to the virtual machine power metering method. The total power consumption is then apportioned among all activities that are performed by the computing infrastructure. The total energy may be apportioned equally among all activities, divided in the ratio of their execution times, or in the ratios of their previously computed energy spent within the virtual machines.
  • the techniques presented herein use the term "energy usage” or "power consumption.” These may be reported in units of energy (such as Joules or Kilowatt Hours). Power consumption may also be reported in terms of power drawn, using units of power (such as Watts). In the case where the reporting is in terms of power, a time series representing power drawn over unit time intervals may be provided.
  • Embodiments of the virtual machine power metering system 100 and method are designed to operate in a computing environment. The following discussion is intended to provide a brief, general description of a suitable computing environment in which embodiments of the virtual machine power metering system 100 and method may be implemented.
  • FIG. 8 illustrates an example of a suitable computing system environment in which embodiments of the virtual machine power metering system 100 and method shown in FIGS. 1-7 may be implemented.
  • the computing system environment 800 is only one example of a suitable computing environment and is not intended to suggest any limitation as to the scope of use or functionality of the invention. Neither should the computing environment 800 be interpreted as having any dependency or requirement relating to any one or combination of components illustrated in the exemplary operating environment.
  • Embodiments of the virtual machine power metering system 100 and method are operational with numerous other general purpose or special purpose computing system environments or configurations.
  • Examples of well known computing systems, environments, and/or configurations that may be suitable for use with embodiments of the virtual machine power metering system 100 and method include, but are not limited to, personal computers, server computers, hand-held (including smartphones), laptop or mobile computer or communications devices such as cell phones and PDA's,
  • Embodiments of the virtual machine power metering system 100 and method may be described in the general context of computer-executable instructions, such as program modules, being executed by a computer.
  • program modules include routines, programs, objects, components, data structures, etc., that perform particular tasks or implement particular abstract data types.
  • Embodiments of the virtual machine power metering system 100 and method may also be practiced in distributed computing environments where tasks are performed by remote processing devices that are linked through a communications network.
  • program modules may be located in both local and remote computer storage media including memory storage devices.
  • an exemplary system for embodiments of the virtual machine power metering system 100 and method includes a general-purpose computing device in the form of a computer 810.
  • Components of the computer 810 may include, but are not limited to, a processing unit 820 (such as a central processing unit, CPU), a system memory 830, and a system bus 821 that couples various system components including the system memory to the processing unit 820.
  • the system bus 821 may be any of several types of bus structures including a memory bus or memory controller, a peripheral bus, and a local bus using any of a variety of bus architectures.
  • a processing unit 820 such as a central processing unit, CPU
  • system memory 830 such as a central processing unit, RAM
  • system bus 821 that couples various system components including the system memory to the processing unit 820.
  • the system bus 821 may be any of several types of bus structures including a memory bus or memory controller, a peripheral bus, and a local bus using any of a variety of bus architectures.
  • ISA Industry Standard Architecture
  • Micro Channel Micro Channel
  • MCA Management Entity Bus
  • EISA Enhanced ISA
  • VESA Video Electronics Standards Association
  • PCI Peripheral Component Interconnect
  • the computer 810 typically includes a variety of computer readable media.
  • Computer readable media can be any available media that can be accessed by the computer 810 and includes both volatile and nonvolatile media, removable and nonremovable media.
  • Computer readable media may comprise computer storage media and communication media.
  • Computer storage media includes volatile and nonvolatile removable and non-removable media implemented in any method or technology for storage of information such as computer readable instructions, data structures, program modules or other data.
  • Computer storage media includes, but is not limited to, RAM, ROM, EEPROM, flash memory or other memory technology, CD-ROM, digital versatile disks (DVD) or other optical disk storage, magnetic cassettes, magnetic tape, magnetic disk storage or other magnetic storage devices, or any other medium which can be used to store the desired information and which can be accessed by the computer 810.
  • communication media includes wired media such as a wired network or direct-wired connection, and wireless media such as acoustic, RF, infrared and other wireless media. Combinations of any of the above should also be included within the scope of computer readable media.
  • the system memory 830 includes computer storage media in the form of volatile and/or nonvolatile memory such as read only memory (ROM) 831 and random access memory (RAM) 832.
  • ROM read only memory
  • RAM random access memory
  • BIOS basic input/output system 833
  • RAM 832 typically contains data and/or program modules that are immediately accessible to and/or presently being operated on by processing unit 820.
  • FIG. 8 illustrates operating system 834, application programs 835, other program modules 836, and program data 837.
  • the computer 810 may also include other removable/non-removable,
  • FIG. 8 illustrates a hard disk drive 841 that reads from or writes to non-removable, nonvolatile magnetic media, a magnetic disk drive 851 that reads from or writes to a removable, nonvolatile magnetic disk 852, and an optical disk drive 855 that reads from or writes to a removable, nonvolatile optical disk 856 such as a CD ROM or other optical media.
  • a removable, nonvolatile optical disk 856 such as a CD ROM or other optical media.
  • Other removable/non-removable, volatile/nonvolatile computer storage media that can be used in the exemplary operating environment include, but are not limited to, magnetic tape cassettes, flash memory cards, digital versatile disks, digital video tape, solid state RAM, solid state ROM, and the like.
  • the hard disk drive 841 is typically connected to the system bus 821 through a non-removable memory interface such as interface 840, and magnetic disk drive 851 and optical disk drive 855 are typically connected to the system bus 821 by a removable memory interface, such as interface 850.
  • the drives and their associated computer storage media discussed above and illustrated in FIG. 8, provide storage of computer readable instructions, data structures, program modules and other data for the computer 810.
  • hard disk drive 841 is illustrated as storing operating system 844, application programs 845, other program modules 846, and program data 847. Note that these components can either be the same as or different from operating system 834, application programs 835, other program modules 836, and program data 837.
  • Operating system 844, application programs 845, other program modules 846, and program data 847 are given different numbers here to illustrate that, at a minimum, they are different copies.
  • a user may enter commands and information (or data) into the computer 810 through input devices such as a keyboard 862, pointing device 861, commonly referred to as a mouse, trackball or touch pad, and a touch panel or touch screen (not shown).
  • Other input devices may include a microphone, joystick, game pad, satellite dish, scanner, radio receiver, or a television or broadcast video receiver, or the like. These and other input devices are often connected to the processing unit 820 through a user input interface 860 that is coupled to the system bus 821, but may be connected by other interface and bus structures, such as, for example, a parallel port, game port or a universal serial bus (USB).
  • a monitor 891 or other type of display device is also connected to the system bus 821 via an interface, such as a video interface 890.
  • computers may also include other peripheral output devices such as speakers 897 and printer 896, which may be connected through an output peripheral interface 895.
  • the computer 810 may operate in a networked environment using logical connections to one or more remote computers, such as a remote computer 880.
  • the remote computer 880 may be a personal computer, a server, a router, a network PC, a peer device or other common network node, and typically includes many or all of the elements described above relative to the computer 810, although only a memory storage device 881 has been illustrated in FIG. 8.
  • the logical connections depicted in FIG. 8 include a local area network (LAN) 871 and a wide area network (WAN) 873, but may also include other networks.
  • LAN local area network
  • WAN wide area network
  • Such networking environments are commonplace in offices, enterprise-wide computer networks, intranets and the Internet.
  • the computer 810 When used in a LAN networking environment, the computer 810 is connected to the LAN 871 through a network interface or adapter 870. When used in a WAN networking environment, the computer 810 typically includes a modem 872 or other means for establishing communications over the WAN 873, such as the Internet.
  • the modem 872 which may be internal or external, may be connected to the system bus 821 via the user input interface 860, or other appropriate mechanism.
  • program modules depicted relative to the computer 810, or portions thereof may be stored in the remote memory storage device.
  • FIG. 8 illustrates remote application programs 885 as residing on memory device 881. It will be appreciated that the network connections shown are exemplary and other means of establishing a communications link between the computers may be used.

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • General Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • Evolutionary Biology (AREA)
  • Computer Hardware Design (AREA)
  • Quality & Reliability (AREA)
  • Power Engineering (AREA)
  • Power Sources (AREA)
  • Debugging And Monitoring (AREA)
  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)

Abstract

Embodiments of the virtual machine power metering system and method measure the power consumption of individual virtual machines. Power meter measurements for a physical host server are converted into individual virtual machine power meters that measure the power consumption of each individual virtual machine residing on the host server. The virtual machine power consumption is computed by generating a power model using the total power consumption of the host server and resource utilization for a virtual machine. Optimal power model coefficients are computed using the power model. The energy used by the virtual machine is computed using one of two embodiments. Embodiments of the system and method also can be used to obtain the power consumption for a specific activity (such as a service, request, or search query). In addition, the virtual machine power metering can be used for virtual machine power capping to allow power oversubscription in virtualized environments.

Description

VIRTUAL MACHINE POWER CONSUMPTION
MEASUREMENT AND MANAGEMENT
BACKGROUND
[0001] Data centers, cloud computing infrastructures, compute clusters, enterprise computational facilities, and high performance computing facilities are increasingly using virtual machines to isolate workloads while running the workloads on shared physical servers. A virtual machine is an emulation of a computing device that is strictly software yet acts like a physical computing device. This means that one or more virtual machines can reside on a single physical host server. It appears to clients and all software running on the virtual machines, however, that each virtual machine is an independent server.
[0002] Virtualization (or the use of virtual machines) allows multiple applications to be safely placed on a single physical host server because each application operates inside its "virtual" computer and does not affect other applications. This allows server resources to be better utilized and reduces the number of servers required as well as the operational costs of hosting them such as electricity and cooling. Hence, virtualization is an efficient technique for managing computational resources and is gaining widespread acceptance in data centers and large scale computing facilities.
[0003] Data centers and other computing facilities often monitor power usage of servers to correctly manage the power capacity and distribution. The power measurement data is also used to make several manual and automated power management decisions. In fact power metering is so widespread in data centers that power distribution units (PDUs), power strips, server power supplies and in newer servers even server motherboards already contain the power metering instruments required to measure power. Another way to measure power consumption in a physical server is to connect a power metering instrument to the server.
[0004] Power consumption for a physical server can be measured by metering the electricity supplied to the server. Virtual machines, however, are created in software and there is no easy way to connect a power meter to a virtual machine. This makes it quite difficult to measure power consumption of virtual machines. Techniques used for physical servers can only report the total power draw of the server. But they cannot be used to measure the individual power consumption of each of the multiple virtual machines that reside on the single physical host server. In fact, the power consumption of a virtual machine cannot be measured using purely hardware solutions. [0005] The ability to measure the power consumption of a virtual machine is important. Users or server managers are unable to measure the power used by each virtual machine. Measuring this power is useful for several reasons. For example, if a virtual machine is using excessive power it may have to be shut down or migrated to a different power circuit with excess capacity. In addition, software developers can measure how much energy their software consumes when executed on a physical server but cannot measure it when running on a virtual machine. Not being able to measure the power usage cripples the developer's ability to optimize the software for lower power consumption. Another use of measuring a virtual machine's power consumption is to charge back users based on power use. Shared physical servers, such as used in cloud computing, allow multiple users to run virtual machines on the shared physical infrastructure. The owner of the physical servers may wish to charge the virtual machine users based on power use.
[0006] Yet another use of virtual machine power metering, and one that is discussed in more detail below, arises in virtual machine power capping. Power capping allows over- subscription of the power infrastructure and results in more efficient utilization of the power capacity. However, current power capping techniques are designed for physical servers and do not work well when applied to virtual machines. For example, capping the power of a physical server results in the capping of all virtual machines that are hosted on the server. This violates the isolation of workloads provided by virtual machines and can defeat the advantages of using virtualization.
[0007] Thus, one of the major components of power management in large scale computational facilities is the visibility into power. This capability, however, is lacking in virtualized machines that are increasingly being used in data centers. The use of virtual machines allows safe isolation of multiple co-located workloads, enabling multiple workloads to be consolidated on fewer servers, and results in improved resource utilization. However, the lack of visibility into virtual machine power usage takes away the many advantages of server power metering that were available without virtualization.
SUMMARY
[0008] This Summary is provided to introduce a selection of concepts in a simplified form that are further described below in the Detailed Description. This Summary is not intended to identify key features or essential features of the claimed subject matter, nor is it intended to be used to limit the scope of the claimed subject matter.
[0009] Embodiments of the virtual machine power metering system and method measure the power consumption of individual virtual machines. Power metering is often used on physical servers to make several power management and operational decisions.
Embodiments of the virtual machine power metering system and method allow such decisions to be made also on virtual machines.
[0010] The general idea is that a power meter measurement, which measures the power consumption of an entire physical host server, is converted into individual virtual machine power meters that measure the power consumption of each individual virtual machine residing on the host server. Embodiments of the virtual machine power metering system and method will work with any number of virtual machines and work across multiple physical host servers.
[0011] Knowing the power consumption of an individual virtual machine is useful for several power management decisions that are made in a data center or any virtualized server. For example, the virtual machine power metering can be used for virtual machine power capping to allow power oversubscription in virtualized environments. This virtual machine power capping can be performed without violating the isolation property. The individual virtual machine power consumption measurements also can be used to track the power consumption of an activity (such as a search query) that spans multiple virtual machines.
[0012] Embodiments of the virtual machine power metering system and method do not need the hardware to supply a power model. Any information needed is learned on the fly. Embodiments of the system and method automatically learn a power model relating the hardware (or resource) usage (such as central processing unit (CPU) usage, disk usage, memory usage, network, usage, and so forth) to power consumption. Then the hardware usage of each virtual machine is tracked to infer the power consumption of the virtual machine.
[0013] Embodiments of the system and method compute the power consumption in two broad steps. First, a power model is generated and trained using the total power consumption of the physical host server and resource utilization for a specific virtual machine. Resource utilization is the determination of how much of a resource the virtual machine is using. These resource utilizations are summed to find a total resource utilization for the virtual machine. The power model is generated that relates the total resource utilization to the total power consumption.
[0014] Embodiments of the system and method also determine optimal power model coefficients during this step. This is achieved by determine the number of resources, N, being used by the virtual machine. A system of N equations having N unknowns is generated and solved to obtain the optimal power model coefficients.
[0015] Embodiments of the system and method also include a power model learning technique, where the model learned is unique to the workloads hosted within each virtual machine. This accounts for differences in power usage among workloads that can be seen even when the workloads have the same or similar appearing resource usage.
[0016] Second, embodiments of the system and method calculate the energy used by the virtual machine. Several embodiments may be used to calculate the energy. One embodiment uses the percentages of resources used by the virtual machine and then sums these percentages to obtain the power consumption for the virtual machine. In other embodiments, the resources used may be measured in units other than percentages. By way of example, resources measured may include the CPU, memory, disks, network cards, and fans, to name a few. In these embodiments the power model can be augmented by taking into account the frequency of the CPU of the host server.
[0017] Embodiments of the system and method can also be used to obtain the power consumption for a specific activity. An activity can be a service, a request (such as a web search query), an e-mail delivery, a webpage retrieval, a database operation, or a file update. The power consumption for the activity can be computed even if the activity is across multiple virtual machines and across multiple host servers. An activity period, which is the period of time that the activity is active on the computing infrastructure, is determined. Embodiments of the system and method then compute the power
consumption by summing power consumption for each virtual machine working on the activity during the respective activity periods to obtain total activity power consumption.
[0018] It should be noted that alternative embodiments are possible, and that steps and elements discussed herein may be changed, added, or eliminated, depending on the particular embodiment. These alternative embodiments include alternative steps and alternative elements that may be used, and structural changes that may be made, without departing from the scope of the invention.
DRAWINGS DESCRIPTION
[0019] Referring now to the drawings in which like reference numbers represent corresponding parts throughout:
[0020] FIG. 1 is a block diagram illustrating a general overview of embodiments of the virtual machine power metering system and method implemented on a host server computing device. [0021] FIG. 2 is a block diagram illustrating modules and data used in embodiments of the virtual machine power metering system and method shown in FIG. 1.
[0022] FIG. 3 is a flow diagram illustrating the general operation of embodiments of the virtual machine power metering system shown in FIGS. 1 and 2.
[0023] FIG. 4 is a flow diagram illustrating the operational details of embodiments of the power model training module shown in FIG. 2.
[0024] FIG. 5 is a flow diagram illustrating the operational details of a first embodiment of the energy calculation module shown in FIG. 2.
[0025] FIG. 6 is a flow diagram illustrating the operational details of a second
embodiment of the energy calculation module shown in FIG. 2.
[0026] FIG. 7 is a flow diagram illustrating the operational details of embodiments of the power capping module shown in FIG. 2.
[0027] FIG. 8 illustrates an example of a suitable computing system environment in which embodiments of the virtual machine power metering system and method shown in FIGS. 1-7 may be implemented.
DETAILED DESCRIPTION
[0028] In the following description of embodiments of the virtual machine power metering system and method, reference is made to the accompanying drawings, which form a part thereof, and show, by way of illustration, a specific example whereby embodiments of the virtual machine power metering system and method may be practiced. It is to be understood that other embodiments may be utilized and structural changes may be made without departing from the scope of the claimed subject matter.
I. Power Metering Concepts
[0029] Since no hardware power measuring device can be connected to an individual virtual machine, the virtual machine power metering system and method infers energy from resource usage. In principle, the virtual machine power metering system and method tracks the resources used by each virtual machine in software and then converts the resource usage to energy by leveraging the power models of the individual resources.
[0030] For this approach, the system and method track usage for all the hardware resources, such as the CPU cores, disk arrays, memory banks, network cards, and graphics cards. Tracking the resource usage includes determining what power state the resource was in and what virtual machine was it working for. If the system and method correctly label each resource according to which virtual machine was using it and also measure the power draw during that duration (typically a few microseconds or milliseconds), then the system and method can compute the active energy usage of each virtual machine. The active energy may be defined as the energy consumed by a resource when working on behalf of some virtual machine or the system.
[0031] In addition to the active energy, the system also uses some idle energy and shared energy. By way of example, for the CPU, the idle energy could include the energy spent in a sleep state for a small duration between active states. The shared energy includes energy used by a resource when the work performed benefits multiple virtual machines, such as the energy spent on disk spin-up before input/output operations from multiple virtual machines. Some of the idle energy is spent regardless of whether any virtual machine is running or not while part of it is affected by the presence of virtual machines.
[0032] For instance the time durations between a virtual machine's idle periods may affect whether a deeper sleep state is used by the processor or not during the idle time. The idle and shared energies can be reported separately or divided among virtual machines either equally or proportional to their active energy usage, depending on how the metering information is to be used. For the power capping application discussed in detail below, reporting separately works best.
[0033] Thus, tracking virtual machine energy boils down to two challenges. First, power measurement at microsecond or millisecond granularity: The instantaneous power should be determined every few microseconds for each resource, since the virtual machine using the resource may change every few microseconds (depending on time period used for context switching). This may cause the component usage and power state to change.
[0034] Second, label resource use for each individual virtual machine. It must be determined which virtual machine was responsible for using each resource. Ideally, this process does not require the application source code running inside a virtual machine to be instrumented, since the platform developers may not have access to it and requiring application developers to instrument their source code is unlikely to scale in practice.
[0035] In order to address the first challenge, embodiments of the system and method leverage power models that relate the software-observable state of a resource to its power usage. If the state changes can be observed at microsecond granularity then the power usage at that fine time scale can be inferred. Observing the power states is non-trivial and several approximations must be made. For instance, the resource states, such as the clock gating of sub-components within the processor or exact nature of mechanical motions within a disk array may not be completely known in some embodiments of the method and system. Moreover, there may be no visibility into power states of certain hardware resources within the platform, such as hardware-controlled device power states or graphics processor activity.
[0036] After observing the resource state, the next step is to determine the power usage in that state using its power model. However, power models are not readily available.
Embodiments of the system and method build the power models in situ. To address the second challenge, embodiments of the system and method leverage the knowledge regarding scheduling of resources for the virtual machines. Again, complete visibility into all resources is lacking and trade-offs in instrumentation overhead and accuracy must be made.
II. System Overview
[0037] FIG. 1 is a block diagram illustrating a general overview of embodiments of the virtual machine power metering system 100 and method implemented on a host server computing device 110. In general, embodiments of the virtual machine power metering system 100 and method determine the power consumption of individual virtual machines running on the host server computing device 110 without the need to special hardware.
[0038] Embodiments of the virtual machine power metering system 100 reside on the host server computing device 110. The host server computing device 110 represents a physical server. One or more virtual machines 120 are hosted by the host server computing device 110. These one or more virtual machines 120 can be represented as VMi, VM2, . . . to VMN, where N is the number of virtual machines. A power tracing module 130 measure the full system power, or the total power consumed by the host server computing device 110. In some embodiments, the power tracing module 130 represents a software driver to read the full system power consumption from an available power sensor on the
motherboard of the host server computing device 110.
[0039] A resource tracing module 140 represents hypervisor performance counters that trace the resource usage of virtual machines. A hypervisor (also known as a virtual machine manager) is management software that that allows multiple operating systems to run on a host computer concurrently. The resource tracing module 140 tracks the resources of the host server computing device 110. To keep embodiments of the virtual machine power metering system 100 usable with existing hypervisors, embodiments of the system 100 use existing hypervisor performance counters. However, in some
embodiments of the system and method additional performance counters and
instrumentation can be added to monitor resource usage where usability with existing hypervisors is not essential. Embodiments of the system 100 process this information to compute and output power consumption for each virtual machine 150.
[0040] FIG. 2 is a block diagram illustrating modules and data used in embodiments of the virtual machine power metering system 100 and method shown in FIG. 1. In general, embodiments of the virtual machine power metering system 100 include modules for power measurement and management in virtual machines. More specifically,
embodiments of the virtual machine power metering system 100 include a power model training module 200 that generates a power model based on information from the power tracing module 130 and the resource tracing module 140. In some embodiments of the module 200 the power model is further refined by taking into account the frequency of the central processing unit (CPU).
[0041] Embodiments of the system 100 also include an energy calculation module 210 that uses the power model to compute the power consumption for each of the one or more virtual machines 120. As explained below, embodiments of this module use at least two embodiments to obtain the power consumption. The result is the power consumption for each individual virtual machine 150. This information is used by a power capping module 220 to provide power limitations for each virtual machine as desired.
[0042] The power model training module 200 uses total power usage by the host server 230 and resource utilizations measurements 240 to generate a power model 250. As explained in detail below, a number of measurements are taken to obtain a system of power model equations. This system of equations is solved to find optimal power model coefficients 260.
[0043] The energy calculation module 210 uses the power tracing module 130
information, the resource tracing module 140 information, and the optimal power model coefficients 260 to calculate the power consumption for each of the one or more virtual machines 120. There are two embodiments of the energy calculation module 210, as shown by the dotted lines in FIG. 2.
[0044] A first embodiment of the energy calculation module 210 uses virtual machine resource percentages to make the calculations 270. In this embodiment the coefficient used is the same for each of the virtual machines. For example, if two virtual machines both use 10% CPU (and no other resources) then their energy use will be the same. A second embodiment of the module 210 uses virtual machine resource percentages and a virtual machine specific power model to make the energy calculations 280. In this embodiment, as an example, even with both virtual machines using 10% CPU each the energy use could be different because one virtual machine could be performing floating point operations while the other virtual machine could be performing fixed point operations (these two operations use different sub-units of the CPU and hence may use different amounts of energy).
[0045] The mathematical details of both embodiments are discussed below. Both of these embodiments are refined by adding slightly to the complexity of the power model by using a power model augmentation module 290. The power model augmentation module 290 augments the power model by taking into account the frequency of the central processing unit (CPU) of the host server computing device 110.
[0046] The result is the power consumption for each individual virtual machine 150. This result then is used by the power capping module 220 to cap the power used by each virtual machine. The details of the power capping process are discussed in detail below.
III. Operational Overview
[0047] Computational activities in a computing infrastructure (such as a data center) often consist of multiple tasks performed by multiple virtual machines or server roles hosted on different physical machines. For example, a search query may be first received by a web front end service (also referred to as web frontend server role) and may be hosted inside a virtual machine. The search query then may be processed by multiple search data retrieval servers, advertisement servers, and content-rendering servers. The results of the processing are collected and sent to the user.
[0048] As another example, an activity may consist of receiving an email for a particular user. First an incoming-email server role (possibly hosted within a virtual machine) will receive the message and determine the correct user account for the message. A virus- scanning server may check the contents of the email message. A spam-filtering server may check the contents of the message as well. Then a storage server may place the message in its storage. The activity in each case involves expenditure of energy on multiple distributed server roles some of which could be hosted on the same machine and some on different machines. It is desirable to determine the energy used by the activity as a whole.
[0049] FIG. 3 is a flow diagram illustrating the general operation of embodiments of the virtual machine power metering system 100 shown in FIGS. 1 and 2. In general, embodiments of the virtual machine power metering system 100 and method to determine the energy usage of an individual activity that may be distributed over several virtual machines. An activity is defined as a service or request. For example, a search query is an activity that has a verifiable beginning and end.
[0050] Referring to FIG. 3, the method begins by selecting an activity for which to determine the power consumption (box 300). The activity spans at least one virtual machine on each of a plurality of physical host servers. Next, a beginning event is generated when the activity is first initiated on the virtual machine (box 310). This beginning event marks the beginning of the tracking of each resource that is used by the activity (box 320). When an activity uses a particular virtual machine the energy usage (or power consumption) of the relevant virtual machine is determined over the duration of the activity's use (box 330).
[0051] It should be noted that the activity could start and end on different virtual machines at different times. For example, in the e-mail example just given a virtual machine may start virus scanning the e-mail at time Ti and end the scanning at time T2. Another virtual machine may then start analyzing the same e-mail for spam at time T3 and end the analysis at time T4.
[0052] The power consumption of each virtual machine may be determined using the virtual machine power metering methods described in detail below. This generates power consumption measurements for each virtual machine working on the selected activity. As the activity progresses, the energy used across multiple server roles is added up to generate the total energy used by the activity.
[0053] Embodiments of the system 100 then determine and mark an ending event in which the activity was terminated (box 340). This ending event may be the fulfillment of a search request, or some other event that marks the time at which the activity was completed or terminated and presented to a user. An activity period is defined as the amount of time between the beginning event and the ending event (box 350).
[0054] In addition, some energy may be spent by devices that are not part of the computing infrastructure as tracked by the virtual machine power metering method. These could include cooling equipment for the computing facility and such peripherals. The energy used in these devices is estimated as follows. Embodiments of the system 100 determine the total power consumption of devices that were used during the activity period but that were not tracked (box 360).
[0055] The total energy used by these devices is measured using hardware power metering or using known power models (based on specifications and data sheets or learned in situ). The portion of the total power consumption that was used by the selected activity during the activity period then is determined (box 370). The total energy may be apportioned equally among all activities, divided in the ratio of their execution times, or in the ratios of their previously computed energy spent within the virtual machines.
[0056] Embodiments of the system 100 then sum the power consumption used during the activity period for the selected activity to obtain the total activity power consumption (box 380). Specifically, the power consumption measurements for each virtual machine working on the activity during the activity period and the portion of the power
consumption of the untracked devices used during the activity period are summed. This yields the total activity power consumption, or the power consumption used in the execution of the activity over all virtual machines used in the activity, regardless of which physical host server the virtual machines reside. The total activity power consumption then is output (box 390).
[0057] The above discussion uses the terms "energy usage" and "power consumption." These may be reported in units of energy (such as Joules or Kilowatt Hours). Power consumption may also be reported in terms of power drawn, using units of power (such as Watts). In the case where the reporting is in terms of power, a time series representing power drawn over unit time intervals may be provided. The total activity power consumption may be reported as a single number in units of energy per virtual machine activity, or as a single number in units of energy per virtual machine over the activity period. Alternatively, the total activity power consumption may be reported as a time series with multiple values in units of power over the activity period.
IV. System and Operational Details
[0058] The system and the operational details of embodiments of the virtual machine power metering system 100 and method now will be discussed. These embodiments include embodiments of the power model training module 200, the energy calculation module 210, the power model augmentation module 290, and the power capping module 220. The system and operational details of each of these modules now will be discussed in detail.
IV.A. Power Model Training Module
[0059] The power model training module 200 generates a power model that is used to determine the power consumption for a virtual machine. FIG. 4 is a flow diagram illustrating the operational details of embodiments of the power model training module 200 shown in FIG. 2. The method begins by inputting the total power consumption (box 400). This is reported by the power tracing module 130. In other words, the host server computing device 110 reports the total power used to the power tracing module 120.
[0060] Another input is a breakdown of how much of a given resource the virtual machine is using (box 410). This information is reported by the resource tracing module 140. A "resource" can be any component that is part of the computer infrastructure of the host server computing device 110. For example, a resource may be processor (such as a central processing unit (CPU)), a disk, memory, and so forth. The resource tracing module 140 determines how much of each resource is being utilized by a virtual machine (called "resource utilization"). The resource utilization for each resource then is summed to obtain a total resource utilization (box 420).
[0061] The host server computing device 110 includes a host operating system (not shown) that allocates these resources to any virtual machines residing thereon. For example, the host operating system may say that a virtual machine can get up to 30% of the CPU deposit. When the host operating system reserves this 30% it also tracks that the virtual machine is currently using 17% of its allocated 30% of the CPU deposit. So the host operating system both enforces that it never goes above the 30% and tracks what the virtual machine is currently using. This is done for each virtual machine residing on the host server computing device 110. In this way the host server computing device 110 gives the resource tracing module information about how much of a resource each virtual machine is using.
[0062] Let M denote the number of resources and let N denote the number of observations. For each of the M number of resources in the physical server, embodiments of the module 200 determine a total resource utilization that is summed over all virtual machines (box 430).
[0063] In general, a power model is generated based on the resource utilization for each virtual machine. This is done by taking the utilization of each resource and summing over all the resources being used by the virtual machine to obtain the total resource utilization. Mathematically, assume that there are three resources being used by the virtual machine and that aj, ci2, and <¾ are optimal power model coefficients, which are constants. The power model then relates the resource utilization to total power consumption (TPC) as follows:
a I * (CPU utilization) + <¾ * (disk utilization) + <¾ * (network utilization) = TPC.
[0064] Theoretically, if three different measurements of the total power consumption are made for three different values of the resource utilization, then we have three equations and three unknowns. Thus, embodiments of the module 200 determine the M number of resources being used by the virtual machine. A system of M equations can be solved to determine the M unknowns. However, in practice, most embodiments of the system collect a N measurements, where N is greater than M. Known methods to fit a model (such as linear regression) then are used to determine the values of M variables that would best fit the N equations. The use of a N that is greater than M helps overcome
measurement noise and such variations.
[0065] In particular, embodiments of the module 200 make N number of measurements of the total power consumption of the host server computing device 1 10 at N different combinations of resource utilizations (box 440). This gives a system of N equations and M unknowns. This system of N equations then is solved to obtain M number of optimal power model coefficients (box 450).
[0066] Next, embodiments of the module 200 generate a power model that relates for each resource its utilization to power consumption (box 460). This is represented using M coefficients that relate the utilization to power consumption for each resource. This power model is used to measure the power used by a virtual machine. Conceptually, the power used by the virtual machine is measured by tracking the hardware resources (such as the CPU, disk, memory, network, and so forth) used by the virtual machine. The output of embodiments of the module 200 is the optimal power model coefficients (box 470).
[0067] The following example demonstrates how the power models can be obtained and how the resource usage on each hardware resource can be tracked. Consider the CPU as a first resource, and suppose that multiple virtual machines, denoted as VMi, VM2, to VMN, are hosted on a single physical host server. Then, suppose that two cores of the N-core CPU are used by VMi for time period ti to Suppose each of these two cores is used to 70% of capacity.
[0068] Suppose that when no virtual machine is using the CPU that its power usage is Pidie- Moreover, suppose when one core is fully used the power consumption is Pidie + PCOre and when one core is used to xi% utilization, the power consumption is Pidie +
(xi/100)*Pcore. Suppose the utilization of all cores 1 to n is denoted xl s x2, . .. , xN. Then, mathematically the power usage can be written as:
PCPU= Pidie + (xi/100)*Pcore + (x2/100)*Pcore + . .. + (xN/100)*Pcore.
[0069] The above equation is an example of a power model for the CPU. The power model relates the usage (denoted by utilization percentages, xls x2, . .. , xN) to the power usage. Note that this is one example of a power model. A more sophisticated model that includes not just the utilization of each core but also the types of instructions being executed by the processor and other more detailed information also may be used.
[0070] Continuing the above example where a virtual machine used 70% of two cores, the power model above can be used to say that between times t and t2 the virtual machine drew power in the CPU at a rate PVM:CPU = (70/100)*Pcore + (70/100)*Pcore. Similarly, using a power model for the other hardware components (such as disk, memory, network, and so forth), the power drawn by the virtual machine can be computed for each component. Then the power drawn by each component is added up to obtain the total power drawn by the virtual machine.
[0071] In the above example, Pidie for the CPU was not assigned to the virtual machine. The user may consider Pidie as wasted power regardless of whether a virtual machine is present or not and hence feel that it is not needed. Other users may consider that Pidle is spent on maintaining a server which is needed because of the virtual machines. In this case, the Pidie could be added to the power drawn by the virtual machines, either divided equally among all virtual machines using the server, or divided proportionally in the ratios of their other power drawn.
[0072] The details of some practical power models for a variety of system resources now will be discussed. Embodiments of the power model training module 200 use these models to relate the resource utilization to the total power consumption of the host server computing device 1 10 and thereby measure the power used by a virtual machine.
IV.A.1. Practical Power Model for CPU
[0073] The power usage of the CPU depends on several factors, such as the sub-units within the processor that are active, specific instructions executed, on-chip cache usage, if the processor has dynamic voltage frequency scaling (DVFS), sleep state usage over time, and the frequency used (denoted as P-state). To be precise, only DVFS frequency states are referred to as P-states and DFS states are referred to as Throttle (or T-states). In this document P-state is used to represent all frequency levels as the distinction is not relevant in this context. An accurate power estimation considering these factors can be achieved using a cycle accurate simulator. However, that requires a complete architectural model of the processor, and has high processing overheads, making it unsuitable for runtime virtual machine power metering.
[0074] A lighter weight alternative is to track processor states that capture the major power usage changes. Suppose there are m P-states, denoted as P0, PI, . . ., Pm, representing progressively lower frequencies. When the processor is not actively executing any thread it enters a low power sleep state, referred to as a C-state. By convention, CO is used to denote the active state and CI, C2, . . ., Cn represent
progressively deeper sleep states, where n is the number of sleep states (typically n ε {3, ... , 6} . Suppose the instantaneous power used by the processor at time t is Pcpu(t)-
Assuming that the power depends on only the P-states and C-states, it can be denoted by a constant in each different combination of fre uency p(t) and sleep state c(t) at time t:
Figure imgf000017_0001
where kpc represents the power used in P-state and C-state combination p, c. If it were possible to measure Pcpu(t) with every P-state and C-state change, then learning the values of kp,c would be easy. Measurement at microsecond granularity is feasible on specially instrumented motherboards but not on realistic platforms. Instead, power can be measured at coarser time granularity (on the order of a second). Hence, embodiments of the power model training module 200 integrate the above power model over a longer time duration, T, say:
Figure imgf000017_0002
Where ATpc represents the time spent in state {p, c) during the duration T and the summation is taken over all (p, c) combinations. The above equation then can be expressed in terms of observable quantities that are available on real systems, as follows.
[0075] For a given P-state, the energy spent in all sleep states CI and higher may be approximated as kP:Sieep since the differences are small relative to other approximations made. The active energy is kpco- Let Ecpu(T) denote the integral of the power over T.
Also, P-state changes are slow and hypervisor counters often allow updates at interval T such that the P-state is constant over T. Then,
assumin a single P-state p for the duration T, yields:
Figure imgf000017_0003
where ucpu(p) represents the percentage of time the processor was active during T, in fixed P-state p. Let a(p) = (kP,co - kp,sieep). Also let the sleep state energy in lowest frequency state, kpm,sieep, be denoted as ycpurfie and the increase in sleep state energy for higher frequencies be denoted y(p) = kpsieep - kpm,sieep. Then, taking T = 1 second for convenience: Ecpu = <x{p)ucpu (p) 4* lP) + />V.. < <A (5) where ucpu(p) is readily available as an hypervisor performance counter and may thus be used on a real platform to monitor the processor state. If the model parameters are known, the processor energy can be estimated. The method to learn parameters a(p), y(p), and cpu, is described in detail below together with the power models for other resources.
[0076] Assigning the CPU usage to relevant virtual machines requires accounting for the exact chip resources used by the virtual machine, including the shared caches and
processing components. Embodiments of the virtual machine power metering system 100 use a light weight approach that simply tracks when a virtual machine is active on a processor core (as is possible by tracking the context switches for every core). The energy used can be tracked using Equation (1) during the durations when the virtual machine is active on the processor. Processor context switch events are available on the many types of hypervisors. Re-writing in terms of a more readily available performance counter, the processor utilization of virtual machine A, denoted UCPU,A(P), and assuming that the P-state stays constant during interval T, the energy usage of a virtual machine A, denoted ECPU , can be written as
Ecpu . A = a (p) Ucpu , A. (p) (6) again taking T = 1 second. Note that the processor idle energy (spent in sleep states) is not included in the above discussion as embodiments of the power model tr aining module 200 report it separately.
IV.A.2. Practical Power Model for Memory
[0077] A true estimate of memory energy usage may use a cycle accurate simulation of its hardware design. However, prior memory power models have found that the key factor that affects memory energy usage is the read and write throughput. While external instrumentation has been attempted to capture memory throughput accurately, a low overhead estimate of memory throughput is the last level cache (LLC) miss counter available in most processors. Using this metric, memory power consumption may be written as: EMem i ) — mem ^LLCM ( T) -f m m (7) where EMem(T) represents the energy used by memory over time duration T, NLLCM(T) is the number of LLC misses during T, and amem and ymem are the linear model parameters. [0078] Since embodiments of the module 200 use LLC misses as an approximation for throughput, misses corresponding to each virtual machine are tracked. On a single core system, embodiments of the module 200 can assign the LLC misses during the time a virtual machine has the active context on the processor as the LLC misses for that virtual machine. Thus, the memory energy used by a virtual machine A becomes:
¾f em 5.A (^ ') = Orne m ^LL C M,A (8) where amem is the same as in Equation (7) and NLLCM.OPP represents the number of LLC misses while the virtual machine was active on the processor.
IV.A.3. Practical Power Model for Disk
[0079] While several power models have been developed for disks, this sub-system remains the hardest to model. The difficulty arises due to lack of visibility into the power states of a hard disk and the impact of disks' hardware caches. Further, in data center servers, disks are mostly used in RAID arrays and even when RAID-0 is used only the RAID controller hardware controls the physical disks while the hypervisor only sees the logical drives. Some embodiments of the power model training module 200 may use only hypervisor-observable parameters for ease of implementation. Alternate embodiments are possible, such as where the RAID controller is modified to expose the greater details about disk power states and mechanical motion.
[0080] The hypervisor can observe the number of bytes read and written as well as the service times for those reads and writes. However, for individual virtual machines, current hypervisors only track bytes read or written. These are used in the disk energy model, which can be written as follows:
EDi k(T)™ ν η 4- wbbw + ^usk (9) where EDisk(T) represents the energy consumed by the disk over time duration T, and br and bw are the number of bytes read and written respectively during the interval T. The a parameters and Ydisk are model parameters to be learned.
[0081] This model involves approximations since disk spin up/down actions, which are not visible outside of the RAID controller, are not captured. Variable spin speeds are not captured but, as multi-speed disks are not commonly used in data centers, this is not a serious concern.
[0082] As for other resources, embodiments of the power model training module 200 track the disk usage parameters in equation (9) for individual virtual machines. The time at which the disk activity occurs is usually not the same as when the virtual machine is active on the processor, since the hypervisor may batch 10 interrupts and buffer 10 operations. Thus instead of looking at storage system activity during the active context of a virtual machine, the 10 operations
need to be explicitly tracked in the hypervisor. Fortunately, the many hypervisors already do most of this tracking and can be used to report the 10 activity of each virtual machine separately. This yields the following disk energy model for a particular virtual machine A:
EoiBkfA ::::::: Ctrb * ¾\ , + *wbbn* ,A (10) where br and bW.A represent the number of bytes read and written, respectively, by virtual machine A. Further, it has been found that the difference in energies for disk read and write to be negligible and hence a common parameter, say b-0, can be used to represent the sum of bytes read and written. This changes the model to:
^ D is -l ') — (Xiohio + ^Disk ( 1 1 ) Virtual machine disk energy can, for the embodiment described above, be computed using:
Eoisk , A = to * bio, A ( 12)
IV.A.4. Other Resources
[0083] The dynamic range of power usage due to other resources was found to be small and they do not need to be modeled in all embodiments of the power model training module 200. However, the static energy use of those resources is included in the system idle energy in the model. Some of these resources may be important to model on other platforms. Some slower network cards, for example, do not show a wide variation in energy use with network activity but higher speed cards do use more energy in driving the physical medium, and this energy is likely to vary with network activity. With servers that use multiple such cards, modeling the network energy will thus be important.
[0084] Many servers do not vary their fan speeds, but if variable speed fans are used their contribution to dynamic energy may be modeled. Another effect that may be considered in some embodiments is the change in power supply efficiency as the power drawn changes.
IV.A.5. Power Model Parameter Training
[0085] The power models in Equations (5), (7), and (11) use certain coefficients, denoted by oc's, and ' s, that need to be learned in situ on the servers. Realistic platforms do not allow measuring ECPU(T), EMem(T), and EDisk(T) separately but only the full system power, denoted Esys(T). Define a quantity, Estatic(T), to represent the energy used by the non-modeled resources in the system. Then, assuming T = 1 as before, an equation can be written as:
Esy = Ecpu + EMe,n 1 ED t s k † E.tatic
~ &(p)Ucpu (p) ~†~ 1P) + cpu le. + tt iem NLL CM + mewt ~f~
Figure imgf000021_0001
[0086] The following points are worth noting regarding Equation (13) and lead to slight modifications for actual implementation. First, with the above summation, since the constants cpu die, mem, disk, and E static do not have any observable parameters that vary across observations, their individual values from cannot be learned from measurements of Esys. Their sum can hence be denoted as a single constant, . Second, note that the magnitude of ucpu(p) is a fraction between 0 and 1 while NLLCM and bio take values of the order of a hundred million. For numerical stability, NLLCM and bio can be normalized with respect to their maximum values observed on a system, such that the a parameters are scaled to similar magnitudes. The final equation used in learning thus becomes:
Esys— a {p) U pV (p) - l (p) + amem Urnem + < ioUdi sk + 7
( 14) where umem and Udisk represent the normalized value of NLLCM and bio respectively. In summary, there are m * 2 + 3 unknown model parameters: {a(p), (£>}™=1, amem, a.io, and y, where m is the number of P-states, and four observables that are available on real hardware without extra instrumentation: Esys, UCPU, umem,
and Udisk- [0087] Taking multiple observations of the observable quantities allows estimating the model parameters using learning techniques such as linear regression with ordinary least squares estimation. The output of the power model training module 200 are the optimal power model coefficients 260.
IV.B. Energy Calculation Module
[0088] The energy calculation module 210 uses the optimal power model coefficients from the power model training module 200 to obtain the power consumption for a virtual machine. In particular, The learned coefficient values are used in Equations (6), (8), and (12) and the computed resource energies are summed up to determine the energy usage of each VM. In fact the energy used by the virtual machine on each resource can be reported separately, if desired. There are two embodiments of the energy calculation module 210. Each of these embodiments now will be discussed.
IV.B.l. First Embodiment for Energy Calculation
[0089] The first embodiment of the energy calculation tracks resource percentages to obtain the power consumption of a virtual machine. FIG. 5 is a flow diagram illustrating the operational details of a first embodiment of the energy calculation module 210 shown in FIG. 2. The method begins by inputting the optimal power model coefficients (box 500). The method also inputs the percentages of each resource that is used by the virtual machine (box 510).
[0090] Each resource then is multiplied by its corresponding optimal power model coefficients to obtain intermediate resource results (box 520). Each of the intermediate resource results then are summed to obtain a power consumption for the virtual machine (box 530). For example, for each virtual machine the percentage of the CPU resource used by the virtual machine plus the percentage of the disk resource used by the virtual machine, and so forth, and is summed over all of the resources. The output of the module 210 is the power consumption of the virtual machine (box 540).
[0091] This first embodiment generates linearly independent combinations of power states in Equation (14) through controlled workloads that cause various resource power states to be used. This approach is a basic approach and has the lowest overhead since one workload is used for training. However, the controlled workloads may not use the processor and other resources in a manner representative of real workloads.
IV.B.2. Second Embodiment for Energy Calculation
[0092] The second embodiment of the energy calculation tracks more specific information than the first embodiment. This second embodiment is a variation on the first
embodiments that makes the equation a little more specific. Instead of just tracking resource percentages, we track a CPU% working for a virtual machine. This takes into account the type of work that the virtual machine is doing. This leads to a more complex equation that takes into account the nature of work being done by each of the virtual machines.
[0093] FIG. 6 is a flow diagram illustrating the operational details of a second
embodiment of the energy calculation module 210 shown in FIG. 2. The method begins by inputting the optimal power model coefficients (box 600). In addition, the method inputs the percentages of the CPU working for a particular virtual machine (box 610). Next, embodiments of the module 210 compute virtual-machine specific coefficients for each resource using the optimal power model coefficients (box 620).
[0094] For example, it could be the case where VMi is using 10% of the CPU (say 10 Watts) and VM2 is using 10% of the CPU (say 12 Watts). The second embodiment takes this into account. This means that the second embodiment also uses virtual machine- specific coefficients, such that the ( is the i value that works for VMi, ( 2 is the ( i value that works for VM2, and so forth. In other words, each of the al 's are for a first resource (such as the CPU resource), and the a2's are for a second resource (such as the disk resource), and so forth.
[0095] Each CPU percentage and resource percentage then is multiplied by its
corresponding virtual-machine specific coefficient to obtain intermediate resource results (box 630). Each of the intermediate resource results then are summed to obtain a power consumption for the virtual machine (box 640). The output of the module 210 is the power consumption of the virtual machine (box 650).
[0096] The power model augmentation module 290 augments the above power models by taking into account the fact that CPUs can operate a different frequencies. The power augmentation module 290 is used to overcome the accuracy limitations of the two embodiments set forth above. The general idea of the power augmentation module 290 is that the model parameters (or optimal power model coefficients) for each virtual machine are learned separately. Since for a given workload, the unobserved power states are highly correlated to the observed ones, the model based on a small number of observed states will capture the power usage more accurately. In fact, with the power model augmentation module 290, even if the processor counter for LLC misses is not captured due to portability limitations across processors, that simply becomes another unobserved state that for a given workload may be correlated with observed states.
V. Power Metering Application
[0097] The virtual machine power metering system 100 and method includes the virtual machine power metering techniques that were set forth above. These virtual machine power metering techniques can be used for various power management decisions. One of these power management decisions and applications will now be discussed, namely, power capping.
V.A. Power Capping Module
[0098] The power capping module 220 uses the power consumption computed for each virtual machine using the above techniques for a power capping technique for virtual machines. This virtual machine power capping allows virtual machines to be hosted in power over-subscribed circuits, leading to significant advantages in power provisioning costs.
[0099] In order to better explain the operation of the power capping module 220, the power capping concept in over-subscribed circuits as used for physical servers (without virtual machines) will first be described. Imagine that a server is rated for power Prat. Then, a conventionally provisioned power distribution circuit will have enough power capacity to supply Prat to all servers connected to that circuit. However, a server rated for Prat can only use Prat if each component of the server (CPU, disk, motherboard chipsets, memory, and so forth) is used to its full power at the same time. In reality, this does not happen. When the CPU is busy accessing the memory it is possible that the disk is not fully used, and when the CPU is busy processing data, it is possible that memory is not fully used. A realistic workload thus uses lower power than Prat. Suppose the maximum of the actual power used by any combination of software intended to be executed on the server is denoted Pact. Then, a power distribution circuit that has a capacity to supply only Pact per server could be used. Since Pact is lower than Prat, the power circuit is said to be over-subscribed since it is possible (though unlikely) that the power capacity is exceeded. For instance, capacity may be exceeded if some unexpected combination of software that uses more resources simultaneously than any previously measured software combination is executed.
[0100] If the power capacity is exceeded, the circuit breaker will trip causing multiple servers served by that circuit to stop working. This is clearly not desirable. Thus, it is desirable that if a server is provisioned for Pact power capacity, it stay within Pact. The method to ensure that each physical server stays within Pact is known as power capping. Typically, it is implemented in server motherboard's firmware. When the server exceeds Pact its resource usage is throttled (for instance the CPU frequency could be lowered) to keep power usage below Pact.
[0101] However, the above power capping method does not apply directly to virtual machines. As noted earlier in this document, if the physical server hosting multiple virtual machines is capped by throttling the physical resources (such as CPU frequency) in the server, it may cause multiple virtual machines to be throttled even when it is just one virtual machine that exceeded its power rating. Clearly it is desirable to enforce power capacity limits on each virtual machine separately. The power capping module 220 uses the virtual machine metering information described above to enable power capping for virtual machines.
[0102] FIG. 7 is a flow diagram illustrating the operational details of embodiments of the power capping module 220 shown in FIG. 2. First, the power usage of each virtual machine is monitored (box 700). The peak power drawn by that virtual machine is measured (box 710) and virtual machines are placed on physical host servers such that the sum of their peaks is less than or equal to the power capacity allocated to that server (box 720).
[0103] Next, a determination is made as to whether a virtual machine has exceeded its power budget (box 730). If not, then no action is taken (box 740). If so, then when a virtual machine has exceeded (or is about to exceed) its power budget, the module 290 checks whether this excess will cause the physical server's power budget to be exceeded (box 750). If some of the virtual machines are exceeding their budget but enough other virtual machines are operating below their budget such that the physical server as a whole is below the allocated power capacity limit, there is no danger of a circuit breaker tripping. In this case no action needs to be taken (box 740).
[0104] On the other hand, if one or more virtual machines are exceeding their power budget such that the physical server hosting those virtual machines has exceeded (or is about to exceed) its allocated power capacity, then an appropriate action is taken on individual virtual machines to reduce power usage on this server (box 760). Note that the entire server is not throttled (unlike existing techniques). The appropriate action includes one of multiple options. Note that each of the following options is shown in FIG. 7 as being connected with a dashed arrow to indicate that any of the four options may be used.
[0105] A first option is that the virtual machines that are exceeding their power budget are throttled to the power level that allows the server to be within its allocated capacity (box 765). Throttling may be implemented by reducing the amount of CPU time allocated to that VM. A second option is that the virtual machines are allowed to operate at higher power but some other lower priority virtual machines are throttled to reduce overall power usage (box 770). For example, a revenue generating virtual machine may be allowed to run above its budget and some background tasks may be throttled temporarily. A third option is that one or more virtual machines are migrated from this server to another server that has excess capacity (box 775). A fourth option is that one or more virtual machines are shut down to be executed later (box 780). Additional options, which are not shown in FIG. 7, include reducing the number of requests being submitted to the virtual machines, and increasing the number of virtual machines on other servers where excess power capacity.
[0106] In other embodiments of the power capping module 220, a variation of the above method is used. In reality, the virtual machines (or the applications hosted on the virtual machines) have a variable user load. The peak user load occurs only for a small duration of the day and at other times they are operating below peak. In effect the peak of the sum of the power usages of the virtual machine is lower than the sum of the peaks of the power usages. Also, the peak load of multiple virtual machines sharing a server may occur at different times. Hence, rather than allocating power capacity for peak load, some embodiments of the power capping module 220 allocate power capacity for a level lower than the peak. When the server power capacity is exceeded, an appropriate action similar to the above method (such as throttling individual VMs, migrating VMs to other servers, and so forth) is used.
V.B. Energy Metering of Distributed Activities
[0107] Computational activities in a computing infrastructure (such as a data center) often consist of multiple tasks performed by multiple virtual machines or server roles hosted on different physical machines. For example, a search query may be first received by a web front end service (also referred to as web frontend server role) and may be hosted inside a virtual machine. The search query then may be processed by multiple search data retrieval servers, advertisement servers, and content-rendering servers. The results of the processing are collected and sent to the user. As another example, an activity may consist of receiving an email for a particular user. First an incoming-email server role (possibly hosted within a virtual machine) will receive the message and determine the correct user account for the message. A virus-scanning server may check the contents of the email message. A spam-filtering server may check the contents of the message as well. Then a storage server may place the message in its storage. The activity in each case involves expenditure of energy on multiple distributed server roles some of which could be hosted on the same machine and some on different machines. It is desirable to determine the energy used by the activity as a whole.
[0108] The virtual machine power metering system 100 and method also includes a technique to determine the energy usage of an individual activity that may be distributed over several virtual machines. An activity is defined as a service or request. For example, a search query is an activity that has a verifiable beginning and end. [0109] In particular, when the activity is initiated an event is generated. This begins the tracking of each resource that is used by the activity. For example, when an activity uses a particular server role, the energy usage of the relevant server role is determined over the duration of the activity's use using previously described methods to monitor energy usage of virtual machines. As noted above, the activity could start and end on different virtual machines at different times. As the activity progresses, the energy used across multiple server roles is added up to generate the total energy used by the activity.
[0110] In addition, some energy may be spent by devices that are not part of the computing infrastructure as tracked by the virtual machine power metering method. These could include cooling equipment for the computing facility and such peripherals. The energy used in these devices is estimated as follows. The total energy used by those devices is measured using hardware power metering or using known power models (based on specifications and data sheets or learned in situ). Then, the total activity workload served by the computational facility is tracked using only the methods that monitor computing infrastructure that are accessible to the virtual machine power metering method. The total power consumption is then apportioned among all activities that are performed by the computing infrastructure. The total energy may be apportioned equally among all activities, divided in the ratio of their execution times, or in the ratios of their previously computed energy spent within the virtual machines.
[0111] The techniques presented herein use the term "energy usage" or "power consumption." These may be reported in units of energy (such as Joules or Kilowatt Hours). Power consumption may also be reported in terms of power drawn, using units of power (such as Watts). In the case where the reporting is in terms of power, a time series representing power drawn over unit time intervals may be provided.
VI. Exemplary Operating Environment
[0112] Embodiments of the virtual machine power metering system 100 and method are designed to operate in a computing environment. The following discussion is intended to provide a brief, general description of a suitable computing environment in which embodiments of the virtual machine power metering system 100 and method may be implemented.
[0113] FIG. 8 illustrates an example of a suitable computing system environment in which embodiments of the virtual machine power metering system 100 and method shown in FIGS. 1-7 may be implemented. The computing system environment 800 is only one example of a suitable computing environment and is not intended to suggest any limitation as to the scope of use or functionality of the invention. Neither should the computing environment 800 be interpreted as having any dependency or requirement relating to any one or combination of components illustrated in the exemplary operating environment.
[0114] Embodiments of the virtual machine power metering system 100 and method are operational with numerous other general purpose or special purpose computing system environments or configurations. Examples of well known computing systems, environments, and/or configurations that may be suitable for use with embodiments of the virtual machine power metering system 100 and method include, but are not limited to, personal computers, server computers, hand-held (including smartphones), laptop or mobile computer or communications devices such as cell phones and PDA's,
multiprocessor systems, microprocessor-based systems, set top boxes, programmable consumer electronics, network PCs, minicomputers, mainframe computers, distributed computing environments that include any of the above systems or devices, and the like.
[0115] Embodiments of the virtual machine power metering system 100 and method may be described in the general context of computer-executable instructions, such as program modules, being executed by a computer. Generally, program modules include routines, programs, objects, components, data structures, etc., that perform particular tasks or implement particular abstract data types. Embodiments of the virtual machine power metering system 100 and method may also be practiced in distributed computing environments where tasks are performed by remote processing devices that are linked through a communications network. In a distributed computing environment, program modules may be located in both local and remote computer storage media including memory storage devices. With reference to FIG. 8, an exemplary system for embodiments of the virtual machine power metering system 100 and method includes a general-purpose computing device in the form of a computer 810.
[0116] Components of the computer 810 may include, but are not limited to, a processing unit 820 (such as a central processing unit, CPU), a system memory 830, and a system bus 821 that couples various system components including the system memory to the processing unit 820. The system bus 821 may be any of several types of bus structures including a memory bus or memory controller, a peripheral bus, and a local bus using any of a variety of bus architectures. By way of example, and not limitation, such
architectures include Industry Standard Architecture (ISA) bus, Micro Channel
Architecture (MCA) bus, Enhanced ISA (EISA) bus, Video Electronics Standards Association (VESA) local bus, and Peripheral Component Interconnect (PCI) bus also known as Mezzanine bus.
[0117] The computer 810 typically includes a variety of computer readable media.
Computer readable media can be any available media that can be accessed by the computer 810 and includes both volatile and nonvolatile media, removable and nonremovable media. By way of example, and not limitation, computer readable media may comprise computer storage media and communication media. Computer storage media includes volatile and nonvolatile removable and non-removable media implemented in any method or technology for storage of information such as computer readable instructions, data structures, program modules or other data.
[0118] Computer storage media includes, but is not limited to, RAM, ROM, EEPROM, flash memory or other memory technology, CD-ROM, digital versatile disks (DVD) or other optical disk storage, magnetic cassettes, magnetic tape, magnetic disk storage or other magnetic storage devices, or any other medium which can be used to store the desired information and which can be accessed by the computer 810. By way of example, and not limitation, communication media includes wired media such as a wired network or direct-wired connection, and wireless media such as acoustic, RF, infrared and other wireless media. Combinations of any of the above should also be included within the scope of computer readable media.
[0119] The system memory 830 includes computer storage media in the form of volatile and/or nonvolatile memory such as read only memory (ROM) 831 and random access memory (RAM) 832. A basic input/output system 833 (BIOS), containing the basic routines that help to transfer information between elements within the computer 810, such as during start-up, is typically stored in ROM 831. RAM 832 typically contains data and/or program modules that are immediately accessible to and/or presently being operated on by processing unit 820. By way of example, and not limitation, FIG. 8 illustrates operating system 834, application programs 835, other program modules 836, and program data 837.
[0120] The computer 810 may also include other removable/non-removable,
volatile/nonvolatile computer storage media. By way of example only, FIG. 8 illustrates a hard disk drive 841 that reads from or writes to non-removable, nonvolatile magnetic media, a magnetic disk drive 851 that reads from or writes to a removable, nonvolatile magnetic disk 852, and an optical disk drive 855 that reads from or writes to a removable, nonvolatile optical disk 856 such as a CD ROM or other optical media. [0121] Other removable/non-removable, volatile/nonvolatile computer storage media that can be used in the exemplary operating environment include, but are not limited to, magnetic tape cassettes, flash memory cards, digital versatile disks, digital video tape, solid state RAM, solid state ROM, and the like. The hard disk drive 841 is typically connected to the system bus 821 through a non-removable memory interface such as interface 840, and magnetic disk drive 851 and optical disk drive 855 are typically connected to the system bus 821 by a removable memory interface, such as interface 850.
[0122] The drives and their associated computer storage media discussed above and illustrated in FIG. 8, provide storage of computer readable instructions, data structures, program modules and other data for the computer 810. In FIG. 8, for example, hard disk drive 841 is illustrated as storing operating system 844, application programs 845, other program modules 846, and program data 847. Note that these components can either be the same as or different from operating system 834, application programs 835, other program modules 836, and program data 837. Operating system 844, application programs 845, other program modules 846, and program data 847 are given different numbers here to illustrate that, at a minimum, they are different copies. A user may enter commands and information (or data) into the computer 810 through input devices such as a keyboard 862, pointing device 861, commonly referred to as a mouse, trackball or touch pad, and a touch panel or touch screen (not shown).
[0123] Other input devices (not shown) may include a microphone, joystick, game pad, satellite dish, scanner, radio receiver, or a television or broadcast video receiver, or the like. These and other input devices are often connected to the processing unit 820 through a user input interface 860 that is coupled to the system bus 821, but may be connected by other interface and bus structures, such as, for example, a parallel port, game port or a universal serial bus (USB). A monitor 891 or other type of display device is also connected to the system bus 821 via an interface, such as a video interface 890. In addition to the monitor, computers may also include other peripheral output devices such as speakers 897 and printer 896, which may be connected through an output peripheral interface 895.
[0124] The computer 810 may operate in a networked environment using logical connections to one or more remote computers, such as a remote computer 880. The remote computer 880 may be a personal computer, a server, a router, a network PC, a peer device or other common network node, and typically includes many or all of the elements described above relative to the computer 810, although only a memory storage device 881 has been illustrated in FIG. 8. The logical connections depicted in FIG. 8 include a local area network (LAN) 871 and a wide area network (WAN) 873, but may also include other networks. Such networking environments are commonplace in offices, enterprise-wide computer networks, intranets and the Internet.
[0125] When used in a LAN networking environment, the computer 810 is connected to the LAN 871 through a network interface or adapter 870. When used in a WAN networking environment, the computer 810 typically includes a modem 872 or other means for establishing communications over the WAN 873, such as the Internet. The modem 872, which may be internal or external, may be connected to the system bus 821 via the user input interface 860, or other appropriate mechanism. In a networked environment, program modules depicted relative to the computer 810, or portions thereof, may be stored in the remote memory storage device. By way of example, and not limitation, FIG. 8 illustrates remote application programs 885 as residing on memory device 881. It will be appreciated that the network connections shown are exemplary and other means of establishing a communications link between the computers may be used.
[0126] The foregoing Detailed Description has been presented for the purposes of illustration and description. Many modifications and variations are possible in light of the above teaching. It is not intended to be exhaustive or to limit the subject matter described herein to the precise form disclosed. Although the subject matter has been described in language specific to structural features and/or methodological acts, it is to be understood that the subject matter defined in the appended claims is not necessarily limited to the specific features or acts described above. Rather, the specific features and acts described above are disclosed as example forms of implementing the claims appended hereto.

Claims

1. A method implemented on a computing device for computing power consumption of an activity that occurs across at least one virtual machine residing on a physical host server, comprising:
defining an activity period as an amount of time elapsed between a beginning event and an ending event;
tracking each resource on each virtual machine that is used in the activity during the activity period;
computing power consumption for each virtual machine working on the activity during the activity period to obtain power consumption measurements;
summing the power consumption measurements for each virtual machine working on the activity during the activity period to obtain the total activity power consumption; and
outputting the total activity power consumption representing the total power consumed for the activity.
2. The method of claim 1, further comprising:
computing a power model that relates a total resource utilization for a virtual machine to a total power consumption of the host server; and
computing a power consumption for the virtual machine using the power model and optimal power model coefficients.
3. The method of claim 1, further comprising:
computing total power consumption of devices used during the activity period but not tracked as a resource; and
determining a portion of the total power consumption of the untracked devices that was used in the activity during the activity period.
4. The method of claim 2, further comprising summing the power consumption measurements for each virtual machine working on the activity and the portion of the power consumption of the untracked devices using during the activity period to obtain the total activity power consumption.
5. The method of claim 4, further comprising:
generating a beginning event when the activity is initiated on the virtual machine; and
determining and marking an ending event when the activity is terminated and presented to a user.
6. The method of claim 2, further comprising:
determining percentages of each resource used by each virtual machine;
multiplying each resource percentage by its corresponding optimal power model coefficients to obtain intermediate resource results for each virtual machine; and
summing each of the intermediate resource results for each virtual machine to obtain the individual power consumption for each virtual machine.
7. The computer-implemented method of claim 2, further comprising:
determining percentages of a central processing unit (CPU) working for each virtual machine; and
using the optimal power model coefficients to compute virtual machine-specific coefficients for each resource for each virtual machine.
8. A method implemented on a computing device for individually controlling power consumption of a plurality of virtual machines residing on a physical host server, comprising:
using the computing device to perform the following:
monitoring a power consumption of each virtual machine on the host server;
measuring a peak power drawn by each virtual machine on the host server; determining whether any of the plurality of virtual machines has exceeded its power budget;
if so, then determining whether a power budget of the host server has been exceeded; and
if so, then reducing a power consumption of at least one of the virtual machines to reduced a total power consumption of the host server.
9. The method of claim 10, further comprising locating virtual machines on the host server based on measured peak power of each virtual machine.
10. The method of claim 10, further comprising:
generating a power model that relates a total resource utilization for each of the plurality of virtual machines to a total power consumption of the host server;
using the power model to obtain optimal power model coefficients; and computing a power consumption for each of the plurality of virtual machines using the power model and the optimal power model coefficients.
11. The method of claim 10, further comprising reducing power consumption of each virtual machine that is exceeding its power budget so as to not exceed the power budget of the host server.
12. The method of claim 10, further comprising reducing power consumption of each lower-priority virtual machines residing the host server so as to not exceed the power budget of the host server, where the lower-priority virtual machines have a lower priority as compared to virtual machines that have exceeded their power budget.
13. The method of claim 10, further comprising shutting down one or more of the virtual machines that have exceeded their power budget so as to not exceed the power budget of the host server.
14. The method of claim 10, further comprising migrating one or more virtual machines to another server so as to not exceed the power budget of the host server.
PCT/US2011/026353 2010-02-26 2011-02-25 Virtual machine power consumption measurement and management WO2011106725A2 (en)

Priority Applications (3)

Application Number Priority Date Filing Date Title
EP18152419.0A EP3333668B1 (en) 2010-02-26 2011-02-25 Virtual machine power consumption measurement and management
CN201180011019.0A CN102770826B (en) 2010-02-26 2011-02-25 virtual machine power consumption measurement and management
EP11748196.0A EP2539791B1 (en) 2010-02-26 2011-02-25 Virtual machine power consumption measurement and management

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US12/714,416 2010-02-26
US12/714,416 US8862914B2 (en) 2010-02-26 2010-02-26 Virtual machine power consumption measurement and management

Publications (2)

Publication Number Publication Date
WO2011106725A2 true WO2011106725A2 (en) 2011-09-01
WO2011106725A3 WO2011106725A3 (en) 2012-01-05

Family

ID=44505945

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/US2011/026353 WO2011106725A2 (en) 2010-02-26 2011-02-25 Virtual machine power consumption measurement and management

Country Status (4)

Country Link
US (2) US8862914B2 (en)
EP (2) EP2539791B1 (en)
CN (2) CN104850488B (en)
WO (1) WO2011106725A2 (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102567076A (en) * 2011-12-12 2012-07-11 华中科技大学 Selection method of server integration scheme based on energy consumption
WO2017165003A1 (en) * 2016-03-22 2017-09-28 Intel Corporation Control device for estimation of power consumption and energy efficiency of application containers

Families Citing this family (92)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP4488072B2 (en) * 2008-01-18 2010-06-23 日本電気株式会社 Server system and power reduction method for server system
US8862914B2 (en) 2010-02-26 2014-10-14 Microsoft Corporation Virtual machine power consumption measurement and management
US8467823B2 (en) * 2010-03-24 2013-06-18 Fujitsu Limited Method and system for CPRI cascading in distributed radio head architectures
US8422884B2 (en) * 2010-03-24 2013-04-16 Fujitsu Limited Method and apparatus for picocell distributed radio heads providing macrocell capabilities
US9148428B1 (en) * 2011-05-25 2015-09-29 Bromium, Inc. Seamless management of untrusted data using virtual machines
US8589932B2 (en) * 2010-07-02 2013-11-19 International Business Machines Corporation Data processing workload control
WO2012004418A2 (en) * 2010-07-09 2012-01-12 Stratergia Ltd. Power profiling and auditing consumption systems and methods
US8707300B2 (en) * 2010-07-26 2014-04-22 Microsoft Corporation Workload interference estimation and performance optimization
US8756441B1 (en) * 2010-09-30 2014-06-17 Emc Corporation Data center energy manager for monitoring power usage in a data storage environment having a power monitor and a monitor module for correlating associative information associated with power consumption
US8825858B1 (en) 2010-11-04 2014-09-02 Sprint Communications Company L.P. Virtual server resource monitoring and management
EP2652685A2 (en) * 2010-12-13 2013-10-23 Fraunhofer USA, Inc. Methods and system for nonintrusive load monitoring
CN102759979B (en) * 2011-04-29 2016-03-30 国际商业机器公司 A kind of energy consumption of virtual machine method of estimation and device
US8645733B2 (en) * 2011-05-13 2014-02-04 Microsoft Corporation Virtualized application power budgeting
US20120303322A1 (en) * 2011-05-23 2012-11-29 Rego Charles W Incorporating memory and io cycle information into compute usage determinations
US20140343743A1 (en) * 2011-08-31 2014-11-20 Long Meadow Technologies, Llc System, method and apparatus for computing, monitoring, measuring, optimizing and allocating power and energy for a rod pumping system
JP5970461B2 (en) * 2011-09-07 2016-08-17 パナソニック インテレクチュアル プロパティ コーポレーション オブ アメリカPanasonic Intellectual Property Corporation of America Virtual computer control device, virtual computer control method, virtual computer control program, and integrated circuit
US8892919B2 (en) 2011-12-14 2014-11-18 Ati Technologies Ulc Method and apparatus for power management of a processor in a virtual environment
US9154366B1 (en) 2011-12-14 2015-10-06 Sprint Communications Company L.P. Server maintenance modeling in cloud computing
US10095295B2 (en) 2011-12-14 2018-10-09 Advanced Micro Devices, Inc. Method and apparatus for power management of a graphics processing core in a virtual environment
GB2499185A (en) * 2012-01-31 2013-08-14 Qatar Foundation Gathering power consumption data for monitoring impact of virtual machines on host server resources
EP2823373B1 (en) * 2012-03-08 2019-07-17 Hewlett-Packard Development Company, L.P. Virtualizing battery across a group of personal mobile devices
CN102708000B (en) * 2012-04-19 2014-10-29 北京华胜天成科技股份有限公司 System and method for realizing energy consumption control through virtual machine migration
US8856567B2 (en) 2012-05-10 2014-10-07 International Business Machines Corporation Management of thermal condition in a data processing system by dynamic management of thermal loads
US9772669B2 (en) * 2012-05-21 2017-09-26 Lg Electronics Inc. Power device, power control device, and operating method therefor
US9292060B1 (en) * 2012-06-28 2016-03-22 Amazon Technologies, Inc. Allowing clients to limited control on power consumed by the cloud while executing the client's tasks
US9348391B1 (en) * 2012-06-28 2016-05-24 Amazon Technologies, Inc. Managing resource power states in shared environments
EP2866403B1 (en) 2012-07-20 2018-01-17 Huawei Technologies Co., Ltd. Resource management method and management server
US20140074561A1 (en) * 2012-09-12 2014-03-13 International Business Machines Corporation Configurable rating and metering
US9310864B1 (en) * 2012-09-19 2016-04-12 Amazon Technologies, Inc. Monitoring and real-time adjustment of power consumption settings
US9547353B1 (en) 2012-09-19 2017-01-17 Amazon Technologies, Inc. Processor energy monitoring and dynamic adjustment
US9311209B2 (en) 2012-11-27 2016-04-12 International Business Machines Corporation Associating energy consumption with a virtual machine
US9459685B1 (en) * 2012-12-10 2016-10-04 Arizona Board Of Regents On Behalf Of Northern Arizona University System and methods for optimizing energy efficiency in programmable devices
JP6056488B2 (en) * 2013-01-15 2017-01-11 富士通株式会社 Power consumption estimation device and power consumption estimation method
EP2772862B1 (en) * 2013-02-28 2017-12-20 BlackBerry Limited Electrical current estimation for electronic devices
CN103245829B (en) * 2013-05-24 2015-11-18 中国科学院计算技术研究所 A kind of virtual machine power consumption measurement method
KR20150001102A (en) * 2013-06-26 2015-01-06 한국전자통신연구원 Method and apparatus for server power prediction using virtual machine
US9268382B2 (en) * 2013-07-01 2016-02-23 Oracle International Corporation Method and system for monitoring resource usage of logical domains
US9563724B2 (en) 2013-09-28 2017-02-07 International Business Machines Corporation Virtual power management multiprocessor system simulation
CN104572412A (en) * 2013-10-29 2015-04-29 鸿富锦精密工业(深圳)有限公司 Method and system for making startup time statistics of virtual machines
US10289437B2 (en) 2014-01-07 2019-05-14 Red Hat Israel, Ltd. Idle processor management in virtualized systems via paravirtualization
US10365936B2 (en) 2014-02-27 2019-07-30 Red Hat Israel, Ltd. Idle processor management by guest in virtualized systems
US20150261648A1 (en) 2014-03-13 2015-09-17 Parth Malani Power monitoring system for virtual platform simulation
US9405569B2 (en) * 2014-03-17 2016-08-02 Ca, Inc. Determining virtual machine utilization of distributed computed system infrastructure
CN103914119B (en) * 2014-04-17 2017-01-18 江苏省电力公司盐城供电公司 Electric power measurement method and electric power measurement system for energy consumption of virtual machines
JP6303857B2 (en) * 2014-06-20 2018-04-04 富士通株式会社 Output program, output device, and output method
US20160077571A1 (en) * 2014-09-12 2016-03-17 Microsoft Corporation Heuristic Processor Power Management in Operating Systems
US10896432B1 (en) * 2014-09-22 2021-01-19 Amazon Technologies, Inc. Bandwidth cost assignment for multi-tenant networks
US20160131688A1 (en) * 2014-11-11 2016-05-12 Solarcity Corporation Determining an orientation of a metering device in an energy generation system
US10169104B2 (en) 2014-11-19 2019-01-01 International Business Machines Corporation Virtual computing power management
CN106170744B (en) * 2014-12-31 2019-07-19 华为技术有限公司 Energy consumption of virtual machine determines method, physical machine and network system
US9632553B2 (en) * 2015-03-12 2017-04-25 Vmware, Inc. Computing a unit rate of power consumed in a data center
US10437621B2 (en) * 2015-04-22 2019-10-08 Cisco Technology, Inc. Monitoring and managing applications on virtual machines using a proxy agent
KR102366778B1 (en) 2015-10-07 2022-02-24 한국전자통신연구원 Device for distributing the load and manage the power of virtualization server cluster and method thereof
US10234921B2 (en) * 2016-03-04 2019-03-19 Schneider Electric USA, Inc. Systems and methods for operating a virtual power environment
US10545560B2 (en) 2016-10-10 2020-01-28 International Business Machines Corporation Power management and utilization detection of computing components
US10838482B2 (en) 2016-10-10 2020-11-17 International Business Machines Corporation SLA-based power management in disaggregated computing systems
US10360077B2 (en) * 2016-10-10 2019-07-23 International Business Machines Corporation Measuring utilization of resources in datacenters
US11169592B2 (en) 2016-10-10 2021-11-09 International Business Machines Corporation SLA-based backup power management during utility power interruption in disaggregated datacenters
US10819599B2 (en) 2016-10-10 2020-10-27 International Business Machines Corporation Energy consumption as a measure of utilization and work characterization in a system
CN106412124B (en) * 2016-12-01 2019-10-29 广州高能计算机科技有限公司 A kind of and sequence cloud service platform task distribution system and method for allocating tasks
US10423209B2 (en) 2017-02-13 2019-09-24 Apple Inc. Systems and methods for coherent power management
US11669426B2 (en) * 2017-06-30 2023-06-06 International Business Machines Corporation Kernel-based power consumption and isolation and defense against emerging power attacks
US10496150B2 (en) 2017-07-13 2019-12-03 Red Hat, Inc. Power consumption optimization on the cloud
US10445209B2 (en) 2017-09-08 2019-10-15 Accenture Global Solutions Limited Prescriptive analytics based activation timetable stack for cloud computing resource scheduling
US10514956B2 (en) * 2017-09-20 2019-12-24 Qualcomm Incorporated Performance hint driven resource allocation
US10817041B2 (en) * 2017-11-17 2020-10-27 Philip Vaccaro Energy efficient computer process
US10922141B2 (en) 2017-12-11 2021-02-16 Accenture Global Solutions Limited Prescriptive analytics based committed compute reservation stack for cloud computing resource scheduling
US10719344B2 (en) 2018-01-03 2020-07-21 Acceture Global Solutions Limited Prescriptive analytics based compute sizing correction stack for cloud computing resource scheduling
US10708142B2 (en) 2018-01-24 2020-07-07 Keysight Technologies, Inc. Methods, systems, and computer readable media for providing cloud visibility
CN110362398B (en) 2018-04-09 2023-09-12 阿里巴巴集团控股有限公司 Scheduling method and system of virtual machine
US11188439B2 (en) * 2018-04-13 2021-11-30 Vmware, Inc. Usage pattern virtual machine idle detection
CN109101316B (en) * 2018-07-11 2022-10-04 南京邮电大学 Energy efficiency related cloud computing platform resource scheduling method
US10976801B2 (en) * 2018-09-20 2021-04-13 Intel Corporation System, apparatus and method for power budget distribution for a plurality of virtual machines to execute on a processor
CN109582531A (en) * 2018-10-15 2019-04-05 平安科技(深圳)有限公司 Monitoring method, device and the electronic equipment of power distribution unit
WO2020102930A1 (en) * 2018-11-19 2020-05-28 Alibaba Group Holding Limited Power management method
CN113056717B (en) * 2018-11-19 2023-09-19 阿里云计算有限公司 Method and apparatus for providing power management and computer readable storage medium
US10884484B2 (en) * 2018-12-20 2021-01-05 Dell Products, L.P. Extending a battery life of a battery-powered computing device
US10459757B1 (en) 2019-05-13 2019-10-29 Accenture Global Solutions Limited Prescriptive cloud computing resource sizing based on multi-stream data sources
US11269677B2 (en) * 2019-08-08 2022-03-08 Vigyanlabs Innovations Private Limited System and method to analyze and optimize application level resource and energy consumption by data center servers
US10817328B1 (en) * 2020-01-30 2020-10-27 Coupang Corp. Systems and methods for virtual server resource usage metric evaluation and performance tracking
US10817329B1 (en) * 2020-02-07 2020-10-27 Coupang Corp. Systems and methods for centralization and diagnostics for live virtual server performance data
US11429176B2 (en) * 2020-05-14 2022-08-30 Dell Products L.P. Intelligent and predictive optimization of power needs across virtualized environments
CN114237808A (en) * 2020-09-07 2022-03-25 华为云计算技术有限公司 Virtual machine management method and device based on server cabinet
WO2022048674A1 (en) * 2020-09-07 2022-03-10 华为云计算技术有限公司 Server cabinet-based virtual machine management method and apparatus
US11947436B2 (en) 2020-12-21 2024-04-02 International Business Machines Corporation Automatic evaluation of virtual machine computing power
US12001256B2 (en) * 2021-03-09 2024-06-04 Hewlett Packard Enterprise Development Lp Job agent-based power-capping of systems
CN113010273B (en) * 2021-03-23 2022-07-19 河北冀联人力资源服务集团有限公司 Human resource data distributed task processing method and system
US11669141B2 (en) * 2021-05-28 2023-06-06 Microsoft Technology Licensing, Llc Computing system including power nodes
CN113934533A (en) * 2021-09-26 2022-01-14 度小满科技(北京)有限公司 Service deployment method and device, storage medium and electronic equipment
US12117885B2 (en) 2021-12-28 2024-10-15 Seagate Technology Llc Electronic device power consumption grading
US20240061494A1 (en) * 2022-08-17 2024-02-22 Red Hat, Inc. Monitoring energy consumption associated with users of a distributed computing system using tracing
US20240201767A1 (en) * 2022-12-20 2024-06-20 Microsoft Technology Licensing, Llc Utilizing device signals to improve the recovery of virtual machine host devices

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20050268078A1 (en) 2004-05-12 2005-12-01 Zimmer Vincent J Distributed advanced power management
US20060155415A1 (en) 2005-01-13 2006-07-13 Lefurgy Charles R Attributing energy consumption to individual code threads in a data processing system
US20090007108A1 (en) 2007-06-29 2009-01-01 Hanebutte Ulf R Arrangements for hardware and software resource monitoring
US20100011227A1 (en) 2008-07-11 2010-01-14 International Business Machines Corporation System and method for measurement-based power and energy accounting for virtual machines

Family Cites Families (21)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6408393B1 (en) * 1998-01-09 2002-06-18 Hitachi, Ltd. CPU power adjustment method
KR100613201B1 (en) * 2000-08-28 2006-08-18 마이크로코넥트 엘엘씨 Measuring method for cpu usage
US7543161B2 (en) * 2004-09-30 2009-06-02 International Business Machines Corporation Method and apparatus for tracking variable speed microprocessor performance caused by power management in a logically partitioned data processing system
US7721292B2 (en) * 2004-12-16 2010-05-18 International Business Machines Corporation System for adjusting resource allocation to a logical partition based on rate of page swaps and utilization by changing a boot configuration file
US8909946B2 (en) * 2005-11-15 2014-12-09 Microsoft Corporation Efficient power management of a system with virtual machines
US7856549B2 (en) * 2007-01-24 2010-12-21 Hewlett-Packard Development Company, L.P. Regulating power consumption
US8140885B2 (en) * 2007-02-14 2012-03-20 International Business Machines Corporation Accounting for microprocessor resource consumption
US7925901B2 (en) * 2007-03-15 2011-04-12 International Business Machines Corporation Method and system for estimating processor utilization from power measurements
KR101116615B1 (en) 2007-03-28 2012-03-07 삼성전자주식회사 Resource management system and method for applications and threads in JAVA Virtual Machine
US7861102B1 (en) * 2007-04-30 2010-12-28 Hewlett-Packard Development Company, L.P. Unified power management architecture
US20080301473A1 (en) * 2007-05-29 2008-12-04 International Business Machines Corporation Method and system for hypervisor based power management
US8046600B2 (en) * 2007-10-29 2011-10-25 Microsoft Corporation Collaborative power sharing between computing devices
US7904287B2 (en) * 2007-11-13 2011-03-08 International Business Machines Corporation Method and system for real-time prediction of power usage for a change to another performance state
US9395786B2 (en) * 2007-11-20 2016-07-19 Hewlett Packard Enterprise Development Lp Cross-layer power management in a multi-layer system
JP4488072B2 (en) * 2008-01-18 2010-06-23 日本電気株式会社 Server system and power reduction method for server system
WO2009110884A1 (en) 2008-03-03 2009-09-11 Hewlett-Packard Development Company, L.P. Software-based power-capping
JP5157717B2 (en) * 2008-07-28 2013-03-06 富士通株式会社 Virtual machine system with virtual battery and program for virtual machine system with virtual battery
US20100138829A1 (en) * 2008-12-01 2010-06-03 Vincent Hanquez Systems and Methods for Optimizing Configuration of a Virtual Machine Running At Least One Process
US20110004500A1 (en) * 2009-07-06 2011-01-06 Microsoft Corporation Allocating a resource based on quality-of-service considerations
US8751844B2 (en) * 2009-09-24 2014-06-10 Citrix Systems, Inc. Systems and methods for attributing an amount of power consumption to a workload
US8862914B2 (en) 2010-02-26 2014-10-14 Microsoft Corporation Virtual machine power consumption measurement and management

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20050268078A1 (en) 2004-05-12 2005-12-01 Zimmer Vincent J Distributed advanced power management
US20060155415A1 (en) 2005-01-13 2006-07-13 Lefurgy Charles R Attributing energy consumption to individual code threads in a data processing system
US20090007108A1 (en) 2007-06-29 2009-01-01 Hanebutte Ulf R Arrangements for hardware and software resource monitoring
US20100011227A1 (en) 2008-07-11 2010-01-14 International Business Machines Corporation System and method for measurement-based power and energy accounting for virtual machines

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
See also references of EP2539791A4

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102567076A (en) * 2011-12-12 2012-07-11 华中科技大学 Selection method of server integration scheme based on energy consumption
CN102567076B (en) * 2011-12-12 2014-12-17 华中科技大学 Selection method of server integration scheme based on energy consumption
WO2017165003A1 (en) * 2016-03-22 2017-09-28 Intel Corporation Control device for estimation of power consumption and energy efficiency of application containers
US10432491B2 (en) 2016-03-22 2019-10-01 Intel Corporation Control device for estimation of power consumption and energy efficiency of application containers

Also Published As

Publication number Publication date
US20110213997A1 (en) 2011-09-01
EP2539791B1 (en) 2018-03-28
EP3333668B1 (en) 2020-04-15
EP2539791A4 (en) 2017-05-17
WO2011106725A3 (en) 2012-01-05
EP3333668A3 (en) 2018-07-18
US8862914B2 (en) 2014-10-14
EP2539791A2 (en) 2013-01-02
CN102770826B (en) 2016-03-23
CN104850488A (en) 2015-08-19
CN104850488B (en) 2018-04-06
US20140351613A1 (en) 2014-11-27
EP3333668A2 (en) 2018-06-13
CN102770826A (en) 2012-11-07
US9575539B2 (en) 2017-02-21

Similar Documents

Publication Publication Date Title
US9575539B2 (en) Virtual machine power consumption measurement and management
Bohra et al. VMeter: Power modelling for virtualized clouds
Jin et al. Energy efficiency and server virtualization in data centers: An empirical investigation
Krishnan et al. VM power metering: feasibility and challenges
Shen et al. Cloudscale: elastic resource scaling for multi-tenant cloud systems
Iyer et al. VM3: Measuring, modeling and managing VM shared resources
Dhiman et al. vgreen: A system for energy-efficient management of virtual machines
Feller et al. Energy management in IaaS clouds: a holistic approach
Lin et al. Experimental and quantitative analysis of server power model for cloud data centers
Xiao et al. Virtual machine power measuring technique with bounded error in cloud environments
Hwang et al. A study of the effectiveness of CPU consolidation in a virtualized multi-core server system
Lim et al. Padd: Power aware domain distribution
Islam et al. Online energy budgeting for cost minimization in virtualized data center
De Maio et al. An improved model for live migration in data centre simulators
Piga et al. Empirical and analytical approaches for web server power modeling
El-Refaey et al. Virtual systems workload characterization: An overview
Peng et al. A low-cost power measuring technique for virtual machine in cloud environments
El Motaki et al. Modeling the correlation between the workload and the power consumed by a server using stochastic and non‐parametric approaches
Chen et al. Using CPU gradients for performance-aware energy conservation in multitier systems
Krzywda et al. Modeling and simulation of qos-aware power budgeting in cloud data centers
Kothari et al. Joulemeter: Virtual machine power measurement and management
Amin et al. Profiling-based energy-aware recommendation system for cloud platforms
Okonor et al. Intelligent agent-based technique for virtual machine resource allocation for energy-efficient cloud data centre
Diouani et al. Energy Consumption Modeling and Prediction in the Cloud Data Centers.
Subramaniam Metrics, models and methodologies for energy-proportional computing

Legal Events

Date Code Title Description
WWE Wipo information: entry into national phase

Ref document number: 201180011019.0

Country of ref document: CN

121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 11748196

Country of ref document: EP

Kind code of ref document: A2

WWE Wipo information: entry into national phase

Ref document number: 6953/CHENP/2012

Country of ref document: IN

WWE Wipo information: entry into national phase

Ref document number: 2011748196

Country of ref document: EP

NENP Non-entry into the national phase

Ref country code: DE