US20200112520A1 - Resource allocation using restore credits - Google Patents

Resource allocation using restore credits Download PDF

Info

Publication number
US20200112520A1
US20200112520A1 US16/154,518 US201816154518A US2020112520A1 US 20200112520 A1 US20200112520 A1 US 20200112520A1 US 201816154518 A US201816154518 A US 201816154518A US 2020112520 A1 US2020112520 A1 US 2020112520A1
Authority
US
United States
Prior art keywords
credits
client
restore
server
issuing
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
US16/154,518
Other versions
US10630602B1 (en
Inventor
Keyur B. Desai
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
EMC Corp
Original Assignee
EMC IP Holding Co LLC
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by EMC IP Holding Co LLC filed Critical EMC IP Holding Co LLC
Assigned to EMC IP Holding Company LLC reassignment EMC IP Holding Company LLC ASSIGNMENT OF ASSIGNORS INTEREST (SEE DOCUMENT FOR DETAILS). Assignors: DESAI, KEYUR B.
Priority to US16/154,518 priority Critical patent/US10630602B1/en
Assigned to THE BANK OF NEW YORK MELLON TRUST COMPANY, N.A. reassignment THE BANK OF NEW YORK MELLON TRUST COMPANY, N.A. SECURITY AGREEMENT Assignors: CREDANT TECHNOLOGIES, INC., DELL INTERNATIONAL L.L.C., DELL MARKETING L.P., DELL PRODUCTS L.P., DELL USA L.P., EMC CORPORATION, EMC IP Holding Company LLC, FORCE10 NETWORKS, INC., WYSE TECHNOLOGY L.L.C.
Priority to DE112019005042.7T priority patent/DE112019005042T5/en
Priority to GB2104643.8A priority patent/GB2591928B/en
Priority to CN201980066453.5A priority patent/CN112805684A/en
Priority to PCT/US2019/043976 priority patent/WO2020076394A1/en
Priority to US16/836,350 priority patent/US11005776B2/en
Publication of US20200112520A1 publication Critical patent/US20200112520A1/en
Publication of US10630602B1 publication Critical patent/US10630602B1/en
Application granted granted Critical
Assigned to THE BANK OF NEW YORK MELLON TRUST COMPANY, N.A. reassignment THE BANK OF NEW YORK MELLON TRUST COMPANY, N.A. SECURITY AGREEMENT Assignors: CREDANT TECHNOLOGIES INC., DELL INTERNATIONAL L.L.C., DELL MARKETING L.P., DELL PRODUCTS L.P., DELL USA L.P., EMC CORPORATION, EMC IP Holding Company LLC, FORCE10 NETWORKS, INC., WYSE TECHNOLOGY L.L.C.
Active legal-status Critical Current
Adjusted expiration legal-status Critical

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L47/00Traffic control in data switching networks
    • H04L47/70Admission control; Resource allocation
    • H04L47/78Architectures of resource allocation
    • H04L47/783Distributed allocation of resources, e.g. bandwidth brokers
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F9/00Arrangements for program control, e.g. control units
    • G06F9/06Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
    • G06F9/46Multiprogramming arrangements
    • G06F9/50Allocation of resources, e.g. of the central processing unit [CPU]
    • G06F9/5005Allocation of resources, e.g. of the central processing unit [CPU] to service a request
    • G06F9/5011Allocation of resources, e.g. of the central processing unit [CPU] to service a request the resources being hardware resources other than CPUs, Servers and Terminals
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/07Responding to the occurrence of a fault, e.g. fault tolerance
    • G06F11/14Error detection or correction of the data by redundancy in operation
    • G06F11/1402Saving, restoring, recovering or retrying
    • G06F11/1446Point-in-time backing up or restoration of persistent data
    • G06F11/1458Management of the backup or restore process
    • G06F11/1464Management of the backup or restore process for networked environments
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/07Responding to the occurrence of a fault, e.g. fault tolerance
    • G06F11/14Error detection or correction of the data by redundancy in operation
    • G06F11/1402Saving, restoring, recovering or retrying
    • G06F11/1446Point-in-time backing up or restoration of persistent data
    • G06F11/1458Management of the backup or restore process
    • G06F11/1469Backup restoration techniques
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L47/00Traffic control in data switching networks
    • H04L47/70Admission control; Resource allocation
    • H04L47/80Actions related to the user profile or the type of traffic
    • H04L47/805QOS or priority aware
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L47/00Traffic control in data switching networks
    • H04L47/70Admission control; Resource allocation
    • H04L47/82Miscellaneous aspects
    • H04L47/822Collecting or measuring resource availability data
    • H04L67/2842
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L67/00Network arrangements or protocols for supporting network services or applications
    • H04L67/50Network services
    • H04L67/56Provisioning of proxy services
    • H04L67/568Storing data temporarily at an intermediate stage, e.g. caching
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L67/00Network arrangements or protocols for supporting network services or applications
    • H04L67/01Protocols
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L67/00Network arrangements or protocols for supporting network services or applications
    • H04L67/01Protocols
    • H04L67/10Protocols in which an application is distributed across nodes in the network
    • H04L67/42

Definitions

  • Embodiments of the present invention relate to systems and methods for allocating resources. More particularly, embodiments of the invention relate to systems and methods for stream or resource allocation when performing data protection operations such as restore operations. Appendix A forms part of the present disclosure and is incorporated herein in its entirety by this reference.
  • data protection operations e.g., backup, restore
  • QOS quality of service
  • these issues arise when some clients are using too many resources and other clients are therefore neglected or unable to acquire the necessary resources.
  • the QOS often suffers when the demand for resources is more than the node or cluster can bear. To avoid this circumstance or to correct this circumstance, there is a need to throttle requests from any particular client at any particular time. Consequently, systems and methods are needed to fairly allocate resources while, at the same time, ensuring or meeting quality of service requirements.
  • FIG. 1 illustrates an example of a server configured to allocate resources to clients
  • FIG. 2 further illustrates resource allocation including stream allocation in the context of cluster or server resources
  • FIG. 3A illustrates an example of a method for performing resource allocation and in particular for allocating streams in a computing environment
  • FIG. 3B illustrates an example of a method for evaluating a stream allocation state of a node or a server or a cluster
  • FIG. 3C further illustrates the method for evaluating the stream allocation state of FIG. 3B .
  • FIG. 4 illustrates an example of a system in which restore credits are allocated to requesting clients.
  • FIG. 5 is a flow diagram illustrating an example of a method for performing resource allocation during a data protection operation such as a restore operation.
  • Embodiments of the invention relate to systems and methods for performing data protection operations.
  • data protection operations include, but are not limited to, resource allocation operations including stream allocations, read allocations, segment processing allocations, or the like.
  • Data protection operations may also include backup operations, restore operations, deduplication operations, mirroring operations, data replication operations, and the like or combination thereof.
  • Embodiments of the invention relate to systems and methods for allocating resources in a computing environment. Embodiments of the invention further relate to systems and methods for measuring and improving quality of service and for throttling clients in the context of resource allocation. Embodiments of the invention further relate to systems and methods for allocating streams to clients, allocating restore credits, for example when performing restore operations.
  • a cluster of servers may have resources that can be allocated to clients. These resources include streams, reads, writes, processing, deduplication, or the like.
  • a particular server for example, may be able to provide x number of streams, or a certain number of reads/writes. As a whole, the cluster can also provide a larger number of streams, reads/writes, and processing. Embodiments of the invention relate to systems and methods for allocating these resources.
  • FIG. 1 illustrates an example of a computing environment in which clients communicate with a server (or a cluster).
  • the resources allocated to the client include streams.
  • a client may be able to establish multiple streams with multiple servers.
  • a server can establish multiple streams with multiple clients.
  • a safe allocation state is one in which all of the resource requests can be granted and serviced until completion. This is achieved using a credit system.
  • the credit system can be used to allocate different types of resources and/or to allocate multiple resources at the same time.
  • the number of credits granted by the server or cluster may be equal to the number of credits requested, less than the number of credits requested, greater than the number of credits requested, zero, or negative. Issuing zero or negative credits allows the server to fully use resources but also throttle when necessary. This also allows the server or cluster to recover from an unsafe state and return to a safe allocation state.
  • the credits may be described as follows:
  • FIG. 1 illustrates a server (e.g., a data protection or backup server) 110 that provides resources to clients, represented by clients 102 , 104 , 106 and 108 .
  • the server 110 may also represent a cluster of nodes or servers.
  • the clients 102 , 104 , 106 and 108 are streaming data (e.g., backup data or streams, restore streams, streams that include data for processing such as deduplication, etc.) to/from the server 110 .
  • the client 102 may be backing up a plurality of virtual machines, a database, a file system, or other data type using streams 112 .
  • the client 104 is associated with streams 114
  • the client 106 is associated with streams 116
  • the client 108 is associated with streams 118 .
  • the server 110 is configured to allocate streams to the clients 102 , 104 , 106 and 108 .
  • the server 102 is configured to perform stream allocation using, in one example, stream credits.
  • the stream credits can be managed using a resource allocation table 120 that allows the state of allocation (e.g., safe, unsafe) to be determined. Whenever credits are issued (regardless of type), the allocation table 120 is updated so that subsequent requests can be evaluated.
  • a request for stream credits is evaluated to determine whether granting the request results in a safe allocation state. Generally, the request is granted if the resulting allocation state is safe. If the request results in an unsafe allocation state, then the request is denied, for example by issuing zero credits or by issuing negative credits.
  • a server may grant x number of streams per credit, for example.
  • the server 110 may grant a stream credit to a requesting client if it is possible for all streams associated with all clients to finish executing.
  • the server 110 may not know when a particular client stream will terminate or how may more stream credits different clients will have requested by the time that the particular client stream finishes, the server 110 may assume that all clients will eventually attempt to acquire their maximum allowed stream credits, use the stream credits, and then release the stream credits.
  • the server may determine if the stream allocation state is safe by finding a hypothetical set of stream credit requests by the clients that would allow each client to acquire its maximum requested stream credits and use the stream credits. If there is a state where no such set exists, this may result in the server 110 granting zero stream credits or negative stream credits. This may cause clients that receive these grants or requests to return any stream credits being held. Stated differently, the ability to grant or issue zero credits or negative credits allows the clients to be throttled. In one example, the client may self-throttle because they may not have sufficient credits or because they may need to return credits to the server 110 . In this manner, the server then attempts to get back to a safe stream allocation state in order to grant the requested credits.
  • Embodiments of the invention may allocate resources when the allocation state of the system resulting from a particular allocation is safe. If the proposed allocation results in an unsafe state, then the allocation may be made to return the system to a safe allocation state (e.g., by issuing negative or zero credits).
  • a safe allocation state e.g., by issuing negative or zero credits.
  • C be the number of clients in the system and N be the number nodes or servers in the system.
  • Total (Maximum Streams) Availability Matrix A matrix of length N indicating a maximum number of available stream resources for each node.
  • TAM[j] k, there are k instances of stream resource Rj available.
  • C ⁇ N Current Allocation Matrix
  • CALM[i,j] k, then client Ci is currently allocated k instances of stream resource Rj.
  • CAM Current Availability Matrix
  • CAM[j] TAM[j] ⁇ (CALM[C 0 ]+CALM[C 1 ]+ . . . +CALM[CN]);
  • CDM Current Demand Matrix
  • client Ci may request at most k instances of stream resource Rj.
  • CCM Current Need Matrix
  • the server determines if it is safe to allocate stream credits in response to the client credits requested.
  • the system is in safe state, if at a given point in time, all client credit requests can be satisfied, i.e. for all clients, their stream resource needs are less that the current streams availability for all the nodes in a system.
  • FIG. 2 illustrates a cluster that includes nodes or servers 202 and clients 204 . More specifically, FIG. 2 illustrates four nodes or servers: N 1 , N 2 , N 3 and N 4 . FIG. 2 also illustrates clients C 1 , C 2 and C 3 (clients 204 ) that use resources of the servers 202 . In this example, the resources of the servers 202 allocated to the clients 204 includes streams 206 . The streams 206 may include backup streams, restore streams, or other data streams.
  • the TAM or total maximum streams available on each of the nodes is represented as follows:
  • N 1 has 60 streams for allocation to clients.
  • N 2 , N 3 and N 4 have 50, 70 and 60 streams, respectively, for allocation to clients.
  • the total maximum streams can be determined by considering the number of processors and cores on a server and by determining how much processing power a stream consumes.
  • the total maximum streams can be determined in other ways, such as by testing or by user input.
  • the CALM matrix below indicates the stream credits that have already been allocated to the client C 1 -C 3 .
  • clients C 1 , C 2 and C 3 have the following stream credits already allocated to them.
  • the CAM or the current streams available (or streams that have not been allocated) can be calculated from the TAM and CALM above. For example: Node N 1 has 60 maximum streams that it can allocate from the TAM matrix above. Node N 1 has already allocated 10 streams to C 1 , C 2 and C 3 respectively. So total streams currently available on N 1 is
  • the CAM identifies which nodes or servers are providing the streams allocated to the clients 204 .
  • the clients 204 can connect to any of the servers 202 and can therefore request credits from any of the servers 202 in the cluster.
  • the following CDM defines the maximum client stream credit request at a given point in time.
  • the following matrix defines how many streams each client can request from each of the servers at a given point in time.
  • These numbers or maximums can be predetermined and set by an administrator. Further, these numbers may be dynamic and may be based on the number of clients and/or the number of servers. As the numbers of servers and clients changed, the point in time stream credit request numbers may change.
  • N ⁇ ⁇ 1 N ⁇ ⁇ 2 N ⁇ ⁇ 3 N ⁇ ⁇ 4 C ⁇ ⁇ 1 30 30 20 20 C ⁇ ⁇ 2 10 20 30 40 C ⁇ ⁇ 3 10 30 50 00 CDM - N ⁇ ⁇ 1 N ⁇ ⁇ 2 N ⁇ ⁇ 3 N ⁇ ⁇ 4 C ⁇ ⁇ 1 10 20 20 10 C ⁇ ⁇ 2 10 00 30 30 C ⁇ ⁇ 3 10 20 10 00 CALM N ⁇ ⁇ 1 N ⁇ ⁇ 2 N ⁇ ⁇ 3 N ⁇ ⁇ 4 C ⁇ ⁇ 1 20 10 00 10 C ⁇ ⁇ 2 00 20 00 10 C ⁇ ⁇ 3 00 10 40 00 CNM
  • C 1 requests and acquires 20 N 1 stream credits, 10 N 2 stream credits and 10 N 4 stream credits to achieve is maximum requested credits.
  • the server may perform this determination prior to actually granting the request.
  • the cluster still has 10 N 1 streams, 00 N 2 streams, 10 N 3 streams and 10 N 4 streams available.
  • C 1 terminates the processes associated with the streams and returns 30 N 1 , 30 N 2 , 20 N 3 and 20 N 4 stream credits back to the system.
  • the cluster now has 40 N 1 , 30 N 2 , 30 N 3 , and 30 N 4 total streams available.
  • C 2 now acquires 20 N 1 streams and 10 N 4 streams. C 2 then terminates and returns all of its stream credits.
  • the available streams are or equals:
  • C 3 acquires 10 N 2 and 40 N 3 streams, terminates and returns all streams (returns stream credits). This results in the following:
  • a stream allocation safe state indicates that stream credits can be granted or issued.
  • Embodiments of the invention contemplate several different kinds of credits that can be requested and granted.
  • a server grants “Equal” credits.
  • the CALM streams currently allocated to the clients 204 is now as follows (this assumes that C 3 's request for 10 N 3 credits is granted):
  • N ⁇ ⁇ 1 N ⁇ ⁇ 2 N ⁇ ⁇ 3 N ⁇ ⁇ 4 C ⁇ ⁇ 1 30 30 20 20 C ⁇ ⁇ 2 10 20 30 40 C ⁇ ⁇ 3 10 30 50 00 CDM - N ⁇ ⁇ 1 N ⁇ ⁇ 2 N ⁇ ⁇ 3 N ⁇ ⁇ 4 C ⁇ ⁇ 1 10 20 20 10 C ⁇ ⁇ 2 10 00 30 30 C ⁇ ⁇ 3 10 20 20 00 CALM N ⁇ ⁇ 1 N ⁇ ⁇ 2 N ⁇ ⁇ 3 N ⁇ ⁇ 4 C ⁇ ⁇ 1 20 10 00 10 C ⁇ ⁇ 2 00 20 00 10 C ⁇ ⁇ 3 00 10 30 00 CNM
  • C 1 can acquire 20 N 1 , 10 N 2 and 10 N 4 streams, use them and release them. Then, C 2 can acquire 20 N 2 and 10 N 4 streams, use them and release them. Finally, C 3 can acquire 10 N 2 and 30 N 3 streams, use them and then release them. Therefore, this new allocation state is safe.
  • the request from C 3 for 10 streams credits on node N 3 is granted.
  • This is an example of a server granting stream credits equal to the number of stream credits requested by the client.
  • the server may decide to grant 10 stream credits (which is a partial grant because 20 stream credits were requested). As previously stated with respect to the previous example, granting 10 stream credits to C 3 from N 3 results in a safe allocation state. This illustrates an example of a partial grant of stream credits.
  • the CALM or currently allocated streams according to the initial state are the CALM or currently allocated streams according to the initial state:
  • the CDM or the point in time maximum requested streams is determined as follows:
  • N ⁇ ⁇ 1 N ⁇ ⁇ 2 N ⁇ ⁇ 3 N ⁇ ⁇ 4 C ⁇ ⁇ 1 30 30 20 20 C ⁇ ⁇ 2 10 20 30 40 C ⁇ ⁇ 3 10 30 50 00 CDM - N ⁇ ⁇ 1 N ⁇ ⁇ 2 N ⁇ ⁇ 3 N ⁇ ⁇ 4 C ⁇ ⁇ 1 10 20 20 10 C ⁇ ⁇ 2 10 10 30 30 C ⁇ ⁇ 3 10 20 10 00 CALM N ⁇ ⁇ 1 N ⁇ ⁇ 2 N ⁇ ⁇ 3 N ⁇ ⁇ 4 C ⁇ ⁇ 1 20 10 00 10 C ⁇ ⁇ 2 00 10 00 10 C ⁇ ⁇ 3 00 10 40 00 CNM
  • C 1 is unable to acquire enough streams from N 2 i.e. from the CNM above, it needs 10 streams from N 2 .
  • the number of streams available for N 2 is 0.
  • C 2 is unable to acquire enough streams from N 2
  • C 3 is unable to acquire enough streams from N 2 .
  • None of the clients in this example can acquire enough stream credits to achieve their maximum allowed stream credits. As a result, this state is not safe and the server 202 may throttle one or more of the clients 204 and recover from the unsafe allocation state by issuing negative credits. In other words, the servers 202 recover from this unsafe state by throttling and issuing negative credits.
  • the server N 2 may grant negative 20 stream credits to C 1 .
  • N 2 grants zero credits to clients C 2 and C 3 (i.e., clients C 2 and C 3 throttle and retry their requests after some time).
  • Client C 1 returns the 20 stream credits it holds to N 2 and the safe allocation state check is performed to determine if the state is safe.
  • Stream credits are used to perform resource allocation.
  • the stream allocation method can be applied to many types of streams.
  • the stream allocation method may maintain stable stream allocation states by granting negative/zero credits to various clients. Further, embodiments of the invention allow for different types of credit grants as previously described.
  • stream credits may be prefetched. If a client holds no stream credits (or even if the client holds some stream credits) and if there are enough free streams on the server, the server can grant the client more credits then requested.
  • Prefetching credits may be requested, for example based on anticipated workloads. This may apply, for example, during a restore operation where the stream credits are used in anticipation of restoring a stream by reading a backup.
  • Granted credits can also be used to make decisions related to the sizing of the client size cache. This relates, for example, to reading ahead with stream credits used for the restore operation, performing an intelligent read ahead, or using credits to manage the cost of a solution.
  • a partial grant of credits can allow operations to be partially completed. Further, stream credits can be retrieved from the clients by issuing negative credits and flushing the number of negative credits from a client's cache. In other words, a client may be throttled if the number of granted credits is zero or negative. Further different credit allocation methods may be implemented based on the type of credits requested.
  • FIG. 3A illustrates an example of a method for performing resource allocation.
  • various parameters associated with the resource allocation may be defined 302 or determined. For example, a determination may be made regarding how many streams each node or server can safely support. This may be based on number of processors/cores, memory, write/read parameters or the like. For example, a relationship between writes, processor or core consumption may be determined. If a predetermined number of writes or a data transmission rate consumes 1% of a CPU, then a stream at that transmission rate may correspond to 1 credit. Also, the maximum number of streams allowed per client may be determined.
  • This aspect of the method 300 may be performed at a single time. However, this aspect of the method 300 can be reevaluated as nodes are added/removed or as clients are added/removed from the system. These values may also account for other functions performed by the servers 202 that may not involve streams or that may not involve the particular resource being allocated. Further, these values may be able to vary based on other factors such as time of day. For example, when the processor is not required for other tasks such as during a slower period, it may be possible to temporarily increase the number of available streams.
  • the method 300 enforces or performs the allocation method. For example, a request for stream credits may be received 304 . This request is evaluated as discussed previously to determine whether the requested allocation results in a safe allocation state. Thus, the server may evaluate 306 the stream state or the allocation state by hypothetically granting the request. This involves considering whether the other clients could still be allocated their maximum credits. As previously stated, in one embodiment, it is assumed that clients may ultimately request, use and release their maximum credits allowed. The evaluation thus determines what the allocation state would be if the request were granted.
  • the server then issues credits 308 according to the result (the determined allocation state) to the requesting client (and/or to other clients). If the allocation state is safe, the server may issue credits equal to the request or greater than equal to the request. If the allocation state is not safe, a partial grant may occur that still results in a safe allocation state. If the allocation state is not safe, the server may issue zero or negative credits. In one example, the zero and/or negative credits could be issued to any of the clients.
  • FIG. 3B illustrates an example of evaluating the stream state in more detail. More specifically, FIG. 3B illustrates an example of evaluating the server stream state 306 shown in FIG. 3A .
  • the method 320 illustrates an example of evaluating the server stream state 306 .
  • the server may calculate the TAM 322 , which determines the total streams available. The server may then lookup the CALM 324 . The CALM identifies the streams that are currently allocated to the clients.
  • a determination 332 is made as to whether the stream allocation state is safe or unsafe. If the stream allocation state is not safe, then zero or negative credits are granted 340 . If the stream allocation state is safe, then credits are granted. For example, partial credits may be granted 334 , equal credits may be granted 336 , or prefetch credits may be granted 338 . The credits are then issued 308 . In one example, issuing credits 308 may be part of the method 320 and is incorporated into the granting of credits 334 , 336 , 338 or 340 .
  • FIG. 3C illustrates an example of determining a stream allocation state. More specifically, FIG. 3C illustrates an example of determining if the stream allocation state is safe 332 in FIG. 3B .
  • the method 348 may be performed for each client 350 . Staring with a first client 350 , a determination is made to determine 352 if CNM is greater than CDM. Thus, if the current need is not greater than the current demand, then the state is unsafe 354 and negative or zero credits are issued or granted as shown in FIG. 3B .
  • the stream availability after granting the maximum stream requests for the client is determined 356 . This computation may be performed as if the requested credits were granted to determine whether the resulting state is safe. Further, all clients, in one embodiment, are evaluated as a whole to determine whether the stream allocation state is safe.
  • the stream availability ( 356 ) is determined by subtracting the streams acquired by the client to reach the client's maximum demand 360 from the number of streams currently available 358 (this may be done as a whole or on a per server or node basis). This result is then added to the streams returned by the client after the demand is processed 362 . In other words, the system evaluates the state assuming, in one example, that the clients requested and are granted their maximum possible streams.
  • FIGS. 3A-3C thus illustrate an example of a method for allocating resources such that the allocation state of the system is safe.
  • a proposed allocation of resources e.g., a request from a client
  • the allocation may be zero or negative, which allows the system to either avoid an unsafe allocation state or return to a safe allocation state.
  • restore credits are another example of credits.
  • a restore operation may involve reading data from a backup maintained by a backup server and transmitting or sending the data read from the backup to a restore location or device.
  • Restore credits improve the operation of a client or server by helping a client with their read ahead cache allocation/sizing, helping a client perform an intelligent read ahead, and improving the performance of the restore operation.
  • embodiments of the invention help in restoring only data that is needed. This may avoid costs associated with data that is not needed for a restore operation.
  • Embodiments of the invention allow the read ahead cache or buffer of a client to be sized or tuned based on the size of the data being restored. This is useful because, in one example, the client may not know the size of the data to be restored.
  • the server can aid in the allocation and size of the client read ahead cache by providing prefetch restore credits.
  • FIG. 4 illustrates an example of a client performing a restore operation using restore credits and/or stream credits.
  • the server (or a cluster) may be able to support multiple clients using restore and/or stream credits.
  • a client 402 is restoring restored data 406 to storage 404 (or other device/machine).
  • the restored data 406 may be a database, a virtual machine, a file system, or the like.
  • the restored data 4006 is restored from the backups 412 (e.g., a particular backup). Data read from the backups 412 is thus read by the server 408 and transmitted to the client 402 .
  • the client then writes the data to the restored data 406 .
  • the client 402 may request restore credits from the server 408 (e.g., a backup server).
  • the server 408 may maintain an allocation table 414 that allows restore credits and/or stream credits to be tracked.
  • the client 402 does not know the size of the dataset to be restored.
  • the server 408 may know the size of the dataset because this context is present on the server 408
  • the client 402 desire to read 1 GB of data and may therefore request 4 restore credits from the server 408 .
  • the client may also set a “prefetch” flag, which indicates to the server 408 that this is a sequential restore and that the server could grand more credits than requested. If the dataset to be restored is 4 GB, the server 408 may grant 16 restore credits to the client 402 even though 4 restore credits were requested.
  • the number of restore credits granted to the client 402 can be used to adjust the size of the client's read ahead cache buffers 416 .
  • a large number of credits may cause the client 402 to increase the size of the read ahead buffer or cache to a size that can accommodate the amount of data associated with the granted credits.
  • the buffers 416 may be sized in a manner that accounts for a rate at which the cached data is restored to the restored data 406 .
  • the size of the buffers 416 may dynamically adapt to the number of credits held by the client 402 .
  • the server 408 may ignore the prefetch flag and choose to grant the number restore credits in the amount requested by the client. This may be done because reading ahead and restoring or reading more data than needed can be expensive in a cloud environment.
  • the number of restore credits granted by the server 408 can be granted in a manner that is similar to the manner in which stream credits are granted.
  • the granted amount of restore credits can be equal to the number of restore credits requested, less than the number of restore credits requested, greater than the number of credits requested by the client, zero, or negative.
  • the client 402 would use the restore credits to perform restore operations.
  • the restore credits are used to read data and, as the data is read, the restore credits are accordingly used or returned. Use of each chunk of reads would result in using of each Restore credits.
  • the restore request cannot be performed (or may only be partially performed) and restore credits are returned by the client to the server. This allows the server to achieve a safe allocation state.
  • the client 402 can choose to unilaterally release the those restore credits.
  • the server 408 may update its internal credit accounting database, the allocation table 414 , to account for release of the restore credits from a particular client.
  • the server 408 may perform a credit allocation method.
  • Embodiments of the invention contemplate that many metrics can be used in determining the credit allocation or the credit allocation state. Examples include, machine capabilities (connections, processors, cores, memory size, memory types, client connections, existing streams, available resources, and the like or combination thereof.
  • FIG. 5 illustrates an example of a method for allocating restore credits.
  • the method 500 may include steps or acts that are not performed each time the method is performed.
  • the amount or number of reads that consume 1% of a processor or core e.g., a CPU or central processing unit
  • a processor or core e.g., a CPU or central processing unit
  • gathering statistical data by doing empirical restore of data can be used to qualify this number.
  • the percentage of CPU utilization during various restore runs or operations of different sizes and/or data types can be obtained or measured.
  • the average, for example, of these observations can be used to calculate number of reads that consume 1% of data.
  • the average number of per core reads allowed is determined 504 . In one example, this is determined by multiplying the number of reads that consume 1% of the CPU with the average percentage of free CPU per core. If the average percentage of free CPU per core is less than a threshold (e.g., 2%), then the credits granted to all clients is zero or negative.
  • a threshold e.g., 2%
  • the maximum credits per client are determined 506 . This may be determined by multiplying the average per core reads allowed with the number of CPU cores and then dividing by the number of client connections.
  • the maximum credits per client represents the maximum number of credits that a client may acquire.
  • the allocation table accounts for credits that have already been issued to the client. For example, if a client's maximum credits is 100 and 60 have already been granted, a request for 50 restore credits may result in a grant of partial credits or zero credits or negative credits.
  • the allocation table is updated as credits are granted, released, etc.
  • the number of credits per client are determined 508 . This is distinct from the maximum credits because this act or step may account for a tuning factor that can be adjusted or is configurable.
  • the tuning factor allows embodiments of the invention to factor in a reserve value into the resources being allocated.
  • the tuning factor may be 50-70% of the maximum restore credits.
  • credits may be issued to requesting clients 510 .
  • the number of credits issued may be determined, by way of example, only by using the minimum of the restore credits requested and the calculated credits per client. If the client has requested prefetch, then the number of restore credits issued may be a maximum of the requested restore credits and the calculated credits per client.
  • embodiments of the invention may ensure that the grant does not result in an unsafe allocation state. For example, requesting credits that exceeds a client's maximum credits may result in an unsafe allocation state. Further, the credits already used by the client and other clients may also be considered when granting credits. Also, when determining the allocation state, the average percentage of free CPU per core may be determined. If the grant drops the average percentage of free CPU below a threshold, then the grant may be for zero credits or negative credits.
  • the restore credits can be managed in a manner similar to the stream credits such that each request for restore credits is considered in the context of all available restore credits rather than each client's maximum allowed clients or calculated restore credits based on the tuning factor.
  • restore credits are an example of stream credits at least because the data being restored is also streamed from the server to the clients.
  • these credit types can also be used together.
  • the stream credits can be used to manage the number of streams and the restore credits may determine how much data a particular client can read for all of the client's streams.
  • the present invention can be implemented in numerous ways, including as a process, an apparatus, a system, a device, a method, or a computer readable medium such as a computer readable storage medium or a computer network wherein computer program instructions are sent over optical or electronic communication links.
  • Applications may take the form of software executing on a general purpose computer or be hardwired or hard coded in hardware.
  • these implementations, or any other form that the invention may take, may be referred to as techniques.
  • the order of the steps of disclosed processes may be altered within the scope of the invention.
  • a computer may include a processor and computer storage media carrying instructions that, when executed by the processor and/or caused to be executed by the processor, perform any one or more of the methods disclosed herein.
  • embodiments within the scope of the present invention also include computer storage media, which are physical media for carrying or having computer-executable instructions or data structures stored thereon.
  • Such computer storage media can be any available physical media that can be accessed by a general purpose or special purpose computer.
  • such computer storage media can comprise hardware such as solid state disk (SSD), RAM, ROM, EEPROM, CD-ROM, flash memory, phase-change memory (“PCM”), or other optical disk storage, magnetic disk storage or other magnetic storage devices, or any other hardware storage devices which can be used to store program code in the form of computer-executable instructions or data structures, which can be accessed and executed by a general-purpose or special-purpose computer system to implement the disclosed functionality of the invention. Combinations of the above should also be included within the scope of computer storage media.
  • Such media are also examples of non-transitory storage media, and non-transitory storage media also embraces cloud-based storage systems and structures, although the scope of the invention is not limited to these examples of non-transitory storage media.
  • Computer-executable instructions comprise, for example, instructions and data which cause a general purpose computer, special purpose computer, or special purpose processing device to perform a certain function or group of functions.
  • module or ‘component’ can refer to software objects or routines that execute on the computing system.
  • the different components, modules, engines, and services described herein may be implemented as objects or processes that execute on the computing system, for example, as separate threads. While the system and methods described herein can be implemented in software, implementations in hardware or a combination of software and hardware are also possible and contemplated.
  • a ‘computing entity’ may be any computing system as previously defined herein, or any module or combination of modules running on a computing system.
  • a hardware processor is provided that is operable to carry out executable instructions for performing a method or process, such as the methods and processes disclosed herein.
  • the hardware processor may or may not comprise an element of other hardware, such as the computing devices and systems disclosed herein.
  • embodiments of the invention can be performed in client-server environments, whether network or local environments, or in any other suitable environment.
  • Suitable operating environments for at least some embodiments of the invention include cloud computing environments where one or more of a client, server, or target virtual machine may reside and operate in a cloud environment.

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Computer Networks & Wireless Communication (AREA)
  • Signal Processing (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Software Systems (AREA)
  • Quality & Reliability (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
  • Computer And Data Communications (AREA)
  • Hardware Redundancy (AREA)
  • Information Transfer Between Computers (AREA)

Abstract

Systems and methods for allocating resources are disclosed. Resources such as streams are allocated using restore credits. Credits are issued to the clients in a manner that ensure the system is operating in a safe allocation state. The credits can be used not only to allocate resources but also to throttle clients where necessary. Credits can be granted fully, partially, and in a number greater than requested. Zero or negative credits can also be issued to throttle clients. Restore credits are associated with reads and may be allocated by determining how many credits a CPU/cores can support. This maximum number may be divided amongst clients connected with the server.

Description

    FIELD OF THE INVENTION
  • Embodiments of the present invention relate to systems and methods for allocating resources. More particularly, embodiments of the invention relate to systems and methods for stream or resource allocation when performing data protection operations such as restore operations. Appendix A forms part of the present disclosure and is incorporated herein in its entirety by this reference.
  • BACKGROUND
  • In a single node or a distributed/scaleout cluster environment, allocating resources can be a challenging task. The task is further complicated when attempting to ensure that the resources are allocated fairly to all of the clients using the available resources. For example, any one client should not be able to have an unfairly large share of the available resources. At the same time, there is a need to satisfy quality of service (QOS) requirements.
  • More specifically, data protection operations (e.g., backup, restore) are often associated with resource allocation issues and quality of service (QOS) issues. These issues arise when some clients are using too many resources and other clients are therefore neglected or unable to acquire the necessary resources. In addition, the QOS often suffers when the demand for resources is more than the node or cluster can bear. To avoid this circumstance or to correct this circumstance, there is a need to throttle requests from any particular client at any particular time. Consequently, systems and methods are needed to fairly allocate resources while, at the same time, ensuring or meeting quality of service requirements.
  • BRIEF DESCRIPTION OF THE DRAWINGS
  • In order to describe the manner in which at least some aspects of this disclosure can be obtained, a more particular description will be rendered by reference to specific embodiments thereof which are illustrated in the appended drawings. Understanding that these drawings depict only example embodiments of the invention and are not therefore to be considered to be limiting of its scope, embodiments of the invention will be described and explained with additional specificity and detail through the use of the accompanying drawings, in which:
  • FIG. 1 illustrates an example of a server configured to allocate resources to clients;
  • FIG. 2 further illustrates resource allocation including stream allocation in the context of cluster or server resources;
  • FIG. 3A illustrates an example of a method for performing resource allocation and in particular for allocating streams in a computing environment;
  • FIG. 3B illustrates an example of a method for evaluating a stream allocation state of a node or a server or a cluster; and
  • FIG. 3C further illustrates the method for evaluating the stream allocation state of FIG. 3B.
  • FIG. 4 illustrates an example of a system in which restore credits are allocated to requesting clients; and
  • FIG. 5 is a flow diagram illustrating an example of a method for performing resource allocation during a data protection operation such as a restore operation.
  • DETAILED DESCRIPTION OF SOME EXAMPLE EMBODIMENTS
  • Embodiments of the invention relate to systems and methods for performing data protection operations. Examples of data protection operations include, but are not limited to, resource allocation operations including stream allocations, read allocations, segment processing allocations, or the like. Data protection operations may also include backup operations, restore operations, deduplication operations, mirroring operations, data replication operations, and the like or combination thereof.
  • Embodiments of the invention relate to systems and methods for allocating resources in a computing environment. Embodiments of the invention further relate to systems and methods for measuring and improving quality of service and for throttling clients in the context of resource allocation. Embodiments of the invention further relate to systems and methods for allocating streams to clients, allocating restore credits, for example when performing restore operations.
  • In one example, a cluster of servers (or a single server or node) may have resources that can be allocated to clients. These resources include streams, reads, writes, processing, deduplication, or the like. A particular server, for example, may be able to provide x number of streams, or a certain number of reads/writes. As a whole, the cluster can also provide a larger number of streams, reads/writes, and processing. Embodiments of the invention relate to systems and methods for allocating these resources.
  • FIG. 1 illustrates an example of a computing environment in which clients communicate with a server (or a cluster). In this example, the resources allocated to the client include streams. A client may be able to establish multiple streams with multiple servers. Similarly, a server can establish multiple streams with multiple clients.
  • These resources (and/or other resources including read resources, write resources, processing resources, etc.) are allocated such that the server of cluster operates in a safe allocation state. A safe allocation state is one in which all of the resource requests can be granted and serviced until completion. This is achieved using a credit system. In order to account for multiple scenarios, there are different types of credits that can be granted. Each type, however, may relate to the resources being allocated. The different types of credits effectively represent a different response to credit requests. The credit system can be used to allocate different types of resources and/or to allocate multiple resources at the same time.
  • For example, the number of credits granted by the server or cluster may be equal to the number of credits requested, less than the number of credits requested, greater than the number of credits requested, zero, or negative. Issuing zero or negative credits allows the server to fully use resources but also throttle when necessary. This also allows the server or cluster to recover from an unsafe state and return to a safe allocation state. By way of example, the credits may be described as follows:
      • Prefetch credits: More than the number of credits requested by clients.
      • Partial credits: Less than (but greater than 0) number of credits requested by clients.
      • Equal credits: Equal to the number of credits requested by clients.
      • Zero credits: Equal to zero, indicating, current client request cannot be processed. The client needs to wait and retry obtaining credits.
      • Negative credits: A negative number, indicating to the client to release the number of cached credits.
      • The zero and negative credits allow the server to throttle a request from a client.
  • FIG. 1 illustrates a server (e.g., a data protection or backup server) 110 that provides resources to clients, represented by clients 102, 104, 106 and 108. The server 110 may also represent a cluster of nodes or servers. In one example, the clients 102, 104, 106 and 108 are streaming data (e.g., backup data or streams, restore streams, streams that include data for processing such as deduplication, etc.) to/from the server 110. The client 102, for example, may be backing up a plurality of virtual machines, a database, a file system, or other data type using streams 112. Similarly, the client 104 is associated with streams 114, the client 106 is associated with streams 116, and the client 108 is associated with streams 118.
  • In this example, the server 110 is configured to allocate streams to the clients 102, 104, 106 and 108. The server 102 is configured to perform stream allocation using, in one example, stream credits. The stream credits can be managed using a resource allocation table 120 that allows the state of allocation (e.g., safe, unsafe) to be determined. Whenever credits are issued (regardless of type), the allocation table 120 is updated so that subsequent requests can be evaluated.
  • In one example, a request for stream credits is evaluated to determine whether granting the request results in a safe allocation state. Generally, the request is granted if the resulting allocation state is safe. If the request results in an unsafe allocation state, then the request is denied, for example by issuing zero credits or by issuing negative credits.
  • In the following disclosure and by way of example only, it is assumed that 1 stream available is associated with 1 stream credit granted. In other words and by way of example only, 1 credit represents 1 stream. Other credit per resource allocation schemes could be different. A server may grant x number of streams per credit, for example. The server 110 may grant a stream credit to a requesting client if it is possible for all streams associated with all clients to finish executing.
  • Because the server 110 may not know when a particular client stream will terminate or how may more stream credits different clients will have requested by the time that the particular client stream finishes, the server 110 may assume that all clients will eventually attempt to acquire their maximum allowed stream credits, use the stream credits, and then release the stream credits.
  • On these assumptions, the server may determine if the stream allocation state is safe by finding a hypothetical set of stream credit requests by the clients that would allow each client to acquire its maximum requested stream credits and use the stream credits. If there is a state where no such set exists, this may result in the server 110 granting zero stream credits or negative stream credits. This may cause clients that receive these grants or requests to return any stream credits being held. Stated differently, the ability to grant or issue zero credits or negative credits allows the clients to be throttled. In one example, the client may self-throttle because they may not have sufficient credits or because they may need to return credits to the server 110. In this manner, the server then attempts to get back to a safe stream allocation state in order to grant the requested credits.
  • Embodiments of the invention may allocate resources when the allocation state of the system resulting from a particular allocation is safe. If the proposed allocation results in an unsafe state, then the allocation may be made to return the system to a safe allocation state (e.g., by issuing negative or zero credits). The following discussion, with regard to stream credits, includes the following. This allocation method is described in more detail with regard to FIGS. 3B and 3C described below.
  • In one example, let C be the number of clients in the system and N be the number nodes or servers in the system.
  • Total (Maximum Streams) Availability Matrix (TAM): A matrix of length N indicating a maximum number of available stream resources for each node.
  • TAM[j]=k, there are k instances of stream resource Rj available.
  • Current Allocation Matrix (CALM): A C×N matrix that defines the number of stream resources currently allocated to each client.
  • CALM[i,j]=k, then client Ci is currently allocated k instances of stream resource Rj.
  • Current Availability Matrix (CAM): A matrix of length N indicating the current number of streams available for each node type. This is determined by adding currently allocated streams for all the clients on each individual nodes and subtracting the result from the total maximum streams for that node.
  • CAM[j]=TAM[j]−(CALM[C0]+CALM[C1]+ . . . +CALM[CN]);
  • Current Demand Matrix (CDM): An C×N matrix that defines the current demand or the point in time maximum requested streams.
  • If CDM[i,j]=k, then client Ci may request at most k instances of stream resource Rj.
  • Current Need Matrix (CNM): A C×N matrix indicates the stream credit needs for each clients. (Need=Demand−Allocated).
  • CNM[i,j]=CDM[i,j]−CALM[i,j].
  • At any point of time, the server determines if it is safe to allocate stream credits in response to the client credits requested. The system is in safe state, if at a given point in time, all client credit requests can be satisfied, i.e. for all clients, their stream resource needs are less that the current streams availability for all the nodes in a system.
  • CNM[i, j]<CAM[j]
  • If stream needs of a client is greater than the streams available (CNM[i, j]>CAM[j]), the system is considered unsafe (unsafe allocation state) and negative or zero credits are granted to clients and an effort is made to bring the system to safe/stable stream allocation state.
  • The following examples illustrate this process in more detail. FIG. 2 illustrates a cluster that includes nodes or servers 202 and clients 204. More specifically, FIG. 2 illustrates four nodes or servers: N1, N2, N3 and N4. FIG. 2 also illustrates clients C1, C2 and C3 (clients 204) that use resources of the servers 202. In this example, the resources of the servers 202 allocated to the clients 204 includes streams 206. The streams 206 may include backup streams, restore streams, or other data streams.
  • As an example, let us assume that in FIG. 2, the TAM or total maximum streams available on each of the nodes is represented as follows:
  • N1 N2 N3 N4
    60 50 70 60
  • Thus, N1 has 60 streams for allocation to clients. Similarly, N2, N3 and N4 have 50, 70 and 60 streams, respectively, for allocation to clients.
  • The total maximum streams can be determined by considering the number of processors and cores on a server and by determining how much processing power a stream consumes. The total maximum streams can be determined in other ways, such as by testing or by user input.
  • The CALM matrix below indicates the stream credits that have already been allocated to the client C1-C3. In this example, assume that clients C1, C2 and C3 have the following stream credits already allocated to them.
  • CALM
    N1 N2 N3 N4
    C1 10 20 20 10
    C2 10 00 30 30
    C3 10 20 10 00
  • The CAM or the current streams available (or streams that have not been allocated) can be calculated from the TAM and CALM above. For example: Node N1 has 60 maximum streams that it can allocate from the TAM matrix above. Node N1 has already allocated 10 streams to C1, C2 and C3 respectively. So total streams currently available on N1 is
    • CAM[N1]=TAM[N1]−(CALM[0, C1]+CALM[0, C2]+CALM[0, C3]) i.e.
    • CAM[N1]=60−(10+10+10)=30.
    • Similarly,
    • CAM[N2]=50−(20+0+20)=10.
    • CAM[N3]=70−(20+30+10)=10.
    • CAM[N4]=60−(10+30+0)=20
  • N 1 N 2 N 3 N 4 60 50 70 60 TAM - N 1 N 2 N 3 N 4 C 1 10 20 20 10 C 2 10 00 30 30 C 3 10 20 10 00 CALM = N 1 N 2 N 3 N 4 30 10 10 20 CAM
  • More generally, the CAM identifies which nodes or servers are providing the streams allocated to the clients 204. As previously stated, The clients 204 can connect to any of the servers 202 and can therefore request credits from any of the servers 202 in the cluster.
  • The following CDM defines the maximum client stream credit request at a given point in time. In other words, the following matrix defines how many streams each client can request from each of the servers at a given point in time. These numbers or maximums can be predetermined and set by an administrator. Further, these numbers may be dynamic and may be based on the number of clients and/or the number of servers. As the numbers of servers and clients changed, the point in time stream credit request numbers may change.
  • CDM
    N1 N2 N3 N4
    C1 30 30 20 20
    C2 10 20 30 40
    C3 10 30 50 00
  • By subtracting Current Allocated streams Matric (CALM) from Current Demand Matrix (CDM), the total stream credit needed or the CNM for C1, C2 and C3 can be determined as follows:
  • N 1 N 2 N 3 N 4 C 1 30 30 20 20 C 2 10 20 30 40 C 3 10 30 50 00 CDM - N 1 N 2 N 3 N 4 C 1 10 20 20 10 C 2 10 00 30 30 C 3 10 20 10 00 CALM = N 1 N 2 N 3 N 4 C 1 20 10 00 10 C 2 00 20 00 10 C 3 00 10 40 00 CNM
  • Using the above information, it is possible to determine whether each client can acquire and use its maximum requested stream credits. The following format is used in the following discussion <xx xx xx xx> represents streams associated with, respectively, nodes N1, N2, N3 and N4.
  • For example, from the CNM, C1 requests and acquires 20 N1 stream credits, 10 N2 stream credits and 10 N4 stream credits to achieve is maximum requested credits. The server may perform this determination prior to actually granting the request.
  • After C1 requests and acquires the available streams are now determined as follows:
  • <30 10 10 20> (CAM or available streams)−
  • <20 10 00 10> (streams acquired by C1 to reach C1's max)=
  • <10 00 10 10> (Streams still available)
  • Thus, the cluster still has 10 N1 streams, 00 N2 streams, 10 N3 streams and 10 N4 streams available.
  • Next, C1 terminates the processes associated with the streams and returns 30 N1, 30 N2, 20 N3 and 20 N4 stream credits back to the system. These are the streams associated with the C1 row in the CDM. Adding it to the streams currently available <10 00 10 10>+<30 30 20 20>=<40 30 30 30> As a result, the cluster now has 40 N1, 30 N2, 30 N3, and 30 N4 total streams available. This <40 30 30 30> is less than or equal to the TAM <60 50 70 60> or the total maximum stream for each node of the cluster i.e. <40 30 30 30><=<60 50 70 60> so the system state is safe to allocate and to process next client request.
  • C2 now acquires 20 N1 streams and 10 N4 streams. C2 then terminates and returns all of its stream credits. In this example and after these steps, the available streams are or equals:
  • <40 30 30 30> (streams currently available prior to C2's request)−
  • <00 20 00 10> (streams acquired by C2 to reach C2's max)=
  • <40 30 30 30>−<00 20 00 10>=<40 10 30 20> (streams still available)+
  • <10 20 30 40> (streams associated with the C2 row in the CDM)<10 20 30 40>+<40 10 30 20>=<50 30 60 60> (streams available after C2 returns stream credits). This <50 30 60 60> is less than or equal to the TAM <60 50 70 60> or the total maximum stream for each node of the cluster i.e. <50 30 60 60><=<60 50 70 60> so the system state is safe to allocate and process to process next client request.
  • Next, C3 acquires 10 N2 and 40 N3 streams, terminates and returns all streams (returns stream credits). This results in the following:
  • <50 30 60 60> (currently available streams prior to C3's)−
  • <00 10 40 00> (streams acquired by C3 to reach C3's max)+
  • <10 30 50 00> (streams returned by C3)=
  • <60 50 70 60> (stream credits available). This <60 50 70 60> is less than or equal to the TAM <60 50 70 60> or the total maximum stream for each node of the cluster i.e. <60 50 70 60><=<60 50 70 60> so the system state is safe to allocate and process to process next client request.
  • This demonstrates that because it is possible for each client to acquire its maximum requested stream credits and use the stream credits, the stream allocation states are safe and stream credits can be granted to all clients as described above.
  • A stream allocation safe state indicates that stream credits can be granted or issued. Embodiments of the invention contemplate several different kinds of credits that can be requested and granted.
  • The following examples illustrate these types of credits and illustrates whether the credits are granted.
  • EXAMPLE 1
  • A server grants “Equal” credits.
  • Starting in the same state as the previous example started in, assume C3 requests 10 streams credits on node N3. In this example, there are enough available streams such that the credit request can be granted. After the grant, the new stream allocation state is as follows:
  • CAM or the Available streams on nodes:
  • N1 N2 N3 N4
    Available 30 10 00 20
    Streams
  • The CALM streams currently allocated to the clients 204 is now as follows (this assumes that C3's request for 10 N3 credits is granted):
  • CALM
    N1 N2 N3 N4
    C1 10 20 20 10
    C2 10 00 30 30
    C3 10 20 20 00
  • Now, the clients maximum requested streams is as follows:
  • CDM
    N1 N2 N3 N4
    C1 30 30 20 20
    C2 10 20 30 40
    C3 10 30 50 00
  • With this information, a determination can be made as to whether the new stream allocation state is safe.
  • N 1 N 2 N 3 N 4 C 1 30 30 20 20 C 2 10 20 30 40 C 3 10 30 50 00 CDM - N 1 N 2 N 3 N 4 C 1 10 20 20 10 C 2 10 00 30 30 C 3 10 20 20 00 CALM = N 1 N 2 N 3 N 4 C 1 20 10 00 10 C 2 00 20 00 10 C 3 00 10 30 00 CNM
  • In the above example, C1 can acquire 20 N1, 10 N2 and 10 N4 streams, use them and release them. Then, C2 can acquire 20 N2 and 10 N4 streams, use them and release them. Finally, C3 can acquire 10 N2 and 30 N3 streams, use them and then release them. Therefore, this new allocation state is safe.
  • Because the new state is safe, the request from C3 for 10 streams credits on node N3 is granted. This is an example of a server granting stream credits equal to the number of stream credits requested by the client.
  • EXAMPLE 2
  • Server grants “Partial” credits
  • Starting in the same state that the previous example started in, assume C3 requests 20 streams credits on N3. In this example, the streams available before granting the requested stream credits is as follows:
  • N1 N2 N3 N4
    30 10 10 20
  • The streams available after granting the stream credits is as follows:
  • N1 N2 N3 N4
    30 10 −10 20
  • Because the number of total streams available after the grant is less than zero, the server may decide to grant 10 stream credits (which is a partial grant because 20 stream credits were requested). As previously stated with respect to the previous example, granting 10 stream credits to C3 from N3 results in a safe allocation state. This illustrates an example of a partial grant of stream credits.
  • EXAMPLE 3
  • “Zero” or “Negative” stream credit allocation
  • From the previous starting state, assume that client C2 requests 10 stream credits from node N2. In this example, there are enough streams to grant stream credits. Assuming that the request is granted, the new state would be:
  • CAM or the Available streams on nodes:
  • N1 N2 N3 N4
    Available 30 00 10 20
    Streams
  • The CALM or currently allocated streams according to the initial state:
  • CALM
    N1 N2 N3 N4
    C1 10 20 20 10
    C2 10 10 30 30
    C3 10 20 10 00
  • The CDM or the point in time maximum requested streams is determined as follows:
  • CDM
    N1 N2 N3 N4
    C1 30 30 20 20
    C2 10 20 30 40
    C3 10 30 50 00
  • Now a determination is made to determine if the new allocation state is safe. Assuming that clients C1, C2 and C3 request more stream credits from N2 and N3.
  • N 1 N 2 N 3 N 4 C 1 30 30 20 20 C 2 10 20 30 40 C 3 10 30 50 00 CDM - N 1 N 2 N 3 N 4 C 1 10 20 20 10 C 2 10 10 30 30 C 3 10 20 10 00 CALM = N 1 N 2 N 3 N 4 C 1 20 10 00 10 C 2 00 10 00 10 C 3 00 10 40 00 CNM
  • In this case, C1 is unable to acquire enough streams from N2 i.e. from the CNM above, it needs 10 streams from N2. However, according to the CAM above, the number of streams available for N2 is 0. Also, C2 is unable to acquire enough streams from N2, and C3 is unable to acquire enough streams from N2.
  • None of the clients in this example can acquire enough stream credits to achieve their maximum allowed stream credits. As a result, this state is not safe and the server 202 may throttle one or more of the clients 204 and recover from the unsafe allocation state by issuing negative credits. In other words, the servers 202 recover from this unsafe state by throttling and issuing negative credits.
  • For example, the server N2 may grant negative 20 stream credits to C1. Optionally, N2 grants zero credits to clients C2 and C3 (i.e., clients C2 and C3 throttle and retry their requests after some time). Client C1 returns the 20 stream credits it holds to N2 and the safe allocation state check is performed to determine if the state is safe.
  • Stream credits are used to perform resource allocation. The stream allocation method can be applied to many types of streams. The stream allocation method may maintain stable stream allocation states by granting negative/zero credits to various clients. Further, embodiments of the invention allow for different types of credit grants as previously described.
  • More specifically, stream credits may be prefetched. If a client holds no stream credits (or even if the client holds some stream credits) and if there are enough free streams on the server, the server can grant the client more credits then requested.
  • Prefetching credits may be requested, for example based on anticipated workloads. This may apply, for example, during a restore operation where the stream credits are used in anticipation of restoring a stream by reading a backup.
  • Granted credits can also be used to make decisions related to the sizing of the client size cache. This relates, for example, to reading ahead with stream credits used for the restore operation, performing an intelligent read ahead, or using credits to manage the cost of a solution.
  • A partial grant of credits can allow operations to be partially completed. Further, stream credits can be retrieved from the clients by issuing negative credits and flushing the number of negative credits from a client's cache. In other words, a client may be throttled if the number of granted credits is zero or negative. Further different credit allocation methods may be implemented based on the type of credits requested.
  • FIG. 3A illustrates an example of a method for performing resource allocation. In one example, various parameters associated with the resource allocation may be defined 302 or determined. For example, a determination may be made regarding how many streams each node or server can safely support. This may be based on number of processors/cores, memory, write/read parameters or the like. For example, a relationship between writes, processor or core consumption may be determined. If a predetermined number of writes or a data transmission rate consumes 1% of a CPU, then a stream at that transmission rate may correspond to 1 credit. Also, the maximum number of streams allowed per client may be determined.
  • This aspect of the method 300 may be performed at a single time. However, this aspect of the method 300 can be reevaluated as nodes are added/removed or as clients are added/removed from the system. These values may also account for other functions performed by the servers 202 that may not involve streams or that may not involve the particular resource being allocated. Further, these values may be able to vary based on other factors such as time of day. For example, when the processor is not required for other tasks such as during a slower period, it may be possible to temporarily increase the number of available streams.
  • Once the resource allocations have been defined and the server is allocating resources to the clients, the method 300 enforces or performs the allocation method. For example, a request for stream credits may be received 304. This request is evaluated as discussed previously to determine whether the requested allocation results in a safe allocation state. Thus, the server may evaluate 306 the stream state or the allocation state by hypothetically granting the request. This involves considering whether the other clients could still be allocated their maximum credits. As previously stated, in one embodiment, it is assumed that clients may ultimately request, use and release their maximum credits allowed. The evaluation thus determines what the allocation state would be if the request were granted.
  • The server then issues credits 308 according to the result (the determined allocation state) to the requesting client (and/or to other clients). If the allocation state is safe, the server may issue credits equal to the request or greater than equal to the request. If the allocation state is not safe, a partial grant may occur that still results in a safe allocation state. If the allocation state is not safe, the server may issue zero or negative credits. In one example, the zero and/or negative credits could be issued to any of the clients.
  • FIG. 3B illustrates an example of evaluating the stream state in more detail. More specifically, FIG. 3B illustrates an example of evaluating the server stream state 306 shown in FIG. 3A. Thus, the method 320 illustrates an example of evaluating the server stream state 306. In an example of the method 320, the server may calculate the TAM 322, which determines the total streams available. The server may then lookup the CALM 324. The CALM identifies the streams that are currently allocated to the clients.
  • Next, the point in time CAM is determined 326. This is determined by subtracting the CALM from the TAM (CAM=TAM−CALM). This allows the server to determine how many streams are available for allocation. This can be determined from the perspective of the system as whole and/or on an per node or per server basis. As discussed above, the number of available streams may be determined on a per server basis. In one example, this ensures that the resources of a particular server are not overtaxed. Plus, in one embodiment, this may give the server or cluster flexibility in determining which servers provide or allocate resources. For example, it may be possible for a server to redirect a request to a different server if the redirection would result in a safe allocation state.
  • Next, the CDM is determined 328 and the CNMs determined 330 by subtracting the CALM from the CDM (CNM=CDM−CALM).
  • After this information has been determined, a determination 332 is made as to whether the stream allocation state is safe or unsafe. If the stream allocation state is not safe, then zero or negative credits are granted 340. If the stream allocation state is safe, then credits are granted. For example, partial credits may be granted 334, equal credits may be granted 336, or prefetch credits may be granted 338. The credits are then issued 308. In one example, issuing credits 308 may be part of the method 320 and is incorporated into the granting of credits 334, 336, 338 or 340.
  • FIG. 3C illustrates an example of determining a stream allocation state. More specifically, FIG. 3C illustrates an example of determining if the stream allocation state is safe 332 in FIG. 3B. The method 348 may be performed for each client 350. Staring with a first client 350, a determination is made to determine 352 if CNM is greater than CDM. Thus, if the current need is not greater than the current demand, then the state is unsafe 354 and negative or zero credits are issued or granted as shown in FIG. 3B.
  • When the CNM is greater than the CDM, then the stream availability after granting the maximum stream requests for the client is determined 356. This computation may be performed as if the requested credits were granted to determine whether the resulting state is safe. Further, all clients, in one embodiment, are evaluated as a whole to determine whether the stream allocation state is safe.
  • In one example, the stream availability (356) is determined by subtracting the streams acquired by the client to reach the client's maximum demand 360 from the number of streams currently available 358 (this may be done as a whole or on a per server or node basis). This result is then added to the streams returned by the client after the demand is processed 362. In other words, the system evaluates the state assuming, in one example, that the clients requested and are granted their maximum possible streams.
  • Based on this determination 356, a determination is made as to whether the available streams is less than the total available matrix 364. If not, the state is unsafe 366. If so and all clients have been processed 368, the state is safe 372 and the credits can be granted as shown in FIG. 3B. If all clients are not processed, the next client is processed 370.
  • FIGS. 3A-3C thus illustrate an example of a method for allocating resources such that the allocation state of the system is safe. When a proposed allocation of resources (e.g., a request from a client) results in an unsafe allocation state, then the allocation may be zero or negative, which allows the system to either avoid an unsafe allocation state or return to a safe allocation state.
  • In addition to stream credits, embodiments of the invention relate to restore credits, which are another example of credits. A restore operation may involve reading data from a backup maintained by a backup server and transmitting or sending the data read from the backup to a restore location or device. In one example, the resource allocation systems and methods may rely solely on the restore credits, which may be defined in terms of data (e.g., 1 credit=256 MG or 500 MB, or 1 GB, etc., of data read), and/or on stream credits as previously described.
  • Restore credits improve the operation of a client or server by helping a client with their read ahead cache allocation/sizing, helping a client perform an intelligent read ahead, and improving the performance of the restore operation. In addition, embodiments of the invention help in restoring only data that is needed. This may avoid costs associated with data that is not needed for a restore operation.
  • More specifically, clients often implement read ahead caching and may read more data that is required. This may be a concern, for example, in a cloud environment, where the costs are determined per restore byte. In other words, reading more data than required can be expensive. Embodiments of the invention allow the read ahead cache or buffer of a client to be sized or tuned based on the size of the data being restored. This is useful because, in one example, the client may not know the size of the data to be restored. The server can aid in the allocation and size of the client read ahead cache by providing prefetch restore credits.
  • FIG. 4, for example, illustrates an example of a client performing a restore operation using restore credits and/or stream credits. The server (or a cluster) may be able to support multiple clients using restore and/or stream credits.
  • In FIG. 4, a client 402 is restoring restored data 406 to storage 404 (or other device/machine). The restored data 406 may be a database, a virtual machine, a file system, or the like. When performing the restore operation, the restored data 4006 is restored from the backups 412 (e.g., a particular backup). Data read from the backups 412 is thus read by the server 408 and transmitted to the client 402. The client then writes the data to the restored data 406.
  • In this example, the client 402 may request restore credits from the server 408 (e.g., a backup server). The server 408 may maintain an allocation table 414 that allows restore credits and/or stream credits to be tracked. In this example, the client 402 does not know the size of the dataset to be restored. The server 408, however, may know the size of the dataset because this context is present on the server 408
  • Assume, by way of example only, that 256 MB of data is associated with one credit of restore, the following scenarios may occur. The client 402 desire to read 1 GB of data and may therefore request 4 restore credits from the server 408. The client may also set a “prefetch” flag, which indicates to the server 408 that this is a sequential restore and that the server could grand more credits than requested. If the dataset to be restored is 4 GB, the server 408 may grant 16 restore credits to the client 402 even though 4 restore credits were requested.
  • The number of restore credits granted to the client 402 can be used to adjust the size of the client's read ahead cache buffers 416. For example, a large number of credits may cause the client 402 to increase the size of the read ahead buffer or cache to a size that can accommodate the amount of data associated with the granted credits. Alternatively, the buffers 416 may be sized in a manner that accounts for a rate at which the cached data is restored to the restored data 406. In one example, the size of the buffers 416 may dynamically adapt to the number of credits held by the client 402.
  • If the restore operation were to occur in a cloud environment, the server 408 may ignore the prefetch flag and choose to grant the number restore credits in the amount requested by the client. This may be done because reading ahead and restoring or reading more data than needed can be expensive in a cloud environment.
  • The number of restore credits granted by the server 408 can be granted in a manner that is similar to the manner in which stream credits are granted. The granted amount of restore credits can be equal to the number of restore credits requested, less than the number of restore credits requested, greater than the number of credits requested by the client, zero, or negative.
  • Depending on the number of restore credits received by the client 402, the client 402 would use the restore credits to perform restore operations. The restore credits are used to read data and, as the data is read, the restore credits are accordingly used or returned. Use of each chunk of reads would result in using of each Restore credits.
  • If the number of restore credits granted is zero or a negative value, it is an indication to the client to throttle. Consequently, the restore request cannot be performed (or may only be partially performed) and restore credits are returned by the client to the server. This allows the server to achieve a safe allocation state.
  • If the restore operation has been completed and if the client 402 has additional restore credits cached in its connection structure, the client 402 can choose to unilaterally release the those restore credits. The server 408 may update its internal credit accounting database, the allocation table 414, to account for release of the restore credits from a particular client.
  • When granting credits (regardless of type), the server 408 may perform a credit allocation method. Embodiments of the invention contemplate that many metrics can be used in determining the credit allocation or the credit allocation state. Examples include, machine capabilities (connections, processors, cores, memory size, memory types, client connections, existing streams, available resources, and the like or combination thereof.
  • FIG. 5 illustrates an example of a method for allocating restore credits. The method 500 may include steps or acts that are not performed each time the method is performed. In FIG. 5, the amount or number of reads that consume 1% of a processor or core (e.g., a CPU or central processing unit) on average is determined or defined 502. While this number is typically an approximation, gathering statistical data by doing empirical restore of data can be used to qualify this number. The percentage of CPU utilization during various restore runs or operations of different sizes and/or data types can be obtained or measured. The average, for example, of these observations can be used to calculate number of reads that consume 1% of data. For example, if it is observed observe that restoring 1 GB of data consumes 10% CPU and results in 10,000 read requests on average, it can be approximated that 1000 read requests to the server, consume 1% of CPU. This result may be used to determine the number of restore credits to be allocated to requesting clients.
  • Next, the average number of per core reads allowed is determined 504. In one example, this is determined by multiplying the number of reads that consume 1% of the CPU with the average percentage of free CPU per core. If the average percentage of free CPU per core is less than a threshold (e.g., 2%), then the credits granted to all clients is zero or negative.
  • Next, the maximum credits per client are determined 506. This may be determined by multiplying the average per core reads allowed with the number of CPU cores and then dividing by the number of client connections. The maximum credits per client represents the maximum number of credits that a client may acquire.
  • The allocation table accounts for credits that have already been issued to the client. For example, if a client's maximum credits is 100 and 60 have already been granted, a request for 50 restore credits may result in a grant of partial credits or zero credits or negative credits. The allocation table is updated as credits are granted, released, etc.
  • In one example, the number of credits per client are determined 508. This is distinct from the maximum credits because this act or step may account for a tuning factor that can be adjusted or is configurable. The tuning factor allows embodiments of the invention to factor in a reserve value into the resources being allocated. The tuning factor may be 50-70% of the maximum restore credits.
  • Next, credits may be issued to requesting clients 510. The number of credits issued may be determined, by way of example, only by using the minimum of the restore credits requested and the calculated credits per client. If the client has requested prefetch, then the number of restore credits issued may be a maximum of the requested restore credits and the calculated credits per client.
  • Consider the following example. If the number of reads that consume 1% of the CPU on average is 1000 and the average percentage of free CPU per core is 50%, then the average per core reads allowed is ((1000*0.5)=500). If the number of CPU cores is 4 and the number of clients is 10, then the maximum credits per client is ((500*4)/10=200). If the tuning factor is 50%, then the calculated credits per client is (200*0.5=100). Thus, there is a distinction between the maximum credits per client and the tuned or calculated credits per client.
  • If a client then requests 40 restore credits, the granted restore credits is MIN(40,100)=40. Thus 40 credits are granted. If the client requests prefetch, then the granted credits is MAX(40,100)=100. Thus 100 credits are granted. If restoring from the cloud, the prefetch may be ignored, in which case the granted credits may be 40 in this example.
  • Each time restore credits are requested, embodiments of the invention may ensure that the grant does not result in an unsafe allocation state. For example, requesting credits that exceeds a client's maximum credits may result in an unsafe allocation state. Further, the credits already used by the client and other clients may also be considered when granting credits. Also, when determining the allocation state, the average percentage of free CPU per core may be determined. If the grant drops the average percentage of free CPU below a threshold, then the grant may be for zero credits or negative credits.
  • In another example, the restore credits can be managed in a manner similar to the stream credits such that each request for restore credits is considered in the context of all available restore credits rather than each client's maximum allowed clients or calculated restore credits based on the tuning factor.
  • In one example, restore credits are an example of stream credits at least because the data being restored is also streamed from the server to the clients. However, these credit types can also be used together. For example, the stream credits can be used to manage the number of streams and the restore credits may determine how much data a particular client can read for all of the client's streams.
  • It should be appreciated that the present invention can be implemented in numerous ways, including as a process, an apparatus, a system, a device, a method, or a computer readable medium such as a computer readable storage medium or a computer network wherein computer program instructions are sent over optical or electronic communication links. Applications may take the form of software executing on a general purpose computer or be hardwired or hard coded in hardware. In this specification, these implementations, or any other form that the invention may take, may be referred to as techniques. In general, the order of the steps of disclosed processes may be altered within the scope of the invention.
  • The embodiments disclosed herein may include the use of a special purpose or general-purpose computer including various computer hardware or software modules, as discussed in greater detail below. A computer may include a processor and computer storage media carrying instructions that, when executed by the processor and/or caused to be executed by the processor, perform any one or more of the methods disclosed herein.
  • As indicated above, embodiments within the scope of the present invention also include computer storage media, which are physical media for carrying or having computer-executable instructions or data structures stored thereon. Such computer storage media can be any available physical media that can be accessed by a general purpose or special purpose computer.
  • By way of example, and not limitation, such computer storage media can comprise hardware such as solid state disk (SSD), RAM, ROM, EEPROM, CD-ROM, flash memory, phase-change memory (“PCM”), or other optical disk storage, magnetic disk storage or other magnetic storage devices, or any other hardware storage devices which can be used to store program code in the form of computer-executable instructions or data structures, which can be accessed and executed by a general-purpose or special-purpose computer system to implement the disclosed functionality of the invention. Combinations of the above should also be included within the scope of computer storage media. Such media are also examples of non-transitory storage media, and non-transitory storage media also embraces cloud-based storage systems and structures, although the scope of the invention is not limited to these examples of non-transitory storage media.
  • Computer-executable instructions comprise, for example, instructions and data which cause a general purpose computer, special purpose computer, or special purpose processing device to perform a certain function or group of functions. Although the subject matter has been described in language specific to structural features and/or methodological acts, it is to be understood that the subject matter defined in the appended claims is not necessarily limited to the specific features or acts described above. Rather, the specific features and acts disclosed herein are disclosed as example forms of implementing the claims.
  • As used herein, the term ‘module’ or ‘component’ can refer to software objects or routines that execute on the computing system. The different components, modules, engines, and services described herein may be implemented as objects or processes that execute on the computing system, for example, as separate threads. While the system and methods described herein can be implemented in software, implementations in hardware or a combination of software and hardware are also possible and contemplated. In the present disclosure, a ‘computing entity’ may be any computing system as previously defined herein, or any module or combination of modules running on a computing system.
  • In at least some instances, a hardware processor is provided that is operable to carry out executable instructions for performing a method or process, such as the methods and processes disclosed herein. The hardware processor may or may not comprise an element of other hardware, such as the computing devices and systems disclosed herein.
  • In terms of computing environments, embodiments of the invention can be performed in client-server environments, whether network or local environments, or in any other suitable environment. Suitable operating environments for at least some embodiments of the invention include cloud computing environments where one or more of a client, server, or target virtual machine may reside and operate in a cloud environment.
  • The present invention may be embodied in other specific forms without departing from its spirit or essential characteristics. The described embodiments are to be considered in all respects only as illustrative and not restrictive. The scope of the invention is, therefore, indicated by the appended claims rather than by the foregoing description. All changes which come within the meaning and range of equivalency of the claims are to be embraced within their scope.

Claims (20)

What is claimed is:
1. A method for allocating resources of a server to clients connected to the server and restoring data from the server, the method comprising:
receiving a request for restore credits from a client, wherein the restore credits each correspond to an amount of data to read from the server;
determining a number of credits available to the client, wherein the number of credits available to the client accounts for restore credits already issued to the client;
issuing credits based on the request and the number of credits available to the client.
2. The method of claim 1, further comprising at least one of:
issuing restore credits equal to a number of restore credits requested by the client;
issuing restore credits greater than the number of restore credits requested by the client;
issuing restore credits less than the number of restore credits requested by the client;
issuing zero restore credits to the client; or
issuing negative restore credits to the client.
3. The method of claim 1, further comprising determining a size of a client cache based on the number of restore credits issued to the client.
4. The method of claim 1, further comprising including a prefetch flag in the request, wherein the prefetch flag indicates that the data at the server is accessed sequentially by the client.
5. The method of claim 1, further comprising determining a number of reads that consumes 1% of a CPU on average.
6. The method of claim 5, further comprising determining an average per core reads allowed based on the number of reads that consume 1% of the CPU and the average free percentage of the CUP and determining a maximum credits per client based on the number of client connections.
7. The method of claim 6, further comprising determining a calculated number of credits per client based on a tuning factor that is applied to the maximum credits per client.
8. The method of claim 7, further comprising issuing the restore credits in a number equal to a minimum of the request or the calculated number of credits.
9. The method of claim 7, further comprising issuing the restore credits in a number equal to a maximum of the request or the calculated number of credits.
10. The method of claim 1, further comprising throttling all clients when an average free percentage per core of a processor is less than a predetermined threshold.
11. A non-transitory computer readable medium including computer executable instructions for implementing a method, when executed, for allocating resources of a server to clients connected to the server and restoring data from the server, the method comprising:
receiving a request for restore credits from a client, wherein the restore credits each correspond to an amount of data to read from the server;
determining a number of credits available to the client, wherein the number of credits available to the client accounts for restore credits already issued to the client;
issuing credits based on the request and the number of credits available to the client.
12. The non-transitory computer readable medium of claim 11, further comprising at least one of:
issuing restore credits equal to a number of restore credits requested by the client;
issuing restore credits greater than the number of restore credits requested by the client;
issuing restore credits less than the number of restore credits requested by the client;
issuing zero restore credits to the client; or
issuing negative restore credits to the client.
13. The non-transitory computer readable medium of claim 11, further comprising determining a size of a client cache based on the number of restore credits issued to the client.
14. The non-transitory computer readable medium of claim 11, further comprising including a prefetch flag in the request, wherein the prefetch flag indicates that the data at the server is accessed sequentially by the client.
15. The non-transitory computer readable medium of claim 11, further comprising determining a number of reads that consume 1% of a CPU on average.
16. The non-transitory computer readable medium of claim 15, further comprising determining an average per core reads allowed based on the number of reads that consume 1% of the CPU and the average free percentage of the CUP and determining a maximum credits per client based on the number of client connections.
17. The non-transitory computer readable medium of claim 16, further comprising determining a calculated number of credits per client based on a tuning factor that is applied to the maximum credits per client.
18. The non-transitory computer readable medium of claim 16, further comprising issuing the restore credits in a number equal to a minimum of the request or the calculated number of credits.
19. The non-transitory computer readable medium of claim 16, further comprising issuing the restore credits in a number equal to a maximum of the request or the calculated number of credits.
20. A method for performing resource allocation when a client requests restore credits from a server, the method comprising:
determining a number of reads that consume 1% of a processor on average;
determining an average number of reads allowed per core by multiplying the number of reads that consume 1% of the processor with an average free percentage of each core of each processor;
determining a maximum number of credits per client by multiplying the average number of reads allowed per core with the number of cores and dividing by a number of client connections;
tuning the maximum number of credits per client to obtain a calculated number of credits per client; and
granting the request in an amount equal to a minimum between the requested restore credits and the calculated number of credits.
US16/154,518 2018-10-08 2018-10-08 Resource allocation using restore credits Active 2038-11-02 US10630602B1 (en)

Priority Applications (6)

Application Number Priority Date Filing Date Title
US16/154,518 US10630602B1 (en) 2018-10-08 2018-10-08 Resource allocation using restore credits
PCT/US2019/043976 WO2020076394A1 (en) 2018-10-08 2019-07-29 Resource allocation using restore credits
CN201980066453.5A CN112805684A (en) 2018-10-08 2019-07-29 Resource allocation using recovery borrowing
GB2104643.8A GB2591928B (en) 2018-10-08 2019-07-29 Resource allocation using restore credits
DE112019005042.7T DE112019005042T5 (en) 2018-10-08 2019-07-29 RESOURCE ALLOCATION USING RECOVERY CREDIT
US16/836,350 US11005776B2 (en) 2018-10-08 2020-03-31 Resource allocation using restore credits

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
US16/154,518 US10630602B1 (en) 2018-10-08 2018-10-08 Resource allocation using restore credits

Related Child Applications (1)

Application Number Title Priority Date Filing Date
US16/836,350 Continuation US11005776B2 (en) 2018-10-08 2020-03-31 Resource allocation using restore credits

Publications (2)

Publication Number Publication Date
US20200112520A1 true US20200112520A1 (en) 2020-04-09
US10630602B1 US10630602B1 (en) 2020-04-21

Family

ID=67551451

Family Applications (2)

Application Number Title Priority Date Filing Date
US16/154,518 Active 2038-11-02 US10630602B1 (en) 2018-10-08 2018-10-08 Resource allocation using restore credits
US16/836,350 Active US11005776B2 (en) 2018-10-08 2020-03-31 Resource allocation using restore credits

Family Applications After (1)

Application Number Title Priority Date Filing Date
US16/836,350 Active US11005776B2 (en) 2018-10-08 2020-03-31 Resource allocation using restore credits

Country Status (5)

Country Link
US (2) US10630602B1 (en)
CN (1) CN112805684A (en)
DE (1) DE112019005042T5 (en)
GB (1) GB2591928B (en)
WO (1) WO2020076394A1 (en)

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112260955A (en) * 2020-09-18 2021-01-22 苏州浪潮智能科技有限公司 Hybrid read-write flow control method and device
US10990447B1 (en) * 2018-07-12 2021-04-27 Lightbits Labs Ltd. System and method for controlling a flow of storage access requests
CN114911514A (en) * 2021-02-10 2022-08-16 北京字跳网络技术有限公司 Method and device for configuring algorithm resources, electronic equipment and storage medium

Family Cites Families (59)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5453982A (en) 1994-08-29 1995-09-26 Hewlett-Packard Company Packet control procedure between a host processor and a peripheral unit
US5956321A (en) 1995-03-16 1999-09-21 Kabushiki Kaisha Toshiba Stream scheduling system for real time stream server
US5586121A (en) * 1995-04-21 1996-12-17 Hybrid Networks, Inc. Asymmetric hybrid access system and method
US5812545A (en) 1996-01-04 1998-09-22 Orion Atlantic, L.P. Full mesh satellite-based multimedia networking system
US5778320A (en) 1996-10-04 1998-07-07 Motorola, Inc. Method for allocating communication resources among groups of communication units
US6438141B1 (en) 1998-04-20 2002-08-20 Sun Microsystems, Inc. Method and management of communications over media of finite bandwidth
US6459901B1 (en) 1999-07-01 2002-10-01 At&T Corp. Wireless network resource allocation
US6467024B1 (en) * 1999-09-07 2002-10-15 International Business Machines Corporation Accessing data volumes from data storage libraries in a redundant copy synchronization token tracking system
US6502165B1 (en) * 1999-12-03 2002-12-31 International Business Machines Corporation Balanced access to data volumes with redundant copies stored in data storage libraries
US6625709B2 (en) 2000-10-30 2003-09-23 Microsoft Corporation Fair share dynamic resource allocation scheme with a safety buffer
WO2003019412A2 (en) 2001-08-20 2003-03-06 Datacentertechnologies N.V. File backup system and method
US7539735B2 (en) * 2002-03-06 2009-05-26 International Business Machines Corporation Multi-session no query restore
US7398557B2 (en) * 2002-09-13 2008-07-08 Sun Microsystems, Inc. Accessing in a rights locker system for digital content access control
US7539199B2 (en) 2003-02-21 2009-05-26 Gireesh Shrimali Switch fabric scheduling with fairness and priority consideration
US7269697B1 (en) * 2003-05-07 2007-09-11 Avago Technologies General Ip (Singapore) Pte. Ltd. Apparatus and methodology for an input port scheduler
US7519725B2 (en) 2003-05-23 2009-04-14 International Business Machines Corporation System and method for utilizing informed throttling to guarantee quality of service to I/O streams
US7698115B2 (en) * 2003-06-30 2010-04-13 Microsoft Corporation System and method for dynamically allocating resources in a client/server environment
DE60335373D1 (en) * 2003-10-06 2011-01-27 Ericsson Telefon Ab L M Coordinated data flow control and data buffer allocation in UMTS
WO2005079001A1 (en) 2004-02-16 2005-08-25 Christopher Michael Davies Network architecture
US7478158B1 (en) * 2004-03-01 2009-01-13 Adobe Systems Incorporated Bandwidth management system
US7583658B1 (en) 2004-06-17 2009-09-01 Cisco Technology, Inc. Signal processing allocation using credit prediction
US7493426B2 (en) * 2005-01-31 2009-02-17 International Business Machines Corporation Data communication method and apparatus utilizing programmable channels for allocation of buffer space and transaction control
US7853774B1 (en) * 2005-03-25 2010-12-14 Tilera Corporation Managing buffer storage in a parallel processing environment
EP1762935B1 (en) 2005-09-12 2010-02-17 Siemens Aktiengesellschaft Method for controlling a request for resources in a computer system and control program
US7698478B2 (en) * 2006-09-19 2010-04-13 Apple Inc. Managed credit update
US8127099B2 (en) * 2006-12-26 2012-02-28 International Business Machines Corporation Resource recovery using borrowed blocks of memory
US7872975B2 (en) 2007-03-26 2011-01-18 Microsoft Corporation File server pipelining with denial of service mitigation
US20080307094A1 (en) * 2007-06-11 2008-12-11 Olli Karonen Association of peer-to-peer contribution credits with multiple devices
US7707248B2 (en) 2007-06-25 2010-04-27 Microsoft Corporation Credit-based peer-to-peer storage
US20090171812A1 (en) 2007-12-31 2009-07-02 Apple Inc. Media streams and media store
US8306036B1 (en) 2008-06-20 2012-11-06 F5 Networks, Inc. Methods and systems for hierarchical resource allocation through bookmark allocation
US20100031157A1 (en) 2008-07-30 2010-02-04 Robert Neer System that enables a user to adjust resources allocated to a group
US8374576B2 (en) 2008-12-04 2013-02-12 At&T Intellectual Property I, L.P. Methods, systems, and computer program products for generating resource utilization alerts through communication terminals
US8045472B2 (en) * 2008-12-29 2011-10-25 Apple Inc. Credit management when resource granularity is larger than credit granularity
US20120327779A1 (en) 2009-06-12 2012-12-27 Cygnus Broadband, Inc. Systems and methods for congestion detection for use in prioritizing and scheduling packets in a communication network
US8085801B2 (en) 2009-08-08 2011-12-27 Hewlett-Packard Development Company, L.P. Resource arbitration
US20110184998A1 (en) * 2010-01-22 2011-07-28 Palahnuk Samuel L Universally accessible encrypted internet file system for wired and wireless computing devices supplanting synchronization, backup and email file attachment
US8381217B1 (en) 2010-04-30 2013-02-19 Netapp, Inc. System and method for preventing resource over-commitment due to remote management in a clustered network storage system
US10200493B2 (en) * 2011-10-17 2019-02-05 Microsoft Technology Licensing, Llc High-density multi-tenant distributed cache as a service
US9838269B2 (en) 2011-12-27 2017-12-05 Netapp, Inc. Proportional quality of service based on client usage and system metrics
US8763154B2 (en) * 2012-01-23 2014-06-24 Verizon Patent And Licensing Inc. Federated authentication
US9619127B2 (en) 2012-04-17 2017-04-11 Netzero Wireless, Inc. User controlled data speed selector systems and methods
US9507639B2 (en) * 2012-05-06 2016-11-29 Sandisk Technologies Llc Parallel computation with multiple storage devices
US9495379B2 (en) 2012-10-08 2016-11-15 Veritas Technologies Llc Locality aware, two-level fingerprint caching
US9055078B2 (en) * 2013-01-10 2015-06-09 International Business Machines Corporation Token-based flow control of messages in a parallel computer
WO2014209407A1 (en) 2013-06-29 2014-12-31 Intel Corporation Service rate redistribution for credit-based arbitration
US20160005007A1 (en) 2014-07-04 2016-01-07 Flashback Survey, Inc. Methods and systems for using scanable codes to obtain maintenance and reminder services
US10419621B2 (en) 2014-11-14 2019-09-17 Tracfone Wireless, Inc. Methods, systems and applications for managing wireless services on a wireless device
CN111050007A (en) 2015-05-11 2020-04-21 华为技术有限公司 Policy and charging execution function device, online charging device and online charging method
US10652796B2 (en) 2015-10-30 2020-05-12 Investel Capital Corporation Data network access selection, migration and quality management systems and methods
US10115214B2 (en) 2015-11-03 2018-10-30 Verizon Patent And Licensing Inc. Shared data splitting interface
US10007457B2 (en) 2015-12-22 2018-06-26 Pure Storage, Inc. Distributed transactions with token-associated execution
US20170208120A1 (en) 2016-01-15 2017-07-20 Google Inc. Probabilistic throttling
US10146665B2 (en) 2016-03-24 2018-12-04 Oracle International Corporation Systems and methods for providing dynamic and real time simulations of matching resources to requests
US10536482B2 (en) 2017-03-26 2020-01-14 Microsoft Technology Licensing, Llc Computer security attack detection using distribution departure
US11500681B2 (en) * 2017-06-29 2022-11-15 Intel Corporation Technologies for managing quality of service platform interconnects
US10469395B2 (en) * 2017-08-31 2019-11-05 Hewlett Packard Enterprise Development Lp Packet transmission credit allocation
US20190348158A1 (en) 2018-05-11 2019-11-14 Michigan Health Information Network Shared Services Systems and methods for managing data privacy
US11201828B2 (en) * 2018-10-08 2021-12-14 EMC IP Holding Company LLC Stream allocation using stream credits

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US10990447B1 (en) * 2018-07-12 2021-04-27 Lightbits Labs Ltd. System and method for controlling a flow of storage access requests
CN112260955A (en) * 2020-09-18 2021-01-22 苏州浪潮智能科技有限公司 Hybrid read-write flow control method and device
CN114911514A (en) * 2021-02-10 2022-08-16 北京字跳网络技术有限公司 Method and device for configuring algorithm resources, electronic equipment and storage medium

Also Published As

Publication number Publication date
GB202104643D0 (en) 2021-05-12
US11005776B2 (en) 2021-05-11
DE112019005042T5 (en) 2021-09-16
GB2591928A (en) 2021-08-11
US10630602B1 (en) 2020-04-21
WO2020076394A1 (en) 2020-04-16
CN112805684A (en) 2021-05-14
GB2591928B (en) 2023-05-17
US20200228461A1 (en) 2020-07-16

Similar Documents

Publication Publication Date Title
US20230283681A1 (en) System and method for throttling service requests having non-uniform workloads
US10185592B2 (en) Network storage device using dynamic weights based on resource utilization
US9419904B2 (en) System and method for throttling service requests using work-based tokens
US11005776B2 (en) Resource allocation using restore credits
US11936568B2 (en) Stream allocation using stream credits
US10534542B2 (en) Dynamic core allocation for consistent performance in a non-preemptive scheduling environment
US11765099B2 (en) Resource allocation using distributed segment processing credits
US8818989B2 (en) Memory usage query governor
US20170193416A1 (en) Reducing costs related to use of networks based on pricing heterogeneity
US10359945B2 (en) System and method for managing a non-volatile storage resource as a shared resource in a distributed system
WO2018196459A1 (en) Download request processing method and apparatus, processing device and medium
US10135750B1 (en) Satisfaction-ratio based server congestion control mechanism
US12126502B1 (en) Configurable quality of service provider pipeline

Legal Events

Date Code Title Description
AS Assignment

Owner name: EMC IP HOLDING COMPANY LLC, MASSACHUSETTS

Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNOR:DESAI, KEYUR B.;REEL/FRAME:047096/0779

Effective date: 20181003

FEPP Fee payment procedure

Free format text: ENTITY STATUS SET TO UNDISCOUNTED (ORIGINAL EVENT CODE: BIG.); ENTITY STATUS OF PATENT OWNER: LARGE ENTITY

AS Assignment

Owner name: THE BANK OF NEW YORK MELLON TRUST COMPANY, N.A., TEXAS

Free format text: SECURITY AGREEMENT;ASSIGNORS:CREDANT TECHNOLOGIES, INC.;DELL INTERNATIONAL L.L.C.;DELL MARKETING L.P.;AND OTHERS;REEL/FRAME:049452/0223

Effective date: 20190320

STCF Information on status: patent grant

Free format text: PATENTED CASE

AS Assignment

Owner name: THE BANK OF NEW YORK MELLON TRUST COMPANY, N.A., TEXAS

Free format text: SECURITY AGREEMENT;ASSIGNORS:CREDANT TECHNOLOGIES INC.;DELL INTERNATIONAL L.L.C.;DELL MARKETING L.P.;AND OTHERS;REEL/FRAME:053546/0001

Effective date: 20200409

MAFP Maintenance fee payment

Free format text: PAYMENT OF MAINTENANCE FEE, 4TH YEAR, LARGE ENTITY (ORIGINAL EVENT CODE: M1551); ENTITY STATUS OF PATENT OWNER: LARGE ENTITY

Year of fee payment: 4