WO2004004249A1 - Method and apparatus for load estimation and call admission control in a call processing environment - Google Patents

Method and apparatus for load estimation and call admission control in a call processing environment Download PDF

Info

Publication number
WO2004004249A1
WO2004004249A1 PCT/IB2003/002535 IB0302535W WO2004004249A1 WO 2004004249 A1 WO2004004249 A1 WO 2004004249A1 IB 0302535 W IB0302535 W IB 0302535W WO 2004004249 A1 WO2004004249 A1 WO 2004004249A1
Authority
WO
WIPO (PCT)
Prior art keywords
call
load
processing unit
class
eagerness
Prior art date
Application number
PCT/IB2003/002535
Other languages
French (fr)
Inventor
Seyed Bahram Zahir Azami
Mendel Elliott Spencer
Original Assignee
Nortel Networks Limited
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Priority claimed from US10/187,089 external-priority patent/US20040005041A1/en
Priority claimed from US10/186,877 external-priority patent/US7369490B2/en
Application filed by Nortel Networks Limited filed Critical Nortel Networks Limited
Priority to AU2003244911A priority Critical patent/AU2003244911A1/en
Publication of WO2004004249A1 publication Critical patent/WO2004004249A1/en

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L67/00Network arrangements or protocols for supporting network services or applications
    • H04L67/01Protocols
    • H04L67/10Protocols in which an application is distributed across nodes in the network
    • H04L67/1001Protocols in which an application is distributed across nodes in the network for accessing one among a plurality of replicated servers
    • H04L67/1036Load balancing of requests to servers for services different from user content provisioning, e.g. load balancing across domain name servers
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L47/00Traffic control in data switching networks
    • H04L47/10Flow control; Congestion control
    • H04L47/11Identifying congestion
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L47/00Traffic control in data switching networks
    • H04L47/10Flow control; Congestion control
    • H04L47/12Avoiding congestion; Recovering from congestion
    • H04L47/125Avoiding congestion; Recovering from congestion by balancing the load, e.g. traffic engineering
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L47/00Traffic control in data switching networks
    • H04L47/10Flow control; Congestion control
    • H04L47/15Flow control; Congestion control in relation to multipoint traffic
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L47/00Traffic control in data switching networks
    • H04L47/10Flow control; Congestion control
    • H04L47/24Traffic characterised by specific attributes, e.g. priority or QoS
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L47/00Traffic control in data switching networks
    • H04L47/70Admission control; Resource allocation
    • H04L47/76Admission control; Resource allocation using dynamic resource allocation, e.g. in-call renegotiation requested by the user or requested by the network in response to changing network conditions
    • H04L47/765Admission control; Resource allocation using dynamic resource allocation, e.g. in-call renegotiation requested by the user or requested by the network in response to changing network conditions triggered by the end-points
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L47/00Traffic control in data switching networks
    • H04L47/70Admission control; Resource allocation
    • H04L47/80Actions related to the user profile or the type of traffic
    • H04L47/805QOS or priority aware
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L47/00Traffic control in data switching networks
    • H04L47/70Admission control; Resource allocation
    • H04L47/82Miscellaneous aspects
    • H04L47/822Collecting or measuring resource availability data
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L47/00Traffic control in data switching networks
    • H04L47/70Admission control; Resource allocation
    • H04L47/82Miscellaneous aspects
    • H04L47/826Involving periods of time
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L47/00Traffic control in data switching networks
    • H04L47/70Admission control; Resource allocation
    • H04L47/83Admission control; Resource allocation based on usage prediction
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L67/00Network arrangements or protocols for supporting network services or applications
    • H04L67/01Protocols
    • H04L67/10Protocols in which an application is distributed across nodes in the network
    • H04L67/1001Protocols in which an application is distributed across nodes in the network for accessing one among a plurality of replicated servers
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04QSELECTING
    • H04Q3/00Selecting arrangements
    • H04Q3/0016Arrangements providing connection between exchanges
    • H04Q3/0062Provisions for network management
    • H04Q3/0091Congestion or overload control
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L67/00Network arrangements or protocols for supporting network services or applications
    • H04L67/01Protocols
    • H04L67/10Protocols in which an application is distributed across nodes in the network
    • H04L67/1001Protocols in which an application is distributed across nodes in the network for accessing one among a plurality of replicated servers
    • H04L67/1004Server selection for load balancing
    • H04L67/1008Server selection for load balancing based on parameters of servers, e.g. available memory or workload
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L67/00Network arrangements or protocols for supporting network services or applications
    • H04L67/01Protocols
    • H04L67/10Protocols in which an application is distributed across nodes in the network
    • H04L67/1001Protocols in which an application is distributed across nodes in the network for accessing one among a plurality of replicated servers
    • H04L67/1004Server selection for load balancing
    • H04L67/1012Server selection for load balancing based on compliance of requirements or conditions with available server resources
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04QSELECTING
    • H04Q2213/00Indexing scheme relating to selecting arrangements in general and for multiplex systems
    • H04Q2213/13164Traffic (registration, measurement,...)
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04QSELECTING
    • H04Q2213/00Indexing scheme relating to selecting arrangements in general and for multiplex systems
    • H04Q2213/13166Fault prevention
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04QSELECTING
    • H04Q2213/00Indexing scheme relating to selecting arrangements in general and for multiplex systems
    • H04Q2213/13335Simulation, emulation
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04QSELECTING
    • H04Q2213/00Indexing scheme relating to selecting arrangements in general and for multiplex systems
    • H04Q2213/13343Neural networks

Definitions

  • This invention generally relates to communications inf astructure and techniques, and particularly concerned with load estimation, load balancing and call admission control in call processing systems including plural call processing units.
  • Background In light of recent tightening of the credit markets and general drop-off in demand, telecommunications carriers are increasingly pressured to maximize return on their existing infrastructure and/or make wise choices when purchasing and deploying new telecommunications gear. Due in part to their cost and potential performance bottlenecks, especially when stretched beyond nominal loads, call processing systems within the carrier's network have drawn heightened scrutiny.
  • a representative multiple processor call processing system 2500 is shown in FIG. 25.
  • PMCis plural individual call processing units or PMCis, including PMC1 2515, PMC2 2520 and PMCK 2525, collectively handle a number of call processing activities and call events.
  • the managing call processing unit, or PMC-M 2510 performs other call processing activities and events.
  • the PMCis 2515, 2520 and 2525 handle the bulk of the "individual call" oriented activities and tasks based on the subset of system calls they are assigned to handle, whereas the PMC-M 2510 is primarily responsible for coordinating operation of the PMCis as needed for e.g. system-wide call admission control and load balancing.
  • coordination information such as control, status, query and reporting messages occurs between these PMCis 2515, 2520, 2525 and the PMC-M 2510 on a periodic and/or event-driven basis.
  • a well-known goal that multiple processor call processing systems (such as the system 2500 shown in FIG. 25) attempt to achieve with respect to call admission control (CAC) and load balancing is to maintain the QoS commitment for the existing calls (failing to introduce delays or packet loss in packet voice environments) while maximizing call capacity, and ultimately maximize efficient use of the call processing system.
  • CAC call admission control
  • load balancing is to maintain the QoS commitment for the existing calls (failing to introduce delays or packet loss in packet voice environments) while maximizing call capacity, and ultimately maximize efficient use of the call processing system.
  • the goal is to provide an optimally efficient call handling situation in which all individual call processing units in a multiple processor call processing system are evenly and homogeneously (with respect to call type) loaded.
  • call admission control and new call blocking will thus only occur where all these call processing units are already fully or nominally loaded.
  • each box within each PMC1..PMC6 represents a call, with a shaded box 110 representing a voice call, a diagonally hatched box 112 representing a streaming call, and a transparent double- sized box 114 representing an interactive call.
  • Each container 107 represents the capacity of a given PMCi.
  • the PMC1..PMC6 of the multiple processor call processing system 105 are evenly and homogeneously loaded; in FIG. IB, the load is even but not homogeneous; and in FIG. IC the load is neither homogeneous nor even.
  • the risk presented in FIG. IB is that if there is congestion in a given PMCi, most calls within such PMCi might be of the same nature thus making any call processing remedy more difficult.
  • the risk posed in FIG. IC is that while there is unused capacity on some PMCis (e.g. PMC2, PMC4, PMC6), other PMCis (e.g. PMCI, PMC3, PMC5) are very close to their nominal load and may not be able to provide sufficient service level guarantees or quality of service to the calls they are handling.
  • the goal should be to maintain even and homogenous loads, as in FIG. 1 A.
  • the call processing system 105 may reach a loading condition where all the PMC1...PMC6 are nominally loaded, and that the system cannot admit further new calls, but this situation happens less often, if the load is even.
  • each PMCi's loading can be estimated by determining how many of each type of calls are being serviced and multiplying each by this estimated resource utilization. Resources can be reserved within each of the PMCi consistent with the estimated resource utilization and additional capacity can be determined based on remaining unreserved resources.
  • this estimated resource utilization technique does not take into account the number of calls of a given call type being handled by a given call processing unit. It is well-known that the greater the number of calls of a given type are being serviced by a given PMCi, the less is the chance that these behave all the same way or utilize the same amount of resources. For example, it is highly improbable that, in the case of packet voice with silence suppression, that all handled calls will be in the more resource demanding active, vs. silent state.
  • a provisionable reservation coefficient may be used to augment the above to permit the PMCis and the call processing system generally to accept more calls than the above-described conservative estimation technique calls for, and in the typical and aggressive cases, more than the PMCis and the call processing system as a whole can actually handle.
  • This overbooking is justified, especially as the number of calls increases and the probabilistic nature of the calls with respect to actual resource utilization (the probabilistic nature of calls, as used herein, means that the resource utilization for each call varies by time and may have any of a finite or infinite number of values within known limits).
  • the reservation coefficient although provisionable, is constant within different hours of a day or days of a week, unless a system operator manually alters or updates it.
  • the present invention is directed to improved techniques and apparatus for estimating the load of a call processing unit within a call processing system.
  • This load estimation may occur at a given time, on a periodic or event-driven basis, including upon perception of a call event such as a new call admission request, an admitted call modification request, or an admitted call termination event.
  • the load estimation here is dependent upon at least one of load mean and variance estimates.
  • a method and apparatus which includes determination of call classes based on admitted calls handled by the call processing unit at the selected time, calculation of an estimated load mean and an estimated load variance for this call processing unit based on this distribution, along with the class load mean and variance for at least one of the call classes specified by the distribution, and derivation of an estimated load measure from at least one of the estimated load mean and the estimated load variance.
  • a method and apparatus is provided to estimate the load a call processing unit which includes detection of a call event which specifies a call of a given call class, calculation of a new estimated load mean based on a current estimated load mean and a class load mean for the given call class, calculation of a new estimated load variance based on a current estimated load variance and a class load variance for the given call class, and derivation of an estimated load measure from at least one of the new estimated load mean and load variance.
  • the estimated load measure may represent a utilization value of a call processing unit resource such as bandwidth or processing load. Further this estimated load measure may relate to the call processing unit's probability of exceeding it's nominal load.
  • class load mean and variance may be derived from a probabilistic approximation of the associated call class, such as a Gaussian distribution.
  • a probabilistic approximation of the associated call class such as a Gaussian distribution.
  • the use of estimated load mean and variance consistent with these embodiments permits a more accurate assessment or prediction of a call processing units loading than heretofore available, especially where the load mean and variance are related to resource utilization within the call processing unit. Building such assessment or prediction using probabilistic approximation of call class mean and variance parameters is believed to yield even more accurate results. Such results can be useful in call processing management functions, including systemwide load balancing and call admission control in multiple processor call processing environments.
  • the present invention is directed in part to techniques for determining the eagerness of a call processing unit within a multiple processor call processing system to accept a new call or call upgrade.
  • the invention is also directed to call admission control and load balancing through query of such eagerness.
  • fuzzy logic is used to determine the eagerness.
  • a call processing unit capable of supporting a plurality of call classes
  • method and apparatus which includes attainment of first and second load parameters for the call processing unit, fuzzification of the first and second load parameters according to respective first and second fuzzification functions, comparison of the fuzzified first and second load parameters against a defined set of eagerness rules, defuzzification the result of the comparing, and generation of an eagerness to admit a call based on this defuzzification result.
  • the first and second load parameters may include actual and estimated load parameters for the call processing unit.
  • the estimated load may be approximated using a probabilistic distribution function. Homogeneous may also be consulted in determining this eagerness.
  • a call admission apparatus and method which includes monitoring a call admission eagerness for each of the call processing units, perception of a call admission request, selection of a target call processing unit having a relative maximum call admission eagerness, confirmation of the call admission eagerness by this target call processing unit with respect to the call admission request, and admission of the call if the target call processing unit eagerness is confirmed.
  • Predetermined or call class malleable thresholding may be utilized to help determine if a target call processing unit is available.
  • another target may be selected if the eagerness cannot be confirmed by the initial target, and may be recursively performed until either no target call processing units are available, the admission request is withdrawn, or the call is successfully admitted.
  • FIGs. 1A - IC illustrate the concepts of load balancing and homogeneousness consistent with the present invention.
  • FIG. 2 illustrates conventional load estimation.
  • FIG. 3 illustrates resource utilization modes and probabilities.
  • FIGs. 4A - 4C illustrate Gaussian characteristics of load estimation consistent with the present invention.
  • FIG. 5 is a flowchart illustrating resource knowledge base processing according to an embodiment of the invention.
  • FIG. 6 is a flowchart illustrating load estimation processing according to an embodiment of the invention.
  • FIG. 7 is a flowchart illustrating load estimation processing according to an alternative embodiment of the invention.
  • FIGs. 8, 9 and 10 are flowcharts illustrating eagerness determination processing according to respective alternative embodiments of the invention.
  • FIG. 11 is a flowchart illustrating PMCi call event processing according to an embodiment of the invention.
  • FIG. 12 is a flowchart illustrating PMC-M call admission processing according to an embodiment of the invention.
  • FIG. 13 is a simplified block diagram of a PMCi 1300 consistent with eagerness determination processing shown in FIGs. 8 and 9.
  • FIG. 14 is a simplified block diagram of a PMCi 1400 consistent with eagerness determination processing shown in FIG. 10.
  • FIG. 15 is a simplified block diagram of a PMC-M 1500 consistent with the PMCi arrangements shown in FIGs. 13 and 14.
  • FIG. 16 is a simplified block diagram of a PMC-M 1600 consistent with the PMCi arrangement shown in FIG. 13.
  • FIG. 17 is a simplified block diagram of a PMCi consistent with the PMC-M arrangement shown in FIG 18.
  • FIG. 18 is a simplified block diagram of a PMC-M consistent with the PMCi arrangement shown in FIG. 17.
  • FIG. 19 diagrammatically illustrates fuzzification of the actual load parameters according to an embodiment of the invention.
  • FIG. 20 diagrammatically illustrates fuzzification of the estimate load parameters according to an embodiment of the invention.
  • FIG. 21 diagrammatically illustrates fuzzification of the homogeneousness parameters according to an embodiment of the invention.
  • FIG. 22 illustrates threshold malleability by cost of service according to an embodiment of the invention
  • FIG. 23 illustrates eagerness reporting conservation according to an embodiment of the invention.
  • FIG. 24 illustrates a two variable fuzzy logic rules base according to an embodiment of the invention.
  • FIG. 25 is a simplified block diagram of a call processing system including plural individual call processing units coordinated by a single managing call processing unit.
  • the disclosed embodiments presumes a single master call processing unit (PMC-M) coordinating several individual call processing units or PMCis (e.g. on the order of 50-60). See, e.g. FIG. 25. Further, each call is presumed follows a probabilistic behavior in accordance with a finite number of predefined states and that that resource utilization associated with such states can be determined.
  • PMC-M master call processing unit
  • determining whether to admit a new call one should see if the necessary resource such as processing load or bandwidth for the new call is available. If the resources are available, the new call can be admitted.
  • An important action in packet switching systems with QoS requirement is resource reservation. It means that by admitting each new call, the call processing system or constituent call processing unit should restraint its eagerness to accept more new calls.
  • the disclosed embodiments of the present invention utilize a probabilistic estimation of the load, based on the mean and variance of the resource utilization for a given call type and experienced call distribution.
  • the call processing system generates a probability distribution function (pdf) of the total resource usage. Then based on this pdf, this call processing system calculates the probability of exceeding a nominal load value on a per call processing unit basis. This probability should be kept lower than a given threshold, by rejecting some calls if they are coming at the time of heavy load. So, in CAC and load balancing consistent with the present invention, this probability gives this call processing system a measure to accept or reject the new call or, if sufficient capacity exists at the system level, to determine which PMCi should handle the call.
  • pdf probability distribution function
  • one or more of the disclosed embodiments selectively perform "static" load balancing of the call processing system based at least in part on the aforementioned estimated load results.
  • load balancing seeks to make load balancing decisions (which call processing unit, if any?) with respect to a call only during admission of the call.
  • teachings of the present invention are not intended to be so limiting, and, as will be appreciated by those ordinarily skilled in the art, can be easily applied to dynamic load balancing of calls where, in addition to balancing at admission, calls can be transferred from one call processing unit to another on fly. See, e.g.
  • the probabilistic behavior of the calls is usually reflected in the active/silence mode of voice or the burstiness (of data).
  • the call types or classes to be conversation mainly voice
  • streaming mainly web data
  • interactive web data
  • background email
  • other call types such as fax, telecommands, etc may be considered as will be appreciated by those ordinarily skilled in the art.
  • FIG. 3 depicts a discrete probability mass function (pmf) of one call within a type/class (say, the voice class, here).
  • the horizontal axis represents the resource requirement, here bandwidth expressed as a bit rate.
  • the probability that the resource requirement 300 will be in discreet state 1 i.e. silence
  • the probability that the resource requirement 300 will be in discrete state 2 is 60%, with a characteristically higher 12.2Kbps bandwidth requirement 304 (m 2 ).
  • ⁇ , and ⁇ are respectively the mean and variance of one call.
  • ⁇ , and ⁇ are respectively the mean and variance of one call.
  • the mean value calculated here for the total distribution is simply the sum of all the average values.
  • the variance is also the sum of the variances attributed to all of the calls.
  • the mean value gets 100 times bigger; so does the variance; however, the standard deviation, ⁇ , gets only 10 times bigger. This means that the more calls are added up in a PMCi, the bigger is the mean and the standard deviation, but the smaller would be their ratio ⁇ / ⁇ 2 .
  • a generalized plot of the probability distribution function in accordance with this algorithm is shown in FIG. 4B, with the ⁇ 406 and the ⁇ 408 denoted.
  • the exact form of the distribution can be obtained by applying N times convolution of the original pdf with itself.
  • the probability at each point is C N -P2 1 P ⁇ N" ⁇ as shown in FIG. 4A.
  • Some calls may have different resource usage than others, although they may be in the same class. For instance, a voice call may have "A" times processing power requirement than others, because of the special conditioning or additional processing to be applied to the call, such as silence suppression or ciphering. Therefore, provision may made to count such a call A times more than "an ordinary call”.
  • the next step is to add the variables obtained in each class, to obtain the pdf of the whole resource usage in all of the classes for a given PMCi.
  • the central limit Theorem implies that "the sum of any number of variables with Gaussian distribution, is a new variable with a Gaussian distribution.”
  • the mean and variance of the new variable can be simply obtained by summing up the mean and variances of the elements.
  • the total pdf has a mean and a variance that can be obtained from simple arithmetic.
  • the pair of values are the only parameters we need when using the Gaussian distribution's error function.
  • ⁇ 412 maximum allowable resource utilization or nominal load
  • ⁇ 414 probability of exceeding the nominal load - the hatched area 412 under the curve410
  • each PMCi based on the current number of the admitted calls in each class, we can calculate the probability of having the total resource usage exceeding the maximum allowable value set for the resource (say for example, 2 Gbps for bit rate or 99% of CPU usage). In other words, we can determine the probability of having the total resource utilization for a PMCi exceeding the nominal load specified for that PMCi. Alternatively, having a provisioned tolerable probability for exceeding the nominal value, we can determine for each PMCi, whether the value corresponding to this probability is smaller or greater than the maximum allowable resource utilization.
  • a load estimation algorithm according to an embodiment of the invention, based on the statistical analysis and Gaussian distribution discussed above will now be disclosed. Unlike the above, however, the scarce resource is now processing load (in the unit of instruction cycles/sec or any other measure of processor utilization within a call processing unit).
  • N there are two modes: active and non- active.
  • the probability of being in each state is also considered to be known (p ⁇ , p 2 , respectively).
  • the ratio p ⁇ /p 2 is a main characteristic of each class. It should be understood that this is a simplifying assumption to have only two modes (active and non-active) and that in reality, more than two modes might be possible. However, this does not affect the disclosed algorithm as long as one can translate these numbers to form a Gaussian distribution with a known ⁇ , ⁇ 2 .
  • the first action is to measure the number of instruction cycles/sec (processing resource utilization) corresponding to each of the two modes: active and non-active for given call class or type. These two parameters are known as mi and m 2 . Note here that there are however a number of other parameters that affect the number of instruction cycles being used to support the call mode. These parameters may include some or all of the following parameters in an e.g.
  • PS Packet Switching
  • CS Circuit
  • the Boolean PS/CS indicates whether a UMTS call is either packet or circuit standard respectively.
  • the CRC Boolean indicates whether Cyclic Redundancy Coding is used with a given UMTS Call.
  • RLC_mode refers to the UMTS radio link mode associated with a call (TM: transparent mode; AM: acknowledged mode; and UM: unacknowledged mode).
  • TTI refers to the Transmission Time interval.
  • the No_of_TB refers to the number of traffic bearers.
  • No_of_RL refers to the number of UMTS radio links available for the UMTS call.
  • mi is obtained for the active mode and m 2 for the non-active mode.
  • m ⁇ , m i a different set of parameters is assumed, which results in a new (m ⁇ , m i) pair.
  • ⁇ , ⁇ 2 may be provided which represent other parameters having some influence on the values of ⁇ , ⁇ 2 , such as the bit rate, or the number of radio links in a wireless system (e.g. 1-6 links in a UMTS system).
  • ⁇ , ⁇ 2 e.g. 1-6 links in a UMTS system.
  • other real world factors may be considered and the resource requirement algorithm made more complex to accommodate such factors.
  • the number of radio links which has also a probabilistic character may be investigated. In such a case, the variances obtained will be substantially higher for conversation calls, and not necessarily zero for streaming calls.
  • a neural network be may employed to make this function fitting, when more combinations are considered. See e.g.
  • each PMCi manages a table of its admitted calls including call type or class, ⁇ , ⁇ 2 either within the admitted call table or as part of a separate resource knowledge base.
  • the call type is simply the indicator of one of the N classes.
  • conversat ⁇ onal
  • ⁇ (streaming) ⁇ (interactive)
  • ⁇ (background) sum of ⁇ 's for all conversational calls in this PMC.
  • the variance ⁇ 2 of each class is obtained in the same way.
  • total ⁇ t , ⁇ 2 1 for the PMC are obtained by summing the ⁇ , ⁇ 2 for all the classes.
  • the values of ⁇ t , ⁇ 2 t can be updated once a new call is admitted or an existing call is terminated (or modified) by simply adding or subtracting the corresponding ⁇ c , ⁇ 2 c .
  • new mean and variance are calculated and put in the table but before that the old values are subtracted from the sums.
  • the following rules may be established: 1) the initial values of ⁇ , ⁇ 2 (for all classes) are zero. 2) Upon the admission of a new call, the values of ⁇ , ⁇ 2 are increased. 3) Upon termination of each call, the values of ⁇ , ⁇ 2 are decreased. And, 4) upon alteration of each call, the values of ⁇ , ⁇ 2 are first decreased by the old value from the table and then increased by its new value.
  • Either value can serve as a guideline to decide either to accept or to reject a new call.
  • the use of this estimate, along with some other estimates is the subject of the next section where we explain a fuzzy logic controller for this effect and we show how to use this parameter along with a couple of other ones to determine the eagerness of each PMC to receive a new call. It is also noteworthy that since ⁇ is an input to a fuzzifier, we can simply directly feed ( ⁇ - ⁇ )/ ⁇ to the fuzzifier.
  • the intervening parameters are multiple: first the number of calls in each class and second the characteristic that we consider for each class. For instance the more calls of streaming type we have, the narrower is the Gaussian distribution. Below we examine a few examples.
  • Example 3 interactive class. Data, however, has usually a more probabilistic manner, as we may experience bigger ⁇ 2 values for data.
  • Example 4 aggregated traffic on a PMCi.
  • CAC decision and the load balancing are explained.
  • the CAC decision is made in two steps: the initial step where the PMC-M make a guess on the best PMCi able to handle the call based on reported eagerness.
  • An early CAC algorithm can be included here for assisting the managing call processing unit make the decision of either to accept or to reject a call, if none of the PMCis are eager enough to accept the new call.
  • the chosen PMCi itself reviews if it is really in the state of accepting the call or not.
  • fuzzy logic is used in parts of both processing to arrive at CAC and load balancing decisions.
  • the disclosed embodiments all try to even the load on the individual PMCis to reduce risk of overloading and underloading PMCi capacity, as well as enhance efficient use of scarce and expensive processing resources.
  • several of the disclosed embodiments attempt to homogenize the load among the PMCi's. Homogenizing herein means that the ratio between the effect of different types and service classes be approximately the same in all PMCi's. In other words, if half of the total load (in all PMCi's) in the call processing system is voice, the same ratio is maintained in all PMCi's. By doing so one equalize the chances of exceeding the nominal load in all PMCis.
  • the disclosed models supporting these embodiments are based on a partial knowledge of the system; modeling for some classes is better than for others. By having a homogenous load, one avoids that most of the inexactitude because of the inaccurate modeling fails to gather in a single PMCi and increase estimate inaccuracies disproportionately.
  • a call processing system may end up in a situation where for some of the PMCi's the probability of exceeding the nominal value is small but for some others, it is not. This would be the case, for instance when most voice calls go to one PMCi; most data to one and most streaming and background each to another PMCi.
  • streaming calls have insignificant variance, they show a very narrow pdf, while interactive calls show very large variances.
  • time span of a call a voice call has an average duration of about 90 seconds while some interactive and streaming calls may have much longer durations. This fact may also be considered as an input to the CAC and load balancing decisions.
  • an initial guess on CAC and the optimum PMCi is selected based on: 1) the ensemble of calls which yield the pdf ( 2 , ⁇ 2 ⁇ J in each PMC; 2) the periodically updated measurement of the PMC loads; and 3) the QoS type and CoS (requirements of the new call).
  • a resource knowledge base (e.g. knowledge base 1310 in PMCi 1300 shown in FIG. 13 or knowledge base 1310 in PMC-M 1800 shown in FIG. 18) is used to retain call class mean and variance information, either for a specific PMCi or the PMC-M.
  • Resource knowledge base development according to an embodiment of the invention will now be detailed with reference to FIG. 5. Assembly of this knowledge base may be performed upon call processing unit installation, upgrade or other change, or on an as-needed basis when the types of supported call classes change.
  • step 514 the mean and variance for each of the call classes of interest will be calculated using the obtained resource utilization in operational mode probabilities obtained in step 512.
  • step 516 the knowledge base of supported call classes is updated with the new mean variance information. Knowledge base creation or update processing according to the present embodiment then terminates naturally.
  • the estimated load is recalculated every time a call event such as a new call admission, an admitted call termination or an admitted call classification modification event has occurred with respect to an individual call processing unit, whether or not responsive action is taken by that individual call processing unit or the master call processing unit.
  • a call event such as a new call admission, an admitted call termination or an admitted call classification modification event has occurred with respect to an individual call processing unit, whether or not responsive action is taken by that individual call processing unit or the master call processing unit.
  • the current collective mean ( ⁇ t ) and variance ( ⁇ 2 t ) values which are used to generate the estimated load measure for a particular PMCi are selectively updated upon detection of a call event involving such PMCp, by either the PMC-M or the PMCp as well as the particular type of call event.
  • step 602 a determination is made whether the detected call event involves a new call admission request. If so, control passes to step 608 in which a "new" estimated load mean and variance for the involved individual call processing unit, e.g. PMCp, is determined by adding the requested "new" call class ( ⁇ x ) load mean ⁇ and variance ⁇ 2 values to the current estimated load mean ( ⁇ t ) and variance ( ⁇ 2 t ) values for the entire PMCp as specified in the resource knowledge base corresponding to the PMCp. It should be noted that this is made possible through application of the central limit Theorem discussed above. Thereafter, control passes to step 610 in which the load estimator determines the estimated load measure (e.g.
  • step 602 a determination is made whether the detected call event specifies an admitted call termination request. If so, control passes to step 606, in which the new mean and variance values for the estimated load are calculated by subtracting the class load mean and variance values associated with the terminated call, as contained in the corresponding resource knowledge base, from current estimated load mean and variance values. Thereafter, control passes to step 610, where again the estimated load measure is determined based on the new mean and variance values for the estimated load as calculated in step 606.
  • step 604 a determination is made whether the detected call event specifies an admitted call class modification request.
  • Call class modifications such as in-process call upgrades or downgrades, can be specified for an admitted call. Examples include requesting a call type modification (such as from voice to data) for an existing call, a call class modification within a common type (such as from interactive data type call to a streaming data call or requesting a transition in class- specific call parameter (such as the toggling the presence or absence of ciphering in voice calls or changing the bit rate for data calls).
  • the estimated load measure may comprise any number of values suitable to quantatively represent the estimated load, including the aforementioned probability of exceeding a nominal load ⁇ (p)— or a ratio between the nominal load and the estimated load mean, equation x 2 (p) mentioned above.
  • other expressions of the estimated load measure may be used as would be understood by those ordinarily skilled in the art.
  • the estimated load parameter is used by the PMCp and/or by the managing call processing unit PMC-M to help undertake eagerness determination and ultimately load balancing and call admission control according to the disclosed embodiments. Thereafter event driven load estimation updating terminates naturally.
  • Load estimation is calculated based on the call information contained in the admitted call table at a particular time (e.g. time X in the figure).
  • control initiates at step 702, in which a distribution of the call classes in the admitted call table (such as table at a given time X is made). This distribution identifies how many calls within a given call class are contained in the admitted call table. This distribution is used to help develop an estimated load mean and load variance value for the entire call processing unit based on the class distribution.
  • step 704 the estimated load mean ( ⁇ t ) and variance ( ⁇ 2 t ) for the entire call processing unit PMCp is calculated using the class distribution obtained in step 702 along with the class load mean and variance values contained in the resource knowledge base. In particular, this is done by multiplying the number of calls in the class as recorded in the class distribution by its associated class load mean and variance values contained in the resource knowledge base corresponding to the call processing unit of interest. This is done for each class represented in the class distribution calculated in step 702. Thereafter, in step 706, the estimated load measure (e.g. ⁇ (p) or x 2 (p)) based on the new estimated load mean and new estimated load variance for the call processing unit PMCp is determined.
  • the estimated load measure e.g. ⁇ (p) or x 2 (p)
  • step 708 an estimated load parameter based on the aforementioned load measure is then issued for further processing, including eagerness determination for the PMCp of interest. Thereafter load estimation calculation according to this alternative embodiment terminates.
  • the eagerness ⁇ (p) of a given individual call processing unit PMCp is generated with respect to fuzzy logic analysis of the actual load X ⁇ (p) and estimated load ( ⁇ (p) or x 2 (p)) parameters for the PMCp.
  • homogeneousness of the PMCp is not taken into consideration.
  • Such eagerness processing may be conveniently undertaken within the PMCp itself, such as within the resource utilization and fuzzy logic units 1320, 1325 of the PMCi 1300 shown in Fig. 13.
  • processing may be carried out on the behalf of the PMCp by the master call processing unit, such as through the resource utilization and fuzzy logic units 1850, 1860 of the PMC-M 1800 shown in FIG. 18, although in such case input from the weighting engine 1640 would not be utilized, nor would x 3 (p) be realized by the resource utilization unit 1850.
  • eagerness processing within this embodiment begins at steps 812 and 810 in parallel, in which the actual load parameter x ⁇ (p) is generated by the e.g. the load monitor 1325 shown in FIG. 13 (step 812) and the estimated load parameter ( ⁇ (p) or x 2 (p)) is generated by e.g. the load estimator 1330 shown in FIG. 13 pursuant to load estimation described above with reference to FIGs. 6 or 7.
  • Control thereafter passes to steps 814 and 816 in parallel, where these actual and estimated load parameters are each fiizzified into respective fuzzy logic states representative of the conditions they quantify, such as through fuzzifier 1 1342 and fuzzifier 2 1344 respectively.
  • ⁇ (p) 1960 450 MIPS
  • five discrete membership states are specified: vl (very low) 1910 associated with a range between 0 and 150 MIPS, lo (low) 1920 ranging from 50 to 250 MIPS, me (medium) 1930 ranging from 150 to 350 MIPS, hi (high) 1940 ranging from 250 to 450 MIPS, and vh (very high) 1950 ranging from 350 to ⁇ (p).
  • vl (very low) 1910 associated with a range between 0 and 150 MIPS
  • lo (low) 1920 ranging from 50 to 250 MIPS
  • me (medium) 1930 ranging from 150 to 350 MIPS
  • hi (high) 1940 ranging from 250 to 450 MIPS
  • vh (very high) 1950 ranging from 350 to ⁇ (p).
  • first fuzzifier 1342 along with the second fuzzifier 1344 and the third fuzzifier 1448 (FIG. 14) are designed such that: 1) at most two fuzzy variables will have non zero values for the same input; and that the sum of the membership functions of all fuzzy variables for each input value would be 1.
  • other configurations can be used without departing from the teachings of the present invention.
  • the second fuzzifier 1344 acts on the load estimation parameter x 2 ( ⁇ ), or, alternatively, the probability of exceeding the nominal load ( ⁇ ) directly, where ⁇ itself is obtained from the Gaussian distribution error function explained above.
  • FIG. 20 illustrates the membership response curve for the second fuzzifier 1344, which again defines five discrete membership states or fuzzy result values vh 2010, hi 2020, me 2030, lo 2040, and vl 2050.
  • step 820 an inference mechanism such as a rules base, part of the PMCi load balancing fuzzy logic 1346 in FIG. 13 or the fuzzy logic unit 1860 of the PMC-M 1800 of FIG. 18, applies a finite series of rules against the possible combinations of the fi(p) and f 2 (p) in order to arrive at a rules result.
  • an inference mechanism such as a rules base, part of the PMCi load balancing fuzzy logic 1346 in FIG. 13 or the fuzzy logic unit 1860 of the PMC-M 1800 of FIG. 18, applies a finite series of rules against the possible combinations of the fi(p) and f 2 (p) in order to arrive at a rules result.
  • an individual rules result approaching 1 indicates that the PMCi is very eager to accept a new call or call upgrade
  • an individual rules result approaching 0 indicates that the PMCi of interest is not eager at all to accept a new call or call upgrade.
  • Dashed line 2410 represents the threshold to the right of which the individual PMCi( ⁇ ) decides to reject the call admission or upgrade. The rules are determined with a "common sense" logic, for each different case.
  • step 820 these rules results are "deffuzified” to obtain the eagerness of accepting a new call or a call modification upgrade ⁇ (p) , which again may be conveniently carried out by the fuzzy logic units 1346, 1860.
  • an averaging algorithm called “centroid defuzzification” may be utilized (also known as “center of gravity” defuzzification).
  • centroid defuzzification also known as “center of gravity” defuzzification. The formula for the defuzzification is as follows:
  • the final output is also a fuzzy variable, as a result, the eagerness has always a value between 0 and 1.
  • steps 810 and 812, and steps 814 and 816 are shown executing respectively in parallel.
  • teachings of the invention are not intended to be so limited, and in fact nonparallel execution of these steps can occur as long as fl(p) and f2(p) can be obtained such that rules can be applied to their combination as described above without either becoming stale.
  • Eagerness determination processing according to an alternative embodiment of the invention will now be detailed with reference to FIG. 9.
  • the homogeneousness of the given individual call processing unit PMCp load is ascertained along with fuzzy logic analysis of the actual load and estimated load parameters for the PMCp as described above with reference to FIG. 8.
  • Eagerness determination according to this embodiment may be conveniently performed by the resource utilization 1320 and fuzzy logic unit 1340 of the PMCi 1300 shown in FIG. 13 in combination with the homogeneousness realization 1650 and eagerness determination unit 1660 of the PMC-M 1600 shown in FIG. 16, although other configurations may be utilized, as will be recognized by those of ordinary skill in the art.
  • step 918 of FIG. 9 is calculated, here in parallel with fuzzification of the actual and estimated loads (steps 814 and
  • step 816 Sequencing this after step 810 is important since homogeneousness or x 3 (p) is dependent in part on the new estimated load mean ⁇ t calculated in step 810 as a precursor to obtaining either x 2 ( ⁇ ) or ⁇ (p).
  • step 918 need not occur in parallel with either step 814 or 816 as shown in FIG. 9, and can occur at any point in time after the new estimated load mean is determined (such as by the load estimator 1330 shown in FIG.13) and before the new eagerness value for the PMCp is determined (step 924).
  • the homogeneousness parameter, x (p) is determined in accordance with the following equation: balance target for the new or upgraded call class minus the (new or upgraded call class mean/new estimated load mean for the PMCp), or ⁇ ( ⁇ x )- ⁇ ( ⁇ x )/ ⁇ t .
  • the balance target is the ratio of the number of calls in class ⁇ x to the total number of calls in the call processing system managed by the master call processing unit, such as PMC-M 1600 shown in FIG. 16.
  • the PMC-M in order to obtain a valid balance target ⁇ ( ⁇ x ), the PMC-M should have access to the admitted call tables for every PMCi it services (including the PMCp), if not a copy locally accessible to it, such as admitted call tables 1634 contained in local memory 1630.
  • a weighting engine 1640 forming part of the PMC-M may be utilized to maintain and update balance targets ⁇ for all supported call classes and current state of all PMCl ...PMCk admitted call tables.
  • particular balance target information for a call class of interest such as one for a new call or an upgraded existing call, may be issued by the weighting engine to a homogeneousness realization unit (such as unit 1650 within PMC-M 1600 or unit 1435 forming part of the resource utilization unit 1420 for the PMCi 1400 shown in FIG. 14.
  • a homogeneousness realization unit such as unit 1650 within PMC-M 1600 or unit 1435 forming part of the resource utilization unit 1420 for the PMCi 1400 shown in FIG. 14.
  • x 3 (p) should approach 0, so that the ratio of calls of class ⁇ x to the total number of calls being handled by the PMCp matches the overall ratios experienced by the entire call processing system, from which ⁇ ( ⁇ x ) is derived. If x 3 (p) ⁇ 0, this means that the PMCp has more than the average number of admitted calls of class ⁇ x, and admission of the new or upgraded call of class ⁇ x should be rejected or at least disadvantaged. Conversely, if x (p) > 0, this means that the PMCp has less than the average number of admitted calls of class ⁇ x , and the admission of the new or upgraded call of class ⁇ x should be encouraged.
  • Determination of homogeneousness as specified in step 918 may be conveniently implemented by a homogeneousness realization unit adapted to calculate x3(p) as discussed above.
  • This homogeneousness realization unit may be situated onboard the PMCp for which eagerness is being determined, as is best shown in FIG. 14 through homogeneousness realization unit 1435 accepting the new estimated load mean and new or upgraded call class mean (contained in the resource knowledge base 1310) from the load estimator 1430 local to such PMCp).
  • the ⁇ ( ⁇ x ) is sent from the PMC-M including the aforementioned weighting engine, such as PMC- M 1800, to the PMCp.
  • the homogeneous realization unit may be situated locally within the PMC-M, such as that shown in FIG. 16, where the PMCp sends the new estimated load mean and new or upgraded call class mean to the PMC-M to permit such realization to occur.
  • eagerness determination processing also includes obtaining an intermediate eagerness ⁇ (p) in step 922, followed by final ⁇ (p) which takes into account ⁇ (p) and the aforementioned x (p) calculated in step 918.
  • Eagerness determination for individual call processing unit will now be detailed with reference to FIG. 10.
  • Processing according to this embodiment differs from processes previously described with reference to FIGs. 8 and 9 in that homogeneousness is also fiizzified (step 1010) with respect to a 3 membership state (H 2110 "high” to indicate that the ratio of calls of class ⁇ x being handled by the PMCp exceeds the ⁇ x balance target average for the entire call processing system, M 2120 “medium” to indicate that this ratio within the PMCp is approaching the balance target, L 2130 “low” means that the PMCp is handling fewer calls of class ⁇ x than average for the entire call processing system) fuzzification response curve shown in FIG. 21.
  • the rules base is applied to all three fuzzified parameters in step 1020.
  • the following table illustrates such a rules base, if fl(p) and f2(p) are simplified to 3 member states each as well:
  • step 1022 the rules result to fuzzify to directly obtain the new eagerness value ⁇ (p) for the individual call processing unit (step 1022).
  • eagerness determination processing according to the embodiment of FIG. 10 ends.
  • the individual call processing unit arrangement shown in FIG. 14 may conveniently implement the processing described above with reference to FIG. 10.
  • such processing may occur within the managing call processing unit such as that shown in FIG. 18 on behalf of the given individual call processing unit, assuming that the actual load for that call processing unit is made accessible to the PMC-M, such as through realization and transmission of the actual load x ⁇ (p) parameter by the individual PMCp to the PMC-M by the load monitor 1325 depicted in the PMCi 1700 shown in FIG. 17.
  • FIG. 11 illustrates call event, including call admission processing undertaken by a given one of the individual call processing units , including when the PMC-M identifies a potential PMCi for call admission based on early CAC processing.
  • step 1210 early CAC processing within the PMC-M begins at step 1210, in which upon detection of a call admission request, the PMC-M control logic, such the PMC-M management unit 1510 (FIG. 15), 1610 (FIG. 16) or 1810 (FIG. 18) queries the current eagerness values ⁇ ... ⁇ k corresponding to each of the individual call processing units PMCi .
  • the PMC-M may prompt each PMCi for this eagerness information as needed or periodically as will become apparent to those ordinarily skilled in the art. Once these eagerness values are obtained, control thereafter passes to step 1210, in which upon detection of a call admission request, the PMC-M control logic, such the PMC-M management unit 1510 (FIG. 15), 1610 (FIG. 16) or 1810 (FIG. 18) queries the current eagerness values ⁇ ... ⁇ k corresponding to each of the individual call processing units PMCi .
  • the PMC-M may prompt each PMCi for this eagerness information as needed or periodically as will become apparent to
  • step 1212 in which the maximum relative eagerness value is determined based on the all the eagerness values (a.k.a. eagerness vector ⁇ ) obtained in step 1210. Thereafter in step 1214, a determination is made whether the maximum eagerness value exceeds a threshold.
  • This threshold can be a uniform threshold for any call or can be based on the type of call, its associated cost of service such as bronze, silver or gold, and or based on other factors such as originator status, intended recipient status, etc. For example, consider the threshold malleability chart of FIG. 22. Here, a bronze CoS call will not be admitted if the reported eagerness for any of the PMCis fails to exceed ⁇ , thus in eagerness situations 2222, 2226 and 2228 the call is rejected, but accepted in situation 2220.
  • a silver CoS call will not be admitted if the reported eagerness for any of the PMCis fails to exceed ⁇ s , so in situations 2236 and 2238 the silver class call is rejected, and in situations 2230 and 2232 it is accepted. And, a gold class call will only fail to be accepted where the reported eagerness for each of the PMCis fails to exceed the gold threshold ⁇ g , such as in situation 2248.
  • step 1218 the call admission request under scrutiny is transferred to the PMCi exhibiting the maximum eagerness value.
  • the eagerness value for such call processing unit is recalculated taking into account the call class specified by the call in the new call admission request. Eagerness and load estimate determination as described herein may conveniently be used to confirm the new eagerness value.
  • the decision to ultimately admit the call pursuant to the call request rests not with the PMC-M here, but by the PMCi exhibiting the maximum eagerness to accept a new call. Control thereafter passes to step 1224.
  • the corresponding individual call processing unit's reported eagerness is disadvantaged such as by scaling its corresponding ⁇ by a factor of 0.8 - 0.9 for at least one iteration. Control thereafter ends. It should be noted that this processing will restart with the disadvantaged ⁇ replacing the previous maximum ⁇ .
  • step 1110 upon receipt of a call event directed to a given one of the PMCis either internally from calls already admitted or by the managing call processing unit responding to a call admission request, as outlined above with reference to PMC-M processing described with reference to FIG. 12. If at step 1110, a determination is made that the call event includes a call admission request, control passes to step 1112. At step 1112, the PMCi undertakes determination of the estimated load which includes the new call, such through processing described herein with reference to FIGs. 6 and 7. Thereafter, in step 1114, an intermediate or final eagerness value for the PMCi based on this newly obtained estimated load in step 1112 is determined using e.g.
  • step 1116 the intermediate or final eagerness determined in step 1114 is compared against a threshold (similar if not the same as the threshold used by the PMC-M in step 1214 of FIGs. 12 and 22). If the new eagerness(which as noted above takes into account the new call) fails to exceed this threshold, control passes to step 1118 in which the new call is rejected by the current PMCi and previous estimated load values and eagerness values are restored to reflect the situation prior to consideration of the new call. Control thereafter terminates.
  • step 1116 If however, in step 1116, it is determined that the new eagerness does in fact exceed the threshold, control instead passes to step 1122.
  • step 1122 the PMCi admits the call.
  • step 1124 a determination is made whether the PMC-M should be apprised of the newly calculated eagerness value.
  • a content- based messaging conservation algorithm may be used as depicted in FIG. 23.
  • the new and previous eagerness both fall within region 2310(i.e. new and old ⁇ (p) ⁇ 0.25), the eagerness is determined not to be sensitive since the PMC-M will not consider the current PMCi for call admission anyway, as it fails to exceed the minimum threshold discussed above.
  • the new eagerness and old eagerness both fall within region 2330 (i.e. new and old ⁇ (p) >0.75), the current PMCi is deemed eager to admit calls anyway and so is not sensitive to the change.
  • the new eagerness or old eagerness falls in region 2320 (i.e.
  • the eagerness is deemed sensitive to the change and so the new eagerness is reported to the PMC-M.
  • other techniques may be used alternatively or in combination, such as thresholding the change in eagerness between old and new, reporting only after so many determinations, and the like.
  • the new estimated load is calculated with respect to a difference between the new and prior class characteristics for the call in which modification is requested. If, however, in step 1132, it is determined that the call modification request specifies a call downgrade, control instead passes to step 1134 in which the new estimated load is determined based on the downgrade. Then, in step 1136, the new intermediate or final (depending on PMCi resource utilization capabilities) eagerness is re-determined taking into account the new estimated load obtained in step 1134. Thereafter, processing continues with the conditional publication or reporting steps 1124 and 1126 detailed above. If, in step 1130, a determination is made that the intercepted call event does not comprise either a call admission request or an admitted call modification request, control passes to step 1140.
  • step 1140 a determination is made whether the call event includes an admitted call termination event. If so, control passes to step 1134 through 1126 detailed above, with the exception that the estimated load is recalculated without consideration of the terminated call. If, however, in step 1140, it is determined that the call event is not one of a call admission request, a call modification request, or a call termination request, the call event falls through to conventional call management processing (not shown in the FIG.), or in the alternative, is not recognized or acted upon at all by the PMCi of interest.
  • FIG. 13 depicts an individual call processing unit PMCi 1300 arrangement which includes a resource utilization unit 1320 and fuzzy logic unit 1340 capable of determining intermediate or final eagerness values based on actual and estimated load parameters.
  • the PMCi 1300 may conveniently implement estimated load processing discussed above with reference to FIGs. 6 and 7, and eagerness determination according to embodiments shown in FIGs. 8 and 9.
  • the PMCi 1300 coordinates with a PMC-M capable of determining a final eagerness including homogeneousness realization, such as PMC-M 1600 shown in FIG. 16.
  • the PMCi 1300 self determines a final eagerness which may then be reported to a PMC-M such as PMC-M 1500 shown in FIG. 15.
  • FIG. 14 illustrates an alternative PMCi 1400 arrangement which includes onboard homogeneousness realization and fuzzification consistent with the present invention.
  • the PMCi 1400 may conveniently implement eagerness determination consistent with the embodiment shown in FIG. 10, and may coordinate results with any PMC-M including a mechanism for generating and managing balance targets, such as PMC-M 1600 (FIG. 16) or PMC-M 1800 (FIG. 18).
  • FIG. 17 illustrates yet another alternative PMCi 1700 arrangement in which the PMCi 1700 does not include any fuzzy logic for load balancing or call admission control but does include a load monitor 1325 capable of obtaining xl(p) as described above. It is contemplated that in this embodiment, such load balancing and call admission control functionality will be undertaken by the managing call processing unit, such as the PMC-M 1800 shown in FIG. 18.

Landscapes

  • Engineering & Computer Science (AREA)
  • Computer Networks & Wireless Communication (AREA)
  • Signal Processing (AREA)
  • Data Exchanges In Wide-Area Networks (AREA)

Abstract

Improved techniques and apparatus are disclosed and determining the eagerness of a call processing unit (1400) within a multiple processor call processing system to accept a new call or call upgrade, as well as call admission control and other call event processing, as well as systemwide load balancing through query of such eagerness for estimating the load of a call processing unit (1400) within a call processing system. Load estimation may occur at a given time, on a periodic or event-driven basis, including upon perception of a call event such as a new call admission request, call modification request, or a call termination event. The load estimation here is dependent upon at least one of load mean and variance estimates and may be approximated using a probabilistic distribution function. Fuzzy logic and associated analysis is used to determine eagerness. Eagerness is determined in part with reference to the actual load of a call as well as an estimated load.

Description

METHOD AND APPARATUS FOR LOAD ESTIMATION AND CALL ADMISSION CONTROL IN A CALL PROCESSING ENVIRONMENT
Technical Field
This invention generally relates to communications inf astructure and techniques, and particularly concerned with load estimation, load balancing and call admission control in call processing systems including plural call processing units. Background In light of recent tightening of the credit markets and general drop-off in demand, telecommunications carriers are increasingly pressured to maximize return on their existing infrastructure and/or make wise choices when purchasing and deploying new telecommunications gear. Due in part to their cost and potential performance bottlenecks, especially when stretched beyond nominal loads, call processing systems within the carrier's network have drawn heightened scrutiny.
Traditional time-domain switched, voice-oriented communications typically included a single call processing unit tightly coupled to and servicing a given switch fabric or a dedicated portion thereof. Smaller, enterprise class telecom solutions such as a digital key system or private branch exchange included one call processing unit, whereas carrier grade central office solutions involved a dedicated call processing unit servicing a dedicated portion of the central office switching fabric. TDM switching techniques assured an actual or virtual connection be established and maintained for the duration of a call, at the expense of resource efficiency and dynamism.
With the advent of packet voice and Softswitch solutions, the call processing system has been decoupled from the actual or virtual switching fabric, and a number of multi call processor designs have been developed, since each call processing unit is now capable of handling almost any call originating from or terminating to the larger system.
A representative multiple processor call processing system 2500 is shown in FIG. 25. In this system, plural individual call processing units or PMCis, including PMC1 2515, PMC2 2520 and PMCK 2525, collectively handle a number of call processing activities and call events. The managing call processing unit, or PMC-M 2510, performs other call processing activities and events. Typically, in known multiple call processing system architectures, the PMCis 2515, 2520 and 2525 handle the bulk of the "individual call" oriented activities and tasks based on the subset of system calls they are assigned to handle, whereas the PMC-M 2510 is primarily responsible for coordinating operation of the PMCis as needed for e.g. system-wide call admission control and load balancing. As such, communication of coordination information (not shown) such as control, status, query and reporting messages occurs between these PMCis 2515, 2520, 2525 and the PMC-M 2510 on a periodic and/or event-driven basis.
A well-known goal that multiple processor call processing systems (such as the system 2500 shown in FIG. 25) attempt to achieve with respect to call admission control (CAC) and load balancing is to maintain the QoS commitment for the existing calls (failing to introduce delays or packet loss in packet voice environments) while maximizing call capacity, and ultimately maximize efficient use of the call processing system. In particular, the goal is to provide an optimally efficient call handling situation in which all individual call processing units in a multiple processor call processing system are evenly and homogeneously (with respect to call type) loaded. Thus, ideally, call admission control and new call blocking will thus only occur where all these call processing units are already fully or nominally loaded.
To illustrate, refer to a simplified capacity diagram of a 6 PMCi call processing system 105 shown in FIGs. 1A - IC, generally consistent with the system 2500 shown in FIG. 25. As shown in each of FIGs. 1A - IC, each box within each PMC1..PMC6 represents a call, with a shaded box 110 representing a voice call, a diagonally hatched box 112 representing a streaming call, and a transparent double- sized box 114 representing an interactive call. Each container 107 represents the capacity of a given PMCi. In FIG. 1A, the PMC1..PMC6 of the multiple processor call processing system 105 are evenly and homogeneously loaded; in FIG. IB, the load is even but not homogeneous; and in FIG. IC the load is neither homogeneous nor even.
The risk presented in FIG. IB is that if there is congestion in a given PMCi, most calls within such PMCi might be of the same nature thus making any call processing remedy more difficult. The risk posed in FIG. IC is that while there is unused capacity on some PMCis (e.g. PMC2, PMC4, PMC6), other PMCis (e.g. PMCI, PMC3, PMC5) are very close to their nominal load and may not be able to provide sufficient service level guarantees or quality of service to the calls they are handling. As a result, the goal should be to maintain even and homogenous loads, as in FIG. 1 A. It should also be noted that even this situation, the call processing system 105 may reach a loading condition where all the PMC1...PMC6 are nominally loaded, and that the system cannot admit further new calls, but this situation happens less often, if the load is even.
In order to load balance a multiple processor call processing system, it is important to be able to estimate the resource utilization (in terms of e.g. bandwidth or processor loading) on each call processing unit at any given time. Once the per PMCi estimated loading is determined, one can prospectively load balance by admitting new calls based on which PMCi has free resources to handle the call. A simple brute force approach is to assume that each call is taking up its peak resource requirement, and that the estimated load is based on this peak resource requirement multiplied by the number of calls being handled. This obviously results in inefficient resource under- utilization, and cuts against some of the perceived variable bandwidth advantages afforded by packet voice transmission.
Another known technique for estimating resource utilization is based on a relationship between peak and average resource requirements for a given call type, e.g.: estimated resource utilization = 2 * peak * average (1) for a given call type peak + average
Once the estimated resource utilization for the different call types being or expected to be handled within the call processing system is known, each PMCi's loading can be estimated by determining how many of each type of calls are being serviced and multiplying each by this estimated resource utilization. Resources can be reserved within each of the PMCi consistent with the estimated resource utilization and additional capacity can be determined based on remaining unreserved resources.
FIG. 2 illustrates a plot of the estimated resource utilization curve 202 corresponding to the above equation (1) versus the average resource utilization. It is obvious that in this method, when the average is significantly smaller than the peak resource requirement, the estimated resource utilization approaches twice the average resource requirement (e.g. approaches line 204 having a slope of 2 in FIG. 2). Otherwise, the estimated resource utilization is always a 1 to 2 multiple of the average (e.g. always greater than, but approaching line 206 having a slope of 1). Ergo: average « peak = estimate = 2 x average (2)
average ≡ peak =» estimate = average = peak (3)
Thus, using this estimated resource utilization technique results in a overly- conservative loading estimation for the PMCis (i.e. more resources are estimated being used than is actually the case) and can potentially result in a false determination that a given PMCi is at nominal loading or overloaded when it still in fact has capacity. The call processing system can refuse to provide additional calls to the so- overloaded PMCi which still in fact has capacity, and worse, may prematurely refuse to admit further calls to the system (assuming the remaining PMCis are estimated at nominal or greater loading). On the other hand, it is unlikely that any one PMCi will be overbooked or oversubscribed, thus this technique is perceived at least being QoS friendly.
Further, this estimated resource utilization technique does not take into account the number of calls of a given call type being handled by a given call processing unit. It is well-known that the greater the number of calls of a given type are being serviced by a given PMCi, the less is the chance that these behave all the same way or utilize the same amount of resources. For example, it is highly improbable that, in the case of packet voice with silence suppression, that all handled calls will be in the more resource demanding active, vs. silent state. To account for this, it is generally known that a provisionable reservation coefficient may be used to augment the above to permit the PMCis and the call processing system generally to accept more calls than the above-described conservative estimation technique calls for, and in the typical and aggressive cases, more than the PMCis and the call processing system as a whole can actually handle. This overbooking is justified, especially as the number of calls increases and the probabilistic nature of the calls with respect to actual resource utilization (the probabilistic nature of calls, as used herein, means that the resource utilization for each call varies by time and may have any of a finite or infinite number of values within known limits). However, in known systems, the reservation coefficient, although provisionable, is constant within different hours of a day or days of a week, unless a system operator manually alters or updates it.
Therefore, it would be advantageous in a multiple processor call processing system if critical resource utilization could be better predicted, and consequently more accurate call admission and load balancing decisions could be made without detracting from overall system performance.
Summary of the Invention The present invention is directed to improved techniques and apparatus for estimating the load of a call processing unit within a call processing system. This load estimation may occur at a given time, on a periodic or event-driven basis, including upon perception of a call event such as a new call admission request, an admitted call modification request, or an admitted call termination event. The load estimation here is dependent upon at least one of load mean and variance estimates.
In accordance with an embodiment of the invention, in which the call processing unit is capable of handling a plurality of call classes, with each call class defining a class load mean and load variance, a method and apparatus is provided which includes determination of call classes based on admitted calls handled by the call processing unit at the selected time, calculation of an estimated load mean and an estimated load variance for this call processing unit based on this distribution, along with the class load mean and variance for at least one of the call classes specified by the distribution, and derivation of an estimated load measure from at least one of the estimated load mean and the estimated load variance.
In accordance with another embodiment of the invention, a method and apparatus is provided to estimate the load a call processing unit which includes detection of a call event which specifies a call of a given call class, calculation of a new estimated load mean based on a current estimated load mean and a class load mean for the given call class, calculation of a new estimated load variance based on a current estimated load variance and a class load variance for the given call class, and derivation of an estimated load measure from at least one of the new estimated load mean and load variance. Consistent with these and other disclosed embodiments, the estimated load measure may represent a utilization value of a call processing unit resource such as bandwidth or processing load. Further this estimated load measure may relate to the call processing unit's probability of exceeding it's nominal load.
Furthermore, consistent with these and other disclosed embodiments, class load mean and variance may be derived from a probabilistic approximation of the associated call class, such as a Gaussian distribution. The use of estimated load mean and variance consistent with these embodiments permits a more accurate assessment or prediction of a call processing units loading than heretofore available, especially where the load mean and variance are related to resource utilization within the call processing unit. Building such assessment or prediction using probabilistic approximation of call class mean and variance parameters is believed to yield even more accurate results. Such results can be useful in call processing management functions, including systemwide load balancing and call admission control in multiple processor call processing environments. The present invention is directed in part to techniques for determining the eagerness of a call processing unit within a multiple processor call processing system to accept a new call or call upgrade. The invention is also directed to call admission control and load balancing through query of such eagerness. In particular, fuzzy logic is used to determine the eagerness. Accordingly, consistent with one embodiment of the invention, in a call processing unit capable of supporting a plurality of call classes, method and apparatus are provided which includes attainment of first and second load parameters for the call processing unit, fuzzification of the first and second load parameters according to respective first and second fuzzification functions, comparison of the fuzzified first and second load parameters against a defined set of eagerness rules, defuzzification the result of the comparing, and generation of an eagerness to admit a call based on this defuzzification result. The first and second load parameters may include actual and estimated load parameters for the call processing unit. The estimated load may be approximated using a probabilistic distribution function. Homogeneous may also be consulted in determining this eagerness.
Consistent with another embodiment of the invention, in system comprising plural call processing units, a call admission apparatus and method are provided which includes monitoring a call admission eagerness for each of the call processing units, perception of a call admission request, selection of a target call processing unit having a relative maximum call admission eagerness, confirmation of the call admission eagerness by this target call processing unit with respect to the call admission request, and admission of the call if the target call processing unit eagerness is confirmed. Predetermined or call class malleable thresholding may be utilized to help determine if a target call processing unit is available. Also, another target may be selected if the eagerness cannot be confirmed by the initial target, and may be recursively performed until either no target call processing units are available, the admission request is withdrawn, or the call is successfully admitted.
Additional aspects and advantages of this invention will be apparent from the following detailed description of embodiments thereof, which proceeds with reference to the accompanying drawings.
Brief Description of the Drawings FIGs. 1A - IC illustrate the concepts of load balancing and homogeneousness consistent with the present invention.
FIG. 2 illustrates conventional load estimation. FIG. 3 illustrates resource utilization modes and probabilities.
FIGs. 4A - 4C illustrate Gaussian characteristics of load estimation consistent with the present invention.
FIG. 5 is a flowchart illustrating resource knowledge base processing according to an embodiment of the invention. FIG. 6 is a flowchart illustrating load estimation processing according to an embodiment of the invention.
FIG. 7 is a flowchart illustrating load estimation processing according to an alternative embodiment of the invention.
FIGs. 8, 9 and 10 are flowcharts illustrating eagerness determination processing according to respective alternative embodiments of the invention.
FIG. 11 is a flowchart illustrating PMCi call event processing according to an embodiment of the invention.
FIG. 12 is a flowchart illustrating PMC-M call admission processing according to an embodiment of the invention. FIG. 13 is a simplified block diagram of a PMCi 1300 consistent with eagerness determination processing shown in FIGs. 8 and 9.
FIG. 14 is a simplified block diagram of a PMCi 1400 consistent with eagerness determination processing shown in FIG. 10.
FIG. 15 is a simplified block diagram of a PMC-M 1500 consistent with the PMCi arrangements shown in FIGs. 13 and 14.
FIG. 16 is a simplified block diagram of a PMC-M 1600 consistent with the PMCi arrangement shown in FIG. 13.
FIG. 17 is a simplified block diagram of a PMCi consistent with the PMC-M arrangement shown in FIG 18. FIG. 18 is a simplified block diagram of a PMC-M consistent with the PMCi arrangement shown in FIG. 17. FIG. 19 diagrammatically illustrates fuzzification of the actual load parameters according to an embodiment of the invention.
FIG. 20 diagrammatically illustrates fuzzification of the estimate load parameters according to an embodiment of the invention. FIG. 21 diagrammatically illustrates fuzzification of the homogeneousness parameters according to an embodiment of the invention.
FIG. 22 illustrates threshold malleability by cost of service according to an embodiment of the invention
FIG. 23 illustrates eagerness reporting conservation according to an embodiment of the invention.
FIG. 24 illustrates a two variable fuzzy logic rules base according to an embodiment of the invention.
FIG. 25 is a simplified block diagram of a call processing system including plural individual call processing units coordinated by a single managing call processing unit.
Detailed Description of the Embodiments Unless otherwise noted, the listed terms below, including abbreviations and symbols, will have the following meaning ascribed to them: term meaning
Kbps kilo bits per second
MIPS Mega instructions Per Second msec milliseconds pdf Probability Distribution Function pmf Probability Mass Function
PMCi individual call processing unit
PMC-M master or managing call processing unit
CoS Class of Service or allocation-retention priority
UMTS
QoS Quality of Service
UMTS Universal Mobile Telecommunications System
K number of PMCis
L number of inputs to a fuzzy logic
N number of call classes
R number of rules in a fuzzy logic fi i-th fuzzified value mi load in state i
Pi probability of state i β CoS coefficient ε CAC threshold γ activation of a rule θ class type μ mean σ standard deviation σ2 variance
Θ nominal load ζ probability of exceeding the nominal load Θ φ eagerness Φ eagerness vector ω balance target
Ω balance target vector
For simplification purposes only, and not meant to limit the teachings of the present invention in any fashion, the disclosed embodiments presumes a single master call processing unit (PMC-M) coordinating several individual call processing units or PMCis (e.g. on the order of 50-60). See, e.g. FIG. 25. Further, each call is presumed follows a probabilistic behavior in accordance with a finite number of predefined states and that that resource utilization associated with such states can be determined.
In determining whether to admit a new call, one should see if the necessary resource such as processing load or bandwidth for the new call is available. If the resources are available, the new call can be admitted. An important action in packet switching systems with QoS requirement is resource reservation. It means that by admitting each new call, the call processing system or constituent call processing unit should restraint its eagerness to accept more new calls.
LOAD ESTIMATION THEORY
As previously discussed, if the reservation is made based on the maximum possible resource utilization or a relationship that disregards or oversimplifies the number of actual calls and their probabilistic nature, the efficiency of the call processing system will be less than optimal. Hence, consistent with one aspect of the present invention, the disclosed embodiments of the present invention utilize a probabilistic estimation of the load, based on the mean and variance of the resource utilization for a given call type and experienced call distribution.
In particular, the call processing system according to a specific embodiment generates a probability distribution function (pdf) of the total resource usage. Then based on this pdf, this call processing system calculates the probability of exceeding a nominal load value on a per call processing unit basis. This probability should be kept lower than a given threshold, by rejecting some calls if they are coming at the time of heavy load. So, in CAC and load balancing consistent with the present invention, this probability gives this call processing system a measure to accept or reject the new call or, if sufficient capacity exists at the system level, to determine which PMCi should handle the call.
To ease understanding, one or more of the disclosed embodiments selectively perform "static" load balancing of the call processing system based at least in part on the aforementioned estimated load results. Such load balancing seeks to make load balancing decisions (which call processing unit, if any?) with respect to a call only during admission of the call. However, the teachings of the present invention are not intended to be so limiting, and, as will be appreciated by those ordinarily skilled in the art, can be easily applied to dynamic load balancing of calls where, in addition to balancing at admission, calls can be transferred from one call processing unit to another on fly. See, e.g. Willebeek-LeMair et al., "Strategies for Dynamic Load Balancing on Highly Parallel Computers", IEEE Transactions on Parallel and Distributed Systems, Vol. 4, No. 9, September 1993 incorporated herein fully by reference. In the disclosed embodiments, assume that an individual call processing unit or PMCi can handle a limited number of call types or classes of service: θi ... ΘN- Each class has a different set of QoS requirements.
For each call type or class, there is basically a different probabilistic pattern. The probabilistic pattern identifies the probability (or percentage) of being in a set of discrete states or modes (active/non active or silence/speech, for example). Assuming there is usually 1 to 3 discrete states and that the resource usage in each state individually is known through empirical data, predictive analysis, or otherwise. For example, in packet voice conversation, there are two basic states: silence and speech. The probabilities of being in each one of these two states is pi and p2 , where p +p = 1. The resource utilization for each state is
Figure imgf000015_0001
and m , respectively. The goal is to estimate the load (resource utilization) based on the a priori knowledge about these classes and the number of existing calls in each type or class. The probabilistic behavior of the calls is usually reflected in the active/silence mode of voice or the burstiness (of data). For discussion purposes only, assume the call types or classes to be conversation (mainly voice), streaming, interactive (web data), and background (email), as listed below. In other embodiments, other call types such as fax, telecommands, etc may be considered as will be appreciated by those ordinarily skilled in the art.
Figure imgf000016_0001
For illustration purposes, bandwidth is presented in the above chart as the required resource. However, as will be discussed in more detail below, the processing load or CPU consumption of a PMCi will be also be used, as it can be the more the scarce type of call processing resource, especially in larger call processing environments. As an example, FIG. 3 depicts a discrete probability mass function (pmf) of one call within a type/class (say, the voice class, here). The horizontal axis represents the resource requirement, here bandwidth expressed as a bit rate. As shown in the FIG., the probability that the resource requirement 300 will be in discreet state 1 (i.e. silence) is 40%, with a characteristically low resource requirement 302 (mi) of 1.95Kbps. Likewise, the probability that the resource requirement 300 will be in discrete state 2 (active conversation) is 60%, with a characteristically higher 12.2Kbps bandwidth requirement 304 (m2). Likewise, the probability.
In the following subsections, we first show how to obtain the pdf of a number of calls in each class and then we obtain the pdf of the total load within a PMC. The mean and variance are calculated as follows: μ= p1.mI+p2.m2 = 8.1 kbps. (4)
And the variance is:
σ = pι.(mι -μ)2 +P2-(m2 -μ)2 25.21. (5) Which yields a standard deviation of σ = 5.02.
The well-known central limit Theorem states that "by summing up a sufficient number of independent stochastic variables, we get a new variable, which has a Gaussian distribution". This approach has been already used for connection admission for satellites as described in Yeong Min Jang's "Central limit approximation approach for connection admission control in broadband satellite systems", IEE Electronics Letters Vol. 36, No. 3, 3rd February 2000 incorporated herein fully by reference. The mean and variance of this Gaussian distribution are easily calculated as follows:
Figure imgf000017_0001
Where μ, and σ , are respectively the mean and variance of one call. As a result, if we keep the example of the previous subsection, by adding N calls, the mean and variance will be: μ= 8.1*N Kbps and σ2 = 25.21*N. For instance, N = 100 gives μ= 8100 Kbps and σ2= 50.21 Kbps.
The mean value calculated here for the total distribution is simply the sum of all the average values. The variance is also the sum of the variances attributed to all of the calls. When, for example, 100 calls of one type together, the mean value gets 100 times bigger; so does the variance; however, the standard deviation, σ, gets only 10 times bigger. This means that the more calls are added up in a PMCi, the bigger is the mean and the standard deviation, but the smaller would be their ratio μ/σ2. A generalized plot of the probability distribution function in accordance with this algorithm is shown in FIG. 4B, with the μ 406 and the σ 408 denoted.
It should be noted that this algorithm is an approximation that becomes more accurate for larger values of N. However, this approximation is believed good enough in providing an estimated load commensurate with the goals of the invention. In fact, a well known method to construct a random variable with Gaussian distribution is to add N (usually 12) random variables with uniform distribution (such as using rand( ) in the "C" programming language).
Alternatively, the exact form of the distribution can be obtained by applying N times convolution of the original pdf with itself. The more accurate calculation is that for this case, the distribution of the sum is a series of impulses at i*m + (N-i) mi , where 0<=i<=N. The probability at each point is C N -P21N"\ as shown in FIG. 4A. The absolute maximum 402 and minimum 404 possible values for x are respectively N*mι, N*m or 1220Kbps to 195 Kbps in the example where N = 100 as described above. Though not implemented in the embodiments disclosed below, there is another point here which applies to the classification of the calls in different classes. Some calls, for some reasons may have different resource usage than others, although they may be in the same class. For instance, a voice call may have "A" times processing power requirement than others, because of the special conditioning or additional processing to be applied to the call, such as silence suppression or ciphering. Therefore, provision may made to count such a call A times more than "an ordinary call".
The next step is to add the variables obtained in each class, to obtain the pdf of the whole resource usage in all of the classes for a given PMCi. Again, the central limit Theorem implies that "the sum of any number of variables with Gaussian distribution, is a new variable with a Gaussian distribution." The mean and variance of the new variable can be simply obtained by summing up the mean and variances of the elements.
As discussed above, the total pdf has a mean and a variance that can be obtained from simple arithmetic. The pair of values (mean, standard deviation) are the only parameters we need when using the Gaussian distribution's error function. In FIG. 4C, which again shows a Gaussian approximation of the probability distribution function, Θ 412 (maximum allowable resource utilization or nominal load) and ζ 414(probability of exceeding the nominal load - the hatched area 412 under the curve410) defined such that: p(x >Θ) = ζ.
For each PMCi, based on the current number of the admitted calls in each class, we can calculate the probability of having the total resource usage exceeding the maximum allowable value set for the resource (say for example, 2 Gbps for bit rate or 99% of CPU usage). In other words, we can determine the probability of having the total resource utilization for a PMCi exceeding the nominal load specified for that PMCi. Alternatively, having a provisioned tolerable probability for exceeding the nominal value, we can determine for each PMCi, whether the value corresponding to this probability is smaller or greater than the maximum allowable resource utilization. A load estimation algorithm according to an embodiment of the invention, based on the statistical analysis and Gaussian distribution discussed above will now be disclosed. Unlike the above, however, the scarce resource is now processing load (in the unit of instruction cycles/sec or any other measure of processor utilization within a call processing unit).
In each one of the N classes (here N=4 for voice conversation, streaming, web data interactive and streaming), assume that there are two modes: active and non- active. The probability of being in each state is also considered to be known (p\, p2, respectively). The ratio pι/p2 is a main characteristic of each class. It should be understood that this is a simplifying assumption to have only two modes (active and non-active) and that in reality, more than two modes might be possible. However, this does not affect the disclosed algorithm as long as one can translate these numbers to form a Gaussian distribution with a known μ, σ2.
The first action is to measure the number of instruction cycles/sec (processing resource utilization) corresponding to each of the two modes: active and non-active for given call class or type. These two parameters are known as mi and m2. Note here that there are however a number of other parameters that affect the number of instruction cycles being used to support the call mode. These parameters may include some or all of the following parameters in an e.g. UMTS call processing environment, including: enum TTI (10, 20, 40, 80 msec.) boolean CRC boolean Ciphering (on or off) enum RLC_mode (TM, AM, UM) boolean PS/CS int No_of_RL ( 1 <= No_of_RL <= 6 but usually 1 , 2, or 3) int No_of_TB int Rate (up to 2 Mbps).
The term PS here refers to Packet Switching and CS refers to Circuit
Switching. The Boolean PS/CS indicates whether a UMTS call is either packet or circuit standard respectively. The CRC Boolean indicates whether Cyclic Redundancy Coding is used with a given UMTS Call. RLC_mode refers to the UMTS radio link mode associated with a call (TM: transparent mode; AM: acknowledged mode; and UM: unacknowledged mode). TTI refers to the Transmission Time interval. The No_of_TB refers to the number of traffic bearers. Finally No_of_RL refers to the number of UMTS radio links available for the UMTS call.
A fast look at these parameters may suggest that the actual number of classes is more than what was assumed above (here N=4). However, despite the fact that based on the actual values of these parameters some of the statistical characteristics of the call varies, some others remain intact. The ratio pι/p2, for instance does not change and, that in most cases, both mi, m2 are assumed to change proportionally. Once more, as will be appreciated by those ordinarily skilled in the art, this is a simplifying hypothesis which may inject some inaccuracy in the modeled probability distribution function. However, as will be discussed below, fuzzy logic may be conveniently implemented to interpret this statistical model such that simplification inaccuracies will not deter from arriving as a correct result.
First, for each class of call (say voice conversation for example) consider a set of given call parameters such as:
Figure imgf000020_0001
Then, based on these parameters, mi is obtained for the active mode and m2 for the non-active mode. Next, a different set of parameters is assumed, which results in a new (m \, m i) pair. By making the assumption that mι/m2 is close enough to m
/«j 2 for load estimation purposes consistent with the present embodiment, classification is greatly simplified. The rational behind this assumption is twofold: first, it appears that most of these parameters influence the complexity in the same way. And second although some inexactitude may be encountered with this assumption, in average it will not be a big problem. Finally, the resulted simplification is worth this small inexactitude. As a result, in accordance with the present embodiment, one can characterize a class with 2 states or modes, each with associated probabilities (pi, p2) and corresponding resource requirements (mi, m ) which can be obtained directly through empirical measurements. Several combinations of the call parameters discussed above are considered and for each set the load is measured. Then an algorithm may be used to fit a function to those parameters. The derived mode probability and resource requirements in turn give the representative pair of μ, σ as described above.
Consider the following table showing a restrained number of the combinations of the call parameters (ciphering Y/N, CRC enabled Y/N) in an UMTS multiple processor call processing system:
Figure imgf000021_0001
Although not presented herein, more columns in this table should be provided which represent other parameters having some influence on the values of μ, σ2, such as the bit rate, or the number of radio links in a wireless system (e.g. 1-6 links in a UMTS system). Additionally, other real world factors may be considered and the resource requirement algorithm made more complex to accommodate such factors. For example, in a wireless call processing system implementation, the number of radio links which has also a probabilistic character may be investigated. In such a case, the variances obtained will be substantially higher for conversation calls, and not necessarily zero for streaming calls. When more extensive parameter combinations are considered, a neural network be may employed to make this function fitting, when more combinations are considered. See e.g. Bernard Widrow et al., "30 Years of Adaptive Neural Networks: Perceptron, Madaline, and Backpropagation" , Proceedings of the IEEE Vol. 78, No. 9, September 1990 which is incoφorated herein fully by reference. The well known error backpropagation algorithm can be used to train the neural network and the resulted neural network will be optimized to have the smallest error for the set of empirical measurements. The generalization characteristic of the neural network acts so that it will have good performance for the sets of parameters out of its training set, as well. In case that one uses a neural network with no hidden layer, this solution simplifies to a linear combination but by having at least one hidden layer and nonlinearity in it, very complex nonlinear functions can also be approximated. A table outlining load estimation based on capacity measurement & Gaussian distribution with ciphering:
Figure imgf000022_0001
Though not considered above, it should be noted that a class of service
("CoS") dependent parameter β can be multiplied to mean and variance for each call, which will reflect the importance of each class, or alternatively, call allocation or retention priority as contemplated within a UMTS communications system. For example, in "gold", "silver" and "bronze" classes of service, we can have β=1.4, 1.2, 1 respectively. This will mean that there will be higher estimated resource requirements and consequently more resource reservation performed for calls in higher classes compared to the calls in lower classes.
In this embodiment, it is envisioned that each PMCi manages a table of its admitted calls including call type or class, μ, σ2 either within the admitted call table or as part of a separate resource knowledge base. Consider the following admitted table with class mean and variance individually specified:
Figure imgf000023_0001
The call type is simply the indicator of one of the N classes. For each class we make a summation over the μ, σ values, namely μ(conversatιonal), μ(streaming), μ(interactive), μ(background). For instance, μ(conversational) = sum of μ's for all conversational calls in this PMC. The variance σ2 of each class is obtained in the same way. Finally, total μ t, σ2 1 for the PMC are obtained by summing the μ, σ2 for all the classes. The values of μ t, σ2 t can be updated once a new call is admitted or an existing call is terminated (or modified) by simply adding or subtracting the corresponding μc, σ2 c. When the call's parameters are changed, new mean and variance are calculated and put in the table but before that the old values are subtracted from the sums. There is no need to rebuild the μ t, σ t values from scratch each time, though such processing is contemplated below with reference to FIG. 7. Thus, the following rules may be established: 1) the initial values of μ, σ2 (for all classes) are zero. 2) Upon the admission of a new call, the values of μ, σ2 are increased. 3) Upon termination of each call, the values of μ, σ2 are decreased. And, 4) upon alteration of each call, the values of μ, σ2 are first decreased by the old value from the table and then increased by its new value.
In this stage, for each class, c, we have the corresponding μ(c), σ2(c). We add these values over all the classes to obtain one μt, σ2 t pair representing the whole load on a PMC.
By calculating μt, σ t, we know all that we need to know about the distribution. Usually we want to calculate the probability, ζ, of exceeding a nominal value of the resource usage, Θ.
Figure imgf000024_0001
where erfc(.) is the complementary error function of a normal distribution (a built-in function call in "C" and also implementable as a look-up table). #include "math.h" double y = erfc((Theta-mean)/(sqrt(2)*stdev))/2;
Where Theta, mean, stdev respectively stand for Θ, μ, σ. This function can be kept in a table, for simplicity. The following table lists the value of error probabilities ζ for the various (Θ-μ)/σ. For example, we see that for having a ζ = 1 %, Θ should be equal to μ+2.33*σ. At μ+5*σ, ζ is practically zero.
Figure imgf000024_0002
Either value can serve as a guideline to decide either to accept or to reject a new call. The use of this estimate, along with some other estimates is the subject of the next section where we explain a fuzzy logic controller for this effect and we show how to use this parameter along with a couple of other ones to determine the eagerness of each PMC to receive a new call. It is also noteworthy that since ζ is an input to a fuzzifier, we can simply directly feed (Θ-μ)/σ to the fuzzifier.
What's our a priori estimation of the total σtt? The intervening parameters are multiple: first the number of calls in each class and second the characteristic that we consider for each class. For instance the more calls of streaming type we have, the narrower is the Gaussian distribution. Below we examine a few examples.
In the following examples, we give some numerical values to the different classes of calls and at the end verify the aggregated traffic.
Example 1 : conversation class. Let's assume that we have only 150 voice calls. μ=pι.mι+p2.m2=2.496MIPS (9)
σ2=p,.(m,-μ)2+p2(m2-μ)2=864.36M, so σ=20.4KIPS (10)
Then μt = 374.4 MIPS and σ2t = 1.29654*1011 and σt = 360KIPS. In other terms, in this scenario, we have a narrow Gaussian distribution, in part because of the big number of voice calls that we consider.
Example 2: streaming class. For streaming, the situation is simpler, because basically we have pi = 1 and p2 = 0. So σ2 = 0 and the problem is not probabilistic anymore. Considering 5 streaming calls with mi = 4.11MIPS, we have μt = 20.55 MIPS and σt = 0.
Example 3: interactive class. Data, however, has usually a more probabilistic manner, as we may experience bigger σ2 values for data. Here, we just create an example for data: pι = 0.1, p2 = 0.9 (11)
mι = 4.11MIPS, m2 = 1.07MIPS (12) μ = pι.mι+p2.m2 = 1.374MIPS (13)
σ = pι.(mι-μ) + p2.( 2-μ)2' _ = 831744MIPS, so σ = 0.912MIPS (14)
If we consider 20 data calls, it comes to μt = 27.48 MIPS and σ2 t = 16.6312. and σt = 4.078 MIPS.
Example 4: aggregated traffic on a PMCi. By adding all the voice, streaming and data calls in the three previous examples, we calculate μt, σt as in the following table. In the table, we give also the mean and standard deviation values for some other numbers of different types of calls.
Figure imgf000026_0001
Consider that the total permitted instruction cycles on the board is Θ= 450 MIPS. This means that the probability of exceeding Θ from the above-mentioned formula is practically zero ζ = 0. In fact, ζ can be reasonably assumed to be zero for μ, = 5σt = 442.9 MIPS. Call Admission Control ("CAC") and Load Balancing
In this section, CAC decision and the load balancing according to several distinct embodiments are explained. The CAC decision is made in two steps: the initial step where the PMC-M make a guess on the best PMCi able to handle the call based on reported eagerness. An early CAC algorithm can be included here for assisting the managing call processing unit make the decision of either to accept or to reject a call, if none of the PMCis are eager enough to accept the new call. In the second step, the chosen PMCi itself reviews if it is really in the state of accepting the call or not. Consistent with these embodiments, fuzzy logic is used in parts of both processing to arrive at CAC and load balancing decisions. The disclosed embodiments all try to even the load on the individual PMCis to reduce risk of overloading and underloading PMCi capacity, as well as enhance efficient use of scarce and expensive processing resources. In addition, several of the disclosed embodiments attempt to homogenize the load among the PMCi's. Homogenizing herein means that the ratio between the effect of different types and service classes be approximately the same in all PMCi's. In other words, if half of the total load (in all PMCi's) in the call processing system is voice, the same ratio is maintained in all PMCi's. By doing so one equalize the chances of exceeding the nominal load in all PMCis. The following arguments explain why a homogenous load is advantageous:
(1) various classes have different types of QoS requirement. For instance some are more sensitive to delay and some are more sensitive to discard. By having a homogenous load, that is assurance that those natural differences can be handled by the PMCi's. In contrast to this, if all delay sensitive calls are in one PMCi, and all non delay sensitive calls in another PMC, there will be little if any flexibility for any special case handling technique to operate successfully ~ for instance, the PMCi having all delay sensitive calls can not handle the situation by delaying some of the calls.
The disclosed models supporting these embodiments are based on a partial knowledge of the system; modeling for some classes is better than for others. By having a homogenous load, one avoids that most of the inexactitude because of the inaccurate modeling fails to gather in a single PMCi and increase estimate inaccuracies disproportionately.
Finally, as different classes of calls have different variances for their pdf s, if one forces an evenly distributed load is enforced but not necessarily load homogeneity, a call processing system may end up in a situation where for some of the PMCi's the probability of exceeding the nominal value is small but for some others, it is not. This would be the case, for instance when most voice calls go to one PMCi; most data to one and most streaming and background each to another PMCi. As streaming calls have insignificant variance, they show a very narrow pdf, while interactive calls show very large variances. Though not disclosed as part of the specific embodiments, other criteria may be also important and should be considered as will become apparent to those ordinarily skilled in the art. For instance, consider the ability to maintain the QoS of existing calls (delay, BER, dropping). This means that there would be a feedback from the overload module to the CAC (and load balancing) functions. If a given PMCi experiences packet discard, or excessive delay (for delay sensitive calls), the PMCi should be marked as to be less eager to accept new calls. This concept introduces some kind of adaptability to the system.
Moreover, consider time of the day/week in which the call is or intended to take place. Also, the time span of a call: a voice call has an average duration of about 90 seconds while some interactive and streaming calls may have much longer durations. This fact may also be considered as an input to the CAC and load balancing decisions.
Consistent with one or more embodiments of the invention and based on the following parameters that are available about each PMCi, an initial guess on CAC and the optimum PMCi is selected based on: 1) the ensemble of calls which yield the pdf ( 22ι J in each PMC; 2) the periodically updated measurement of the PMC loads; and 3) the QoS type and CoS (requirements of the new call).
In the disclosed embodiments, a resource knowledge base (e.g. knowledge base 1310 in PMCi 1300 shown in FIG. 13 or knowledge base 1310 in PMC-M 1800 shown in FIG. 18) is used to retain call class mean and variance information, either for a specific PMCi or the PMC-M. Resource knowledge base development according to an embodiment of the invention will now be detailed with reference to FIG. 5. Assembly of this knowledge base may be performed upon call processing unit installation, upgrade or other change, or on an as-needed basis when the types of supported call classes change.
Referring to FIG. 5, control first passes to step 510, in which the operational modes for the supported call classes of interest are determined. This may constitute all of the supported classes if the knowledge base is being assembled from scratch, or a subset thereof when only certain call class information needs to be corrected or updated. As mentioned above, each supported call class most likely have at least two discrete states of operation, as in the example of "active" and "silence" for the voice call class. Control thereafter passes to step 512, in which resource utilization or each of the operational modes as well as the probability of being in such operational mode is then obtained. Such resource utilization and probabilities information may be obtained through empirical measurement of actual call conditions or predicted using appropriate predictive techniques, as described herein or as is well known in the art. Thereafter, in step 514, the mean and variance for each of the call classes of interest will be calculated using the obtained resource utilization in operational mode probabilities obtained in step 512. Thereafter, in step 516, the knowledge base of supported call classes is updated with the new mean variance information. Knowledge base creation or update processing according to the present embodiment then terminates naturally.
Load estimation performed by either the interested individual PMCi call processing unit (see e.g. load estimator 1330 in PMCi 1300 of FIG. 13 or the load estimator 1430 in PMCi 1400 of FIG. 14) or the master PMC-M call processing unit (e.g. see the load estimator in the resource utilization unit 1850 in the PMC-M 1800 of FIG. 18) according to an embodiment of the invention will now be described with reference to FIG. 6. Here, the estimated load is recalculated every time a call event such as a new call admission, an admitted call termination or an admitted call classification modification event has occurred with respect to an individual call processing unit, whether or not responsive action is taken by that individual call processing unit or the master call processing unit. More particularly, the current collective mean (μt) and variance (σ2 t) values which are used to generate the estimated load measure for a particular PMCi (e.g. PMCp) are selectively updated upon detection of a call event involving such PMCp, by either the PMC-M or the PMCp as well as the particular type of call event.
Turning now to FIG. 6, processing begins at step 602, in which a determination is made whether the detected call event involves a new call admission request. If so, control passes to step 608 in which a "new" estimated load mean and variance for the involved individual call processing unit, e.g. PMCp, is determined by adding the requested "new" call class (θx) load mean μ and variance σ2 values to the current estimated load mean (μt) and variance (σ2 t) values for the entire PMCp as specified in the resource knowledge base corresponding to the PMCp. It should be noted that this is made possible through application of the central limit Theorem discussed above. Thereafter, control passes to step 610 in which the load estimator determines the estimated load measure (e.g. ς(p) or x2(p)) based on the new estimated load mean and variance values calculated in step 608. If, however, in step 602, a determination is made that the detected call event does not specify a new call admission request, control instead passes to 604. In step 604, a determination is made whether the detected call event specifies an admitted call termination request. If so, control passes to step 606, in which the new mean and variance values for the estimated load are calculated by subtracting the class load mean and variance values associated with the terminated call, as contained in the corresponding resource knowledge base, from current estimated load mean and variance values. Thereafter, control passes to step 610, where again the estimated load measure is determined based on the new mean and variance values for the estimated load as calculated in step 606. If, however, in step 604, a determination is made that the detected call event in step 604 does not specify an admitted call termination event, control instead passes to step 612. In step 612, a determination is made whether the detected call event specifies an admitted call class modification request. Call class modifications, such as in-process call upgrades or downgrades, can be specified for an admitted call. Examples include requesting a call type modification (such as from voice to data) for an existing call, a call class modification within a common type (such as from interactive data type call to a streaming data call or requesting a transition in class- specific call parameter (such as the toggling the presence or absence of ciphering in voice calls or changing the bit rate for data calls). If the detected event specifies a call class modification event , control passes to step 614, in which the new estimated load mean and variance values are calculated by first subtracting the existing call class load mean and load variance values as specified in the resource knowledge base for the call currently undergoing modification and subsequently adding the new class mean and load variance values in the resource knowledge base specified for the call class the call of interest is evolving into. Control thereafter passes to step 610 in which again the estimated load measure based on the new estimated load mean and load variance for the entire processing unit is determined. It should be noted that the estimated load measure may comprise any number of values suitable to quantatively represent the estimated load, including the aforementioned probability of exceeding a nominal load ~ς(p)— or a ratio between the nominal load and the estimated load mean, equation x2(p) mentioned above. Alternatively, other expressions of the estimated load measure may be used as would be understood by those ordinarily skilled in the art.
Once the estimated load measure has been determined, control thereafter passes to step 620 in which an estimated load parameter corresponding to, or at least based on, the estimated load measure is issued by the load estimator of interest. The estimated load parameter is used by the PMCp and/or by the managing call processing unit PMC-M to help undertake eagerness determination and ultimately load balancing and call admission control according to the disclosed embodiments. Thereafter event driven load estimation updating terminates naturally.
Load estimation according to another embodiment of the invention will now be detailed with reference to the flowchart of FIG. 7. In this embodiment, current load estimation is calculated based on the call information contained in the admitted call table at a particular time (e.g. time X in the figure). Referring to FIG. 7, control initiates at step 702, in which a distribution of the call classes in the admitted call table (such as table at a given time X is made). This distribution identifies how many calls within a given call class are contained in the admitted call table. This distribution is used to help develop an estimated load mean and load variance value for the entire call processing unit based on the class distribution. Control thereafter proceeds to step 704, in which the estimated load mean (μt) and variance (σ2 t) for the entire call processing unit PMCp is calculated using the class distribution obtained in step 702 along with the class load mean and variance values contained in the resource knowledge base. In particular, this is done by multiplying the number of calls in the class as recorded in the class distribution by its associated class load mean and variance values contained in the resource knowledge base corresponding to the call processing unit of interest. This is done for each class represented in the class distribution calculated in step 702. Thereafter, in step 706, the estimated load measure (e.g. ς(p) or x2(p)) based on the new estimated load mean and new estimated load variance for the call processing unit PMCp is determined. As such, information may be used to generate the eagerness or willingness of the PMCp to accept new calls or call upgrades. Thereafter in step 708 in this embodiment, an estimated load parameter based on the aforementioned load measure is then issued for further processing, including eagerness determination for the PMCp of interest. Thereafter load estimation calculation according to this alternative embodiment terminates.
Eagerness determination processing according to an embodiment of the invention will now be detailed with reference to FIG. 8. In this embodiment, the eagerness φ(p) of a given individual call processing unit PMCp is generated with respect to fuzzy logic analysis of the actual load Xι(p) and estimated load (ς(p) or x2(p)) parameters for the PMCp. In this embodiment, homogeneousness of the PMCp is not taken into consideration. Such eagerness processing may be conveniently undertaken within the PMCp itself, such as within the resource utilization and fuzzy logic units 1320, 1325 of the PMCi 1300 shown in Fig. 13. Alternatively, such processing may be carried out on the behalf of the PMCp by the master call processing unit, such as through the resource utilization and fuzzy logic units 1850, 1860 of the PMC-M 1800 shown in FIG. 18, although in such case input from the weighting engine 1640 would not be utilized, nor would x3(p) be realized by the resource utilization unit 1850.
Turning to FIG. 8, eagerness processing within this embodiment begins at steps 812 and 810 in parallel, in which the actual load parameter xι(p) is generated by the e.g. the load monitor 1325 shown in FIG. 13 (step 812) and the estimated load parameter (ς(p) or x2(p)) is generated by e.g. the load estimator 1330 shown in FIG. 13 pursuant to load estimation described above with reference to FIGs. 6 or 7. Control thereafter passes to steps 814 and 816 in parallel, where these actual and estimated load parameters are each fiizzified into respective fuzzy logic states representative of the conditions they quantify, such as through fuzzifier 1 1342 and fuzzifier 2 1344 respectively.
In particular, the first fuzzifier 1342 acts on the actual load parameter x\(p). For each PMCi(p), xι(p) is compared to Θ (p) which is the nominal load for the PMCp in terms of its information processing resources or CPU MIPS. A value of 0 for Xi means absolutely no load; xi = Θ means fully loaded. Referring to the membership response curve employed by the first fuzzifier 1342 shown in FIG. 19, Θ(p) 1960 = 450 MIPS, and five discrete membership states are specified: vl (very low) 1910 associated with a range between 0 and 150 MIPS, lo (low) 1920 ranging from 50 to 250 MIPS, me (medium) 1930 ranging from 150 to 350 MIPS, hi (high) 1940 ranging from 250 to 450 MIPS, and vh (very high) 1950 ranging from 350 to Θ(p). Note that utilizing measured load values normalized to the nominal load Θ(p) of the PMCp, where xl(p)/Θ(p) ranges from 0 to 1 may be alternatively used with equal results, and may be advantageously implemented in a call processing environment including PMCis of mixed resource capacities (such as differing CPU resources). An efficient use of the PMCp resources dictates the need to keep f\(p) in "hi" region. To simplify fuzzifier design in this embodiment, first fuzzifier 1342, along with the second fuzzifier 1344 and the third fuzzifier 1448 (FIG. 14) are designed such that: 1) at most two fuzzy variables will have non zero values for the same input; and that the sum of the membership functions of all fuzzy variables for each input value would be 1. However, other configurations can be used without departing from the teachings of the present invention.
Thus, referring to FIG. 19, assuming that xι(p) = 400 MIPS , the output fι(p)of the first fuzzifier 1342 would be fi(ρ) = {(vl = 0), (lo = 0), (me = 0), (hi=0.5), (vh=0.5)}.
As shown in FIGs. 13 and 20, the second fuzzifier 1344 acts on the load estimation parameter x2(ρ), or, alternatively, the probability of exceeding the nominal load (ζ) directly, where ζ itself is obtained from the Gaussian distribution error function explained above. To ease design of the second fuzzifier 1344, however, x2(p) may be preferred, wherein x2(p) = (Θ - μt)/σt, where σt = standard deviation of μt, or |σ2 t)|1/2. Similar to FIG. 19, FIG. 20 illustrates the membership response curve for the second fuzzifier 1344, which again defines five discrete membership states or fuzzy result values vh 2010, hi 2020, me 2030, lo 2040, and vl 2050. Here, if assuming that μt = 300 MIPS, σt = 100 MIPS and Θ = 450 MIPS, x2(p) = 1.5, and f2(p)= {(vh=0.5), (h=0.5), (me=0), (lo=0), (vl=0)}. Alternatively, the probability ζ in this instance = 6.68%. In another example, if ζ(p) is very small (approaching 0), f2(p)is "vl" . In yet another example, if x2(p) = 4.5, ζ(p) =3.39767*10"° and then f2(p) is both "lo" and "vl" with a membership function of 0.5. Returning to FIG. 8, once fuzzification steps 814 and 816 have been carried out, control passes to step 820, wherein an inference mechanism such as a rules base, part of the PMCi load balancing fuzzy logic 1346 in FIG. 13 or the fuzzy logic unit 1860 of the PMC-M 1800 of FIG. 18, applies a finite series of rules against the possible combinations of the fi(p) and f2(p) in order to arrive at a rules result. E.g., if rule #0: if (f, = vl) & (f2 = vl) then eagerness0 = l ...rule #R-1: if (f, = vh) & (f = vh) then eagernessR-i = 0, where eagerness0, ..., eagernessR.i are the rules results from the application of each individual rule. The final eagerness φ (p) is a weighted average of all rules. In this embodiment, though not required, for each combination of fι(p) and f2(p), there is one rule specified. FIG. 24 illustrates, in the form of a lookup table, rules results based on the possible combinations of fι(p) and f2(p). Here, an individual rules result approaching 1 indicates that the PMCi is very eager to accept a new call or call upgrade, and an individual rules result approaching 0 indicates that the PMCi of interest is not eager at all to accept a new call or call upgrade. Dashed line 2410 represents the threshold to the right of which the individual PMCi(ρ) decides to reject the call admission or upgrade. The rules are determined with a "common sense" logic, for each different case.
Still referring to the flowchart of FIG. 8, once the rules results are obtained in step 820, these rules results are "deffuzified" to obtain the eagerness of accepting a new call or a call modification upgrade φ(p) , which again may be conveniently carried out by the fuzzy logic units 1346, 1860. In particular, an averaging algorithm called "centroid defuzzification" may be utilized (also known as "center of gravity" defuzzification). The formula for the defuzzification is as follows:
Figure imgf000034_0001
where b, is the center of the membership function recommended by the consequent of rule i; γ(i) is the certainty of rule i; R is number of rules.
γ( =ή 7=1 /ω (16) where f,(j) is the membership (fiizzified value) of the input j to the condition of rule i and L is the number of conditions to each rule (the same as the number of inputs of the fuzzy logic; 2 in our case). Because of our assumption that the sum of memberships for any given input is equal to one, this formula simplifies as:
Figure imgf000035_0001
As a result, we can simply calculate the center of gravity of all the rules:
R-l J activation(i) x eagerness,
Φ(P) = i≡Q (18)
R-l
Σ activation(i) i=0 Where activation(i) denotes the product of the membership functions of the inputs of the rule i. For instance, activation(O) = (f, = vl)*(f2 = vl)
activation(R-l) = (fi = vh)*(f2 = vh)
In either case, the final output is also a fuzzy variable, as a result, the eagerness has always a value between 0 and 1.
Once φ(p) is obtained, eagerness determination processing according to the embodiment of FIG. 8 terminates. To illustrate further, consider the following example:
PMCp, where Θ = 450MIPS, μ, = 434.85 MIPS, σt 2 = 56.11 MIPS measured PMCp CPU load at time t = xl(p) = 270 MIPS x2(p) at time t = (Θ-μt)/σt = 2.02 referring to FIG. 19, vector fι(p) = {0,0,0.6,0.4,0} referring to FIG. 20, vector f2(p) = {0,0, 0.022, 0.978, 0} applying rule combinations of FIG. 24, the following activation results are implicated (i.e.nonzero result): (fι=rne)*(f2=me) = 0.6* 0.022 =0.013 (fι=me)*(f2=hi) = 0.6*0.978 = 0.587 (fι=hi)*(f2=me) = 0.4*0.022 = 0.009
(fι=hi)*(f2=hi) = 0.4*0.978 = 0.391 applying equation (18), φ(p) = 0.013*(0.5) + 0.587*(0.3)+ 0.009*(0.3) + 0.391*(0.2) = 0.26 (19) 0.013 + 0.587 + 0.009 + 0.391
It should be noted that in the embodiment of FIG. 8, steps 810 and 812, and steps 814 and 816 are shown executing respectively in parallel. However, the teachings of the invention are not intended to be so limited, and in fact nonparallel execution of these steps can occur as long as fl(p) and f2(p) can be obtained such that rules can be applied to their combination as described above without either becoming stale.
Eagerness determination processing according to an alternative embodiment of the invention will now be detailed with reference to FIG. 9. In this embodiment, the homogeneousness of the given individual call processing unit PMCp load is ascertained along with fuzzy logic analysis of the actual load and estimated load parameters for the PMCp as described above with reference to FIG. 8. Eagerness determination according to this embodiment may be conveniently performed by the resource utilization 1320 and fuzzy logic unit 1340 of the PMCi 1300 shown in FIG. 13 in combination with the homogeneousness realization 1650 and eagerness determination unit 1660 of the PMC-M 1600 shown in FIG. 16, although other configurations may be utilized, as will be recognized by those of ordinary skill in the art.
In comparing the embodiment shown in FIG. 9 to the embodiment described above with reference to FIG. 8, the following differences are noted. First, homogeneousness or the related parameter x3(p) in step 918 of FIG. 9 is calculated, here in parallel with fuzzification of the actual and estimated loads (steps 814 and
816). Sequencing this after step 810 is important since homogeneousness or x3(p) is dependent in part on the new estimated load mean μt calculated in step 810 as a precursor to obtaining either x2(ρ) or ς(p). However, step 918 need not occur in parallel with either step 814 or 816 as shown in FIG. 9, and can occur at any point in time after the new estimated load mean is determined (such as by the load estimator 1330 shown in FIG.13) and before the new eagerness value for the PMCp is determined (step 924).
The homogeneousness parameter, x (p), is determined in accordance with the following equation: balance target for the new or upgraded call class minus the (new or upgraded call class mean/new estimated load mean for the PMCp), or ω(θx)- μ(θx)/μt. The balance target is the ratio of the number of calls in class θx to the total number of calls in the call processing system managed by the master call processing unit, such as PMC-M 1600 shown in FIG. 16. It should be noted that in order to obtain a valid balance target ω(θx), the PMC-M should have access to the admitted call tables for every PMCi it services (including the PMCp), if not a copy locally accessible to it, such as admitted call tables 1634 contained in local memory 1630. As shown in FIG. 16, a weighting engine 1640 forming part of the PMC-M may be utilized to maintain and update balance targets ω for all supported call classes and current state of all PMCl ...PMCk admitted call tables. When needed for homogeneousness parameter determination or otherwise, particular balance target information for a call class of interest, such as one for a new call or an upgraded existing call, may be issued by the weighting engine to a homogeneousness realization unit (such as unit 1650 within PMC-M 1600 or unit 1435 forming part of the resource utilization unit 1420 for the PMCi 1400 shown in FIG. 14.
Ideally, x3(p) should approach 0, so that the ratio of calls of class θx to the total number of calls being handled by the PMCp matches the overall ratios experienced by the entire call processing system, from which ω(θx) is derived. If x3(p) < 0, this means that the PMCp has more than the average number of admitted calls of class θx, and admission of the new or upgraded call of class θx should be rejected or at least disadvantaged. Conversely, if x (p) > 0, this means that the PMCp has less than the average number of admitted calls of class θx, and the admission of the new or upgraded call of class θx should be encouraged. Determination of homogeneousness as specified in step 918 may be conveniently implemented by a homogeneousness realization unit adapted to calculate x3(p) as discussed above. This homogeneousness realization unit may be situated onboard the PMCp for which eagerness is being determined, as is best shown in FIG. 14 through homogeneousness realization unit 1435 accepting the new estimated load mean and new or upgraded call class mean (contained in the resource knowledge base 1310) from the load estimator 1430 local to such PMCp). In such case, the ω(θx) is sent from the PMC-M including the aforementioned weighting engine, such as PMC- M 1800, to the PMCp. In the alternative, as shown in FIG. 16, the homogeneous realization unit may be situated locally within the PMC-M, such as that shown in FIG. 16, where the PMCp sends the new estimated load mean and new or upgraded call class mean to the PMC-M to permit such realization to occur.
As for further differences with the embodiment of FIG. 8, eagerness determination processing according to the embodiment of FIG. 9 also includes obtaining an intermediate eagerness φ (p) in step 922, followed by final φ(p) which takes into account φ (p) and the aforementioned x (p) calculated in step 918. In this embodiment, a simple linear function of φ (p) and x3(p) is used to obtain φ(p), such as: φ(p) = A* φ (p) + (l-A)*x3(p) (20) where A = positive constant between 0 and 1 , e.g. A= 0.9
Eagerness determination for individual call processing unit according to yet a further alternative embodiment will now be detailed with reference to FIG. 10. Processing according to this embodiment differs from processes previously described with reference to FIGs. 8 and 9 in that homogeneousness is also fiizzified (step 1010) with respect to a 3 membership state (H 2110 "high" to indicate that the ratio of calls of class θx being handled by the PMCp exceeds the θx balance target average for the entire call processing system, M 2120 "medium" to indicate that this ratio within the PMCp is approaching the balance target, L 2130 "low" means that the PMCp is handling fewer calls of class θx than average for the entire call processing system) fuzzification response curve shown in FIG. 21. Once fuzzification of all three parameters has been performed, including the actual load parameter xι(p) (step 814), the estimated load parameter x2(p) (step 816) and the homogeneousness parameter x3(p) (step 1010), the rules base is applied to all three fuzzified parameters in step 1020. The following table illustrates such a rules base, if fl(p) and f2(p) are simplified to 3 member states each as well:
Figure imgf000039_0001
Thereafter in step 1022 the rules result to fuzzify to directly obtain the new eagerness value φ (p) for the individual call processing unit (step 1022). Thereafter, eagerness determination processing according to the embodiment of FIG. 10 ends. It should be noted that the individual call processing unit arrangement shown in FIG. 14 may conveniently implement the processing described above with reference to FIG. 10. Alternatively, such processing may occur within the managing call processing unit such as that shown in FIG. 18 on behalf of the given individual call processing unit, assuming that the actual load for that call processing unit is made accessible to the PMC-M, such as through realization and transmission of the actual load xι(p) parameter by the individual PMCp to the PMC-M by the load monitor 1325 depicted in the PMCi 1700 shown in FIG. 17.
Call admission processing according to an embodiment of the invention will now be detailed with respect to the flowchart of FIGs. 11 and 12. In particular, early call admission control or early CAC is handled by the managing call processing unit PMC-M for the entire call processing system based on current eagerness φ values for each of the individual call processing units PMCi constituting the call processing system and is shown and described with reference to FIG. 12. FIG. 11 illustrates call event, including call admission processing undertaken by a given one of the individual call processing units , including when the PMC-M identifies a potential PMCi for call admission based on early CAC processing. Turning first to the flowchart of Fig. 12, early CAC processing within the PMC-M begins at step 1210, in which upon detection of a call admission request, the PMC-M control logic, such the PMC-M management unit 1510 (FIG. 15), 1610 (FIG. 16) or 1810 (FIG. 18) queries the current eagerness values φι...φk corresponding to each of the individual call processing units PMCi . It should be noted that in this embodiment, it is the responsibility of each PMCi to update it's eagerness value as is appropriate to adequately apprise the PMC-M of it's willingness to take on and manage a new call. However, in other embodiments consistent with the present invention, the PMC-M may prompt each PMCi for this eagerness information as needed or periodically as will become apparent to those ordinarily skilled in the art. Once these eagerness values are obtained, control thereafter passes to step
1212, in which the maximum relative eagerness value is determined based on the all the eagerness values (a.k.a. eagerness vector Φ) obtained in step 1210. Thereafter in step 1214, a determination is made whether the maximum eagerness value exceeds a threshold. This threshold can be a uniform threshold for any call or can be based on the type of call, its associated cost of service such as bronze, silver or gold, and or based on other factors such as originator status, intended recipient status, etc. For example, consider the threshold malleability chart of FIG. 22. Here, a bronze CoS call will not be admitted if the reported eagerness for any of the PMCis fails to exceed εβ, thus in eagerness situations 2222, 2226 and 2228 the call is rejected, but accepted in situation 2220. Looking at the chart in another way, if situations 2220, 2222, 2226, and 2228 graphically represent the reported eagerness vector Φ:φl ...φ4 at a given time, the only PMCi corresponding to reported eagerness 2220 will be available to confirm acceptance of the bronze class call.
Likewise, a silver CoS call will not be admitted if the reported eagerness for any of the PMCis fails to exceed εs, so in situations 2236 and 2238 the silver class call is rejected, and in situations 2230 and 2232 it is accepted. And, a gold class call will only fail to be accepted where the reported eagerness for each of the PMCis fails to exceed the gold threshold εg, such as in situation 2248.
Turning back to FIG. 12, if it is determined in step 1214 that this maximum eagerness value does exceed the specified threshold, control passes to step 1218. At step 1218, the call admission request under scrutiny is transferred to the PMCi exhibiting the maximum eagerness value. In particular, the eagerness value for such call processing unit is recalculated taking into account the call class specified by the call in the new call admission request. Eagerness and load estimate determination as described herein may conveniently be used to confirm the new eagerness value. Thus, the decision to ultimately admit the call pursuant to the call request rests not with the PMC-M here, but by the PMCi exhibiting the maximum eagerness to accept a new call. Control thereafter passes to step 1224.
At step 1224, the corresponding individual call processing unit's reported eagerness is disadvantaged such as by scaling its corresponding φ by a factor of 0.8 - 0.9 for at least one iteration. Control thereafter ends. It should be noted that this processing will restart with the disadvantaged φ replacing the previous maximum φ.
In such way, a high probability that another PMCi will be selected, and it's eagerness will be self-confirmed, assuming that it's reported eagerness continues to exceed the threshold as performed in step 1214. As one can see, this process continues iteratively until the first of: (1) the maximum queried eagerness value fails to exceed a specified threshold; (2) eagerness to accept the call cannot be confirmed by any
PMCi; or the CA request is withdrawn.
Though not shown in FIG. 12, alternative ways may be used to disadvantage unconfirmed PMCis, such as through removing them from further consideration for the current CA request, or for a longer duration as will be understood by those ordinarily skilled in the art.
Call event processing according to an embodiment of the invention undertaken by a PMCi such as the PMCi 1300 shown in FIG. 13 or the PMCi 1400 shown in FIG.
14 is now detailed with reference to the flowchart of FIG. 11. Processing begins at step 1110 upon receipt of a call event directed to a given one of the PMCis either internally from calls already admitted or by the managing call processing unit responding to a call admission request, as outlined above with reference to PMC-M processing described with reference to FIG. 12. If at step 1110, a determination is made that the call event includes a call admission request, control passes to step 1112. At step 1112, the PMCi undertakes determination of the estimated load which includes the new call, such through processing described herein with reference to FIGs. 6 and 7. Thereafter, in step 1114, an intermediate or final eagerness value for the PMCi based on this newly obtained estimated load in step 1112 is determined using e.g. eagerness determination processing described herein with reference to FIGs. 8, 9 or 10, based on the configuration and capabilities of the PMCi. At step 1116, the intermediate or final eagerness determined in step 1114 is compared against a threshold (similar if not the same as the threshold used by the PMC-M in step 1214 of FIGs. 12 and 22). If the new eagerness(which as noted above takes into account the new call) fails to exceed this threshold, control passes to step 1118 in which the new call is rejected by the current PMCi and previous estimated load values and eagerness values are restored to reflect the situation prior to consideration of the new call. Control thereafter terminates.
If however, in step 1116, it is determined that the new eagerness does in fact exceed the threshold, control instead passes to step 1122. At step 1122, the PMCi admits the call. Control then passes to step 1124, in which a determination is made whether the PMC-M should be apprised of the newly calculated eagerness value. There is a design goal in the present embodiment to reduce extraneous traffic between the PMC-M and the PMCis it manages, and one way to help achieve this goal is to reduce notification of reported eagerness changes when such changes are relatively small, particularly where the overall system is relatively insensitive to slight alterations in PMCi eagerness. Thus, in this embodiment, a determination is made in step 1124 whether the new eagerness value constitutes a big change from the previous eagerness value reported to the PMC-M. If so, control passes to step 1126 and the PMCi reports the new eagerness to the PMC-M and call admission event processing ends. However, if in step 1124 it is determined that the new eagerness does not represent a significant change from the previously reported eagerness for the current PMCi, call event processing ends without further reporting.
In determining whether the new eagerness constitutes a big change, a content- based messaging conservation algorithm may be used as depicted in FIG. 23. In this figure, if the new and previous eagerness both fall within region 2310(i.e. new and old φ(p) <0.25), the eagerness is determined not to be sensitive since the PMC-M will not consider the current PMCi for call admission anyway, as it fails to exceed the minimum threshold discussed above. Likewise, if the new eagerness and old eagerness both fall within region 2330 (i.e. new and old φ(p) >0.75), the current PMCi is deemed eager to admit calls anyway and so is not sensitive to the change. However, if the new eagerness or old eagerness falls in region 2320 (i.e. 0.25<new φ(p)or old φ(p) < 0.75) , the eagerness is deemed sensitive to the change and so the new eagerness is reported to the PMC-M. In other embodiments, other techniques may be used alternatively or in combination, such as thresholding the change in eagerness between old and new, reporting only after so many determinations, and the like.
Returning to FIG. 11, if the call event is not in fact a call admission request from the PMC-M, in accordance with FIG. 11 control passes to step 1130, in which a determination is made whether the call event for the current PMCi includes an admitted call class modification request. If so, control passes to step 1132, in which a further determination is made whether the call modification request specifies an upgrade or downgrade from a critical resource utilization perspective. If the modification request constitutes such an upgrade, control passes to step 1112 so that the current PMCi can self-determine whether it has sufficient resources to handle call upgrade using the aforementioned eagerness analysis detailed in steps 1112 to 1126 previously discussed. In this instance, however, the new estimated load is calculated with respect to a difference between the new and prior class characteristics for the call in which modification is requested. If, however, in step 1132, it is determined that the call modification request specifies a call downgrade, control instead passes to step 1134 in which the new estimated load is determined based on the downgrade. Then, in step 1136, the new intermediate or final (depending on PMCi resource utilization capabilities) eagerness is re-determined taking into account the new estimated load obtained in step 1134. Thereafter, processing continues with the conditional publication or reporting steps 1124 and 1126 detailed above. If, in step 1130, a determination is made that the intercepted call event does not comprise either a call admission request or an admitted call modification request, control passes to step 1140. At step 1140, a determination is made whether the call event includes an admitted call termination event. If so, control passes to step 1134 through 1126 detailed above, with the exception that the estimated load is recalculated without consideration of the terminated call. If, however, in step 1140, it is determined that the call event is not one of a call admission request, a call modification request, or a call termination request, the call event falls through to conventional call management processing (not shown in the FIG.), or in the alternative, is not recognized or acted upon at all by the PMCi of interest.
Turning briefly to FIG. 13, FIG. 13 depicts an individual call processing unit PMCi 1300 arrangement which includes a resource utilization unit 1320 and fuzzy logic unit 1340 capable of determining intermediate or final eagerness values based on actual and estimated load parameters. As such, the PMCi 1300 may conveniently implement estimated load processing discussed above with reference to FIGs. 6 and 7, and eagerness determination according to embodiments shown in FIGs. 8 and 9. In the case of eagerness determination consistent with FIG. 9, the PMCi 1300 coordinates with a PMC-M capable of determining a final eagerness including homogeneousness realization, such as PMC-M 1600 shown in FIG. 16. In the case of eagerness determination consistent with the embodiment of FIG. 8, the PMCi 1300 self determines a final eagerness which may then be reported to a PMC-M such as PMC-M 1500 shown in FIG. 15.
FIG. 14 illustrates an alternative PMCi 1400 arrangement which includes onboard homogeneousness realization and fuzzification consistent with the present invention. As such, the PMCi 1400 may conveniently implement eagerness determination consistent with the embodiment shown in FIG. 10, and may coordinate results with any PMC-M including a mechanism for generating and managing balance targets, such as PMC-M 1600 (FIG. 16) or PMC-M 1800 (FIG. 18).
FIG. 17 illustrates yet another alternative PMCi 1700 arrangement in which the PMCi 1700 does not include any fuzzy logic for load balancing or call admission control but does include a load monitor 1325 capable of obtaining xl(p) as described above. It is contemplated that in this embodiment, such load balancing and call admission control functionality will be undertaken by the managing call processing unit, such as the PMC-M 1800 shown in FIG. 18.
It will be obvious to those having skill in the art that many changes may be made to the details of the above-described embodiments of this invention without departing from the underlying principles thereof. For example, the processing described above may be conveniently implemented by one or more general (e.g. a microprocessor or microcontroller) or specific purpose information processors (e.g. a network processor or DSP) programmed to undertake one or more of the involved processing steps. In the alternative, or in combination such information processing resources, one or more steps of the above-described processing may be undertaken by a discrete logic and/or circuitry, including application specific integrated circuits and/or analogous devices. The scope of the present invention should, therefore, be determined only by the following claims.

Claims

What is claimed is:
1. A method for estimating a load of a call processing unit at a selected time, the call processing unit capable of handling a plurality of call classes, each of the call classes defining a class load mean and a class load variance, the method comprising: determining a distribution of call classes based on admitted calls handled by the call processing unit at the selected time; calculating an estimated load mean and an estimated load variance for the call processing unit based on the distribution and the class load mean and variance for at least one of the call classes specified in the distribution; and deriving an estimated load measure from at least one of the estimated load mean and the estimated load variance.
2. The method of Claim 1, wherein the estimated load measure relates to a probability of exceeding a nominal load for the call processing unit.
3. The method of Claim 2, wherein the estimated load measure comprises one of the probability of exceeding the nominal load, and a ratio of the estimated load variance to a difference between the nominal load and the estimated load mean.
4. The method of Claim 1, wherein the estimated load measure represents a utilization value for a resource of the call processing unit.
5. The method of Claim 4, wherein the resource comprises one of bandwidth and processing load for the call processing unit.
6. The method of Claim 1, wherein the class load mean and class load variance for a given one of the call classes presented in the distribution are derived from a probabilistic approximation for the given call class.
7. The method of Claim 6, wherein the probabilistic approximation comprises a Gaussian distribution.
8. The method of Claim 1 , further comprising: detecting a call event; and selectively updating the estimated load measure based on the call event.
9. The method of Claim 8, wherein the call event is selected from the group consisting essentially of a call termination, a call modification, and a new call admission.
10. A computer program product, comprising computer readable program code causing an information processor within a call processing unit to perform at least one of the following, comprising: determining a distribution of call classes capable of being supported by the call processing unit based on admitted calls handled by the call processing unit; calculating an estimated load mean and a load variance based on the distribution, and a class load mean and a class load variance for at least one of the call classes specified in the distribution; and deriving an estimated load measure from at least one of the estimated load mean and the load variance.
11. A call processing unit, comprising: a memory defining an admitted call table and a knowledge base of supported call classes, each supported call class defining a class load mean and a class load variance; and an estimated load determination unit, comprising: first logic coupled to said memory, said first logic to determine a distribution of the supported call classes based on said admitted call table; second logic coupled to said memory and said first logic to calculate an estimated load mean and a load variance based on the distribution and the class load mean and the class load variance for at least one of the supported call classes specified in the distribution; and third logic coupled to said second logic to derive an estimated load measure from at least one of the estimated load mean and the load variance.
12. A call processing unit, comprising: memory means, including means for defining an admitted call table and a knowledge base of supported call classes; and estimated load determination means comprising: means for determining a distribution of the supported classes of calls based on said admitted call table; means for calculating an estimated load mean and a load variance based on the distribution, and a class load mean and a class load variance specified in said knowledge base; and means for deriving an estimated load measure from at least one of the estimated load mean and the load variance.
13. A method for estimating a load of a call processing unit upon call event detection, the call processing unit defining a current estimated load mean and variance and being capable of handling a plurality of call classes, each of the call classes defining a class load mean and a class load variance, the method comprising: detecting a call event, the call event specifying a call of a given call class; calculating a new estimated load mean based on the current estimated load mean and the class load mean for the given call class; calculating a new estimated load variance based on the current estimated load variance and the class load variance for the given call class; and deriving an estimated load measure from at least one of the new estimated load mean and the new estimated load variance.
14. The method of Claim 13, wherein the call event is selected from the group consisting essentially of a call termination, a call modification, and a new call admission event.
15. The method of Claim 13, wherein the estimated load measure relates to a probability of exceeding a nominal load for the call processing unit.
16. The method of Claim 15, wherein the estimated load measure comprises the probability of exceeding the nominal load.
17. The method of Claim 15, wherein the estimated load measure comprises a ratio of the new estimated load variance to a difference between the nominal load and the new estimated load mean.
18. The method of Claim 13, wherein the estimated load measure represents a utilization value for a resource of the call processing unit.
19. The method of Claim 18, wherein the resource comprises one of bandwidth and processing load for the call processing unit.
20. The method of Claim 13, wherein the class load mean and class load variance for a given one of the call classes presented in the distribution are derived from a probabilistic approximation for the given call class.
21. The method of Claim 20, wherein the probabilistic approximation comprises a Gaussian distribution.
22. A computer program product comprising computer readable program code causing an information processor within a call processing unit to perform at least one of the following, comprising: detecting a call event, the call event specifying a call of a given call class, the given call class defining a class load mean and a class load variance; calculating a new estimated load mean based on a current estimated load mean for the call processing unit and the class load mean for the given call class; calculating a new estimated load variance based on a current estimated load variance for the call processing unit and the class load variance for the given call class; and deriving an estimated load measure from at least one of the new estimated load mean and the new estimated load variance.
23. A call processing unit, comprising: a memory defining a current estimated load mean, a current estimated load variance and a knowledge base of supported call classes, each supported call class defining a class load mean and a class load variance; a detector to detect a call event, the call event specifying a call of a given one of the supported call classes; and an estimated load determination unit, comprising: first logic coupled to said memory and said detector to calculate a new estimated load mean based on the current estimated load mean and the class load mean for the given one of the supported call classes; second logic coupled to said memory and said detector to calculate a new estimated load variance based on the cuπent estimated load variance and the class load variance for the given one of the supported call classes; and third logic coupled to said first and second logic to derive an estimated load measure from at least one of the new estimated load mean and the new load variance.
24. A call processing unit, comprising: memory means defining a cuπent estimated load mean, a cuπent estimated load variance and a knowledge base of supported call classes, each supported call class defining a class load mean and a class load variance; means for detecting a call event, the call event specifying a call of a given one of the supported call classes; and means for determining an estimated load, comprising: means for calculating a new estimated load mean based on the cuπent estimated load mean and the class load mean for the given one of the supported call classes; means for calculating a new estimated load variance based on the cuπent estimated load variance and the class load variance for the given one of the supported call classes; and means for deriving an estimated load measure from at least one of the new estimated load mean and the new load variance.
25. In a system comprising plural call processing units for handling a plurality of call classes, a method for admitting calls to the system, comprising:
(A) monitoring a call admission eagerness for each of the call processing units; (B) perceiving a call admission request; and
(C) when the call admission request has been perceived, performing the following:
(1) selecting one of the call processing units having a relative maximum call admission eagerness as a target call processing unit; (2) having the target call processing unit confirm the call admission eagerness with respect to the call admission request; and
(3) having the target call processing unit admit a call specified by the call admission request if the call admission eagerness is confirmed.
26. The method of Claim 25, wherein said selecting (CI) comprises selecting one of the call processing units having a relative maximum call admission eagerness as the target call processing unit if the relative maximum call admission eagerness exceeds a predetermined threshold.
27. The method of Claim 26, wherein the predetermined threshold is selected in accordance with the call specified by the call admission request.
28. The method of Claim 25, further comprising:
(C4) having the target call processing unit reject the call admission request if the call admission eagerness cannot be confirmed;
(C5) selecting another one of the call processing units having a relative maximum call admission eagerness other than the target call processing unit as a second target call processing unit;
(C6) having the second target call processing unit confirm the call admission eagerness with respect to the call admission request; and (C7) having the second target call processing unit admit the call specified by the call admission request if the call admission eagerness for the second target call processing unit is confirmed.
29. The method of Claim 25, further comprising:
(C8) recursively performing C4 through C7 in sequence until one of: call admission eagerness for a given target call processing unit is confirmed with respect to the call admission request; and the relative maximum call admission eagerness fails to exceed a predetermined threshold.
30. A computer program product, comprising computer readable program code causing an information processor within a call processing unit to perform at least one of the following, comprising: (A) monitoring a call admission eagerness for each of the call processing units
(B) perceiving a call admission request; and
(C) when the call admission request has been perceived, performing the following: (1) selecting one of the call processing units having a relative maximum call admission eagemess as a target call processing unit;
(2) having the target call processing unit confirm the call admission eagerness with respect to the call admission request; and
(3) the target call processing unit admitting a call specified by the call admission request if the call admission eagerness is confirmed.
31. A call processing system, comprising: plural call processing units for handling a plurality of call classes; an interface communicatively coupled to said plural call processing units; first logic communicatively coupled to said interface to monitor a call admission eagerness for each of the call processing units; second logic to perceive a call admission request; and third logic coupled to said interface and said first and second logic to select one of said call processing units having a relative maximum call admission eagerness as a target call processing unit, have the target call processing unit confirm the call admission eagerness with respect to the call admission request, and have the target call processing unit admit a call specified by the call admission request if the call admission eagerness is confirmed.
32. A call processing system, comprising: means for handling a plurality of call classes; means for interfacing to said handling means; means for monitoring a call admission eagerness for each of said handling means; means for perceiving a call admission request; and means for selecting one of said call processing units having a relative maximum call admission eagerness as a target call processing unit, said selecting means including means for having the target call processing unit confirm the call admission eagerness with respect to the call admission request, said selecting means further including means for having the target call processing unit admit a call specified by the call admission request if the call admission eagerness is confirmed
PCT/IB2003/002535 2002-06-28 2003-06-27 Method and apparatus for load estimation and call admission control in a call processing environment WO2004004249A1 (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
AU2003244911A AU2003244911A1 (en) 2002-06-28 2003-06-27 Method and apparatus for load estimation and call admission control in a call processing environment

Applications Claiming Priority (4)

Application Number Priority Date Filing Date Title
US10/186,877 2002-06-28
US10/187,089 US20040005041A1 (en) 2002-06-28 2002-06-28 Method and apparatus for load estimation in a call processing environment
US10/186,877 US7369490B2 (en) 2002-06-28 2002-06-28 Method and apparatus for call event processing in a multiple processor call processing system
US10/187,089 2002-06-28

Publications (1)

Publication Number Publication Date
WO2004004249A1 true WO2004004249A1 (en) 2004-01-08

Family

ID=30002687

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/IB2003/002535 WO2004004249A1 (en) 2002-06-28 2003-06-27 Method and apparatus for load estimation and call admission control in a call processing environment

Country Status (2)

Country Link
AU (1) AU2003244911A1 (en)
WO (1) WO2004004249A1 (en)

Cited By (15)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2007076253A2 (en) * 2005-12-23 2007-07-05 Avaya Technology Llc Call admission control for mobility-capable telecommunications terminals
US7848238B1 (en) 2007-05-09 2010-12-07 Sprint Spectrum L.P. Using VoIP-quality metrics to dynamically adjust the EV-DO reverse activity bit
US8040803B1 (en) 2009-01-08 2011-10-18 Sprint Spectrum L.P. Using packet-transport metrics for call-admission control
US8107438B1 (en) 2008-06-18 2012-01-31 Sprint Spectrum L.P. Method for initiating handoff of a wireless access terminal based on the reverse activity bit
US8204000B1 (en) 2009-07-23 2012-06-19 Sprint Spectrum L.P. Achieving quality of service (QoS) by using the reverse activity bit (RAB) in creation of neighbor lists for selected access terminals
US8245088B1 (en) 2009-06-30 2012-08-14 Sprint Spectrum L.P. Implementing quality of service (QoS) by using hybrid ARQ (HARQ) response for triggering the EV-DO reverse activity bit (RAB)
US8254930B1 (en) 2009-02-18 2012-08-28 Sprint Spectrum L.P. Method and system for changing a media session codec before handoff in a wireless network
US8310929B1 (en) 2009-06-04 2012-11-13 Sprint Spectrum L.P. Method and system for controlling data rates based on backhaul capacity
US8363564B1 (en) 2010-03-25 2013-01-29 Sprint Spectrum L.P. EVDO coverage modification based on backhaul capacity
US8472952B1 (en) 2010-11-30 2013-06-25 Sprint Spectrum L.P. Discovering a frequency of a wireless access point
US8619674B1 (en) 2010-11-30 2013-12-31 Sprint Spectrum L.P. Delivery of wireless access point information
US8644176B1 (en) 2010-03-11 2014-02-04 Sprint Spectrum L.P. Methods and systems for supporting enhanced non-real-time services for real-time applications
US8693499B2 (en) 2010-08-17 2014-04-08 Microsoft Corporation Dynamic adjustment of bandwidth allocation for an in-progress media session
US9374306B1 (en) 2009-03-04 2016-06-21 Sprint Spectrum L.P. Using packet-transport metrics for setting DRCLocks
US9467938B1 (en) 2009-04-29 2016-10-11 Sprint Spectrum L.P. Using DRCLocks for conducting call admission control

Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6084955A (en) * 1994-04-13 2000-07-04 British Telecommunications Plc Communication network control method and apparatus

Patent Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6084955A (en) * 1994-04-13 2000-07-04 British Telecommunications Plc Communication network control method and apparatus

Non-Patent Citations (4)

* Cited by examiner, † Cited by third party
Title
HELLENDOORN ET AL.: "Fuzzy Traffic Management for Modern Telecommunications", INT. J. UNCERTAIN. FUZZINESS KNOWL.-BASED SYST., vol. 6, no. 2, April 1998 (1998-04-01), Singapore, pages 189 - 199, XP009021248 *
HON-WAI CHU ET AL: "CALL ADMISSION CONTROL OF TELECONFERENCE VBR VIDEO TRAFFIC IN ATM NETWORKS", COMMUNICATIONS - GATEWAY TO GLOBALIZATION. PROCEEDINGS OF THE CONFERENCE ON COMMUNICATIONS. SEATTLE, JUNE 18 - 22, 1995, PROCEEDINGS OF THE CONFERENCE ON COMMUNICATIONS (ICC), NEW YORK, IEEE, US, vol. 2, 18 June 1995 (1995-06-18), pages 847 - 851, XP000533122, ISBN: 0-7803-2487-0 *
QIANG REN: "A real-time dynamic connection admission controller based on traffic modeling, measurement, and fuzzy logic control", IEEE JOURNAL ON SELECTED AREAS IN COMMUNICATIONS, FEB. 2000, vol. 18, no. 2, - February 2000 (2000-02-01), USA, XP002261449 *
RAHIN ET AL.: "Call Admission Control Algorithms in ATM Networks: A Performance Comparison and Research Directions", INTERIM RESEARCH REPORT DRAFT 0.8, 29 September 1998 (1998-09-29), Leeds, XP002261448 *

Cited By (18)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2007076253A3 (en) * 2005-12-23 2007-10-04 Avaya Tech Llc Call admission control for mobility-capable telecommunications terminals
US7688724B2 (en) 2005-12-23 2010-03-30 Avaya Inc. Call admission control for mobility-capable telecommunications terminals
US8004979B2 (en) 2005-12-23 2011-08-23 Avaya Inc. Call admission control for mobility-capable telecommunications terminals
WO2007076253A2 (en) * 2005-12-23 2007-07-05 Avaya Technology Llc Call admission control for mobility-capable telecommunications terminals
US7848238B1 (en) 2007-05-09 2010-12-07 Sprint Spectrum L.P. Using VoIP-quality metrics to dynamically adjust the EV-DO reverse activity bit
US8107438B1 (en) 2008-06-18 2012-01-31 Sprint Spectrum L.P. Method for initiating handoff of a wireless access terminal based on the reverse activity bit
US8040803B1 (en) 2009-01-08 2011-10-18 Sprint Spectrum L.P. Using packet-transport metrics for call-admission control
US8254930B1 (en) 2009-02-18 2012-08-28 Sprint Spectrum L.P. Method and system for changing a media session codec before handoff in a wireless network
US9374306B1 (en) 2009-03-04 2016-06-21 Sprint Spectrum L.P. Using packet-transport metrics for setting DRCLocks
US9467938B1 (en) 2009-04-29 2016-10-11 Sprint Spectrum L.P. Using DRCLocks for conducting call admission control
US8310929B1 (en) 2009-06-04 2012-11-13 Sprint Spectrum L.P. Method and system for controlling data rates based on backhaul capacity
US8245088B1 (en) 2009-06-30 2012-08-14 Sprint Spectrum L.P. Implementing quality of service (QoS) by using hybrid ARQ (HARQ) response for triggering the EV-DO reverse activity bit (RAB)
US8204000B1 (en) 2009-07-23 2012-06-19 Sprint Spectrum L.P. Achieving quality of service (QoS) by using the reverse activity bit (RAB) in creation of neighbor lists for selected access terminals
US8644176B1 (en) 2010-03-11 2014-02-04 Sprint Spectrum L.P. Methods and systems for supporting enhanced non-real-time services for real-time applications
US8363564B1 (en) 2010-03-25 2013-01-29 Sprint Spectrum L.P. EVDO coverage modification based on backhaul capacity
US8693499B2 (en) 2010-08-17 2014-04-08 Microsoft Corporation Dynamic adjustment of bandwidth allocation for an in-progress media session
US8472952B1 (en) 2010-11-30 2013-06-25 Sprint Spectrum L.P. Discovering a frequency of a wireless access point
US8619674B1 (en) 2010-11-30 2013-12-31 Sprint Spectrum L.P. Delivery of wireless access point information

Also Published As

Publication number Publication date
AU2003244911A1 (en) 2004-01-19

Similar Documents

Publication Publication Date Title
US7369490B2 (en) Method and apparatus for call event processing in a multiple processor call processing system
WO2004004249A1 (en) Method and apparatus for load estimation and call admission control in a call processing environment
US6665264B1 (en) Connection admission control for connection orientated networks
Tong et al. Adaptive call admission control under quality of service constraints: a reinforcement learning solution
CA2683501C (en) An automatic policy change management scheme for diffserv-enabled mpls networks
JP4606481B2 (en) A method for determining the optimal alternative node for a customer environment and a method for determining whether rehoming a customer is cost effective
CA2471594C (en) Method and apparatus for web farm traffic control
US6842428B2 (en) Method for allocating communication network resources using adaptive demand prediction
CN109861920A (en) A kind of method and device of elasticity current limliting
WO2006042410A1 (en) System and method for managing use and access of a communication network
WO1995028787A1 (en) A communication network control method and apparatus
US20040005041A1 (en) Method and apparatus for load estimation in a call processing environment
US12020070B2 (en) Managing computer workloads across distributed computing clusters
CN113472659B (en) Method and device for determining forwarding path and SDN controller
CN113132490A (en) MQTT protocol QoS mechanism selection scheme based on reinforcement learning
US9178826B2 (en) Method and apparatus for scheduling communication traffic in ATCA-based equipment
US10057857B2 (en) System power management and optimization in telecommunication systems
CN116074260A (en) Service slice scheduling method in power network
Tong et al. Reinforcement learning for call admission control and routing under quality of service constraints in multimedia networks
Fendick et al. Asymptotic analysis of adaptive rate control for diverse sources with delayed feedback
US6738758B1 (en) Adaptive bucket indexing mechanism to effectively manage service activation requests
Roy et al. GoPro: A Low Complexity Task Allocation Algorithm for a Mobile Edge Computing System
Courcoubetis et al. Fair background data transfers of minimal delay impact
Lozhkovskyi et al. Estimating the service waiting probability in a single-channel system with self-similar traffic
CN115955407B (en) Instance management method, device, equipment and storage medium

Legal Events

Date Code Title Description
AK Designated states

Kind code of ref document: A1

Designated state(s): AE AG AL AM AT AU AZ BA BB BG BR BY BZ CA CH CN CO CR CU CZ DE DK DM DZ EC EE ES FI GB GD GE GH GM HR HU ID IL IN IS JP KE KG KP KR KZ LC LK LR LS LT LU LV MA MD MG MK MN MW MX MZ NI NO NZ OM PG PH PL PT RO RU SC SD SE SG SK SL TJ TM TN TR TT TZ UA UG UZ VC VN YU ZA ZM ZW

AL Designated countries for regional patents

Kind code of ref document: A1

Designated state(s): GH GM KE LS MW MZ SD SL SZ TZ UG ZM ZW AM AZ BY KG KZ MD RU TJ TM AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HU IE IT LU MC NL PT RO SE SI SK TR BF BJ CF CG CI CM GA GN GQ GW ML MR NE SN TD TG

121 Ep: the epo has been informed by wipo that ep was designated in this application
122 Ep: pct application non-entry in european phase
NENP Non-entry into the national phase

Ref country code: JP

WWW Wipo information: withdrawn in national office

Country of ref document: JP