CN113467939A - Capacity management method, device, platform and storage medium - Google Patents

Capacity management method, device, platform and storage medium Download PDF

Info

Publication number
CN113467939A
CN113467939A CN202110706473.3A CN202110706473A CN113467939A CN 113467939 A CN113467939 A CN 113467939A CN 202110706473 A CN202110706473 A CN 202110706473A CN 113467939 A CN113467939 A CN 113467939A
Authority
CN
China
Prior art keywords
resource
service
amount
exclusive
resources
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202110706473.3A
Other languages
Chinese (zh)
Inventor
杨正华
江旻
王大飞
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
WeBank Co Ltd
Original Assignee
WeBank Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by WeBank Co Ltd filed Critical WeBank Co Ltd
Priority to CN202110706473.3A priority Critical patent/CN113467939A/en
Publication of CN113467939A publication Critical patent/CN113467939A/en
Pending legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F9/00Arrangements for program control, e.g. control units
    • G06F9/06Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
    • G06F9/46Multiprogramming arrangements
    • G06F9/50Allocation of resources, e.g. of the central processing unit [CPU]
    • G06F9/5005Allocation of resources, e.g. of the central processing unit [CPU] to service a request

Landscapes

  • Engineering & Computer Science (AREA)
  • Software Systems (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Debugging And Monitoring (AREA)

Abstract

The application provides a capacity management method, a capacity management device, a capacity management platform and a storage medium, wherein the capacity management method comprises the following steps: acquiring the service request quantity of service access requests initiated by a service i of application equipment; wherein I is a positive integer greater than or equal to 1 and less than or equal to I, and I is the total number of all services of the application equipment; acquiring the times of using shared resources of shared resources in application resources of application equipment, and the amount of the shared resources and the maximum available resource amount of the shared resources distributed for a service i; the application resources comprise independent resources distributed for the service i and shared resources distributed for all the services; acquiring the rejection times of the application equipment rejecting the service request when the exclusive resource is occupied and the shared resource is occupied; the exclusive resource allocated to the service i is adjusted based on the amount of exclusive resource, the maximum available resource amount, the number of service requests, the number of times of using the shared resource, and the number of times of refusing.

Description

Capacity management method, device, platform and storage medium
Technical Field
The embodiment of the application relates to the technical field of data processing of financial technology (Fintech), and relates to but is not limited to a capacity management method, a capacity management device, a capacity management platform and a storage medium.
Background
With the development of computer computing, more and more technologies are applied in the financial field, and the traditional financial industry is gradually changing to financial technology (Fintech), however, the financial technology also puts higher demands on the technologies due to the requirements of the financial industry on security and real-time performance.
In the field of financial science and technology, a pressure measurement model is adopted for system capacity management at present, and in a scheme of managing the pressure measurement model, the pressure measurement model is generated approximately, so that the application resources of the system are subjected to pressure test, and the resources required by each service and the maximum capacity supportable by the hardware resources of the system are estimated. It should be noted that, in the management scheme of the pressure measurement model, the system reserves resources for the service in advance, that is, the maximum processing capacity of the service is preset, and the processing of the system is limited by the current limiting algorithm to avoid exceeding the expected flow. Therefore, when capacity management is performed at least, management of resources can be performed only according to preset parameters, and the problem of poor resource utilization rate due to inflexible management mode exists in the above scheme.
Disclosure of Invention
The embodiment of the application provides a capacity management method, a capacity management device, a capacity management platform and a storage medium, which are used for solving the problem that in the related art, at least when capacity management exists, management aiming at resources can only be performed according to preset parameters, and the management mode is inflexible, so that the resource utilization rate is poor.
The technical scheme of the embodiment of the application is realized as follows:
an embodiment of the present application provides a capacity management method, including:
acquiring the service request quantity of service access requests initiated by a service i of application equipment; wherein I is a positive integer greater than or equal to 1 and less than or equal to I, and I is the total number of all services of the application device;
acquiring the times of using shared resources of shared resources in application resources of the application equipment, and the amount of the exclusive resources and the maximum available resource amount of the exclusive resources distributed for the service i; the application resources comprise exclusive resources distributed for a service i and shared resources distributed for all the services;
obtaining the rejection times of the application equipment rejecting the service request when the exclusive resource is occupied and the shared resource is occupied;
and adjusting the exclusive resource allocated to the service i based on the exclusive resource amount, the maximum available resource amount, the service request amount, the shared resource using times and the rejecting times.
An embodiment of the present application provides a capacity management apparatus, including:
the acquisition module is used for acquiring the service request quantity of service access requests initiated by a service i of the application equipment; wherein I is a positive integer greater than or equal to 1 and less than or equal to I, and I is the total number of all services of the application device;
the obtaining module is further configured to obtain times of using shared resources of shared resources in application resources of the application device, and an amount of the shared resources and a maximum available resource amount of the shared resources allocated to the service i; the application resources comprise exclusive resources distributed for a service i and shared resources distributed for all the services;
the obtaining module is further configured to obtain the rejection times of the application device rejecting the service request when the exclusive resource is occupied and the shared resource is occupied;
a processing module, configured to adjust the exclusive resource allocated to the service i based on the amount of the exclusive resource, the maximum available resource amount, the number of service requests, the number of times of using the shared resource, and the number of times of rejecting.
An embodiment of the present application provides a capacity management platform, including:
a memory for storing executable instructions; a processor, when executing executable instructions stored in the memory, implements the method described above.
The embodiment of the application provides a storage medium, which stores executable instructions and is used for causing a processor to implement the method when executed.
The embodiment of the application has the following beneficial effects:
the method comprises the steps of initiating service access requests by acquiring the number of service requests aiming at a service i of the application equipment; wherein I is a positive integer greater than or equal to 1 and less than or equal to I, and I is the total number of all services of the application equipment; acquiring the times of using shared resources of shared resources in application resources of application equipment, and the amount of the shared resources and the maximum available resource amount of the shared resources distributed for a service i; the application resources comprise independent resources distributed for the service i and shared resources distributed for all the services; acquiring the rejection times of the application equipment rejecting the service request when the exclusive resource is occupied and the shared resource is occupied; adjusting the exclusive resource allocated to the service i based on the amount of the exclusive resource, the maximum available resource amount, the number of service requests, the number of times of using the shared resource and the number of times of refusing; that is to say, the capacity index of the service i for the application device is counted in real time, and the capacity index comprises the amount of the exclusive resource, the maximum available amount of the resource, the number of service requests, the number of times of using the shared resource and the number of times of refusing, so that the exclusive resource of the service i is automatically adjusted based on the capacity index, flexible management of the exclusive resource of the service i is at least achieved, and the utilization rate of the application resource of the service i is further improved.
Drawings
FIG. 1 is an alternative schematic diagram of a capacity management platform provided in an embodiment of the present application;
fig. 2 is an alternative flow chart of a capacity management method provided in the embodiment of the present application;
FIG. 3 is an alternative architectural diagram of a capacity management platform provided by embodiments of the present application;
FIG. 4 is an alternative architecture diagram of a capacity broker as provided by an embodiment of the present application;
FIG. 5 is a schematic flow chart of an alternative capacity management method according to an embodiment of the present application;
FIG. 6 is an alternative architecture diagram of a performance level collector provided by an embodiment of the present application;
FIG. 7 is a schematic flow chart of an alternative capacity management method according to an embodiment of the present application;
FIG. 8 is a schematic flow chart of an alternative capacity management method according to an embodiment of the present application;
FIG. 9 is a schematic flow chart of an alternative capacity management method according to an embodiment of the present application;
fig. 10 is a schematic diagram of an alternative architecture of a capacity management controller according to an embodiment of the present application.
Detailed Description
In order to make the objectives, technical solutions and advantages of the present application clearer, the present application will be described in further detail with reference to the attached drawings, the described embodiments should not be considered as limiting the present application, and all other embodiments obtained by a person of ordinary skill in the art without creative efforts shall fall within the protection scope of the present application.
In the following description, reference is made to "some embodiments" which describe a subset of all possible embodiments, but it is understood that "some embodiments" may be the same subset or different subsets of all possible embodiments, and may be combined with each other without conflict. Unless defined otherwise, all technical and scientific terms used herein have the same meaning as commonly understood by one of ordinary skill in the art to which the embodiments of the present application belong. The terminology used in the embodiments of the present application is for the purpose of describing the embodiments of the present application only and is not intended to be limiting of the present application.
An exemplary application of the capacity management platform provided in the embodiment of the present application is described below, and the capacity management platform provided in the embodiment of the present application may be implemented as a server. In the following, an exemplary application will be explained when the capacity management platform is implemented as a capacity management platform.
Referring to fig. 1, fig. 1 is a schematic structural diagram of a capacity management platform provided in an embodiment of the present application, and a capacity management platform 100 shown in fig. 1 includes: at least one processor 110, at least one network interface 120, a user interface 130, and memory 150. The various components in the capacity management platform are coupled together by a bus system 140. It is understood that the bus system 140 is used to enable connected communication between these components. The bus system 140 includes a power bus, a control bus, and a status signal bus in addition to a data bus. For clarity of illustration, however, the various buses are labeled as bus system 140 in fig. 1.
The Processor 110 may be an integrated circuit chip having Signal processing capabilities, such as a general purpose Processor, a Digital Signal Processor (DSP), or other programmable logic device, discrete gate or transistor logic device, discrete hardware components, or the like, wherein the general purpose Processor may be a microprocessor or any conventional Processor, or the like.
The user interface 130 includes one or more output devices 131, including one or more speakers and/or one or more visual display screens, that enable the presentation of media content. The user interface 130 also includes one or more input devices 132 including user interface components that facilitate user input, such as a keyboard, mouse, microphone, touch screen display, camera, other input buttons and controls.
The memory 150 may be removable, non-removable, or a combination thereof. Exemplary hardware devices include solid state memory, hard disk drives, optical disk drives, and the like. Memory 150 optionally includes one or more storage devices physically located remotely from processor 110. The memory 150 includes volatile memory or nonvolatile memory, and may include both volatile and nonvolatile memory. The nonvolatile Memory may be a Read Only Memory (ROM), and the volatile Memory may be a Random Access Memory (RAM). The memory 150 described in embodiments herein is intended to comprise any suitable type of memory. In some embodiments, memory 150 is capable of storing data to support various operations, examples of which include programs, modules, and data structures, or subsets or supersets thereof, as exemplified below.
An operating system 151 including system programs for processing various basic system services and performing hardware-related tasks, such as a framework layer, a core library layer, a driver layer, etc., for implementing various basic services and processing hardware-based tasks;
a network communication module 152 for communicating to other computing devices via one or more (wired or wireless) network interfaces 120, exemplary network interfaces 120 including: bluetooth, wireless compatibility authentication (WiFi), and Universal Serial Bus (USB), etc.;
an input processing module 153 for detecting one or more user inputs or interactions from one of the one or more input devices 132 and translating the detected inputs or interactions.
In some embodiments, the apparatus provided by the embodiments of the present application may be implemented in software, and fig. 1 illustrates a capacity management apparatus 154 stored in the memory 150, where the capacity management apparatus 154 may be a capacity management apparatus in a capacity management platform, and may be software in the form of programs and plug-ins, and the like, and includes the following software modules: an acquiring module 1541, a processing module 1542, these modules being logical and thus arbitrarily combined or further split according to the implemented functions. The functions of the respective modules will be explained below.
In other embodiments, the apparatus provided in the embodiments of the present Application may be implemented in hardware, and for example, the apparatus provided in the embodiments of the present Application may be a processor in the form of a hardware decoding processor, which is programmed to execute the capacity management method provided in the embodiments of the present Application, for example, the processor in the form of the hardware decoding processor may be implemented by one or more Application Specific Integrated Circuits (ASICs), DSPs, Programmable Logic Devices (PLDs), Complex Programmable Logic Devices (CPLDs), Field-Programmable Gate arrays (FPGAs), or other electronic components.
The capacity management platform implementing the embodiment of the present invention is connected to a developer in various ways, and provides various graphical ways to deploy and manage application devices, which will be described below.
The capacity management method provided by the embodiment of the present application will be described below with reference to an exemplary application and implementation of the capacity management platform provided by the embodiment of the present application. Referring to fig. 2, fig. 2 is an alternative flow chart of a capacity management method provided in an embodiment of the present application, which will be described in conjunction with the steps shown in fig. 2,
step 201, obtaining service request quantity R of service access requests initiated by service i of application equipmenti
Wherein I is a positive integer greater than or equal to 1 and less than or equal to I, and I is the total number of all services of the application device.
In the embodiment of the application, a plurality of application devices are deployed in a capacity management platform, and each application device supports a plurality of services. Here, a service is understood to be capable of independently performing a certain functional application.
Illustratively, a user terminal accesses a capacity management platform through a network, an interface of the user terminal is connected with a service interface in the capacity management platform, a capacity management system supporting a specific service is supported to be deployed at the user terminal, and a management page of various functions of the capacity management platform is displayed in a graphical interface of the user terminal, including configuration of application resources of application equipment, expansion of the application equipment, configuration of capacity parameters of the application equipment, and the like. The user terminal may also initiate a service access request to a service i of the application device in the capacity management platform through the service interface.
Here, referring to fig. 3, fig. 3 is an alternative architecture schematic diagram of a capacity management platform provided in an embodiment of the present application, where the capacity management platform 100 includes a first application device 31, a second application device 32, a third application device 33, and a functional module for managing each application device; illustratively, the functional modules include a performance metric collector 34, a capacity analyzer 35, and a capacity management controller 36. It should be noted that the number of the application devices included in the capacity management platform 100 may be flexibly set according to actual requirements, and this is not specifically limited in this application.
The performance indicator collector 34 is configured to collect the capacity indicators of all the service interfaces in the first application device 31, the second application device 32, and the third application device 33, and further collect the capacity indicators of all the service interfaces in the first application device 31, the second application device 32, and the third application device 33And carrying out preliminary statistics and storage on the capacity index. It should be noted that, for example, the capacity index includes a service request number R for initiating a service access request for a service i of the application devicei
The capacity analyzer 35 is configured to perform capacity model modeling on the capacity indexes collected by the performance index collector 34, give an optimized configuration and a capacity evaluation, and push the optimized configuration and the capacity evaluation to the capacity management controller 36.
The capacity management controller 36 is used to distribute capacity parameters and manage system resources, and pull up new applications for capacity expansion as needed.
Step 202, obtaining the times S of using shared resources in the application resources of the application deviceiAnd amount of exclusive resource allocated for service i
Figure BDA0003132145190000071
And maximum amount of available resources
Figure BDA0003132145190000072
The application resources comprise independent resources distributed for the service i and shared resources distributed for all the services, and the maximum available resource amount comprises the amount of the independent resources
Figure BDA0003132145190000073
In the embodiment of the present application, the application resource is a resource that can be used by the application device. The application resources include exclusive resources and shared resources. It should be noted that the exclusive resource can only be used by a specific service, and other services cannot be used; shared resources all services can be used. And the application equipment in the capacity management platform configures certain exclusive resources for each service, and the rest resources except the exclusive resources in the application resources of the application equipment are used as shared resources.
In the embodiment of the application, the capacity management platform sets capacity management parameters for the service i according to a two-layer management model, wherein the capacity management parameters comprise the amount of the exclusive resources
Figure BDA0003132145190000074
And the maximum amount of available resources allocated for service i
Figure BDA0003132145190000075
The capacity index further includes the number of times S of using the shared resource among the application resources of the application deviceiAmount of exclusive resource
Figure BDA0003132145190000076
And maximum amount of available resources
Figure BDA0003132145190000077
In the embodiment of the application, a capacity management platform obtains the service request number R of service access requests initiated by a service i of an application deviceiThen, obtaining the times S of using shared resources of the shared resources in the application resources of the application equipmentiAnd amount of exclusive resource allocated for service i
Figure BDA0003132145190000078
And maximum amount of available resources
Figure BDA0003132145190000079
Step 203, obtaining the rejection times K of the application device rejecting the service request when the exclusive resource is occupied and the shared resource is occupiedi
In an achievable application scenario, if the exclusive resource corresponding to the current service i in the application device is sufficient, the service i may use the exclusive resource; if the exclusive resource corresponding to the current service i is in short supply, namely the exclusive resource is occupied, trying to acquire the shared resource; if the shared resource has a margin, the service i can use the shared resource, otherwise, when the shared resource has no margin, namely the shared resource is occupied, the application device refuses the service request of the service i.
It should be noted that the capacity index further includes the rejection number K of the application device rejecting the service requesti
In other embodiments of the present application, referring to fig. 3, the capacity management platform may embed a first capacity agent 37 in the first application device 31, a second capacity agent 38 in the second application device 32, and a third capacity agent 39 in the third application device 33. Here, the capacity agent is a functional module that provides a capacity management service for the application device. Specifically, the capacity agent is responsible for collecting capacity indexes of all service interfaces in the application system and reporting the capacity indexes to the performance index collector 34, and meanwhile, the capacity agent is also used for receiving relevant operation and maintenance instructions controlled by the capacity management controller 36.
In an implementation application scenario, referring to fig. 3 and fig. 4, fig. 4 is an alternative architecture diagram of a capacity proxy provided in an embodiment of the present application, specifically, a capacity management platform determines an entry point of the capacity proxy, embeds the capacity proxy into an application device by using an Aspect Oriented Programming (AOP), and the embedded capacity proxy is non-intrusive to an application system of the application device. The capacity broker intercepts the service request and then controls the frequency of service invocation and registers the service time consumption. On the other hand, the capacity agent may also obtain Central Processing Unit (CPU) utilization data of the application device. Here, the capacity agent puts all the collected capacity index data into a local data queue, and a reporting module in the capacity agent periodically reports the capacity index data to the performance index collector 34 in an asynchronous process.
Here, the cut-plane-oriented programming is a technique for realizing the uniform maintenance of the program functions by a pre-compiling manner and a dynamic agent during running. By using AOP, each part of the business logic can be isolated, so that the coupling degree between each part of the business logic is reduced.
Here, a Point of entry (Point) represents a Point to be enhanced in a set of business software, and some incremental logic is to be placed to complete a specific operation.
204, based on the amount of the exclusive resource
Figure BDA0003132145190000081
Maximum amount of available resources
Figure BDA0003132145190000082
Number of service requests RiNumber of times of using shared resource SiAnd the number of rejects KiThe exclusive resource allocated to service i is adjusted.
In the embodiment of the application, a capacity management platform obtains the service request number R of service access requests initiated by a service i of an application deviceiNumber of times S shared resource is used by shared resource among application resources of application deviceiAmount of exclusive resource allocated to service i
Figure BDA0003132145190000091
Maximum amount of available resources
Figure BDA0003132145190000092
And the rejection number K of the application device rejecting the service requestiThen, the current amount of the exclusive resource is served
Figure BDA0003132145190000093
For the reference resource amount, passing the maximum available resource amount
Figure BDA0003132145190000094
Number of service requests RiNumber of times of using shared resource SiAnd the number of rejects KiAnd determining the adjustment variable quantity of the reference resource quantity, and further realizing dynamic adjustment of the independent resource allocated to the service i based on the adjustment variable quantity.
According to the capacity management method, the number of service requests for initiating service access requests aiming at the service i of the application equipment is obtained; wherein I is a positive integer greater than or equal to 1 and less than or equal to I, and I is the total number of all services of the application equipment; acquiring the times of using shared resources of shared resources in application resources of application equipment, and the amount of the shared resources and the maximum available resource amount of the shared resources distributed for a service i; the application resources comprise independent resources distributed for the service i and shared resources distributed for all the services; acquiring the rejection times of the application equipment rejecting the service request when the exclusive resource is occupied and the shared resource is occupied; adjusting the exclusive resource allocated to the service i based on the amount of the exclusive resource, the maximum available resource amount, the number of service requests, the number of times of using the shared resource and the number of times of refusing; that is to say, the capacity index of the service i for the application device is counted in real time, and the capacity index comprises the amount of the exclusive resource, the maximum available amount of the resource, the number of service requests, the number of times of using the shared resource and the number of times of refusing, so that the exclusive resource of the service i is automatically adjusted based on the capacity index, flexible management of the exclusive resource of the service i is at least achieved, and the utilization rate of the application resource of the service i is further improved.
In some embodiments, step 204 is based on the amount of exclusive resources
Figure BDA0003132145190000095
Maximum amount of available resources
Figure BDA0003132145190000096
Number of service requests RiNumber of times of using shared resource SiAnd the number of rejects KiThe adjustment of the exclusive resource allocated to the service i can be implemented by the steps shown in fig. 5:
step 301, removing the service request number RiNumber of times of using shared resources SiAnd the number of rejects KiAnd obtaining the use times of the exclusive resource of the service i.
In the embodiment of the application, the number of times of using the exclusive resource of the service i is the number R of service requestsiIn (1), the number of times of using the shared resource S is removediAnd the number of rejects KiThe remaining number of times, i.e. the number of times of use of the exclusive resource of the service i, is Ri-Ki-Si
Step 302, obtaining the amount of the exclusive resource under the using times of the exclusive resource of the service i
Figure BDA0003132145190000101
Exclusive resource utilization rate gamma ofi
In the embodiment of the application, the capacity management platform removes the service request quantity RiNumber of times of using shared resources SiAnd the number of rejects KiAfter obtaining the use times of the exclusive resource of the service i, obtaining the use times R of the exclusive resource of the service ii-Ki-SiNext, for the amount of the exclusive resource
Figure BDA0003132145190000102
Exclusive resource utilization rate gamma ofi. In particular, the exclusive resource utilization γ of service iiThis can be obtained by the following formula:
Figure BDA0003132145190000103
wherein, γiExclusive resource utilization, R, for service iiNumber of service requests, K, initiating service access requests for a service i of an application deviceiNumber of refusals of service request for application device, SiThe number of times the shared resource is used for the shared resource among the application resources of the application device,
Figure BDA0003132145190000104
is the amount of exclusive resources serving i.
Step 303, remove the service request quantity RiNumber of rejections in (K)iAnd obtaining the use times of the application resources of the service i.
In the embodiment of the application, the number of times of using the application resource of the service i is the number R of service requestsiNumber of rejects KiThe remaining number of times, i.e. the number of times of use of the application resource of service i, is Ri-Ki
Step 304, calculating the number of times S the shared resource is usediIn proportion to the number of uses of the application resource, to obtain service iShared resource utilization factor lambdai
In the embodiment of the application, the capacity management platform removes the service request quantity RiNumber of rejections in (K)iObtaining the number of times R of using the application resource of the service ii-KiAnd determining the number of times S of using the shared resourceiNumber of uses R of application resourcei-KiRatio of (a) to shared resource utilization λ of service ii. In particular, shared resource utilization λ for service iiThis can be obtained by the following formula:
Figure BDA0003132145190000105
wherein λ isiShared resource utilization, R, for service iiNumber of service requests, K, initiating service access requests for a service i of an application deviceiNumber of refusals of service request for application device, SiThe number of times the resource is shared for use of a shared resource among application resources of the application device.
Step 305, calculating the rejection number KiAnd the number of service requests RiTo obtain rejection rate xi of service ii
In the embodiment of the application, the capacity management platform determines the rejection number KiAnd the number of service requests RiIs the rejection rate xi of service ii. Specifically, rejection rate ξ for service iiThis can be obtained by the following formula:
Figure BDA0003132145190000111
wherein ξiRejection rate for service i, RiNumber of service requests, K, initiating service access requests for a service i of an application deviceiThe number of denials of the service request for the application device.
In an implementation scenario, referring to fig. 3, fig. 4 and fig. 6, fig. 6 is an alternative architecture diagram of a performance index collector provided in an embodiment of the present application, where the performance index collector 34 includes a receiving module 61, a performance data queue 62, an allocating module 63, a first statistical module 64, a second statistical module 65, a third statistical module 66 and a capacity index database 67.
The receiving module 61 is connected to each capacity agent, and the receiving module 61 is configured to receive a capacity index sent by each capacity agent and store the performance index in the performance data queue 62; the distribution module 63 obtains the capacity index from the performance data queue 62, and periodically initiates a statistical task of distributing the capacity index to the statistical module according to a distribution principle; here, the statistical module includes a first statistical module 64, a second statistical module 65, and a third statistical module 66. Statistical module for capacity index such as amount of exclusive resource
Figure BDA0003132145190000118
Number of service requests RiNumber of times of using shared resource SiAnd the number of rejects KiCalculating to obtain the utilization rate gamma of the exclusive resourceiShared resource utilization factor lambdaiAnd rejection rate xiiAnd the utilization rate of the exclusive resource gammaiShared resource utilization factor lambdaiAnd rejection rate xiiTo the capacity index database 67. It should be noted that the distribution principle may be that the data amount according to the capacity index is averagely distributed to each statistical module, and the calculation amount of each statistical module is as equal as possible.
Step 306, based on the amount of the exclusive resource
Figure BDA0003132145190000112
Maximum amount of available resources
Figure BDA0003132145190000113
Exclusive resource utilization rate gammaiShared resource utilization factor lambdaiAnd rejection rate xiiThe exclusive resource allocated to service i is adjusted.
In the embodiment of the application, the capacity management platform acquires the amount of the exclusive resource
Figure BDA0003132145190000114
Maximum amount of available resources
Figure BDA0003132145190000115
Exclusive resource utilization gammaiShared resource utilization factor lambdaiAnd rejection rate xiiThen, the current amount of the exclusive resource is served
Figure BDA0003132145190000116
For the reference resource amount, passing the maximum available resource amount
Figure BDA0003132145190000117
Exclusive resource utilization rate gammaiShared resource utilization factor lambdaiAnd rejection rate xiiAnd determining the adjustment variable quantity of the reference resource quantity, and further realizing dynamic adjustment of the independent resource allocated to the service i based on the adjustment variable quantity.
In some embodiments, step 306 is based on the amount of exclusive resources
Figure BDA0003132145190000121
Maximum amount of available resources
Figure BDA0003132145190000122
Exclusive resource utilization rate gammaiShared resource utilization factor lambdaiAnd rejection rate xiiThe adjustment of the exclusive resource allocated to the service i can be implemented by the steps shown in fig. 7:
step 401, calculate 1 minus shared resource utilization λiThe reference exclusive resource utilization rate is obtained.
In the embodiment of the present application, the reference exclusive resource utilization rate of the service i is 1 minus the shared resource utilization rate λ of the service iiThe remaining utilization, i.e. the reference exclusive resource utilization of service i, is 1- λi
Step 402, calculating shared resource utilization ratio lambdaiRatio of reference exclusive resource utilizationAnd obtaining the actual utilization ratio of the shared resources and the exclusive resources.
In the embodiment of the present application, the actual utilization ratio of the shared resource and the exclusive resource of the service i is the shared resource utilization ratio λiAnd reference exclusive resource utilization rate 1-lambdaiI.e. the actual utilization ratio of shared resources and exclusive resources of service i is
Figure BDA0003132145190000123
Step 403, calculating the maximum available resource amount
Figure BDA0003132145190000124
Subtracting the amount of exclusive resources
Figure BDA0003132145190000125
The difference value of (2) to obtain the usage amount of the shared resource.
In the embodiment of the application, the shared resource usage amount of the service i is the maximum available resource amount of the service i
Figure BDA0003132145190000126
Subtracting the amount of exclusive resources
Figure BDA0003132145190000127
I.e. the shared resource usage of service i is
Figure BDA0003132145190000128
Step 404, calculating the usage amount of the shared resources and the amount of the exclusive resources
Figure BDA0003132145190000129
The estimated utilization ratio of the shared resource and the exclusive resource is obtained.
In the embodiment of the application, the estimated utilization ratio of the shared resource and the exclusive resource of the service i is the usage amount of the shared resource
Figure BDA00031321451900001210
And amount of exclusive resources
Figure BDA00031321451900001211
The ratio of (a) to (b), i.e. the estimated utilization ratio of the shared resources and the exclusive resources of service i, is
Figure BDA00031321451900001212
Step 405, calculating the difference value of the actual utilization ratio minus the estimated utilization ratio to obtain the share ratio increment delta theta.
In the embodiment of the application, the share ratio increment is used for measuring the condition of occupying the shared resource beyond expectation. Illustratively, if the share ratio is larger than the increment, it means that the independent resource allocation is too small and the shared resource allocation is too large.
In the embodiment of the application, the capacity management platform determines the increment of the share ratio of the service i as the actual utilization ratio
Figure BDA0003132145190000131
Subtracting the estimated utilization ratio
Figure BDA0003132145190000132
Is the share ratio increment Δ θ of service i. Specifically, the share ratio increment Δ θ of the service i can be obtained by the following formula:
Figure BDA0003132145190000133
where Δ θ is the share ratio increment for service i,
Figure BDA0003132145190000134
for the actual utilization ratio of the shared resource and the exclusive resource of the service i,
Figure BDA0003132145190000135
the estimated utilization ratio of the shared resource and the exclusive resource of the service i is obtained.
And step 406, adjusting the exclusive resource allocated to the service i based on the share ratio increment delta theta.
In some embodiments, the step 406 of adjusting the exclusive resource allocated to the service i based on the share ratio increment Δ θ may be implemented as follows:
and if the sharing ratio increment delta theta meets the sharing ratio increment reduction condition, reducing the sharing ratio increment delta theta to increase the exclusive resource allocated to the service i.
In the embodiment of the application, after the capacity management platform obtains the share ratio increment delta theta, the capacity management platform determines that the share ratio increment delta theta meets the share ratio increment reduction condition, namely that the share ratio increment delta theta is too large, that independent resources are distributed too little, and that shared resources are distributed too much, so that the service i contends for the shared resources under the condition that the independent shared resources distributed by the service i are insufficient; at this time, the capacity management platform increases the exclusive resource allocated to the service i by reducing the share ratio increment Δ θ, and reduces the share ratio of the shared resource.
In some embodiments, the step 406 adjusts the exclusive resource allocated to the service i based on the share ratio increment Δ θ, which may be implemented by steps 501 to 505, or steps 501 to 504 and 506 shown in fig. 8:
step 501, obtaining the utilization rate gamma of the exclusive resource in the adjustment time intervaliChange information of (Δ γ)i
In this embodiment, the adjustment time interval may be the last adjustment of the utilization rate γ of the exclusive resourceiTo the current time.
In the embodiment of the application, the change information can represent the utilization rate gamma of the exclusive resourceiThe change information can also represent the utilization rate gamma of the exclusive resourceiThe change information can represent the utilization rate gamma of the exclusive resourceiConstant over the adjustment period.
In an achievable application scenario, if the change information represents the exclusive resource utilization γiIn the adjustment period, the resource is in an increasing trend, that is, the utilization rate of the exclusive resource is improved after the last adjustment, namely, the resource is delta gammaiIs greater than 0; if the change information represents the utilization rate gamma of the exclusive resourceiIn the adjustment period, the resource is in a decreasing trend, that is, the utilization rate of the exclusive resource is reduced after the last adjustment, that is, the resource is delta gammaiLess than 0; if the change information represents the utilization rate gamma of the exclusive resourceiConstant during the adjustment period, i.e. the utilization of the exclusive resource is not changed after the last adjustment, i.e. Δ γi=0。
Step 502, obtaining the utilization rate gamma of the exclusive resourceiCorresponding standard deviation deltai
In the examples of the present application, the standard deviation δiGamma for representing service iiIs detected.
Step 503, change information delta gammaiStandard deviation deltaiReference exclusive resource utilization ratio 1-lambdaiAnd rejection rate xiiAnd the result is input into the reward punishment adjustment model to obtain the reward punishment factor output by the reward punishment adjustment model.
In the embodiment of the application, the reward and punishment adjustment model is,
Figure BDA0003132145190000141
wherein the content of the first and second substances,
Figure BDA0003132145190000142
reward and penalty factor, Δ γ, for service iiInformation on the change in the utilization of the exclusive resource for service i, S (Δ γ)i) Information Δ γ on change in utilization rate of shared resource for service iiCorresponding adjustment direction (1-lambda)i) Reference exclusive resource utilization, ξ, for service iiA denial rate for service i;
wherein the content of the first and second substances,
Figure BDA0003132145190000143
wherein, Delta gammaiThe characteristic that the utilization rate of the exclusive resource is in an increasing change trend within the adjusting period is more than 0, and S (delta gamma)i) Information Δ γ on change in utilization rate of shared resource for service iiCorresponding first adjustment direction, i.e. S (Δ γ)i)=1;Δγi< 0 indicates that the utilization rate of the exclusive resource is in a decreasing trend S (delta gamma) in the adjustment periodi) Information Δ γ on change in utilization rate of shared resource for service iiA corresponding second adjustment direction opposite to the first adjustment direction, i.e. S (Δ γ)i)=-1,ΔγiCharacterization of the exclusive resource utilization as 0 is constant over the adjustment period, S (Δ γ)i) Information Δ γ on change in utilization rate of shared resource for service iiCorresponding first adjustment direction, i.e. S (Δ γ)i) 1. Here, the first adjustment direction is a forward adjustment direction.
In some embodiments, step 503 will change the information Δ γiStandard deviation deltaiReference to exclusive resource utilization 1-lambdaiAnd rejection rate xiiInputting the result into the reward and punishment adjustment model to obtain the reward and punishment factor output by the reward and punishment adjustment model
Figure BDA0003132145190000144
The method can be realized by the following steps:
a1, based on variation information and standard deviation deltaiAnd generating a first reference factor.
In the embodiment of the application, if the change information represents the utilization rate gamma of the exclusive resourceiIn an increasing change trend in the adjusting period, the capacity management platform is based on the change information delta gammaiAnd exclusive resource utilization gammaiCorresponding standard deviation deltaiThe value of the generated first reference factor is the standard deviation deltaiThe value of itself; if the change information represents the utilization rate gamma of the exclusive resourceiIn the decreasing change trend in the adjusting period, the capacity management platform is based on the change information delta gammaiAnd exclusive resource utilization gammaiCorresponding standard deviation deltaiThe value of the generated first reference factor is the standard deviation deltaiThe opposite number of (c).
It should be noted that if Δ γiIf the service is more than 0, a certain amount of reward is given at this time, and the specific amount is based on the exclusive resource utilization rate gamma of the serviceiHistorical fluctuation deltaiA situation decision; on the contrary, the utilization rate of the exclusive resource is reduced after the last adjustment, namely delta gammaiLess than 0, this time giving a certain amount of penalty δi. On the other hand, if the service rejection rate is relatively high, it means that all resources of the service are insufficient, and a certain reward needs to be given.
A2, usage rate and rejection rate xi based on reference exclusive resourceiAnd generating a second reference factor.
In the embodiment of the application, the capacity management platform determines the utilization rate of the reference exclusive resource 1-lambdaiAnd rejection rate xiiIs the second reference factor.
A3, obtaining the sum of the first reference factor and the second reference factor to obtain a reward punishment factor
Figure BDA0003132145190000151
In this embodiment, the capacity management platform determines the sum of the first reference factor and the second reference factor as a reward and punishment factor
Figure BDA0003132145190000152
Step 504, the utilization rate gamma of the exclusive resourceiShared ratio increment delta theta, reward and punishment factor and exclusive resource amount
Figure BDA0003132145190000153
Inputting the target adjustment quantity into a target adjustment model to obtain a target adjustment quantity output by the target adjustment model
Figure BDA0003132145190000154
In the embodiment of the present application, the target adjustment model is,
Figure BDA0003132145190000155
wherein the content of the first and second substances,
Figure BDA0003132145190000156
target adjustment amount, gamma, for exclusive resource of service iiIn order to serve the exclusive resource utilization of i,
Figure BDA0003132145190000157
the amount of exclusive resources for service i, delta theta is the share ratio increment for service i,
Figure BDA0003132145190000158
reward and penalty factors for service i; lambda [ alpha ]iIn order to serve the shared resource utilization of i,
Figure BDA0003132145190000159
maximum amount of available resources, ξ, for service iiRejection rate, Δ γ, for service iiExclusive resource utilization gamma for service iiChange information of δiExclusive resource utilization gamma for service iiStandard deviation of (2).
Step 505, if the target adjustment amount is determined
Figure BDA0003132145190000161
Less than the target adjustment threshold, determining a target adjustment amount
Figure BDA0003132145190000162
And according with the target adjustment amount reducing condition, reducing the amount of the exclusive resource distributed to the service i.
Wherein the reduced amount of exclusive resources equals the target adjustment amount
Figure BDA0003132145190000163
The absolute value of (a) corresponds to the amount of resources.
In this embodiment of the application, the target adjustment threshold is used to determine whether to adjust the exclusive resource of the service i, and for example, the target adjustment threshold may be 0.
In the embodiment of the application, the capacity management platform utilizes the exclusive resource utilization rate gammaiDelta theta of share ratio, and reward penalty factor
Figure BDA0003132145190000164
And amount of exclusive resources
Figure BDA0003132145190000165
Inputting the target adjustment quantity into a target adjustment model to obtain a target adjustment quantity output by the target adjustment model
Figure BDA0003132145190000166
Then, the target adjustment amount is judged
Figure BDA0003132145190000167
If it is less than the target adjustment threshold, e.g., 0, if the target adjustment amount is
Figure BDA0003132145190000168
If less than 0, determining the target adjustment amount
Figure BDA0003132145190000169
Meet the target adjustment quantity reducing condition to serve the current exclusive resource quantity of the i
Figure BDA00031321451900001610
As a reference resource amount, a target adjustment amount is used
Figure BDA00031321451900001611
Is the reduced amount of exclusive resources, reducing the amount of exclusive resources allocated to the exclusive resources of the service i.
Step 506, if the target adjustment amount
Figure BDA00031321451900001612
If the target adjustment threshold is larger than the target adjustment threshold, determining a target adjustment amount
Figure BDA00031321451900001613
And according with the target adjustment amount increasing condition, increasing the amount of the exclusive resource allocated to the exclusive resource of the service i.
Wherein the increased amount of exclusive resources equals the target adjustment amount
Figure BDA00031321451900001614
The corresponding amount of resources.
In the embodiment of the application, the capacity management platformWill share resource utilization rate gammaiDelta theta of share ratio, and reward penalty factor
Figure BDA00031321451900001615
And amount of exclusive resources
Figure BDA00031321451900001616
Inputting the target adjustment quantity into a target adjustment model to obtain a target adjustment quantity output by the target adjustment model
Figure BDA00031321451900001617
Then, the target adjustment amount is judged
Figure BDA00031321451900001618
If it is greater than the target adjustment threshold, e.g., 0, if the target adjustment amount is greater than the target adjustment threshold
Figure BDA00031321451900001619
If the target adjustment amount is greater than 0, the target adjustment amount is determined
Figure BDA00031321451900001620
According with the target adjustment quantity increasing condition to serve the current exclusive resource quantity of the i
Figure BDA00031321451900001621
As a reference resource amount, a target adjustment amount is used
Figure BDA00031321451900001622
And increasing the amount of the exclusive resource allocated to the service i for the increased amount of the exclusive resource.
From the above, the capacity analyzer in the capacity management platform targets the amount of the exclusive resource of the service i according to the actual operation condition of the application device
Figure BDA00031321451900001623
And adjusting to at least realize flexible management of the exclusive resource of the service i, thereby improving the utilization rate of the application resource of the service i.
Other embodiments of the present applicationIn an embodiment, to ensure high availability of a service, the maximum amount of available resources for service i
Figure BDA00031321451900001624
Should be the maximum value max (R) of the number of resources actually used by the current servicei) And
Figure BDA00031321451900001625
the maximum value of the two is the maximum available resource amount of the service i
Figure BDA00031321451900001626
The adjustment amount of (b) can be obtained by the following formula:
Figure BDA0003132145190000171
wherein the content of the first and second substances,
Figure BDA0003132145190000172
maximum amount of available resources for service i
Figure BDA0003132145190000173
Amount of adjustment of (2), RiThe number of service requests to initiate a service access request for service i of an application device,
Figure BDA0003132145190000174
the maximum amount of available resources to serve i.
From the above, the capacity management platform is based on the maximum available resource amount of the service i
Figure BDA0003132145190000175
And the number of service requests for initiating service access requests for a service i of the application device, the maximum available resource amount for the service i
Figure BDA0003132145190000176
Making adjustments to achieve at least the maximum amount of resources available to service i
Figure BDA0003132145190000177
The flexible management of the service i is realized, and the utilization rate of the application resources of the service i is further improved.
In some embodiments, step 204 is based on the amount of exclusive resources
Figure BDA0003132145190000178
Maximum amount of available resources
Figure BDA0003132145190000179
Number of service requests RiNumber of times of using shared resource SiAnd the number of rejects KiAfter the exclusive resource allocated to the service i is adjusted, the steps shown in fig. 9 may be further executed to achieve the purpose of capacity expansion:
step 601, acquiring the resource amount of all resources required by all services.
In the embodiment of the present application, one application device supports a plurality of services, and the resource amount of all resources required by the plurality of services, that is, all services, includes the resource amount of the required resources when the maximum service request may occur simultaneously in the application device.
In this embodiment of the present application, the step 601 of obtaining the resource amount of all resources required by all services includes the following steps:
step1, obtain the first number of services in the access state of all services.
In the embodiment of the application, the service in the access state can be the service with the maximum available resource amount
Figure BDA00031321451900001710
And (4) corresponding service. Illustratively, an application device supports 100 services, and the capacity management platform obtains the maximum available resource amount from the 100 services
Figure BDA00031321451900001711
The number of corresponding services is a first number.
And Step2, acquiring a second number of associated services with preset association coefficients with the service in the access state from all the services.
In the embodiment of the present application, the association service may be understood as a service in which a correlation degree determined by calculating a linear correlation degree between a service in an access state and each service not in the access state meets a correlation threshold condition. Here, determining the associated service may be performed by calculating a correlation coefficient between each of the services in the accessed state and the services not in the accessed state, the correlation coefficient being a quantity of a degree of linear correlation between the service in the accessed state and each of the services not in the accessed state. For example, determining a correlation coefficient between the service X in the access state and each service Y in the services not in the access state may be obtained as follows:
Figure BDA0003132145190000181
where ρ (X, Y) is a correlation coefficient between the service X in the accessed state and each service Y in the services not in the accessed state, cov (X, Y) is a covariance between the service X in the accessed state and each service Y in the services not in the accessed state, σXVariance, σ, for service X in access stateYThe variance for each service Y in services that are not in the access state.
In the embodiment of the application, after the capacity management platform obtains the first number of the services in the access state in all the services, the number of the associated services which have the preset associated coefficient with the services in the access state in all the services is obtained to be the second number.
And Step3, calculating the sum of the first quantity and the second quantity to obtain the quantity of concurrent access.
In the embodiment of the application, after the capacity management platform obtains a first number of services in an access state in all the services and a second number of associated services having a preset association coefficient with the services in the access state in all the services, a sum of the first number and the second number is determined, and further, a total number of services with concurrent access amount in the capacity management platform is determined.
Step4, calculating the concurrent access quantity and the maximum available resource quantity
Figure BDA0003132145190000182
The first resource amount is obtained.
In the embodiment of the application, the capacity management platform obtains the sum of the first quantity and the second quantity, and after the concurrent access quantity is obtained, the concurrent access quantity and the maximum available resource quantity are determined
Figure BDA0003132145190000183
The product of (a) is the first amount of resources.
And Step5, calculating the difference value of the total number of all services minus the concurrent access amount to obtain the residual access amount.
In the embodiment of the application, the capacity management platform determines that the difference value obtained by subtracting the concurrent access amount from the total number of all the services is the remaining access amount.
Step6, calculating the residual access amount and the exclusive resource amount
Figure BDA0003132145190000184
The second resource amount is obtained.
In the embodiment of the application, after the capacity management platform obtains the remaining access amount by subtracting the difference value of the concurrent access amount from the total number of all the services, the remaining access amount and the amount of the exclusive resource are determined
Figure BDA0003132145190000185
The product of (a) is the second amount of resources.
And Step7, calculating the sum of the first resource amount and the second resource amount to obtain the resource amount of all the resources.
In this embodiment, the capacity management platform determines the sum of the first resource amount and the second resource amount as the resource amount of all the resources.
In one achievable application scenario, the capacity management platform setting is accessing the set vision [, ]]The accessing set is used for storing the service in the accessing state and the service to be accessed, and will have the maximum available resource amount
Figure BDA0003132145190000191
The corresponding service i joins the accessing set visiting [ argmax ]i{ti P}]. Meanwhile, the accessed set visited is set, the accessing set is used for storing the accessed service, and the accessed set visited is initialized to be empty, namely the visited is]. Fetching a first service X from the accessing set visiting [0 ]]For each service Y that is not in the accessing set visiting and not in the accessed set visited, a correlation coefficient ρ (X, Y) between the service X and the service Y is calculated. If the correlation coefficient ρ (X, Y) exceeds 0.7, the service X is removed from the accessing set visiting, added to the accessed set visited, and the service Y is added to the accessing set visiting, and further, the above operations are repeated to determine other services having high correlation with the service Y, and the above operations are stopped until the elements in the accessing set visiting are empty.
Further, the capacity management platform has access to the maximum available resource amount of the service in the set of visited
Figure BDA0003132145190000192
Summing to obtain a first resource amount
Figure BDA0003132145190000193
Amount of exclusive resource to remaining access, i.e. other services not in the accessed set visited
Figure BDA0003132145190000194
Summing to obtain a second resource amount
Figure BDA0003132145190000195
Figure BDA0003132145190000196
The capacity management platform determines the amount of resources T ═ T of all resources needed for all services1+T2
As can be seen from the above description, in the embodiment of the present application, all services in the accessed set visited in the capacity management system are services that occupy the most resources and are strongly associated, and the capacity management platform determines the resource amount of all resources required by all services based on consideration of the situation that the services peak or are concurrent at the same time.
Step 602, obtaining the resource amount of the application resource provided by the application device.
In this embodiment of the present application, the resource amount of the application resource provided by the application device may be understood as a resource amount supportable by a CPU resource in the application device.
In other embodiments of the present application, the capacity management platform randomly obtains a service k from the accessed set, and obtains a peak value t of a service k request from the capacity index databasekSum and peak tkCorresponding CPU utilization c based on the peak tkAnd a CPU utilization rate c, determining a resource amount M of the application resource provided by the single application device. Specifically, the resource amount M of the application resource provided by a single application device can be obtained by the following formula,
M=tk/c
wherein M is the resource amount of the application resource provided by a single application device, tkPeak requested for service k, c is the sum peak tkCorresponding CPU utilization.
Step 603, obtaining reference resource quantity of application resources provided by all application devices in a capacity management platform governed by the application devices.
In this embodiment of the present application, a plurality of application devices n exist in a capacity management platform, and the capacity management platform determines a reference resource amount P of an application resource provided by the plurality of application devices. Where P is a reference resource amount of the application resources provided by the multiple application devices, n is the number of application devices that have been enabled in the capacity management platform, and M is a resource amount of the application resources provided by a single application device.
Step 604, calculating the resource amount difference value obtained by subtracting the reference resource amount from the resource amount of all the resources.
Step 605, if the ratio of the resource amount difference to the resource amount of the application resource satisfies the capacity expansion condition, based on the ratio, generating the capacity expansion amount of the application device.
In this embodiment of the application, the capacity expansion condition may be understood that a ratio of the resource amount difference to the resource amount of the application resource is greater than a ratio threshold, and for example, the ratio threshold may be 0.
In the embodiment of the application, the capacity management platform obtains a resource quantity difference value obtained by subtracting the reference resource quantity from the resource quantity of all the resources, and if the ratio of the resource quantity difference value to the resource quantity of the application resource meets the capacity expansion condition, the capacity expansion quantity of the application equipment is generated based on the ratio. Specifically, the expansion number N of the application device may be obtained by the following formula:
Figure BDA0003132145190000201
wherein N is the capacity expansion quantity of the application devices, T is the resource quantity of all resources required by all services, N is the number of the application devices which have been started in the capacity management platform, M is the resource quantity of the application resources provided by a single application device, P is the reference resource quantity of the application resources provided by a plurality of application devices,
Figure BDA0003132145190000202
this notation means rounding up.
Step 606, deploying the application resources of the candidate application devices with the expanded number in the capacity management platform to a resource layer of the capacity management platform.
In the embodiment of the application, if the ratio of the resource amount difference to the resource amount of the application resource satisfies the capacity expansion condition, the capacity management platform generates the capacity expansion number of the application device based on the ratio, and deploys the capacity expansion number of the application device candidate in the capacity management platform, that is, the application resource of the application device not enabled in the capacity management platform, to the resource layer of the capacity management platform.
As can be seen from the above, in the embodiment of the present application, first, the capacity management platform obtains the resource amounts of all resources required by all services and the resource amounts of application resources provided by the application devices, and determines the reference resource amounts of the application resources provided by all application devices in the capacity management platform; then, the capacity management platform determines that the resource quantity difference value of the reference resource quantity subtracted by the resource quantity of all the resources meets the capacity expansion condition with the ratio of the resource quantity of the application resources, and generates the capacity expansion quantity of the application equipment based on the ratio; therefore, the capacity management platform automatically judges whether the system needs capacity expansion or not and the quantity of the application equipment needing capacity expansion according to the actual running condition of the system.
In an implementation application scenario, referring to fig. 3 and fig. 10, fig. 10 is an alternative architecture diagram of a capacity management controller provided in an embodiment of the present application, where the capacity management controller 36 includes a general control module 71, a configuration management module 72, a resource management module 73, and a monitoring alarm module 74.
Wherein, the total control module 71 receives the analysis result of the capacity analyzer 35. If the capacity management parameters need to be changed, the configuration management module 72 is notified to change the parameters, and the alarm module 74 is notified to send a corresponding alarm notification message to the operation and maintenance personnel. If the capacity needs to be expanded, the resource management module 73 is notified to expand the capacity.
In one implementation, the configuration management module 72 logs in the current capacity parameters of all services, and after receiving the change of the capacity parameters, broadcasts a change notification of the capacity parameters to each capacity broker, and the capacity broker actively pulls the latest capacity parameters to the configuration management module 72. The configuration management module 72 receives the current parameters reported by the capacity agent, and if it is found that the parameters of the capacity agent are not updated, the configuration management module sends a notification to start updating.
In one implementation, the resource management module manages 73 the backup available resource and the application version package, needs the corresponding available resource after receiving the capacity expansion command, and pulls the application system after issuing the version package, where the backup available resource may be understood as an application resource of a candidate application device, that is, an application resource of an application device that is not enabled.
Continuing with the exemplary structure of the capacity management device 154 implemented as a software module provided in the embodiments of the present application, in some embodiments, as shown in fig. 1, the software module stored in the capacity management device 154 of the memory 150 may be a capacity management device in a capacity management platform, including:
an obtaining module 1541, configured to obtain a service request number of a service i initiating a service access request for an application device; wherein I is a positive integer greater than or equal to 1 and less than or equal to I, and I is the total number of all services of the application equipment;
the obtaining module 1541 is further configured to obtain times of using the shared resource of the shared resource in the application resources of the application device, and an amount of the shared resource and a maximum available resource amount of the shared resource allocated to the service i; the application resources comprise independent resources distributed for the service i and shared resources distributed for all the services;
the obtaining module 1541 is further configured to obtain a rejection count of the application device rejecting the service request when the exclusive resource is occupied and the shared resource is occupied;
the processing module 1542 is configured to adjust the sharable resource allocated to the service i based on the amount of the sharable resource, the maximum available resource amount, the number of service requests, the number of times of using the shared resource, and the number of times of rejecting the shared resource.
In other embodiments of the present application, the processing module 1542 is further configured to remove the times of using the shared resource and the times of rejecting the shared resource in the service request number, so as to obtain the times of using the exclusive resource of the service i; the obtaining module 1541 is further configured to obtain an exclusive resource utilization rate of the amount of the exclusive resource when the exclusive resource of the service i is used for a number of times; the processing module 1542 is further configured to remove the rejection times in the service request quantity, so as to obtain the use times of the application resource of the service i; calculating the proportion of the times of using the shared resources to the times of using the application resources to obtain the utilization rate of the shared resources of the service i; calculating the ratio of the rejection times to the service request quantity to obtain the rejection rate of the service i; the exclusive resource allocated to the service i is adjusted based on the amount of exclusive resource, the maximum available resource amount, the utilization rate of exclusive resource, the utilization rate of shared resource, and the rejection rate.
In other embodiments of the present application, the processing module 1542 is further configured to calculate a reference exclusive resource utilization rate corresponding to the shared resource utilization rate; calculating the ratio of the utilization rate of the shared resources to the utilization rate of the reference unshared resources to obtain the actual utilization ratio of the shared resources to the unshared resources; calculating the difference value of subtracting the exclusive resource amount from the maximum available resource amount to obtain the shared resource usage amount; calculating the ratio of the usage amount of the shared resources to the amount of the exclusive resources to obtain the estimated utilization ratio of the shared resources to the exclusive resources; calculating the difference value of subtracting the estimated utilization ratio from the actual utilization ratio to obtain the share ratio increment; the exclusive resource allocated to service i is adjusted based on the share ratio increment.
In other embodiments of the present application, the processing module 1542 is further configured to reduce the share ratio increment to increase the exclusive resource allocated to the service i if the share ratio increment meets the share ratio increment reduction condition.
In other embodiments of the present application, the obtaining module 1541 is further configured to obtain change information of the utilization rate of the exclusive resource in an adjustment time period; acquiring a standard deviation corresponding to the utilization rate of the exclusive resource; the processing module 1542 is further configured to input the change information, the standard deviation, the reference exclusive resource utilization rate, and the rejection rate into the reward and punishment adjustment model to obtain a reward and punishment factor output by the reward and punishment adjustment model; inputting the utilization rate of the exclusive resource, the increment of the sharing ratio, the reward and punishment factors and the amount of the exclusive resource into a target adjustment model to obtain a target adjustment amount output by the target adjustment model; if the target adjustment quantity is smaller than the target adjustment threshold, determining that the target adjustment quantity meets a target adjustment quantity reduction condition, and reducing the amount of the exclusive resource allocated to the service i; wherein the reduced amount of the exclusive resource is equal to the amount of the resource corresponding to the absolute value of the target adjustment amount; if the target adjustment quantity is larger than the target adjustment threshold value, determining that the target adjustment quantity meets a target adjustment quantity increasing condition, and increasing the amount of the exclusive resource distributed to the exclusive resource of the service i; and the increased amount of the exclusive resources is equal to the amount of the resources corresponding to the target adjustment amount.
In other embodiments of the present application, the reward and punishment adjustment model is:
Figure BDA0003132145190000231
wherein the content of the first and second substances,
Figure BDA0003132145190000232
reward and penalty factor, Δ γ, for service iiInformation on the change in the utilization of the exclusive resource for service i, S (Δ γ)i) Adjustment direction corresponding to the change information of the utilization rate of the exclusive resource for the service i, (1-lambda)i) Reference exclusive resource utilization, ξ, for service iiA denial rate for service i;
wherein the content of the first and second substances,
Figure BDA0003132145190000233
wherein, Delta gammaiThe expression that the utilization rate of the exclusive resource is in an increasing change trend within the adjusting period is more than 0, and the expression is delta gammaiLess than 0 represents that the utilization rate of the exclusive resource is in a decreasing change trend within the adjustment period, namely delta gammaiThe utilization rate of the shared resource is characterized to be constant in the adjusting time interval as 0;
the target adjustment model is:
Figure BDA0003132145190000234
wherein the content of the first and second substances,
Figure BDA0003132145190000235
target adjustment amount, gamma, for exclusive resource of service iiIn order to serve the exclusive resource utilization of i,
Figure BDA0003132145190000236
the amount of exclusive resources for service i, delta theta is the share ratio increment for service i,
Figure BDA0003132145190000237
is the reward and penalty factor for service i.
In other embodiments of the present application, the obtaining module 1541 is further configured to obtain resource amounts of all resources required by all services; acquiring the resource amount of application resources provided by application equipment; acquiring reference resource quantity of application resources provided by all application equipment in a capacity management platform governed by the application equipment; the processing module 1542 is further configured to calculate a resource amount difference obtained by subtracting the reference resource amount from the resource amount of all the resources; if the ratio of the resource quantity difference value to the resource quantity of the application resource meets the capacity expansion condition, generating the capacity expansion quantity of the application equipment based on the ratio; and deploying the application resources of the candidate application devices with the capacity expansion quantity in the capacity management platform to a resource layer of the capacity management platform.
In other embodiments of the present application, the obtaining module 1541 is further configured to obtain a first number of services in an access state from among all the services; acquiring a second number of associated services with a preset associated coefficient with the service in the access state from all the services; the processing module 1542 is further configured to calculate a sum of the first number and the second number, so as to obtain a concurrent access amount; calculating the product of the concurrent access quantity and the maximum available resource quantity to obtain a first resource quantity; calculating the difference value of the total number of all services minus the concurrent access amount to obtain the residual access amount; calculating the product of the residual access amount and the amount of the exclusive resource to obtain a second resource amount; and calculating the sum of the first resource amount and the second resource amount to obtain the resource amount of all the resources.
Embodiments of the present application provide a storage medium having stored therein executable instructions, which when executed by a processor, will cause the processor to perform the methods provided by embodiments of the present application, for example, the methods as illustrated in fig. 2, 5, 7-9.
The storage medium provided by the application initiates the service access request number aiming at the service i of the application equipment by acquiring the service request number; wherein I is a positive integer greater than or equal to 1 and less than or equal to I, and I is the total number of all services of the application equipment; acquiring the times of using shared resources of shared resources in application resources of application equipment, and the amount of the shared resources and the maximum available resource amount of the shared resources distributed for a service i; the application resources comprise independent resources distributed for the service i and shared resources distributed for all the services; acquiring the rejection times of the application equipment rejecting the service request when the exclusive resource is occupied and the shared resource is occupied; adjusting the exclusive resource allocated to the service i based on the amount of the exclusive resource, the maximum available resource amount, the number of service requests, the number of times of using the shared resource and the number of times of refusing; that is to say, the capacity index of the service i for the application device is counted in real time, and the capacity index comprises the amount of the exclusive resource, the maximum available amount of the resource, the number of service requests, the number of times of using the shared resource and the number of times of refusing, so that the exclusive resource of the service i is automatically adjusted based on the capacity index, flexible management of the exclusive resource of the service i is at least achieved, and the utilization rate of the application resource of the service i is further improved.
In some embodiments, the storage medium may be a computer-readable storage medium, such as a Ferroelectric Random Access Memory (FRAM), a Read Only Memory (ROM), a Programmable Read Only Memory (PROM), an Erasable Programmable Read Only Memory (EPROM), a charged Erasable Programmable Read Only Memory (EEPROM), a flash Memory, a magnetic surface Memory, an optical disc, or a Compact disc Read Only Memory (CD-ROM), among other memories; or may be various devices including one or any combination of the above memories.
In some embodiments, executable instructions may be written in any form of programming language (including compiled or interpreted languages), in the form of programs, software modules, scripts or code, and may be deployed in any form, including as a stand-alone program or as a module, component, subroutine, or other unit suitable for use in a computing environment.
By way of example, executable instructions may correspond, but do not necessarily have to correspond, to files in a file system, and may be stored in a portion of a file that holds other programs or data, such as in one or more scripts in a hypertext Markup Language (hypertext Markup Language) document, in a single file dedicated to the program in question, or in multiple coordinated files (e.g., files that store one or more modules, sub-programs, or portions of code). By way of example, executable instructions may be deployed to be executed on one computing device or on multiple computing devices at one site or distributed across multiple sites and interconnected by a communication network.
The above description is only an example of the present application, and is not intended to limit the scope of the present application. Any modification, equivalent replacement, and improvement made within the spirit and scope of the present application are included in the protection scope of the present application.

Claims (11)

1. A capacity management method, comprising:
acquiring the service request quantity of service access requests initiated by a service i of application equipment; wherein I is a positive integer greater than or equal to 1 and less than or equal to I, and I is the total number of all services of the application device;
acquiring the times of using shared resources of shared resources in application resources of the application equipment, and the amount of the exclusive resources and the maximum available resource amount of the exclusive resources distributed for the service i; the application resources comprise exclusive resources distributed for a service i and shared resources distributed for all the services;
obtaining the rejection times of the application equipment rejecting the service request when the exclusive resource is occupied and the shared resource is occupied;
and adjusting the exclusive resource allocated to the service i based on the exclusive resource amount, the maximum available resource amount, the service request amount, the shared resource using times and the rejecting times.
2. The method of claim 1, wherein the adjusting the exclusive resource allocated to the service i based on the amount of exclusive resource, the maximum available resource, the number of service requests, the number of times of using the shared resource, and the number of times of rejecting comprises:
removing the times of using the shared resources and the times of refusing in the service request quantity to obtain the times of using the exclusive resources of the service i;
acquiring the utilization rate of the exclusive resource amount under the use times of the exclusive resource of the service i;
removing the rejection times in the service request quantity to obtain the use times of the application resources of the service i;
calculating the proportion of the times of using the shared resources to the times of using the application resources to obtain the utilization rate of the shared resources of the service i;
calculating the ratio of the rejection times to the service request quantity to obtain the rejection rate of the service i;
adjusting the exclusive resource allocated to the service i based on the amount of exclusive resource, the maximum available resource amount, the utilization of exclusive resource, the utilization of shared resource, and the rejection rate.
3. The method of claim 2, wherein the adjusting the exclusive resource allocated to the service i based on the amount of exclusive resource, the maximum available resource, the exclusive resource utilization, the shared resource utilization, and the rejection rate comprises:
calculating the difference value obtained by subtracting the shared resource utilization rate from 1 to obtain the reference exclusive resource utilization rate;
calculating the ratio of the utilization rate of the shared resource to the utilization rate of the reference exclusive resource to obtain the actual utilization ratio of the shared resource to the exclusive resource;
calculating the difference value of subtracting the exclusive resource amount from the maximum available resource amount to obtain the shared resource usage amount;
calculating the ratio of the usage amount of the shared resources to the amount of the exclusive resources to obtain the estimated utilization ratio of the shared resources to the exclusive resources;
calculating the difference value of subtracting the estimated utilization ratio from the actual utilization ratio to obtain the share ratio increment;
and adjusting the exclusive resource allocated to the service i based on the share ratio increment.
4. The method of claim 3, wherein the adjusting the exclusive resource allocated to the service i based on the share ratio increment comprises:
and if the sharing ratio increment meets the sharing ratio increment reduction condition, reducing the sharing ratio increment to increase the exclusive resource allocated to the service i.
5. The method of claim 3, wherein the adjusting the exclusive resource allocated to the service i based on the share ratio increment comprises:
acquiring the change information of the utilization rate of the exclusive resource in an adjusting time period;
acquiring a standard deviation corresponding to the utilization rate of the exclusive resource;
inputting the change information, the standard deviation, the reference exclusive resource utilization rate and the rejection rate into a reward punishment adjustment model to obtain a reward punishment factor output by the reward punishment adjustment model;
inputting the utilization rate of the exclusive resource, the sharing ratio increment, the reward and punishment factor and the amount of the exclusive resource into a target adjustment model to obtain a target adjustment amount output by the target adjustment model;
if the target adjustment quantity is smaller than a target adjustment threshold value, determining that the target adjustment quantity meets a target adjustment quantity reduction condition, and reducing the amount of the exclusive resource allocated to the service i; wherein the reduced amount of the exclusive resource is equal to the amount of the resource corresponding to the absolute value of the target adjustment amount;
if the target adjustment quantity is larger than the target adjustment threshold, determining that the target adjustment quantity meets a target adjustment quantity increasing condition, and increasing the amount of the exclusive resource allocated to the service i; and the increased amount of the exclusive resources is equal to the amount of the resources corresponding to the target adjustment amount.
6. The method of claim 5, wherein the reward punishment adjustment model is:
Figure FDA0003132145180000031
wherein the content of the first and second substances,
Figure FDA0003132145180000032
a reward penalty factor, Δ γ, for said service iiInformation on the change of the utilization of the exclusive resource for said service i, S (Delta Gamma)i) Adjusting direction (1-lambda) corresponding to the change information of the utilization rate of the exclusive resource of the service ii) Reference exclusive resource utilization, ξ, for said service iiA rejection rate for the service i;
wherein the content of the first and second substances,
Figure FDA0003132145180000033
wherein, Delta gammaiThe characteristic that the exclusive resource utilization rate is in an increasing change trend within the adjusting period is more than 0, and the value is delta gammai< 0 indicates that the utilization rate of the exclusive resource is in a decreasing trend of change, delta gamma, in the adjustment periodiCharacterizing that the exclusive resource utilization is constant for the adjustment period as 0;
the target adjustment model is as follows:
Figure FDA0003132145180000034
wherein the content of the first and second substances,
Figure FDA0003132145180000035
target adjustment amount, gamma, for the exclusive resource of the service iiFor the exclusive resource utilization of the service i,
Figure FDA0003132145180000036
for exclusive sharing of the service iThe resource amount, Delta theta is the share ratio increment of the service i,
Figure FDA0003132145180000037
is the reward penalty factor for service i.
7. The method according to any of claims 1 to 6, wherein after the adjusting the exclusive resource allocated to the service i based on the amount of exclusive resource, the maximum available resource amount, the number of service requests, the number of times shared resource is used, and the number of times of refusal, the method further comprises:
acquiring the resource quantity of all resources required by all services;
acquiring the resource amount of the application resource provided by the application equipment;
acquiring reference resource quantity of application resources provided by all application equipment in a capacity management platform governed by the application equipment;
calculating the resource quantity difference value of subtracting the reference resource quantity from the resource quantity of all the resources;
if the ratio of the resource quantity difference value to the resource quantity of the application resource meets the capacity expansion condition, generating the capacity expansion quantity of the application equipment based on the ratio;
and deploying the application resources of the candidate application devices with the capacity expansion quantity in the capacity management platform to a resource layer of the capacity management platform.
8. The method of claim 7, wherein the obtaining the resource amount of all resources required for all services comprises:
acquiring a first number of services in an access state in all the services;
acquiring a second number of associated services which have a preset associated coefficient with the service in the access state from all the services;
calculating the sum of the first quantity and the second quantity to obtain a concurrent access quantity;
calculating the product of the concurrent access quantity and the maximum available resource quantity to obtain a first resource quantity;
calculating the difference value of the total number of all services minus the concurrent access amount to obtain the residual access amount;
calculating the product of the residual access amount and the exclusive resource amount to obtain a second resource amount;
and calculating the sum of the first resource amount and the second resource amount to obtain the resource amounts of all the resources.
9. A capacity management apparatus, characterized in that the apparatus comprises:
the acquisition module is used for acquiring the service request quantity of service access requests initiated by a service i of the application equipment; wherein I is a positive integer greater than or equal to 1 and less than or equal to I, and I is the total number of all services of the application device;
the acquiring module is used for acquiring the times of using shared resources in the application resources of the application equipment; the application resources comprise exclusive resources distributed for a service i and shared resources distributed for all the services;
the obtaining module is configured to obtain the rejection times of the application device rejecting the service request when the exclusive resource is occupied and the shared resource is occupied;
and the processing module is used for adjusting the exclusive resource allocated to the service i based on the service request number, the shared resource using times and the rejection times.
10. A capacity management platform, comprising:
a memory for storing executable instructions; a processor for implementing the method of any one of claims 1 to 8 when executing executable instructions stored in the memory.
11. A storage medium having stored thereon executable instructions for causing a processor to perform the method of any one of claims 1 to 8 when executed.
CN202110706473.3A 2021-06-24 2021-06-24 Capacity management method, device, platform and storage medium Pending CN113467939A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202110706473.3A CN113467939A (en) 2021-06-24 2021-06-24 Capacity management method, device, platform and storage medium

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202110706473.3A CN113467939A (en) 2021-06-24 2021-06-24 Capacity management method, device, platform and storage medium

Publications (1)

Publication Number Publication Date
CN113467939A true CN113467939A (en) 2021-10-01

Family

ID=77872787

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202110706473.3A Pending CN113467939A (en) 2021-06-24 2021-06-24 Capacity management method, device, platform and storage medium

Country Status (1)

Country Link
CN (1) CN113467939A (en)

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN114219377A (en) * 2022-02-22 2022-03-22 云智慧(北京)科技有限公司 Service resource allocation method, device and equipment
CN114968744A (en) * 2022-06-07 2022-08-30 北京同创永益科技发展有限公司 Implementation method and system for capacity management prediction analysis AI algorithm based on financial industry
CN115934364A (en) * 2023-03-09 2023-04-07 摩尔线程智能科技(北京)有限责任公司 Memory management method and device and electronic equipment
WO2024113847A1 (en) * 2022-11-29 2024-06-06 华为云计算技术有限公司 Shared resource allocation method and apparatus, and computing device cluster

Cited By (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN114219377A (en) * 2022-02-22 2022-03-22 云智慧(北京)科技有限公司 Service resource allocation method, device and equipment
CN114219377B (en) * 2022-02-22 2022-06-07 云智慧(北京)科技有限公司 Service resource allocation method, device and equipment
CN114968744A (en) * 2022-06-07 2022-08-30 北京同创永益科技发展有限公司 Implementation method and system for capacity management prediction analysis AI algorithm based on financial industry
WO2024113847A1 (en) * 2022-11-29 2024-06-06 华为云计算技术有限公司 Shared resource allocation method and apparatus, and computing device cluster
CN115934364A (en) * 2023-03-09 2023-04-07 摩尔线程智能科技(北京)有限责任公司 Memory management method and device and electronic equipment
CN115934364B (en) * 2023-03-09 2023-10-31 摩尔线程智能科技(北京)有限责任公司 Memory management method and device and electronic equipment

Similar Documents

Publication Publication Date Title
CN113467939A (en) Capacity management method, device, platform and storage medium
EP2646890B1 (en) Dynamic power balancing among blade servers in a chassis
US8745249B2 (en) Intelligence virtualization system and method to support social media cloud service
KR101564481B1 (en) System and method for integrating capacity planning and workload management
CN106959894B (en) Resource allocation method and device
US20120166649A1 (en) Management computer, resource management method, resource management computer program, recording medium, and information processing system
JP2005196601A (en) Policy simulator for autonomous management system
US11496413B2 (en) Allocating cloud computing resources in a cloud computing environment based on user predictability
CN110597639B (en) CPU distribution control method, device, server and storage medium
CN111240838B (en) Pressure testing method and device
CN113010260A (en) Elastic expansion method and system for container quantity
US20200034057A1 (en) Elastic storage volume type selection and optimization engine for public cloud environments
CN113760180A (en) Storage resource management method, device, equipment and computer readable storage medium
JP2003323224A (en) System and method for license management, apparatus and method for information processing, and program
CN112162891A (en) Performance test method in server cluster and related equipment
KR101630125B1 (en) Method for resource provisioning in cloud computing resource management system
JP2018181062A (en) Management apparatus, control method for information processing system, and management program for management apparatus
CN116467082A (en) Big data-based resource allocation method and system
CN107203256B (en) Energy-saving distribution method and device under network function virtualization scene
CN110347546B (en) Dynamic adjustment method, device, medium and electronic equipment for monitoring task
US9367439B2 (en) Physical memory usage prediction
US11429420B2 (en) Method for controlling performance in virtualized environment and information processing device for the same
CN115658287A (en) Method, apparatus, medium, and program product for scheduling execution units
TWI739235B (en) A gpu computing resource assignment system and method thereof
EP3599547B1 (en) Elastic storage volume type selection and optimization engine for public cloud environments

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination