CN118034912A - Method, device, equipment and storage medium for server capacity management - Google Patents
Method, device, equipment and storage medium for server capacity management Download PDFInfo
- Publication number
- CN118034912A CN118034912A CN202410041161.9A CN202410041161A CN118034912A CN 118034912 A CN118034912 A CN 118034912A CN 202410041161 A CN202410041161 A CN 202410041161A CN 118034912 A CN118034912 A CN 118034912A
- Authority
- CN
- China
- Prior art keywords
- capacity
- service
- platform interface
- service platform
- server
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 238000000034 method Methods 0.000 title claims abstract description 38
- 238000007726 management method Methods 0.000 claims abstract description 51
- 238000009530 blood pressure measurement Methods 0.000 claims abstract description 39
- 238000012360 testing method Methods 0.000 claims abstract description 20
- 230000008569 process Effects 0.000 abstract description 19
- 239000002609 medium Substances 0.000 description 14
- 238000004891 communication Methods 0.000 description 7
- 230000006870 function Effects 0.000 description 5
- 230000005856 abnormality Effects 0.000 description 4
- 238000010586 diagram Methods 0.000 description 4
- 230000004044 response Effects 0.000 description 4
- 238000013500 data storage Methods 0.000 description 3
- 238000011161 development Methods 0.000 description 3
- 238000012544 monitoring process Methods 0.000 description 3
- 230000006399 behavior Effects 0.000 description 2
- 230000009286 beneficial effect Effects 0.000 description 2
- 230000015556 catabolic process Effects 0.000 description 2
- 230000008859 change Effects 0.000 description 2
- 238000004590 computer program Methods 0.000 description 2
- 238000013461 design Methods 0.000 description 2
- 230000007246 mechanism Effects 0.000 description 2
- 230000000737 periodic effect Effects 0.000 description 2
- 238000012545 processing Methods 0.000 description 2
- 238000010998 test method Methods 0.000 description 2
- 230000002159 abnormal effect Effects 0.000 description 1
- 238000013459 approach Methods 0.000 description 1
- 238000004364 calculation method Methods 0.000 description 1
- 230000010485 coping Effects 0.000 description 1
- 230000000694 effects Effects 0.000 description 1
- 238000005516 engineering process Methods 0.000 description 1
- 230000003993 interaction Effects 0.000 description 1
- 239000012120 mounting media Substances 0.000 description 1
- 230000003287 optical effect Effects 0.000 description 1
- 230000008707 rearrangement Effects 0.000 description 1
- 238000006467 substitution reaction Methods 0.000 description 1
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F9/00—Arrangements for program control, e.g. control units
- G06F9/06—Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
- G06F9/46—Multiprogramming arrangements
- G06F9/50—Allocation of resources, e.g. of the central processing unit [CPU]
- G06F9/5005—Allocation of resources, e.g. of the central processing unit [CPU] to service a request
- G06F9/5027—Allocation of resources, e.g. of the central processing unit [CPU] to service a request the resource being a machine, e.g. CPUs, Servers, Terminals
- G06F9/505—Allocation of resources, e.g. of the central processing unit [CPU] to service a request the resource being a machine, e.g. CPUs, Servers, Terminals considering the load
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F11/00—Error detection; Error correction; Monitoring
- G06F11/36—Preventing errors by testing or debugging software
- G06F11/3668—Software testing
- G06F11/3672—Test management
- G06F11/3684—Test management for test design, e.g. generating new test cases
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Software Systems (AREA)
- Computer Hardware Design (AREA)
- Quality & Reliability (AREA)
- Debugging And Monitoring (AREA)
Abstract
The embodiment of the application provides a server capacity management method, a device, equipment and a storage medium, wherein the method comprises the following steps: acquiring interface pressure measurement data, and performing offline pressure application test on the service platform interface based on the interface pressure measurement data to obtain the service limit capacity of the service platform interface; determining the daily business capacity of the business platform interface according to the recorded historical access data of the business platform interface; and adjusting the load capacity of the server corresponding to the service platform interface based on the numerical comparison result of the service limit capacity and the service daily capacity. In the scheme, the limit load condition of the service platform interface is simulated, the service limit capacity of the service platform interface can be accurately obtained, the service daily capacity of the service platform interface can be accurately obtained by referring to the real data of the service platform interface in the actual application process, the load capacity of the server is dynamically adjusted, and good user experience of a user is ensured.
Description
Technical Field
The embodiment of the application relates to the technical field of computers, in particular to a server capacity management method, device, equipment and storage medium.
Background
With the rapid development of network technology, various service platforms in the mobile internet are rapidly increased, so that many convenience is brought to life of people, and meanwhile, the service response speed of users to the service platforms and the requirements on service stability are also higher and higher. When dealing with a large number of user groups, in order to ensure the normal operation of the service platform, a service provider can provide normal access service for users, and must ensure that the capacity of a server corresponding to a relevant interface of the service platform is sufficient, if the memory capacity or the disk capacity of the server is insufficient, abnormal service of an application system can be caused, and normal access service cannot be provided for the users.
However, in the related art, the service provider discovers the abnormality of the insufficient capacity of the server corresponding to the service platform through the use problem of the service platform fed back by the received user or the alarm information sent by a certain server due to the insufficient capacity of the service platform, and notifies the staff to verify and maintain the abnormality of the server, however, in both cases, the abnormality of the insufficient capacity of the server needs to be handled by spending human resources, and the service state of the service platform is affected because the server corresponding to the service platform is already in the state of insufficient capacity, so that the service platform cannot provide normal access service, and user experience is affected.
Disclosure of Invention
The embodiment of the application provides a server capacity management method, device, equipment and storage medium, which solve the problem that a service platform cannot provide normal azimuth service and influence user experience because abnormality of insufficient server capacity cannot be found in time. In the scheme, the service limiting capacity of the service platform interface can be accurately obtained by performing offline pressure application test on the service platform interface and simulating the limiting load condition of the service platform interface, the service daily capacity of the service platform interface can be accurately obtained by combining historical access data and referring to the real data of the service platform interface in the actual application process, the load capacity of the server can be adjusted by comparing the service limiting capacity and the service daily capacity, capacity management can be performed in time according to the access condition of the service platform interface, the load capacity of the server can be dynamically adjusted, and good user experience of a user can be ensured.
In a first aspect, an embodiment of the present application provides a server capacity management method, where the method includes:
Acquiring interface pressure measurement data, and performing offline pressure measurement on a service platform interface based on the interface pressure measurement data to obtain the service limit capacity of the service platform interface;
Determining the daily business capacity of the business platform interface according to the recorded historical access data of the business platform interface;
and adjusting the load capacity of the server corresponding to the service platform interface based on the service limit capacity and the value comparison result of the service daily capacity.
In a second aspect, an embodiment of the present application further provides a server capacity management apparatus, including:
the limit capacity determining module is configured to acquire interface pressure measurement data, and perform offline pressure application test on a service platform interface based on the interface pressure measurement data to obtain service limit capacity of the service platform interface;
The daily capacity determining module is configured to determine the daily capacity of the business platform interface according to the recorded historical access data of the business platform interface;
And the server capacity adjustment module is configured to adjust the load capacity of the server corresponding to the service platform interface based on the service limit capacity and the value comparison result of the service daily capacity.
In a third aspect, an embodiment of the present application further provides a server capacity management device, including:
One or more processors;
a storage device configured to store one or more programs,
When the one or more programs are executed by the one or more processors, the one or more processors implement the server capacity management method according to the embodiment of the present application.
In a fourth aspect, embodiments of the present application also provide a non-volatile storage medium storing computer-executable instructions that, when executed by a computer processor, are configured to perform the server capacity management method of embodiments of the present application.
In the embodiment of the application, the interface pressure measurement data is acquired, and the offline pressure measurement is carried out on the service platform interface based on the interface pressure measurement data, so as to obtain the service limit capacity of the service platform interface; determining the daily business capacity of the business platform interface according to the recorded historical access data of the business platform interface; and adjusting the load capacity of the server corresponding to the service platform interface based on the service limit capacity and the value comparison result of the service daily capacity. In the scheme, the service limiting capacity of the service platform interface can be accurately obtained by performing offline pressure application test on the service platform interface and simulating the limiting load condition of the service platform interface, the service daily capacity of the service platform interface can be accurately obtained by combining historical access data and referring to the real data of the service platform interface in the actual application process, the load capacity of the server can be adjusted by comparing the service limiting capacity and the service daily capacity, capacity management can be performed in time according to the access condition of the service platform interface, the load capacity of the server can be dynamically adjusted, and good user experience of a user can be ensured.
Drawings
Fig. 1 is a flowchart of a server capacity management method according to an embodiment of the present application;
FIG. 2 is a flowchart of a server capacity management method including a stress test procedure according to an embodiment of the present application;
FIG. 3 is a flowchart of a server capacity management method including a process of determining daily traffic capacity according to an embodiment of the present application;
FIG. 4 is a flow chart of a server capacity management method including a process of determining historical access data according to an embodiment of the present application;
FIG. 5 is a flowchart of a server capacity management method including a process of determining server capacity according to an embodiment of the present application;
FIG. 6 is a flowchart of a method for managing server capacity in a capacity early warning process according to an embodiment of the present application;
Fig. 7 is a block diagram of a server capacity management device according to an embodiment of the present application;
fig. 8 is a schematic structural diagram of a server capacity management device according to an embodiment of the present application.
Detailed Description
Embodiments of the present application will be described in further detail below with reference to the drawings and examples. It should be understood that the particular embodiments described herein are illustrative only and are not limiting of embodiments of the application. It should be further noted that, for convenience of description, only some, but not all of the structures related to the embodiments of the present application are shown in the drawings.
The terms first, second and the like in the description and in the claims, are used for distinguishing between similar elements and not necessarily for describing a particular sequential or chronological order. It is to be understood that the data so used may be interchanged, as appropriate, such that embodiments of the present application may be implemented in sequences other than those illustrated or described herein, and that the objects identified by "first," "second," etc. are generally of a type, and are not limited to the number of objects, such as the first object may be one or more. Furthermore, in the description and claims, "and/or" means at least one of the connected objects, and the character "/", generally means that the associated object is an "or" relationship.
The server capacity management method provided by the embodiment of the application can be used for managing the load capacity of the server corresponding to the service platform interface, wherein the service platform can be an online game platform, an online live broadcast platform, an online mall platform and the like. Taking an online game platform as an example, the online game platform can comprise different interfaces, such as a user authentication interface, a game hall interface, a game matching interface, a real-time communication interface, a payment interface, a ranking list interface, a data storage interface, an update service interface, a social media interface and the like, and the system can still provide stable and efficient service under high load by effectively managing the load capacity of servers corresponding to the interfaces. The foregoing application scenarios are merely exemplary and explanatory, and in practical applications, the server capacity management method may also be used in a service platform in other scenarios, which is not limited in the embodiments of the present application.
In the server capacity management method provided by the embodiment of the present application, the execution main body of each step may be a computer device, where the computer device refers to any electronic device having data computing, processing and storage capabilities, such as a mobile phone, a PC (Personal Computer ), a tablet computer, or other terminal devices, or may be a server, where the embodiment of the present application is not limited to this.
Fig. 1 is a flowchart of a server capacity management method according to an embodiment of the present application, where the server capacity management method may be implemented by using a server capacity management device as an execution body. As shown in fig. 1, the server capacity management method specifically includes the following steps:
And step S101, acquiring interface pressure measurement data, and performing offline pressure application test on the service platform interface based on the interface pressure measurement data to obtain the service limit capacity of the service platform interface.
The service platform interface can be an interface for communication and data exchange among different components, modules or services in the online service platform, and is used for realizing various functions, services, interactions and the like of online service. For example, taking online gaming as an example, the business platform interfaces may include a user authentication interface, a game lobby interface, a game matching interface, a real-time communication interface, a payment interface, a leaderboard interface, a data storage interface, an update service interface, and a social media interface, among others. The interface pressure measurement data may be reference data for simulating access of concurrent users to the service platform interface, and the interface pressure measurement data may include batch of simulated user data, taking an online game platform as an example, where the simulated user data may include a plurality of user accounts, game roles, friend lists, and the like, and each simulated user may further set user behaviors, such as operations of login, game matching, payment, real-time communication, and the like. The offline pressure test can simulate concurrent users to access the service platform interface in a large amount when the service platform interface is in an offline state, wherein the number of concurrent users, the request type, the duration and the like can be set. In addition, the offline stressing test can gradually increase the number of concurrent users according to the set time interval and the set number interval, simulate the platform interface request under the condition of high load, and test various operations such as login, game matching, payment, real-time communication and the like. Of course, the duration intervals and the number intervals are adapted by the developer, and the application is not limited herein. The service limit capacity may be the maximum number of requests or the number of transactions that the service platform interface can process, and may be correspondingly set with indexes such as throughput, concurrent connection number, response time, and the like.
Step S102, determining the daily business capacity of the business platform interface according to the recorded historical access data of the business platform interface.
The historical access data can be user access data obtained through statistics when the service platform interface is in a normal on-line state, reflects the real load condition of the service platform interface, and can be obtained through statistics and arrangement of real-time access data of the service platform interface by setting a periodic time interval or a fixed time node for recording. The service daily capacity can be a real load condition when the service platform interface is in a normal online state, wherein the service daily capacity can be a peak access amount when the service platform interface is in the normal online state, or can be an average access amount in a peak time period when the service platform interface is in the normal online state, and the like, and the determination of the service daily capacity can be adaptively selected according to the access requirements of different application scenes on the service platform interface.
And step S103, adjusting the load capacity of the server corresponding to the service platform interface based on the service limit capacity and the value comparison result of the service daily capacity.
The numerical comparison result may be obtained by comparing the service limit capacity with the service daily capacity through numerical operation, and is used for reflecting the relation between the limit capacity and the daily capacity of the service platform interface, for example, the multiple relation between the service limit capacity and the service daily capacity may be determined, for example, the difference result between the service limit capacity and the service daily capacity may be determined, and of course, the adopted numerical comparison mode may be formulated by a developer according to different adjustment indexes of the load capacity, which is not limited herein. Taking an online game platform as an example, the service platform interfaces may correspond to different servers according to different types, for example, the user authentication interface may correspond to an authentication server, the game hall interface may correspond to a game server, the game matching interface may correspond to a matching server, the real-time communication interface may correspond to a real-time communication server, the data storage interface may correspond to a data server, and the like. Corresponding to the value comparison result, the load capacity of the corresponding server can be adjusted according to the difference between the daily capacity of the service and the limit capacity of the service, for example, for the comparison mode adopting the multiple relation, a preset multiple value can be set, and for the comparison mode adopting the difference, for example, a preset value range can be set. Further, in the case where the service limit capacity is far higher than the service daily capacity, the load capacity of the corresponding server may be reduced, and in the case where the service limit capacity does not reach the preset gap from the service daily capacity, the number of servers corresponding to the service platform interface may be increased, or the servers with higher load capacity may be switched to, or the like, and of course, the server capacity may be upgraded.
According to the method, the interface pressure measurement data are obtained, and the offline pressure measurement is carried out on the service platform interface based on the interface pressure measurement data, so that the service limit capacity of the service platform interface is obtained; determining the daily business capacity of the business platform interface according to the recorded historical access data of the business platform interface; and adjusting the load capacity of the server corresponding to the service platform interface based on the service limit capacity and the value comparison result of the service daily capacity. In the scheme, the service limiting capacity of the service platform interface can be accurately obtained by performing offline pressure application test on the service platform interface and simulating the limiting load condition of the service platform interface, the service daily capacity of the service platform interface can be accurately obtained by combining historical access data and referring to the real data of the service platform interface in the actual application process, the load capacity of the server can be adjusted by comparing the service limiting capacity and the service daily capacity, capacity management can be performed in time according to the access condition of the service platform interface, the load capacity of the server can be dynamically adjusted, and good user experience of a user can be ensured.
Fig. 2 is a flowchart of a server capacity management method including a stress test procedure according to an embodiment of the present application, where interface stress test data includes analog user data, as shown in fig. 2, including the following steps:
Step S201, concurrent users are gradually increased according to the simulated user data, and concurrent interface requests corresponding to the concurrent users are sent to the service platform interface.
The duration interval and the number interval of the concurrent users may be set at fixed intervals or may be set at increasing intervals, and may be adaptively adjusted according to the network condition and the user activity level of the actual application scenario, which is not limited in this application. The concurrent interface request may be set corresponding to a user behavior of an actual application scenario, taking an online game platform as an example, and the concurrent interface request may be a login request, a matching request, a payment request, a connection request, and the like. And gradually adjusting the concurrent interface request to the limit load capacity of the service platform interface to determine the maximum bearing capacity of the service platform interface.
And step S202, stopping adding concurrent users under the condition that the load limit of the service platform interface is reached.
Judging whether the load limit of the service platform interface is reached or not, wherein the throughput of the service platform interface, namely the number of requests processed per second, can be monitored, and when the throughput is stable at a certain value and fluctuates up and down, the interface can possibly be indicated to face performance bottlenecks; or tracking the time required by the service platform interface to respond to the request, wherein the response time gradually increases along with the increase of the load, and determining whether the response time is at an acceptable level or not so as to judge whether the performance limit is reached or not; or monitoring the error rate generated by the service platform interface under high load, if the error rate is obviously increased, the service platform interface may be indicated to have a problem when bearing pressure, or the utilization condition of system resources such as CPU utilization rate, memory utilization rate, network bandwidth and the like of the server may also be tracked, and when the resource utilization rate approaches or reaches a limit, the system performance may be indicated to reach the limit. Of course, the determination of the load limit of the service platform interface may also be performed in other manners, and the present application is not limited herein.
And step S203, recording the service limit capacity corresponding to the load limit of the service platform interface.
When the load limit of the service platform interface is reached, the maximum value of the service limit capacity of the multiple tests may be recorded, or the average value of the service limit capacity of the multiple tests may be recorded.
Step S204, according to the recorded historical access data of the service platform interface, determining the daily service capacity of the service platform interface.
Step S205, based on the value comparison result of the service limit capacity and the service daily capacity, the load capacity of the server corresponding to the service platform interface is adjusted.
By simulating a large number of concurrent user requests, the limit performance of the service platform interface under high load is tested, the performance bottleneck and the potential problem of the service platform interface are identified, the load limit which can be processed by the service platform interface under high load is effectively determined, accurate reference data is provided for subsequent server capacity adjustment, and reasonable adjustment of the load capacity of the server is facilitated.
Fig. 3 is a flowchart of a server capacity management method including a process of determining daily service capacity according to an embodiment of the present application, as shown in fig. 3, including the following steps:
And step S301, acquiring interface pressure measurement data, and performing offline pressure measurement on the service platform interface based on the interface pressure measurement data to obtain the service limit capacity of the service platform interface.
Step S302, extracting access data in a preset time period from the recorded historical access data of the service platform interface.
The historical access data may include access data of different time periods, for example, low peak time period data and high peak time period data, where the preset time period may be divided based on durations corresponding to different orders of magnitude of the access amount, for example, for a service platform interface with a rapid increase in the short-time access amount, the preset time period may be set to a peak time period with a high magnitude of the access amount, and for a service platform interface with a relatively long-time access amount, the preset time period may be set to a peak time period with a middle-high magnitude of the access amount, which is not limited herein.
Step S303, the peak access amount of the access data is determined as the daily business capacity of the business platform interface.
In order to avoid overload breakdown of the service platform interface, the peak access amount of the access data can be determined as the daily service capacity of the service platform interface, and the peak access amount is used for indicating the real maximum access amount of the service platform interface.
And step S304, adjusting the load capacity of the server corresponding to the service platform interface based on the service limit capacity and the value comparison result of the service daily capacity.
By setting the peak access amount of the access data as the daily service capacity of the service platform interface, the load overload condition of the service platform interface can be effectively referred, the breakdown risk of the service platform interface is reduced, the normal operation of the platform service is ensured, accurate reference data is provided for the subsequent capacity adjustment of the server, and the reasonable adjustment of the load capacity of the server is facilitated.
Fig. 4 is a flowchart of a server capacity management method including a process of determining historical access data according to an embodiment of the present application, as shown in fig. 4, including the following steps:
Step S401, statistics is carried out on real-time access quantity of the service platform interface in an on-line state based on a preset time granularity.
The preset time granularity may be a time interval of real-time access amount of the on-line state of the periodic statistics service platform interface, for example, in units of seconds, in units of minutes, and the like, and may be adaptively adjusted according to accuracy requirements of developers corresponding to different application scenarios, which is not limited in the present application.
And step S402, updating the historical access data of the service platform interface according to the latest access data obtained through statistics.
The latest access data may be a latest access amount statistic according to a normal online state of the service platform interface, which is used for continuously updating historical access data of the service platform interface, maintaining timeliness of the historical access data, and adapting to access adjustment of the service platform interface, for example, platform service adjustment, access restriction adjustment, and the like.
And step S403, acquiring interface pressure measurement data, and performing offline pressure application test on the service platform interface based on the interface pressure measurement data to obtain the service limit capacity of the service platform interface.
Step S404, determining the daily capacity of the business platform interface according to the recorded historical access data of the business platform interface.
And step 405, adjusting the load capacity of the server corresponding to the service platform interface based on the service limit capacity and the value comparison result of the service daily capacity.
By updating the historical access data of the service platform interface based on the latest access data, the method is effectively suitable for carrying out statistics update of the access quantity due to internal adjustment of the service platform interface or trend change of the access quantity of the user, ensures timeliness of the historical access data, provides more accurate reference data for dynamically adjusting the load capacity of a server, avoids the problem that the capacity of the server corresponding to the service platform interface is seriously insufficient or excessively redundant due to the fact that the latest access quantity cannot be adapted to the change of the latest access quantity, and maintains good operation of the service platform interface.
Fig. 5 is a flowchart of a server capacity management method including a process of determining a server capacity according to an embodiment of the present application, as shown in fig. 5, including the following steps:
And step S501, acquiring interface pressure measurement data, and performing offline pressure measurement on the service platform interface based on the interface pressure measurement data to obtain the service limit capacity of the service platform interface.
Step S502, determining the daily business capacity of the business platform interface according to the recorded historical access data of the business platform interface.
Step S503, under the condition that the service limit capacity is smaller than the preset multiple of the service daily capacity, determining the target expansion capacity based on the multiple value of the service limit capacity and the service daily capacity.
The target capacity expansion amount can be determined through the multiple relation between the service limit capacity and the service daily capacity. For example, the preset multiple may be set to be 5 times, and in the case that the service limiting capacity does not reach 5 times of the daily capacity of the service, the problem of platform crash caused by the rapid increase of the access amount of the service platform interface may occur, so that capacity expansion may be selected. According to the multiple values of the service limit capacity and the service daily capacity, for example, the service limit capacity may be 2 times, 3 times or 4 times of the service daily capacity, the target expansion capacity is correspondingly determined to be 50% or 30% of the original expansion capacity, and the purpose is to increase the multiple values of the service limit capacity and the service daily capacity to more than a preset multiple.
And step S504, increasing the load capacity of the server corresponding to the service platform interface according to the target capacity expansion.
The capacity expansion of the server can be increased by increasing the number of servers or improving the configuration of the servers so as to expand the calculation, storage and network resources of the service platform interface, thereby coping with the increase of the load, and the method can comprise the following steps:
increasing the number of servers corresponding to the service platform interfaces according to the target capacity expansion;
or switching the service platform interface to a server with the load capacity as the target capacity according to the target capacity.
The load can be distributed to a plurality of servers by increasing the number of servers corresponding to the service platform interfaces. For example, this may be achieved by adding new server nodes in an existing server cluster, or dynamically adding virtual machine instances in a cloud service provider. In addition, by switching the service platform interface to the server with the load capacity as the target expansion capacity, the service platform interface is equivalent to selecting a server with higher hardware performance, for example, a server with faster CPU speed, larger memory capacity or larger storage capacity. Of course, there are other server capacity expansion modes, which can be adaptively selected according to the actual cost and development difficulty, and the application is not limited herein.
Above-mentioned, can be through making the business limit capacity form the relation of predetermineeing the multiple between business daily capacity to reserve sufficient capacity space, through increasing the quantity of server or switching over the better server of performance, nimble selection dilatation mode effectively satisfies the dilatation condition restriction of different application scenes, can satisfy the demand that the user quantity that increases constantly, improves system performance, ensures stability and reply peak flow simultaneously, effectively ensures the normal function operation of service platform interface.
Fig. 6 is a flowchart of a server capacity management method for joining a capacity early warning process according to an embodiment of the present application, as shown in fig. 6, including the following steps:
and step S601, acquiring interface pressure measurement data, and performing offline pressure application test on the service platform interface based on the interface pressure measurement data to obtain the service limit capacity of the service platform interface.
Step S602, determining the daily business capacity of the business platform interface according to the recorded historical access data of the business platform interface.
And step 603, adjusting the load capacity of the server corresponding to the service platform interface based on the service limit capacity and the value comparison result of the service daily capacity.
Step S604, updating the load capacity of the server corresponding to the adjusted service platform interface to a set capacity early warning line.
After the load capacity of the server is adjusted, a real-time monitoring mechanism can be set for whether the load capacity meets the real-time access requirement of the service platform interface, for example, the capacity early-warning line can be set as the service limit capacity, so that the monitoring range of the real-time access quantity of the service platform interface can be reasonably set.
Step S605, outputting capacity early warning information under the condition that the real-time access quantity of the service platform interface in the on-line state exceeds the capacity early warning line.
The capacity early warning information may be a peak value containing real-time access amount, where the peak value does not meet a preset numerical relation with a service limit capacity, and is used to remind a developer to manually adjust the load capacity of a server corresponding to the service platform interface in time.
By setting the capacity early warning mechanism, the real-time access quantity of the service platform interface can be effectively monitored, and under the condition of dynamically adjusting the load capacity of the corresponding server, a developer can be reminded to perform manual intervention by outputting the capacity early warning information, so that the normal development of the platform service is ensured, and bad use experience of a user is avoided.
Fig. 7 is a block diagram of a server capacity management device according to an embodiment of the present application, where the device is configured to execute the server capacity management method according to the foregoing embodiment, and has functional modules and beneficial effects corresponding to the execution method. As shown in fig. 7, the apparatus specifically includes:
The limit capacity determining module 101 is configured to acquire interface pressure measurement data, and perform offline pressure application test on the service platform interface based on the interface pressure measurement data to obtain service limit capacity of the service platform interface;
The daily capacity determining module 102 is configured to determine the daily capacity of the service platform interface according to the recorded historical access data of the service platform interface;
the server capacity adjustment module 103 is configured to adjust the load capacity of the server corresponding to the service platform interface based on the service limit capacity and the value comparison result of the service daily capacity.
According to the method, the interface pressure measurement data are obtained, and the offline pressure measurement is carried out on the service platform interface based on the interface pressure measurement data, so that the service limit capacity of the service platform interface is obtained; determining the daily business capacity of the business platform interface according to the recorded historical access data of the business platform interface; and adjusting the load capacity of the server corresponding to the service platform interface based on the service limit capacity and the value comparison result of the service daily capacity. In the scheme, the service limiting capacity of the service platform interface can be accurately obtained by performing offline pressure application test on the service platform interface and simulating the limiting load condition of the service platform interface, the service daily capacity of the service platform interface can be accurately obtained by combining historical access data and referring to the real data of the service platform interface in the actual application process, the load capacity of the server can be adjusted by comparing the service limiting capacity and the service daily capacity, capacity management can be performed in time according to the access condition of the service platform interface, the load capacity of the server can be dynamically adjusted, and good user experience of a user can be ensured.
In one possible embodiment, the interface crush data comprises analog user data, the limit capacity determination module 101 is further configured to:
Gradually increasing concurrent users according to the simulated user data, and sending concurrent interface requests corresponding to the concurrent users to the service platform interface;
Stopping adding concurrent users under the condition that the load limit of the service platform interface is reached;
And recording the service limit capacity corresponding to the load limit of the service platform interface.
In one possible embodiment, the daily capacity determination module 102 is further configured to:
extracting access data positioned in a preset time period from recorded historical access data of the service platform interface;
and determining the peak access amount of the access data as the business daily capacity of the business platform interface.
In one possible embodiment, the method further includes a history data update module configured to:
Counting the real-time access quantity of the service platform interface in an online state based on a preset time granularity;
And updating the historical access data of the service platform interface according to the latest access data obtained through statistics.
In one possible embodiment, the server capacity adjustment module 103 is further configured to:
Under the condition that the service limit capacity is smaller than the preset multiple of the service daily capacity, determining a target expansion capacity based on the multiple value of the service limit capacity and the service daily capacity;
and increasing the load capacity of the server corresponding to the service platform interface according to the target capacity expansion.
In one possible embodiment, the server capacity adjustment module 103 is further configured to:
increasing the number of servers corresponding to the service platform interfaces according to the target capacity expansion;
or switching the service platform interface to a server with the load capacity as the target capacity according to the target capacity.
In one possible embodiment, the system further comprises a capacity pre-warning module configured to:
Updating the load capacity of the server corresponding to the adjusted service platform interface to a set capacity early warning line;
and outputting the capacity early warning information under the condition that the real-time access quantity of the service platform interface in the on-line state exceeds the capacity early warning line.
Fig. 8 is a schematic structural diagram of a server capacity management device according to an embodiment of the present application, where, as shown in fig. 8, the device includes a processor 201, a memory 202, an input device 203, and an output device 204; the number of processors 201 in the device may be one or more, one processor 201 being taken as an example in fig. 8; the processor 201, memory 202, input devices 203, and output devices 204 in the apparatus may be connected by a bus or other means, for example in fig. 8. The memory 202 is a computer readable storage medium, and may be configured to store a software program, a computer executable program, and modules, such as program instructions/modules corresponding to the server capacity management method in the embodiment of the present application. The processor 201 executes various functional applications of the device and data processing by running software programs, instructions and modules stored in the memory 202, i.e., implements the server capacity management method described above. The input device 203 may be configured to receive input numeric or character information and to generate key signal inputs related to user settings and function control of the apparatus. The output device 204 may include a display device such as a display screen.
The server capacity management device provided above can be used to execute the server capacity management method provided in any of the above embodiments, and has corresponding functions and beneficial effects.
The embodiments of the present application also provide a non-volatile storage medium containing computer-executable instructions that, when executed by a computer processor, are configured to perform a server capacity management method as described in the above embodiments, comprising: acquiring interface pressure measurement data, and performing offline pressure application test on the service platform interface based on the interface pressure measurement data to obtain the service limit capacity of the service platform interface; determining the daily business capacity of the business platform interface according to the recorded historical access data of the business platform interface; and adjusting the load capacity of the server corresponding to the service platform interface based on the service limit capacity and the value comparison result of the service daily capacity.
Storage media-any of various types of memory devices or storage devices. The term "storage medium" is intended to include: mounting media such as CD-ROM, floppy disk or tape devices; computer system memory or random access memory such as DRAM, DDR RAM, SRAM, EDO RAM, lanbas (Rambus) RAM, etc.; nonvolatile memory such as flash memory, magnetic media, or optical storage; registers or other similar types of memory elements, etc. The storage medium may also include other types of memory or combinations thereof. In addition, the storage medium may be located in a first computer system in which the program is executed, or may be located in a second, different computer system connected to the first computer system through a network such as the internet. The second computer system may provide program instructions to the first computer for execution. The term "storage medium" may include two or more storage media residing in different locations (e.g., in different computer systems connected by a network). The storage medium may store program instructions (e.g., embodied as a computer program) executable by one or more processors.
Of course, the storage medium containing the computer executable instructions provided in the embodiments of the present application is not limited to the server capacity management method described above, and may also perform the relevant operations in the server capacity management method provided in any embodiment of the present application.
It should be noted that, in the embodiment of the server capacity management system, each unit and module included are only divided according to the functional logic, but not limited to the above-mentioned division, so long as the corresponding functions can be implemented; in addition, the specific names of the functional units are also only for convenience of distinguishing from each other, and are not configured to limit the protection scope of the embodiments of the present application.
It should be noted that, the numbers of the steps in the solution are only used to describe the overall design framework of the solution, and do not represent the necessary sequence relationship between the steps. On the basis that the whole implementation process accords with the whole design framework of the scheme, the method belongs to the protection scope of the scheme, and the literal sequence during description is not an exclusive limit on the specific implementation process of the scheme. It will be appreciated by those skilled in the art that embodiments of the present application may be provided as a method, system, or computer program product. In one typical configuration, a computing device includes one or more processors (CPUs), input/output interfaces, network interfaces, and memory. The memory may include volatile memory in a computer-readable medium, random Access Memory (RAM) and/or nonvolatile memory, etc., such as Read Only Memory (ROM) or flash RAM. Memory is an example of a computer-readable medium.
It should also be noted that the terms "comprises," "comprising," or any other variation thereof, are intended to cover a non-exclusive inclusion, such that a process, method, article, or apparatus that comprises a list of elements does not include only those elements but may include other elements not expressly listed or inherent to such process, method, article, or apparatus. Without further limitation, an element defined by the phrase "comprising one … …" does not exclude the presence of other like elements in a process, method, article, or apparatus that comprises an element.
Note that the above is only a preferred embodiment of the present invention and the technical principle applied. It will be understood by those skilled in the art that the present invention is not limited to the particular embodiments described herein, but is capable of various obvious changes, rearrangements and substitutions as will now become apparent to those skilled in the art without departing from the scope of the invention. Therefore, while the invention has been described in connection with the above embodiments, the invention is not limited to the embodiments, but may be embodied in many other equivalent forms without departing from the spirit or scope of the invention, which is set forth in the following claims.
Claims (10)
1. A server capacity management method, comprising:
Acquiring interface pressure measurement data, and performing offline pressure measurement on a service platform interface based on the interface pressure measurement data to obtain the service limit capacity of the service platform interface;
Determining the daily business capacity of the business platform interface according to the recorded historical access data of the business platform interface;
and adjusting the load capacity of the server corresponding to the service platform interface based on the service limit capacity and the value comparison result of the service daily capacity.
2. The server capacity management method of claim 1, wherein the interface crush data comprises simulated user data;
the step of performing a pressure test on the service platform interface based on the interface pressure test data to obtain the service limit capacity of the service platform interface comprises the following steps:
Gradually increasing concurrent users according to the simulated user data, and sending concurrent interface requests corresponding to the concurrent users to the service platform interface;
Stopping adding concurrent users under the condition that the load limit of the service platform interface is reached;
And recording the service limit capacity corresponding to the load limit of the service platform interface.
3. The server capacity management method according to claim 1, wherein the determining the daily capacity of the service platform interface according to the recorded historical access data of the service platform interface includes:
Extracting access data positioned in a preset time period from the recorded historical access data of the service platform interface;
and determining the peak access amount of the access data as the business daily capacity of the business platform interface.
4. The server capacity management method according to claim 1, further comprising, before said determining the business daily capacity of said business platform interface based on said recorded historical access data of said business platform interface:
counting the real-time access quantity of the service platform interface in an online state based on a preset time granularity;
And updating the historical access data of the service platform interface according to the latest access data obtained through statistics.
5. The method for managing server capacity according to claim 1, wherein the adjusting the load capacity of the server corresponding to the service platform interface based on the service limit capacity and the value comparison result of the service daily capacity includes:
Determining a target expansion capacity based on a multiple value of the service limiting capacity and the service daily capacity under the condition that the service limiting capacity is smaller than a preset multiple of the service daily capacity;
And increasing the load capacity of the server corresponding to the service platform interface according to the target capacity expansion.
6. The method for managing capacity of a server according to claim 5, wherein said increasing the load capacity of the server corresponding to the service platform interface according to the target capacity expansion includes:
Increasing the number of servers corresponding to the service platform interfaces according to the target capacity expansion;
or switching the service platform interface to a server with the load capacity as the target capacity according to the target capacity.
7. The method for managing server capacity according to any one of claims 1 to 6, further comprising, after said adjusting the load capacity of the server corresponding to the service platform interface:
Updating the load capacity of the server corresponding to the adjusted service platform interface to a set capacity early warning line;
and outputting capacity early warning information under the condition that the real-time access quantity of the service platform interface in the on-line state exceeds the capacity early warning line.
8. A server capacity management apparatus, comprising:
the limit capacity determining module is configured to acquire interface pressure measurement data, and perform offline pressure application test on a service platform interface based on the interface pressure measurement data to obtain service limit capacity of the service platform interface;
The daily capacity determining module is configured to determine the daily capacity of the business platform interface according to the recorded historical access data of the business platform interface;
And the server capacity adjustment module is configured to adjust the load capacity of the server corresponding to the service platform interface based on the service limit capacity and the value comparison result of the service daily capacity.
9. A server capacity management apparatus, the apparatus comprising: one or more processors; a storage device configured to store one or more programs that, when executed by the one or more processors, cause the one or more processors to implement the server capacity management method of any one of claims 1-7.
10. A non-transitory storage medium storing computer executable instructions which, when executed by a computer processor, are configured to perform the server capacity management method of any one of claims 1-7.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202410041161.9A CN118034912A (en) | 2024-01-10 | 2024-01-10 | Method, device, equipment and storage medium for server capacity management |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202410041161.9A CN118034912A (en) | 2024-01-10 | 2024-01-10 | Method, device, equipment and storage medium for server capacity management |
Publications (1)
Publication Number | Publication Date |
---|---|
CN118034912A true CN118034912A (en) | 2024-05-14 |
Family
ID=91001136
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202410041161.9A Pending CN118034912A (en) | 2024-01-10 | 2024-01-10 | Method, device, equipment and storage medium for server capacity management |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN118034912A (en) |
-
2024
- 2024-01-10 CN CN202410041161.9A patent/CN118034912A/en active Pending
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US10783002B1 (en) | Cost determination of a service call | |
EP3825945A1 (en) | Method and device for performing credit evaluation on work publisher based on block chain | |
US9087156B2 (en) | Application version release management | |
US10944581B2 (en) | Increasing processing capacity of processor cores during initial program load processing | |
US11570075B2 (en) | Reverse health checks | |
CN114095567B (en) | Data access request processing method and device, computer equipment and medium | |
US11327767B2 (en) | Increasing resources for partition to compensate for input/output (I/O) recovery event | |
US20160094392A1 (en) | Evaluating Configuration Changes Based on Aggregate Activity Level | |
US10884845B2 (en) | Increasing processing capacity of processor cores during initial program load processing | |
CN112269622A (en) | Page management method, device, equipment and medium | |
CN110737548B (en) | Data request method and server | |
US9501321B1 (en) | Weighted service requests throttling | |
CN113760672A (en) | Performance test method, equipment and storage medium for multi-cloud platform resources | |
CN118034912A (en) | Method, device, equipment and storage medium for server capacity management | |
CN108418730B (en) | Network flow testing method, device, equipment and computer readable storage medium | |
CN113407337B (en) | Resource allocation method, device, server and medium | |
CN111598390B (en) | Method, device, equipment and readable storage medium for evaluating high availability of server | |
CN114697203A (en) | Network fault pre-judging method and device, electronic equipment and storage medium | |
CN113408817A (en) | Traffic distribution method, device, equipment and storage medium | |
CN117785486B (en) | Environment resource allocation method, device, equipment and medium | |
CN118409882B (en) | Service data processing method, device, computer equipment and readable storage medium | |
CN109358968B (en) | Server resource allocation method and device | |
CN116847113B (en) | Video live broadcast transfer method, device, equipment and medium based on cloud architecture module | |
CN115086147B (en) | Service data monitoring and early warning method and device, medium and electronic equipment | |
US11941421B1 (en) | Evaluating and scaling a collection of isolated execution environments at a particular geographic location |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination |