CN114598658A

CN114598658A - Flow limiting method and device

Info

Publication number: CN114598658A
Application number: CN202210222044.3A
Authority: CN
Inventors: 申大雪
Original assignee: Inspur Cloud Information Technology Co Ltd
Current assignee: Inspur Cloud Information Technology Co Ltd
Priority date: 2022-03-07
Filing date: 2022-03-07
Publication date: 2022-06-07

Abstract

The invention relates to the technical field of web application, and particularly provides a flow limiting method.A user request firstly reaches an interceptor, and after being processed by the interceptor, a token application operation is executed, wherein a token generation controller generates a token according to a certain rule in the token application operation; when the user has frequent access and the tokens are insufficient, temporarily storing the token into a queue for waiting, and detecting the number of the tokens in a token bucket; when enough tokens exist in the token bucket, the request is forwarded again. Compared with the prior art, the method adopts a trigger type token generation mode, triggers the rule of generating the token when requesting access each time, and reduces the occupation of resources such as server threads, CPUs (central processing units) and the like.

Description

Flow limiting method and device

Technical Field

The invention relates to the technical field of web application, and particularly provides a flow limiting method and device.

Background

With the rapid development of computer technology, various new technologies and new architecture ideas are applied to network products, the system is developed from traditional single-machine deployment to current distributed deployment, more and more traditional architectures are changed towards micro-service architectures and distributed architectures, and the traditional industry is also changed towards an internet plus mode. This puts requirements on both quality of service access and speed of service access. In highly concurrent scenarios, the three most common technical approaches are throttling, buffering, and destaging.

The purpose of current limiting is to protect the system by limiting the rate of concurrent access requests or limiting the rate of requested data within a time window, and once the rate of limitation is reached, service, queuing or waiting can be denied, the availability and stability of the system can be maintained, and slow or down operation of the system due to sudden increase in traffic can be prevented.

The token bucket algorithm is a common current-limiting algorithm, and the principle is that a system puts tokens into a bucket at a constant speed, if a request needs to be processed, the tokens need to be obtained from the bucket firstly, when no token is available in the bucket, service is refused, and the token bucket algorithm realizes request frequency limitation, capacity limitation and the like according to the rate frequency of the tokens by issuing the tokens.

The traditional token bucket algorithm needs to configure a special thread to add tokens into the token bucket at regular time, and the server thread and CPU resource occupation is large. The traditional flow limitation is mostly carried out in a single machine range, cannot be applied to a distributed environment, and cannot share the flow among multiple nodes.

Disclosure of Invention

Aiming at the defects of the prior art, the invention provides a flow limiting method with strong practicability.

A further technical task of the present invention is to provide a flow restriction device that is reasonably designed, safe and practical.

The technical scheme adopted by the invention for solving the technical problem is as follows:

a flow limiting method, all user requests arrive at an interceptor first, after being processed by the interceptor, a token applying operation is executed, the token applying operation generates a token by a token generating controller according to a certain rule, when the token applying triggers the token generating rule, the token generating controller generates a certain number of tokens, the tokens are put into a token bucket, a token bucket server issues the tokens required by the user requests, and the requests are forwarded to each application server to execute a data access operation;

when the user has frequent access and the tokens are insufficient, temporarily storing the token into a queue for waiting, and detecting the number of the tokens in a token bucket; when enough tokens exist in the token bucket, the request is forwarded.

Further, before the user request reaches the interceptor, setting the maximum token number of the token bucket, the interval time of supplementing tokens by the token bucket and the quantity of supplementing tokens in each time interval of the token bucket;

and for different types of API services, setting the number of tokens required by different service access according to different sizes of occupied network resources and database resources.

Further, when the http request reaches the server, a request interceptor intercepts the http request to obtain the current limiting policy flag information of the http request, queries whether the current limiting policy flag information exists in a token bucket server, initializes a token bucket if the current limiting policy flag information does not exist, puts a preset maximum token number of the token bucket into the token bucket, sets expiration time of the token, and sets writing time of the token as current time.

Further, if the current-limiting policy flag information exists, reading token writing time corresponding to the current-limiting policy flag information in the token bucket server, comparing the token writing time with a current timestamp, if a time interval between the current timestamp and the last token writing time is greater than or equal to an interval time for supplementing tokens by the token bucket, calculating the number of tokens which can be added according to the time interval between the current timestamp and the last token writing time according to the rate of adding tokens, supplementing the tokens into the token bucket, and updating the writing time of the tokens to be the current time.

Further, if the time interval between the current timestamp and the last token writing time is less than the interval time of token supplement of the token bucket, the number of tokens required by the current http request is judged, if the number of the required tokens is less than or equal to the remaining number of the current token bucket, the tokens required by the http request are removed from the token bucket, the http request is forwarded, and the application server executes specific operation.

Further, if the required token quantity is larger than the residual quantity of the current token bucket, setting a queuing queue, setting the length of the queuing queue and the expiration time of queue elements, and if the token quantity required by the http request is larger than the residual quantity of the current token bucket, putting the http request into the queuing queue.

Preferably, when the http request deposited in the queue reaches an expiration time, the request is directly rejected and removed from the queue.

Further, by setting a timer, detecting the number of tokens in the token bucket by using a probe, and when the number of tokens stored in the token bucket is greater than or equal to the number of tokens required by the request, the token bucket server sends the number of tokens required by the request to the application server for execution.

Further, if the number of tokens in the token bucket is smaller than the number of tokens required by the http request, calculating a time interval between a current timestamp and the time when the token bucket generates tokens last, if the time interval is larger than the interval time when the token bucket supplements the tokens, directly supplementing the tokens in the token bucket to the maximum capacity, updating the token generation time to be the current timestamp, sending the number of tokens required by the request, and forwarding the request to the application server to execute subsequent operations.

A flow restriction device, comprising: at least one memory and at least one processor;

the at least one memory to store a machine readable program;

the at least one processor is configured to invoke the machine readable program to perform a flow restriction method.

Compared with the prior art, the flow limiting method and the flow limiting device have the following outstanding beneficial effects:

(1) the invention improves the traditional token bucket algorithm, adopts a trigger type token generation mode, triggers the rule of generating the token when requesting access each time, and reduces the occupation of resources such as server threads, CPU and the like. The token bucket algorithm used by the invention can realize smooth flow limitation on user access and control the data transmission rate to achieve the purpose of flow limitation.

(2) The invention also arranges a request interceptor, a token generation controller, a token bucket server and the like, which can break through the limitation of the current limiting function of single machine deployment and realize the current limiting requirement of the multi-node distributed environment.

And introducing a queuing queue to cache the http request which does not acquire the token in a high concurrency scene, and after a sufficient number of tokens are generated, processing the http request to allow a flow access spike within a certain range.

Drawings

In order to more clearly illustrate the embodiments of the present invention or the technical solutions in the prior art, the drawings used in the description of the embodiments or the prior art will be briefly introduced below, and it is obvious that the drawings in the following description are some embodiments of the present invention, and for those skilled in the art, other drawings can be obtained according to these drawings without creative efforts.

FIG. 1 is a schematic flow diagram of a flow restriction method;

fig. 2 is an architecture diagram of a flow restriction method.

Detailed Description

The present invention will be described in further detail with reference to specific embodiments in order to better understand the technical solutions of the present invention. It is to be understood that the described embodiments are merely exemplary of the invention, and not restrictive of the full scope of the invention. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present invention.

A preferred embodiment is given below:

as shown in fig. 1 and 2, in the traffic limiting method in this embodiment, all user requests first reach an interceptor, and after being processed by the interceptor, a token application operation is executed, where the request interceptor is located at the uppermost layer of the system and belongs to a gateway layer.

In the token application operation, a token generation controller generates a token according to a certain rule, and Redis cache or memory cache of the memory cache can be used for ensuring the consistency, the robustness, the low time delay and the like of the data request.

When the token applies for triggering the token generation rule, the token generation controller generates a certain number of tokens according to parameters such as the size of a token bucket, the token generation rate, the last generation time of the tokens and the like set by the system, and the tokens are put into the token bucket. And issuing tokens required by the user request by the token bucket server, forwarding the request to each application server, and executing data access operation.

And allowing a part of the traffic requests to be in the token bucket, temporarily storing the traffic requests into a queuing queue to wait in the case of insufficient tokens, and detecting the token quantity of the token bucket by using a probe. When enough tokens exist in the token bucket, the request is forwarded again.

The specific operation is as follows:

before a user request reaches an interceptor, setting the maximum token number of a token bucket, the interval time of token bucket for supplementing tokens and the quantity of the token bucket for supplementing tokens in each time interval;

When the http request reaches the server, a request interceptor intercepts the http request to obtain the http request mark, such as the user ip and the access api, as the mark information of the current limiting policy.

And inquiring whether the token bucket server has the mark information of the current-limiting strategy, if not, initializing the token bucket, putting the preset maximum token quantity of the token bucket into the token bucket, setting the expiration time of the token, and setting the writing time of the token as the current time.

The maximum token quantity of the token bucket is at least 2 times larger than the supplementary token quantity of the token bucket in each time interval, and the actual situation is generally determined according to the current limiting threshold value supported by the system. The expiration time of the token is greater than 2 times the interval between replenishing tokens by the token bucket.

If the current limit strategy mark information exists, reading token writing time corresponding to the current limit strategy mark information in a token bucket server, comparing the token writing time with a current time stamp, if the time interval between the current time stamp and the last token writing time is larger than or equal to the interval time for supplementing tokens by the token bucket, calculating the number of tokens which can be added by the time interval between the current time stamp and the last token writing time according to the rate of adding the tokens, supplementing the tokens into the token bucket, and updating the writing time of the tokens to be the current time.

The time interval between the current timestamp and the last token writing time is calculated according to the token adding rate, and the number of tokens which can be added in the time interval is calculated as follows:

wherein currentTimestamp refers to a timestamp of current access, lastTimestamp refers to a timestamp of last token update, tokenInterval refers to an interval time for supplementing tokens by a token bucket, and tokenCountPer refers to the number of the supplementing tokens by the token bucket at each interval. newTokenCoun requires rounding down.

The number of tokens eventually added to the token bucket is:

resultCount＝min(newTokenCount,maxTokenCountstoredTokenCount)

wherein, maxTokenCount refers to the maximum number of tokens supported by the token bucket, and storedTokenCount refers to the number of tokens remaining in the current token bucket. And replenishing the tokens with the number of resultCount into the token bucket, and updating the writing time of the tokens to be the current time.

And if the time interval between the current timestamp and the last token writing time is less than the interval time of token supplement of the token bucket, judging the number of tokens required by the current http request, if the number of the required tokens is less than or equal to the residual number of the current token bucket, removing the tokens required by the http request from the token bucket, forwarding the http request, and executing specific operation by the application server.

If the number of required tokens is larger than the residual number of the current token bucket, setting a queuing queue, and setting the length of the queuing queue and the expiration time of a queue element. And if the number of tokens required by the http request is larger than the remaining number of the current token bucket, putting the request into a queue.

When the http request stored in the queue reaches the expiration time, the request is directly rejected and removed from the queue.

The number of tokens in the token bucket is detected by using a probe through setting a timer, and when the number of tokens stored in the token bucket is larger than or equal to the number of tokens required by the request, the token bucket server sends the number of tokens required by the request to the application server for execution.

If the number of tokens in the token bucket is smaller than the number of tokens required by the http request, calculating the time interval between the current timestamp and the time for generating the tokens by the token bucket at last, if the time interval is larger than the interval time for replenishing the tokens by the token bucket, replenishing the tokens in the token bucket to the maximum capacity directly, updating the token generation time to be the current timestamp, sending the number of tokens required by the request, and forwarding the request to the application server to execute subsequent operations.

the at least one memory to store a machine readable program;

The above embodiments are only specific ones of the present invention, and the scope of the present invention includes but is not limited to the above embodiments, and any suitable changes or substitutions that are consistent with the flow limiting method and apparatus claims of the present invention and are made by those of ordinary skill in the art shall fall within the scope of the present invention.

Although embodiments of the present invention have been shown and described, it will be appreciated by those skilled in the art that changes, modifications, substitutions and alterations can be made in these embodiments without departing from the principles and spirit of the invention, the scope of which is defined in the appended claims and their equivalents.

Claims

1. A flow limiting method is characterized in that all user requests firstly reach an interceptor, after being processed by the interceptor, a token application operation is executed, the token application operation generates a token by a token generation controller according to a certain rule, when the token application triggers the token generation rule, the token generation controller generates a certain number of tokens, the tokens are put into a token bucket, a token bucket server issues the tokens required by the user requests, and the requests are forwarded to each application server to execute data access operation;

2. The traffic limitation method according to claim 1, wherein before the user request reaches the interceptor, the maximum number of tokens in the token bucket, the interval time for replenishing tokens in the token bucket, and the number of replenishing tokens in each interval in the token bucket are set;

3. The traffic limiting method according to claim 2, wherein when the http request reaches the server, a request interceptor first intercepts the http request to obtain the current limiting policy flag information of the http request, queries whether the current limiting policy flag information exists in the token bucket server, if not, initializes the token bucket, puts a preset maximum token number of the token bucket into the token bucket, sets an expiration time of the token, and sets a write time of the token as a current time.

4. The method according to claim 3, wherein if there is the current-limiting policy flag information, reading a token write time corresponding to the current-limiting policy flag information in the token bucket server, comparing the read time with a current time stamp, if a time interval between the current time stamp and a last token write time is greater than or equal to an interval time for supplementing tokens by the token bucket, calculating the number of tokens that can be added according to a rate of adding tokens, and supplementing the tokens into the token bucket, wherein the write time for updating the tokens is the current time.

5. The traffic limitation method according to claim 4, wherein if the time interval between the current timestamp and the last write time of the token is less than the interval time of the token bucket for supplementing the token, the number of tokens required for the current http request is determined, and if the number of required tokens is less than or equal to the remaining number of the current token bucket, the tokens required for the http request are removed from the token bucket, the http request is forwarded, and the application server performs a specific operation.

6. The traffic throttling method of claim 5, wherein if the number of required tokens is greater than the remaining number of the current token bucket, setting a queue, setting the length of the queue and the expiration time of the queue element, and if the number of required tokens for an http request is greater than the remaining number of the current token bucket, placing the request in the queue.

7. A method according to claim 6, characterized in that when the http request stored in the queue reaches an expiry time, the request is directly rejected and removed from the queue.

8. The traffic limitation method according to claim 6 or 7, wherein a probe is used to detect the number of tokens in the token bucket by setting a timer, and when the number of tokens stored in the token bucket is greater than or equal to the number of tokens required by the request, the token bucket server issues the number of tokens required by the request and forwards the request to the application server for execution.

9. The method of claim 8, wherein if the number of tokens in the token bucket is less than the number of tokens required for the http request, calculating a time interval between a current timestamp and a time when the token bucket last generates tokens, if the time interval is greater than the time interval when the token bucket supplements the tokens, directly supplementing the tokens in the token bucket to a maximum capacity, updating the token generation time to the current timestamp, and sending the number of tokens required for the request, and forwarding the request to the application server for subsequent operations.

10. A flow restriction device, comprising: at least one memory and at least one processor;

the at least one memory to store a machine readable program;

the at least one processor configured to invoke the machine readable program to perform the method of any of claims 1 to 9.