CN112565399B - Adaptive traffic load balancing method for online learning - Google Patents

Adaptive traffic load balancing method for online learning Download PDF

Info

Publication number
CN112565399B
CN112565399B CN202011394360.6A CN202011394360A CN112565399B CN 112565399 B CN112565399 B CN 112565399B CN 202011394360 A CN202011394360 A CN 202011394360A CN 112565399 B CN112565399 B CN 112565399B
Authority
CN
China
Prior art keywords
online
time
model
request
load balancing
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN202011394360.6A
Other languages
Chinese (zh)
Other versions
CN112565399A (en
Inventor
张兴
徐世界
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Tianyi Electronic Commerce Co Ltd
Original Assignee
Tianyi Electronic Commerce Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Tianyi Electronic Commerce Co Ltd filed Critical Tianyi Electronic Commerce Co Ltd
Priority to CN202011394360.6A priority Critical patent/CN112565399B/en
Publication of CN112565399A publication Critical patent/CN112565399A/en
Application granted granted Critical
Publication of CN112565399B publication Critical patent/CN112565399B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L67/00Network arrangements or protocols for supporting network services or applications
    • H04L67/01Protocols
    • H04L67/10Protocols in which an application is distributed across nodes in the network
    • H04L67/1001Protocols in which an application is distributed across nodes in the network for accessing one among a plurality of replicated servers
    • H04L67/1004Server selection for load balancing
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L41/00Arrangements for maintenance, administration or management of data switching networks, e.g. of packet switching networks
    • H04L41/14Network analysis or design
    • H04L41/145Network analysis or design involving simulating, designing, planning or modelling of a network
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L41/00Arrangements for maintenance, administration or management of data switching networks, e.g. of packet switching networks
    • H04L41/14Network analysis or design
    • H04L41/147Network analysis or design for predicting network behaviour
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L67/00Network arrangements or protocols for supporting network services or applications
    • H04L67/01Protocols
    • H04L67/10Protocols in which an application is distributed across nodes in the network
    • H04L67/1001Protocols in which an application is distributed across nodes in the network for accessing one among a plurality of replicated servers
    • H04L67/1004Server selection for load balancing
    • H04L67/1008Server selection for load balancing based on parameters of servers, e.g. available memory or workload

Abstract

The invention discloses an online learning adaptive flow load balancing method, which comprises the following two steps: model training and online prediction, wherein the model trained by the model is used for online prediction, the online prediction service predicts the label (namely the strength of service capability) of a rear-end application example through online collected characteristic data, and the load balancer takes the prediction result as a load balancing weight value, so that online learning self-adaptive load balancing is realized. The invention realizes the self-adaptive flow load balance by combining the indexes of a plurality of dimensions reflecting the real situation of the back-end example and online learning, and the flow can be more quickly and accurately distributed to healthy application nodes by utilizing the method.

Description

Adaptive traffic load balancing method for online learning
Technical Field
The invention relates to the technical field of emerging information, in particular to an online learning adaptive traffic load balancing method.
Background
Load balancing refers to balancing and distributing loads (work tasks) to a plurality of operation units for operation, such as an FTP server, a Web server, an enterprise core application server, and other main task servers, so as to cooperatively complete the work tasks, as shown in fig. 1; common load balancing algorithms include RandomLoadBalance (random balancing algorithm), roundRobinLoadBalance (weight round robin balancing algorithm), leistactionloadbalance (least active call number balancing algorithm), consistence hashloadbalance (consistent Hash balancing algorithm), and the like, and for such configuration, a conventional method is to pre-configure a designated algorithm as a load balancing strategy, but cannot dynamically adjust according to the real situation of a back-end application instance, as shown in fig. 2, a resource tension alarm occurs, and a conventional load balancing cannot make a corresponding decision according to the change.
Disclosure of Invention
The invention aims to overcome the defects of the prior art and provides an online learning adaptive traffic load balancing method.
In order to solve the technical problems, the invention provides the following technical scheme:
the invention discloses an online learning adaptive flow load balancing method, which comprises the following two steps: model training and online prediction, wherein the model trained by the model is used for online prediction, the online prediction service predicts a label of a rear-end application example through online collected characteristic data, and a prediction result is used as a load balancing weight value by a load balancer, so that online learning self-adaptive load balancing is realized, and the method specifically comprises the following steps:
the specific steps of model training are as follows:
s1, collecting model training characteristics,
by adjusting the concurrency number of the requests initiated by the client and the resource state of the back-end application instance server, for example, adjusting resources such as cpu, memory and the like, the state under a real condition is simulated, and the following data with 4 dimensions are obtained and used as model training characteristics:
1) Each resource utilization rate data of the back-end application example in the T time comprises a cpu, a memory and a disk;
2) Requesting to return state code distribution within T time;
3) Average request response time within T time;
4) The health detection passing frequency of the back-end application example within T time;
s2, label definition:
the single request execution results are divided into three cases:
4) Whether the request is executed successfully;
5) Request successfully returns, but response time times out;
6) The request returns successfully and the response time is normal;
to sum up, for a single example, three values are counted: the abnormal rate (E) of requests in the T time, the timeout rate (L) in the T time and the corresponding average time length (A) ms of normal requests in the T time define label = w 1 *E+w 2 *L+w 3 * A, wherein w 1 、w 2 、w 3 Is a weighted value;
s3, training an original model off line, training a regression model by using Lightgbm through model training characteristics and sample labels confirmed by S1 and S2, wherein LightGBM (LightGradientBoosting machine) is a framework for realizing a GBDT algorithm, and performing iterative training by using a weak classifier to obtain an optimal model;
the online prediction method comprises the following specific steps:
s4, obtaining model training characteristics by an online model service;
s5, predicting the strength of service processing capacity in the following T time by utilizing the model training characteristics, and comprehensively evaluating the state of the server by using the average values of the request abnormal rate, the overtime rate and the response duration of the normal request in the T time, wherein the lower the abnormal rate is, the lower the overtime rate is, the shorter the response duration average value is, the better the service state is, and the higher the set weight is;
and S6, the online prediction service pushes the result to a load balancer, and the load balancer sets weight distribution flow according to the push result.
Compared with the prior art, the invention has the following beneficial effects:
the invention realizes the self-adaptive flow load balance by combining the indexes of multiple dimensions reflecting the real situation of the back-end instance and online learning, and the flow can be distributed to healthy application nodes more quickly and accurately by utilizing the method.
Drawings
The accompanying drawings, which are included to provide a further understanding of the invention and are incorporated in and constitute a part of this specification, illustrate embodiments of the invention and together with the description serve to explain the principles of the invention and not to limit the invention. In the drawings:
FIG. 1 is a schematic diagram of a load balancing principle;
FIG. 2 is a diagram of a common load balancing algorithm;
FIG. 3 is an overall block diagram of the present invention;
FIG. 4 is a diagram of steps for collecting model training features according to the present invention;
fig. 5 is a schematic diagram of the online prediction principle of the present invention.
Detailed Description
The preferred embodiments of the present invention will be described in conjunction with the accompanying drawings, and it will be understood that they are described herein for the purpose of illustration and explanation and not limitation.
Example 1
As shown in fig. 3, the present invention provides an adaptive traffic load balancing method for online learning, which includes two steps: model training and online prediction, wherein online prediction is performed through a model trained by the model, an online prediction service predicts a label (namely the strength of service capability) of a rear-end application example through characteristic data acquired online, and a load balancer takes a prediction result as a load balancing weight value, so that online learning self-adaptive load balancing is realized, specifically as follows:
the specific steps of model training are as follows:
s1, collecting model training characteristics, as shown in figure 4,
the method comprises the following steps of simulating the state under a real condition by adjusting the concurrency number of a request initiated by a client and the resource state of a back-end application instance server, such as adjusting resources of a cpu (central processing unit), a memory and the like, and obtaining the following data with 4 dimensions as model training characteristics:
1) The resource utilization rate data of the back-end application example in the T time comprise a cpu, a memory, a disk and the like;
2) Requesting to return the distribution of the state codes within T time;
3) Average request response time within T time;
4) The health detection passing frequency of the back-end application example within T time;
s2, label definition:
the single request execution results are divided into three cases:
7) Whether the request was executed successfully (state 200);
8) Request successfully returns, but response time times out (threshold can be customized, default = Q3+1.5 x (Q3-Q1) of all request response times);
note: q3:75 quantile, Q1:25 quantiles;
9) The request returns successfully and the response time is normal;
to sum up, for a single example, three values are counted: the abnormal rate (E) of requests in the T time, the overtime rate (L) in the T time and the corresponding average time length (A) ms of normal requests in the T time are defined by Label=w 1 *E+w 2 *L+w 3 * A, wherein w 1 、w 2 、w 3 Is a weighted value;
s3, training an original model offline, training a regression model by using Lightgbm through model training characteristics and sample labels confirmed by S1 and S2, wherein the LightGBM (LightGradientBoosting machine) is a framework for realizing a GBDT algorithm, and iterative training is carried out by using a weak classifier (decision tree) to obtain an optimal model, and the model has the advantages of good training effect, difficulty in overfitting and the like;
the specific steps of online prediction are as follows, as shown in fig. 5:
s4, obtaining model training characteristics by an online model service (the characteristic data of T time refers to service state data acquired online in T time, including CPU, memory, magnetic disks and the like, request return state code distribution, average request response time, back-end application health detection passing frequency and the like);
and S5, predicting the strength (namely the label) of the service processing capacity in the next T time by utilizing the model training characteristics (characteristics). And evaluating the state of the server by using the average value synthesis of the request abnormal rate, the overtime rate and the response duration of the normal request in the T time. The lower the abnormal rate, the lower the overtime rate and the shorter the average value of the response duration, the better the service state is, and the higher the set weight is;
and S6, the online prediction service pushes the result to a load balancer, and the load balancer sets weight distribution flow according to the pushed result.
Specifically, the method realizes the self-adaptive traffic load balancing by combining the indexes of multiple dimensions reflecting the real situation of the back-end instance and online learning, and can distribute the traffic to healthy application nodes more quickly and accurately.
Finally, it should be noted that: although the present invention has been described in detail with reference to the foregoing embodiments, it will be apparent to those skilled in the art that changes may be made in the embodiments and/or equivalents thereof without departing from the spirit and scope of the invention. Any modification, equivalent replacement, or improvement made within the spirit and principle of the present invention should be included in the protection scope of the present invention.

Claims (1)

1. An online learning adaptive flow load balancing method is characterized by comprising the following two steps: model training and online prediction, wherein the model trained by the model is used for online prediction, the online prediction service predicts a label of a rear-end application example through online collected characteristic data, and a prediction result is used as a load balancing weight value by a load balancer, so that online learning self-adaptive load balancing is realized, and the method specifically comprises the following steps:
the specific steps of model training are as follows:
s1, collecting model training characteristics,
the method comprises the following steps of simulating the state under a real condition by adjusting the concurrency number of a client-initiated request and the resource state of a back-end application instance server, and obtaining the following 4-dimensional data as a model training characteristic:
1) Each resource utilization rate data of the back-end application example in the time T comprises a CPU, a memory and a disk;
2) Requesting to return the distribution of the state codes within T time;
3) Average request response time within T time;
4) The health detection passing frequency of the back-end application example within T time;
s2, label definition:
the single request execution results are divided into three cases:
1) Whether the request is executed successfully;
2) Request successfully returns, but response time times out;
3) The request returns successfully and the response time is normal;
to sum up, for a single backend application instance, three values are counted: defining a request anomaly rate (E) within T time, a timeout rate (L) within T time and a corresponding average duration (A) ms of normal requests within T time, label = w 1 *E+w 2 *L+w 3 * A, wherein w 1 、w 2 、w 3 Is a weighted value;
s3, training an original model off line, training a regression model by using a LightGBM (LightGradientBoosting machine) through model training characteristics and sample labels confirmed by S1 and S2, wherein the LightGBM (LightGradientBoosting machine) is a framework for realizing a GBDT algorithm, and performing iterative training by using a weak classifier to obtain an optimal model;
the online prediction comprises the following specific steps:
s4, obtaining model training characteristics by an online model service;
s5, predicting the strength of service processing capacity in the following T time by utilizing model training characteristics, and evaluating the state of the server by using the average value of the request abnormal rate, the overtime rate and the response duration of the normal request in the T time in a comprehensive mode, wherein the lower the abnormal rate, the lower the overtime rate and the shorter the average value of the response duration, the better the service state is, and the higher the set weight is;
and S6, the online prediction service pushes the result to a load balancer, and the load balancer sets weight distribution flow according to the pushed result.
CN202011394360.6A 2020-12-02 2020-12-02 Adaptive traffic load balancing method for online learning Active CN112565399B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202011394360.6A CN112565399B (en) 2020-12-02 2020-12-02 Adaptive traffic load balancing method for online learning

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202011394360.6A CN112565399B (en) 2020-12-02 2020-12-02 Adaptive traffic load balancing method for online learning

Publications (2)

Publication Number Publication Date
CN112565399A CN112565399A (en) 2021-03-26
CN112565399B true CN112565399B (en) 2022-12-09

Family

ID=75047230

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202011394360.6A Active CN112565399B (en) 2020-12-02 2020-12-02 Adaptive traffic load balancing method for online learning

Country Status (1)

Country Link
CN (1) CN112565399B (en)

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8949658B1 (en) * 2012-03-02 2015-02-03 Amazon Technologies, Inc. Load balancer host selection and fault detection
WO2018076791A1 (en) * 2016-10-31 2018-05-03 华为技术有限公司 Resource load balancing control method and cluster scheduler
CN111355606A (en) * 2020-02-10 2020-06-30 天津大学 Web application-oriented container cluster self-adaptive expansion and contraction system and method
CN111930511A (en) * 2020-08-24 2020-11-13 北京工业大学 Identifier resolution node load balancing device based on machine learning

Family Cites Families (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US11044199B2 (en) * 2018-06-08 2021-06-22 Cisco Technology, Inc. Inferring device load and availability in a network by observing weak signal network based metrics

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8949658B1 (en) * 2012-03-02 2015-02-03 Amazon Technologies, Inc. Load balancer host selection and fault detection
WO2018076791A1 (en) * 2016-10-31 2018-05-03 华为技术有限公司 Resource load balancing control method and cluster scheduler
CN111355606A (en) * 2020-02-10 2020-06-30 天津大学 Web application-oriented container cluster self-adaptive expansion and contraction system and method
CN111930511A (en) * 2020-08-24 2020-11-13 北京工业大学 Identifier resolution node load balancing device based on machine learning

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
基于预测模型及独立训练节点的负载均衡策略;陈大才等;《计算机系统应用》;20180930;第220-223页 *

Also Published As

Publication number Publication date
CN112565399A (en) 2021-03-26

Similar Documents

Publication Publication Date Title
CN112685170B (en) Dynamic optimization of backup strategies
Peng et al. Intelligent computation offloading and resource allocation in IIoT with end-edge-cloud computing using NSGA-III
CN107196869A (en) The adaptive load balancing method, apparatus and system of Intrusion Detection based on host actual loading
CN103412875B (en) CDN strategy adjusting method based on AHP decision model
CN104756467A (en) CDN traffic management in the cloud
CN111930511A (en) Identifier resolution node load balancing device based on machine learning
Nguyen et al. Scaling upf instances in 5g/6g core with deep reinforcement learning
Yang et al. Reducing idleness in financial cloud via multi-objective evolutionary reinforcement learning based load balancer
Shukla et al. Fault tolerance based load balancing approach for web resources in cloud environment.
CN107193632A (en) A kind of reference test method and system for simulating securities trading
CN112565399B (en) Adaptive traffic load balancing method for online learning
CN106789853A (en) The dynamic dispatching method and device of a kind of transcoder
Khedr et al. Enhancing the e-learning system based on a novel tasks’ classification load-balancing algorithm
Yang et al. Reducing idleness in financial cloud services via multi-objective evolutionary reinforcement learning based load balancer
US20170103341A1 (en) Continual learning in slowly-varying environments
Rajan Service request scheduling based on quantification principle using conjoint analysis and Z-score in cloud
CN103795788B (en) The load-balancing method and load-balanced server of instant communication server
CN106210120B (en) A kind of recommended method and its device of server
Li et al. Mining frequent patterns from dynamic data streams with data load management
CN114327925A (en) Power data real-time calculation scheduling optimization method and system
Park et al. Reinforcement Learning based load balancing in a distributed heterogeneous storage system
Liu et al. Towards dynamic reconfiguration of composite services via failure estimation of general and domain quality of services
Unuvar et al. A predictive method for identifying optimum cloud availability zones
Alyatama et al. Continuous memory allocation model for cloud services
CN114945024B (en) Method for balancing and optimizing server load based on long-term and short-term memory network

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant