CN114461538A

CN114461538A - Cloud computing application memory management method based on real-time content prediction and historical resource occupation

Info

Publication number: CN114461538A
Application number: CN202210089562.2A
Authority: CN
Inventors: 刘东海; 徐育毅; 庞辉富
Original assignee: Hangzhou Youyun Software Co ltd; Beijing Guangtong Youyun Technology Co ltd
Current assignee: Hangzhou Youyun Software Co ltd; Beijing Guangtong Youyun Technology Co ltd
Priority date: 2022-01-26
Filing date: 2022-01-26
Publication date: 2022-05-10

Abstract

The invention relates to the field of cloud environment, in particular to a cloud computing application memory management method based on real-time content prediction and historical resource occupation. The beneficial effects of the invention are as follows: the invention combines real-time memory occupation prediction and historical resource usage statistical records, provides an integrated application memory expansion scheme, and improves the operation efficiency of application programs in the cloud computing environment under the condition of avoiding memory errors.

Description

Cloud computing application memory management method based on real-time content prediction and historical resource occupation

Technical Field

The invention relates to the field of cloud computing, in particular to a cloud computing application memory management method based on real-time content prediction and historical resource occupation.

Background

With the maturity of big data and cloud computing technologies, the high performance computing demand of enterprises for big data processing is increasing. Various applications are deployed on the cloud to enable efficient parallel computing and on-demand allocation of computational resources. But in the face of an unlimited increase in the amount of data, the number of applications, and the complexity of computational tasks, physical resources are ultimately limited.

In a cloud computing environment, memory management of applications has been a difficult problem in the industry. Although the advent of many big data computing platform systems solved the technical challenges of parallel computing. However, when processing large application computing tasks that run for long periods of time, memory errors often occur, which devote both developer's historical time and computing resource investment. The best way to solve such problems is to increase the physical resources of the cloud computing environment, but doing so is not only costly but also results in unnecessary redundancy. Currently, there is also a scheme for scheduling resources by predicting the memory occupation of an application program, but such a method is not real-time enough, and does not effectively balance the performance of the application program with the memory overhead.

Currently, research on application memory management schemes of cloud computing platforms is less. The invention CN113296880A provides a container-based application management method. By configuring a serverless computing system implemented on a container basis, two states are set for an application: online and low power consumption. For the requirements of capacity reduction and capacity expansion, an application instance of the application is switched between two states respectively. The method defines the online and low-power consumption states and the switching steps in detail, but does not provide the switching conditions, and cannot realize automatic real-time memory management, so that the method is not suitable for an enterprise cloud computing platform with massive application data, limited physical resources and complex computing tasks.

Disclosure of Invention

The invention aims to overcome the defects in the prior art, and provides a cloud computing application memory management method based on real-time content prediction and historical resource occupation by combining real-time memory occupation prediction and historical resource usage statistical records.

The object of the present invention is achieved by the following technical means. Aiming at the problem of memory management of an application program in a cloud environment, the invention realizes real-time prediction of a memory at the next moment based on memory occupation time sequence data of a fixed time window of the application program, combines historical resource usage statistical records of the full life cycle of the application program, and realizes an integrated automatic memory recovery method for cloud computing application through a reinforcement learning model. The cloud computing application memory management method based on real-time content prediction and historical resource occupation comprises the following steps:

(1) application program in given cloud computing environment

Checking points at fixed time intervals T, recording application program

In the past memory occupation situation of n check points, the application program

Memory occupation record at the t-th check point, i.e. memory occupation record at time t

Comprises the following steps:

(2) inputting the application program at the time t based on the long-time memory network LSTM

(application i) memory footprint record

Application program with output at t +1 moment

Memory footprint prediction

：

In each step of iterative operation later, at a given time t, the application program at the next time, namely the time t +1, is predicted based on the memory occupation records of the previous n check points

Memory footprint of, the vector

Application program in cloud computing environment as time t

The first part of the state evaluation, namely:

；

(3) setting a log manager to record resource usage records of all application programs in a full life cycle, and specifically comprising the following steps: maximum memory occupation max _ memory, minimum memory occupation min _ memory, maximum CPU occupation max _ CPU, minimum CPU occupation min _ CPU, called times call _ count, total running time total _ time, average running time average _ time, maximum running time max _ time, minimum running time min _ time, called average interval average _ interval, called maximum interval max _ interval, called minimum interval min _ interval, and until t, the application program

History of

Comprises the following steps:

the vector

Evaluating a second part as a state of the cloud computing environment, namely:

therefore, the application program in the cloud computing environment at the time t is obtained

The state of (a) was evaluated as:

；

(4) based on the enhanced learning model DQN, the input of the DQN model is

And performing memory management on a given application program in the cloud environment.

The invention has the beneficial effects that: the invention provides an integrated application memory expansion scheme by combining real-time memory occupation prediction and historical resource usage statistical records, and improves the operation efficiency of application programs in a cloud computing environment under the condition of avoiding memory errors.

Drawings

FIG. 1 is a schematic flow chart of the present invention.

Detailed Description

The invention will be described in detail below with reference to the following drawings:

as shown in fig. 1, the present invention provides a cloud computing application memory management method based on real-time content prediction and historical resource occupancy, which includes the following steps:

a) application in a given cloud computing environment

. Setting check point at fixed time interval T, recording application program

Memory footprint of n checkpoints in the past. Application program

Memory footprint record at the tth checkpoint (hereinafter referred to as time t)

Comprises the following steps:

b) inputting an application program at the time t based on a long-time memory network LSTM (well-known algorithm)

Memory usage record of

Application program with output at t +1 moment

Memory usage prediction:

The memory footprint of. The vector is used as an application program in a cloud computing environment at the time t

The first part of the state evaluation, namely:

c) setting a log manager to record resource usage records of all application programs in a full life cycle, and specifically comprising the following steps: maximum memory occupation max _ memory, minimum memory occupation min _ memory, maximum CPU occupation max _ CPU, minimum CPU occupation min _ CPU, called times call _ count, total running time total _ time, average running time average _ time, maximum running time max _ time, minimum running time min _ time, called average interval average _ interval, called maximum interval max _ interval, called minimum interval min _ interval, and until t, the application program

The history of (c) is:

the vector evaluates a second part as a state of the cloud computing environment, namely:

thus obtaining the application program in the cloud computing environment at the time t

The state of (a) is evaluated as:

；

d) and performing memory management on a given application program in the cloud environment based on a reinforcement learning model DQN (known method). What needs to be specifically stated in DQN includes state space, action space, and rewards. Wherein the definition of the continuous state space has been defined in detail in a, b, c.

We define the memory change action space for an application as follows:

1. unchanged, the current memory is not processed

2. 0.8, memory occupation is reduced to 0.8 times of the current value

3. 0.6, memory occupancy is reduced to 0.6 times of the current value

4. 0.4, memory usage is reduced to 0.4 times of the current value

5. 0.2, memory occupancy is reduced to 0.2 times of the current value

6. And Kill terminates the running of the current application program and recycles all the memory.

The reward function defines:

the purpose of the scheme is to reduce the memory overhead of the application program as much as possible on the premise of ensuring the normal and efficient operation of the application program, so that the reward function of the memory management of the application program is composed of the following points

1. The memory resource occupies M, the negative feedback is carried out, the less the occupation is, the higher the reward is

2. The running time T of the application program, negative feedback, the shorter the running time, the higher the reward

Here the reward normalization process is performed based on the maximum minimum memory usage and run time in the history of different applications,

and

respectively, the weights of the two prizes.

The input of the DQN model is the state

That is, the vector for predicting the memory occupation of the application program at the time t +1 and splicing the history records of the full life cycle of the application program before the time t is output as a 6-dimensional vectorThe vectors of (1) correspond to Q values of 6 actions, respectively, and are based on the selected action

An algorithm to

To randomly select action with a small probability to

The operation action corresponding to the maximum Q value of the DQN output is selected. And optimizing the DQN network based on experience replay of data collected in the experience pool. Finally, a cloud computing application memory recovery scheme based on real-time memory prediction and historical records is achieved.

The invention has the characteristics that: the invention carries out memory prediction through LSTM, but the prediction is not the end point of our task, we take the prediction result as an intermediate result, the intermediate result is finally served by the memory scaling action, and the scaling action is carried out through DQN.

It should be understood that equivalent substitutions and changes to the technical solution and the inventive concept of the present invention should be made by those skilled in the art to the protection scope of the appended claims.

Claims

1. A cloud computing application memory management method based on real-time content prediction and historical resource occupation is characterized by comprising the following steps: the method comprises the following steps:

(1) application program in given cloud computing environment

Checking points at fixed time intervals T, recording application program

Comprises the following steps:

；

Memory usage record of

Application program with output at t +1 moment

Memory footprint prediction

：

The memory usage of (a) is,

application program in cloud computing environment as time t

The first part of the state evaluation, namely:

；

History of

Comprises the following steps:

evaluating a second part as a state of the cloud computing environment, namely:

The state of (a) was evaluated as:

；

(4) based on the enhanced learning model DQN, the input of the DQN model is

2. The cloud computing application memory management method based on real-time content prediction and historical resource occupancy according to claim 1, wherein: in the reinforced learning model DQN, the DQN,

the memory change action space for an application is defined as follows:

(1) if the memory is not changed, the current memory is not processed;

(2) 0.8, reducing the memory occupation to 0.8 times of the current value;

(3) 0.6, reducing the memory occupation to 0.6 times of the current value;

(4) 0.4, reducing the memory occupation to 0.4 times of the current value;

(5) 0.2, reducing the memory occupation to 0.2 times of the current value;

(6) and Kill, stopping running the current application program and recycling all the memories;

the reward function r defines:

memory resource occupation M and application program running time T, reward normalization processing is carried out according to the maximum and minimum memory occupation and running time in the history records of different application programs,

and

respectively, the weights of the two prizes.