WO2021239213A1 - A method for operating a network and a network - Google Patents

A method for operating a network and a network Download PDF

Info

Publication number
WO2021239213A1
WO2021239213A1 PCT/EP2020/064465 EP2020064465W WO2021239213A1 WO 2021239213 A1 WO2021239213 A1 WO 2021239213A1 EP 2020064465 W EP2020064465 W EP 2020064465W WO 2021239213 A1 WO2021239213 A1 WO 2021239213A1
Authority
WO
WIPO (PCT)
Prior art keywords
traffic
user
prediction
network
network traffic
Prior art date
Application number
PCT/EP2020/064465
Other languages
French (fr)
Inventor
Giuseppe SIRACUSANO
Davide SANVITO
Roberto BIFULCO
Roberto GONZALEZ SANCHEZ
Original Assignee
NEC Laboratories Europe GmbH
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by NEC Laboratories Europe GmbH filed Critical NEC Laboratories Europe GmbH
Priority to US17/909,403 priority Critical patent/US11870863B2/en
Priority to PCT/EP2020/064465 priority patent/WO2021239213A1/en
Publication of WO2021239213A1 publication Critical patent/WO2021239213A1/en

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L67/00Network arrangements or protocols for supporting network services or applications
    • H04L67/50Network services
    • H04L67/535Tracking the activity of the user
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L47/00Traffic control in data switching networks
    • H04L47/10Flow control; Congestion control
    • H04L47/24Traffic characterised by specific attributes, e.g. priority or QoS
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L41/00Arrangements for maintenance, administration or management of data switching networks, e.g. of packet switching networks
    • H04L41/16Arrangements for maintenance, administration or management of data switching networks, e.g. of packet switching networks using machine learning or artificial intelligence
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L41/00Arrangements for maintenance, administration or management of data switching networks, e.g. of packet switching networks
    • H04L41/50Network service management, e.g. ensuring proper service fulfilment according to agreements
    • H04L41/5003Managing SLA; Interaction between SLA and QoS
    • H04L41/5019Ensuring fulfilment of SLA
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L43/00Arrangements for monitoring or testing data switching networks
    • H04L43/06Generation of reports
    • H04L43/062Generation of reports related to network traffic
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L43/00Arrangements for monitoring or testing data switching networks
    • H04L43/08Monitoring or testing based on specific metrics, e.g. QoS, energy consumption or environmental parameters
    • H04L43/0876Network utilisation, e.g. volume of load or congestion level
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L47/00Traffic control in data switching networks
    • H04L47/10Flow control; Congestion control
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L63/00Network architectures or network communication protocols for network security
    • H04L63/14Network architectures or network communication protocols for network security for detecting or protecting against malicious traffic
    • H04L63/1408Network architectures or network communication protocols for network security for detecting or protecting against malicious traffic by monitoring network traffic
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/044Recurrent networks, e.g. Hopfield networks

Definitions

  • the present invention relates to a method for operating a network, wherein user network traffic is controlled by an operator.
  • the present invention relates to a corresponding network, wherein user network traffic is controlled by an operator.
  • controlling comprises pacing, throttling and/or shaping of user network traffic.
  • network operators may decide to apply several techniques such as pacing, throttling, etc., on a per-user basis, however, applying those techniques requires investing additional infrastructure resources. This introduces scalability challenges, which make more expensive network operations, e.g., because of the need to add additional hardware to process traffic from all users. Moreover, it is usually difficult to predict the transition of a user from light contributor of network traffic to heavy contributor, i.e., when the user starts generating/receiving large amounts of data. Thus, today operators can only deal with the problem using two approaches: 1) performing continuous traffic shaping for all the users, accepting higher infrastructure and operating costs; 2) reactively shaping traffic from users after a detection of a traffic load increase, which may negatively affect other users’ performance.
  • CN 108062561 shows a short term data stream prediction method based on long short term memory, LSTM, network model.
  • the aforementioned object is accomplished by a method for operating a network, wherein user network traffic is controlled by an operator, comprising the following steps: a) monitoring user network traffic data on a per user basis, b) using said network traffic data in a learning process for providing a prediction of user network traffic on a per user basis, and c) controlling user network traffic under consideration of said prediction, including allocating network resources under consideration of said prediction to one or more users, preferably for providing a definable Quality of Service, QoS, per at least one of said one or more users and/or per at least one other user.
  • QoS Quality of Service
  • a network wherein user network traffic is controlled by an operator, comprising: a) monitoring means which monitor user network traffic data on a per user basis, b) using means or computing means which use said network traffic data in a learning process for providing a prediction of user network traffic on a per user basis, and c) controlling means which control user network traffic under consideration of said prediction, including allocating means which allocate network resources under consideration of said prediction to one or more users, preferably for providing a definable Quality of Service, QoS, per at least one of said one or more users and/or per at least one other user.
  • QoS Quality of Service
  • a first step user network traffic data is monitored on a per user basis. Then, said network traffic data is used in a learning process for providing a prediction of user network traffic on a per user basis. This learning process results in a prediction which is used in controlling future user network traffic.
  • This controlling includes allocating network resources under consideration of said prediction to one or more users, if necessary, i.e. if a prior or a current allocation is not sufficient for providing an efficient use of the network resources.
  • the step of allocating network resources comprises maintaining or keeping pending allocations, if under consideration of said prediction an amendment of pending allocations is not necessary.
  • This prediction allows an allocation of network resources for providing an efficient use of the network resources under consideration of the prediction of future traffic load per each user.
  • a high amount of resources can be assigned to a user expected to need such a high amount of resources.
  • a small amount of resources can be assigned to a user expected to need only such a small amount of resources.
  • a definable QoS per at least one of said one or more users and/or per at least one other user can be provided.
  • said monitoring step comprises monitoring of network traffic data of current and/or historic network traffic, said network traffic data preferably comprising network flows information about several network entities, preferably hosts, and/or for several days of traffic monitoring.
  • Such data can contain network flow information about several network entities and for several days of traffic monitoring.
  • the learning process comprises machine learning techniques or techniques used in the field of Artificial Intelligence.
  • the prediction of user network traffic comprises short term user network traffic or short-term user network traffic behavior on a per user basis. Short-term user network traffic or short-term user network traffic behavior provides an efficient and seamless operation of the network.
  • the prediction of user network traffic comprises the prediction of user network traffic generated in at least one defined future time interval or in at least one future time window, preferably on a fine-grained timescale.
  • the prediction is provided for the following time window or windows.
  • the learning process comprises a learning phase and/or a classification phase, preferably within an offline pre-processing and/or training phase.
  • the learning process can - alternatively or additionally - be performed within an online phase.
  • DASM Data Analysis and Split Module
  • at least one parameter is possible to define the timeframe of the traffic prediction, configure the desired time windows splitting method or methods and specify the labels that will be assigned to the traffic time windows.
  • a traffic session is split into time windows size based, wherein the sequence of hosts visited by a user is divided into time windows of the same size, or temporal based, wherein each time window contains the end host or end hosts visited in a given interval of time.
  • LM Learning Module
  • LM is used for training a prediction model for predicting a user network traffic load that a user will generate in at least one defined future time interval or in at least one future time window, preferably in an immediate future.
  • the LM takes traffic sessions split into time windows and a corresponding label for each time window as inputs, preferably from the DASM, and/or outputs the prediction model, preferably as a trained Neural Network, NN, model or as a set of rules derived by a trained NN model.
  • the prediction model preferably as a trained Neural Network, NN, model or as a set of rules derived by a trained NN model.
  • an online prediction module, OPM for online predictions is used.
  • the OPM takes a prediction model, preferably the prediction model from the LM, live user network traffic, preferably for feeding the prediction model, and traffic window collection parameters, preferably the traffic window collection parameters from the DASM, as inputs and outputs an user network traffic load of a next time window.
  • said controlling step comprises allocating resources in or by a traffic processing module or the above OPM, wherein this traffic processing module or OPM can be designed to perform traffic controlling, preferably shaping or pacing, only for flows that are assigned label corresponding to a large amount of traffic.
  • a method and network or system is provided to predict the users that will generate a large amount of network traffic in a network or network system.
  • These embodiments provide the ability to perform the prediction on the user’s following traffic window, e.g., the next considered time interval or time window, given the navigation pattern of the user during the current traffic window, i.e. , the currently observed time interval or time window.
  • decisions on the scheduling of traffic shaping resources need to take into account what will be the traffic consumption, e.g., in the next few seconds, while prior art load prediction techniques offer only an average prediction of what will happen in e.g., hours or days.
  • Embodiments of the invention are instead tailored to provide predictions of the user’s traffic load on a fine-grained timescale, i.e. from 10s of milliseconds to 10s of seconds, which is compatible with the requirements of a traffic shaping scheduler.
  • Embodiments of the invention allow operators to predict the traffic generated by a user in a given period or to predict the load generated by an user in the next traffic window or time window using as input only the sequence of visited end hosts.
  • the granularity of the prediction e.g. from 10s of milliseconds to 10s of seconds, allows an operator to allocate correctly and efficiently the limited resources used to perform traffic controlling or shaping, and ensures the desired QoS for each user.
  • the network or system can learn the typical behavior of the network users from previously observed traffic and provides a prediction on the short-term behavior of such users’ network flows.
  • Embodiments of this invention use machine learning model results to allocate resources in a traffic processing module or OPM that performs traffic shaping or pacing only for the flows that are assigned a corresponding to a large amount of traffic.
  • Embodiments of this invention organize sequences of network destinations extracted from network traffic logs in per user sequences, and chunk each of such per user sequences in smaller subsequences or time windows of a configurable size. For each subsequence or time window S, a label is built that corresponds to the amount of data transferred by the subsequence or time window that temporarily follows S. The subsequences or time windows and the labels are used to train a machine learning classifier.
  • Embodiments of this invention extract from the machine learning model the set of network destinations whose presence is the most likely to make the machine learning classifier to predict a label corresponding to a large amount of traffic.
  • Embodiments of this invention comprise a method for predicting the traffic generated by a user in a given time period. Such embodiments can comprise the following steps:
  • Preprocessing and training phase i.e. learning phase:
  • Embodiments of methods and networks according to the invention allow network operators to predict the traffic load generated by network users.
  • the fine-grained scale of the prediction allows using the prediction as input of traffic shaping resources schedulers or of the OPM.
  • Fig. 1 shows within a diagram system components of a network according to an embodiment of the present invention
  • Fig. 2 shows within a diagram traffic window examples and their corresponding labels according to an embodiment of the present invention
  • Fig. 3 shows within a diagram input and outputs of a Neural Network used in an LM according to an embodiment of the present invention
  • Fig. 4 shows within a diagram implementation details of a network according to an embodiment of the present invention.
  • FIG. 1 An embodiment of the invention is composed by or comprises three different subsystems or system components as depicted in Fig. 1.
  • Fig. 1 shows said system components and their distinction depending on whether they are used during the learning or classification phases, while the Online Prediction Module, OPM, is used for online predictions.
  • OPM Online Prediction Module
  • the Data Analysis and Split Module, DASM, and the Learning Module, LM are used during the Offline pre-processing and training phase.
  • the DASM takes as inputs: 1.
  • a set of current and/or historical network traffic data It is assumed that the data is recent, and contains network flows information about several network entities and for several days of traffic monitoring. The actual amount of collected data depends on the deployment scenario and on the desired accuracy of the prediction.
  • a set of parameters used to tune the traffic prediction is possible to define the time frame of the traffic prediction, to configure the desired time windows splitting methods and to specify the labels that will be assigned to the traffic time windows, e.g. : predict the next Y seconds of traffic, using a fixed time window and labeling as high load traffic time windows that are bigger more than X byte/second or bigger than the X% of the time windows.
  • a set of traffic sessions organized by users - i.e. , a traffic session will contain traffic belonging to a single user - and split into time windows.
  • Fig. 2 shows traffic window examples and their corresponding labels based on two different approaches used to split a user traffic session into time windows: size based or temporal based.
  • size based approach the sequence of visited hosts is divided into chunks or time windows of the same size, e.g.: 10, 50, 100 contacted end hosts.
  • temporal based approach each chunk or time window will contain the end host or end hosts visited in a given interval of time.
  • Fig. 2 also shows the corresponding label, wherein for each time window in the session a label is associated. The label expresses the traffic load of the next time windows in the session.
  • the DASM computes the following operations:
  • Traffic sessions are divided into time windows.
  • Time windows are labeled according to the traffic load of the next time window.
  • the LM trains the prediction model.
  • Machine learning techniques are used to create a prediction model that, given a sequence of visited end hosts by the same user, is capable to predict the traffic load that the user will generate in the immediate future.
  • the prediction model can be directly used in or by the OPM, if such a module supports the execution of Machine Learning, ML, inference, if not, the LM can extract a set of rules from the ML model. These rules will then be used to predict the future traffic load.
  • the LM takes as inputs:
  • the LM outputs are:
  • the Prediction Model can be outputted as a trained Neural Network, NN, model or as a set of rules derived by a trained Neural Network model.
  • the LM computes the following operations:
  • the sequences of vectors in each time window and the corresponding labels are feed to a recurrent Neural Network, e.g.: RNN, LSTM, etc., for the learning phase.
  • a recurrent Neural Network e.g.: RNN, LSTM, etc.
  • the trained model is stored, the trained model at the previous step produces as output not only the prediction of the next time window load, but also a state machine associated with each input sequence.
  • the extracted set of end hosts per each time window is used to derive rules that can directly be applied to predict the load of a or the next time window, e.g. : if the time window contains in sequence Host X and Host Y but not Host Z.
  • the OPM keeps track of the end host or end hosts visited in the last time window by or per each network user, and feeds this information to the prediction model.
  • the output of the model is a list of users that will have a high traffic load in the next time window.
  • the OPM takes as inputs:
  • the OPM computes the following operations:
  • a time window is full, e.g. the sequence of visited end hosts is equal to the window size or the window time budget is elapsed, or a timer is expired the sequence of end hosts is fed to the prediction model.
  • the term “end host” or “host” can also be understood as or as a synonym of an “IP address” or other identifier of a visited network component.
  • Embodiments of the invention can be deployed or used as part of a telecom operator network traffic shaping system, the offline pre-processing and learning modules - DASM and LM - can be deployed into central datacenters where large storage and computation resources are available. While the OPM can be implemented in or co located with a traffic shaping function, where the prediction outputted by the OPM can be used to proactively allocate per user traffic shaping resources.

Abstract

For providing an efficient network use and resource allocation within the network a method for operating a network is provided, wherein user network traffic is controlled by an operator, comprising the following steps: a) monitoring user network traffic data on a per user basis, b) using said network traffic data in a learning process for providing a prediction of user network traffic on a per user basis, and c) controlling user network traffic under consideration of said prediction, including allocating network resources under consideration of said prediction to one or more users, preferably for providing a definable Quality of Service, QoS, per at least one of said one or more users and/or per at least one other user. Further, a corresponding network is claimed.

Description

A METHOD FOR OPERATING A NETWORK AND A NETWORK
The present invention relates to a method for operating a network, wherein user network traffic is controlled by an operator.
Further, the present invention relates to a corresponding network, wherein user network traffic is controlled by an operator.
Methods and networks for operating a network, wherein user network traffic is controlled by an operator, are widely known. It is emphasized that within this document controlling comprises pacing, throttling and/or shaping of user network traffic.
Within such known methods and networks operators, for example Mobile Network Operators, MNOs, and especially Mobile Virtual Network Operators, MVNOs, need to ensure the fairness among users in the usage of the network. Users using applications that generate a large amount of data in small time, i.e. video applications like YouTube or Netflix, p2p applications, data downloads, etc., may affect the experience of other users, in particular those with applications that have stricter quality of service requirements, e.g., low latency or jitter for audio conferences. This happens because existing congestion control mechanisms, such as those implemented by Transmission Control Protocol, TCP, only ensure fairness at the flow-level, without considering the flows aggregated on a per-user basis. Furthermore, TCP fairness is known to be ineffective when network flows traverse network segments with very different properties.
In order to ensure fairness among users, network operators may decide to apply several techniques such as pacing, throttling, etc., on a per-user basis, however, applying those techniques requires investing additional infrastructure resources. This introduces scalability challenges, which make more expensive network operations, e.g., because of the need to add additional hardware to process traffic from all users. Moreover, it is usually difficult to predict the transition of a user from light contributor of network traffic to heavy contributor, i.e., when the user starts generating/receiving large amounts of data. Thus, today operators can only deal with the problem using two approaches: 1) performing continuous traffic shaping for all the users, accepting higher infrastructure and operating costs; 2) reactively shaping traffic from users after a detection of a traffic load increase, which may negatively affect other users’ performance.
CN 108062561 , for example, shows a short term data stream prediction method based on long short term memory, LSTM, network model.
Further prior art traffic load prediction systems only observe seasonal patterns, e.g. weekly, daily or hourly utilization, to give coarse grained prediction of the future load, on large timescales. This kind of prediction cannot be used to allocate traffic shaping resources which instead have to be allocated in very short time. Said prior art load prediction techniques offer only an average prediction of what will happen in e.g., hours or days.
It is an object of the present invention to improve and further develop a method for operating a network and a corresponding network for providing an efficient network use and resource allocation within the network.
In accordance with the invention, the aforementioned object is accomplished by a method for operating a network, wherein user network traffic is controlled by an operator, comprising the following steps: a) monitoring user network traffic data on a per user basis, b) using said network traffic data in a learning process for providing a prediction of user network traffic on a per user basis, and c) controlling user network traffic under consideration of said prediction, including allocating network resources under consideration of said prediction to one or more users, preferably for providing a definable Quality of Service, QoS, per at least one of said one or more users and/or per at least one other user.
Further, the aforementioned object is accomplished by a network, wherein user network traffic is controlled by an operator, comprising: a) monitoring means which monitor user network traffic data on a per user basis, b) using means or computing means which use said network traffic data in a learning process for providing a prediction of user network traffic on a per user basis, and c) controlling means which control user network traffic under consideration of said prediction, including allocating means which allocate network resources under consideration of said prediction to one or more users, preferably for providing a definable Quality of Service, QoS, per at least one of said one or more users and/or per at least one other user.
According to the invention it has been recognized that it is possible to provide an efficient network use and resource allocation within the network by efficiently predicting future traffic load on a per user basis. In a first step user network traffic data is monitored on a per user basis. Then, said network traffic data is used in a learning process for providing a prediction of user network traffic on a per user basis. This learning process results in a prediction which is used in controlling future user network traffic. This controlling includes allocating network resources under consideration of said prediction to one or more users, if necessary, i.e. if a prior or a current allocation is not sufficient for providing an efficient use of the network resources. Thus, the step of allocating network resources comprises maintaining or keeping pending allocations, if under consideration of said prediction an amendment of pending allocations is not necessary. This prediction allows an allocation of network resources for providing an efficient use of the network resources under consideration of the prediction of future traffic load per each user. Thus, a high amount of resources can be assigned to a user expected to need such a high amount of resources. On the other side, a small amount of resources can be assigned to a user expected to need only such a small amount of resources. As a result, a definable QoS per at least one of said one or more users and/or per at least one other user can be provided.
Thus, on the basis of the inventive method an efficient resource allocation within the network is provided.
Embodiments of the present invention are specified in the depending claims.
According to such an embodiment said monitoring step comprises monitoring of network traffic data of current and/or historic network traffic, said network traffic data preferably comprising network flows information about several network entities, preferably hosts, and/or for several days of traffic monitoring. Such data can contain network flow information about several network entities and for several days of traffic monitoring.
According to a further embodiment the learning process comprises machine learning techniques or techniques used in the field of Artificial Intelligence.
Within a further embodiment the prediction of user network traffic comprises short term user network traffic or short-term user network traffic behavior on a per user basis. Short-term user network traffic or short-term user network traffic behavior provides an efficient and seamless operation of the network.
According to a further embodiment the prediction of user network traffic comprises the prediction of user network traffic generated in at least one defined future time interval or in at least one future time window, preferably on a fine-grained timescale.
Within a further embodiment - on the basis of a current time window - the prediction is provided for the following time window or windows.
According to a further embodiment of the present invention the learning process comprises a learning phase and/or a classification phase, preferably within an offline pre-processing and/or training phase. According to another embodiment the learning process can - alternatively or additionally - be performed within an online phase.
Within a further embodiment, within the learning process a Data Analysis and Split Module, DASM, is used, which takes network traffic data and at least one parameter for tuning the prediction as inputs and which outputs at least one traffic session split into time windows, a corresponding set of labels per each traffic session for classifying each time window in the respective traffic session and traffic window collection parameters for an online prediction phase. Using such at least one parameter is possible to define the timeframe of the traffic prediction, configure the desired time windows splitting method or methods and specify the labels that will be assigned to the traffic time windows. According to further embodiments a traffic session is split into time windows size based, wherein the sequence of hosts visited by a user is divided into time windows of the same size, or temporal based, wherein each time window contains the end host or end hosts visited in a given interval of time.
According to a further embodiment, within the learning process a Learning Module, LM, is used for training a prediction model for predicting a user network traffic load that a user will generate in at least one defined future time interval or in at least one future time window, preferably in an immediate future.
According to further embodiments the LM takes traffic sessions split into time windows and a corresponding label for each time window as inputs, preferably from the DASM, and/or outputs the prediction model, preferably as a trained Neural Network, NN, model or as a set of rules derived by a trained NN model.
Within a further embodiment, during providing a prediction of user network traffic on a per user basis an online prediction module, OPM, for online predictions is used.
According to a further embodiment the OPM takes a prediction model, preferably the prediction model from the LM, live user network traffic, preferably for feeding the prediction model, and traffic window collection parameters, preferably the traffic window collection parameters from the DASM, as inputs and outputs an user network traffic load of a next time window.
Within a further embodiment said controlling step comprises allocating resources in or by a traffic processing module or the above OPM, wherein this traffic processing module or OPM can be designed to perform traffic controlling, preferably shaping or pacing, only for flows that are assigned label corresponding to a large amount of traffic.
According to embodiments of the invention a method and network or system is provided to predict the users that will generate a large amount of network traffic in a network or network system. These embodiments provide the ability to perform the prediction on the user’s following traffic window, e.g., the next considered time interval or time window, given the navigation pattern of the user during the current traffic window, i.e. , the currently observed time interval or time window. In fact, decisions on the scheduling of traffic shaping resources according to embodiments of the invention need to take into account what will be the traffic consumption, e.g., in the next few seconds, while prior art load prediction techniques offer only an average prediction of what will happen in e.g., hours or days. Embodiments of the invention are instead tailored to provide predictions of the user’s traffic load on a fine-grained timescale, i.e. from 10s of milliseconds to 10s of seconds, which is compatible with the requirements of a traffic shaping scheduler.
Embodiments of the invention allow operators to predict the traffic generated by a user in a given period or to predict the load generated by an user in the next traffic window or time window using as input only the sequence of visited end hosts. The granularity of the prediction, e.g. from 10s of milliseconds to 10s of seconds, allows an operator to allocate correctly and efficiently the limited resources used to perform traffic controlling or shaping, and ensures the desired QoS for each user.
Further embodiments of the invention provide an Artificial Intelligence, Al, assisted method and system or network for intelligent network adaptation or a network or system to provide network traffic pacing, throttling or shaping based on the prediction of the short-term network traffic behavior per user. The network or system can learn the typical behavior of the network users from previously observed traffic and provides a prediction on the short-term behavior of such users’ network flows.
Advantages and aspects of embodiments of the present invention are listed as follows:
1) Embodiments of this invention use machine learning model results to allocate resources in a traffic processing module or OPM that performs traffic shaping or pacing only for the flows that are assigned a corresponding to a large amount of traffic.
2) Embodiments of this invention organize sequences of network destinations extracted from network traffic logs in per user sequences, and chunk each of such per user sequences in smaller subsequences or time windows of a configurable size. For each subsequence or time window S, a label is built that corresponds to the amount of data transferred by the subsequence or time window that temporarily follows S. The subsequences or time windows and the labels are used to train a machine learning classifier.
3) Embodiments of this invention extract from the machine learning model the set of network destinations whose presence is the most likely to make the machine learning classifier to predict a label corresponding to a large amount of traffic.
Embodiments of this invention comprise a method for predicting the traffic generated by a user in a given time period. Such embodiments can comprise the following steps:
Preprocessing and training phase, i.e. learning phase:
1) collecting network traffic and extracting contacted network hosts, IP addresses or other destination identifiers and the corresponding traffic load per each user active in the network
2) splitting contacted network hosts, IP addresses or other destination identifiers into chunks, slices or time windows and labeling each chunks, slices or time windows with the generated traffic load
3) feeding the chunks, slices or time windows obtained in this step 2) to a Machine Learning, ML, model that will be trained to predict the traffic load generated in the next chunk, slice or time window
4) extracting a set of rules from the ML model
Online traffic analysis and prediction, i.e. using phase:
5) collecting network traffic and extracting contacted network hosts, IP addresses or other destination identifiers per each user active in the network
6) using either the ML model or the set of rules to predict the user traffic load
7) using the information obtained in this step 6) to allocate - or not - resources in a traffic shaper or in the OPM.
Embodiments of methods and networks according to the invention allow network operators to predict the traffic load generated by network users. The fine-grained scale of the prediction allows using the prediction as input of traffic shaping resources schedulers or of the OPM.
There are several ways how to design and further develop the teaching of the present invention in an advantageous way. To this end it is to be referred to the following explanation of examples of embodiments of the invention, illustrated by the drawing. In the drawing
Fig. 1 shows within a diagram system components of a network according to an embodiment of the present invention,
Fig. 2 shows within a diagram traffic window examples and their corresponding labels according to an embodiment of the present invention,
Fig. 3 shows within a diagram input and outputs of a Neural Network used in an LM according to an embodiment of the present invention and
Fig. 4 shows within a diagram implementation details of a network according to an embodiment of the present invention.
An embodiment of the invention is composed by or comprises three different subsystems or system components as depicted in Fig. 1. Fig. 1 shows said system components and their distinction depending on whether they are used during the learning or classification phases, while the Online Prediction Module, OPM, is used for online predictions.
The Data Analysis and Split Module, DASM, and the Learning Module, LM, are used during the Offline pre-processing and training phase.
The DASM takes as inputs: 1. A set of current and/or historical network traffic data. It is assumed that the data is recent, and contains network flows information about several network entities and for several days of traffic monitoring. The actual amount of collected data depends on the deployment scenario and on the desired accuracy of the prediction.
2. A set of parameters used to tune the traffic prediction. Using such parameters is possible to define the time frame of the traffic prediction, to configure the desired time windows splitting methods and to specify the labels that will be assigned to the traffic time windows, e.g. : predict the next Y seconds of traffic, using a fixed time window and labeling as high load traffic time windows that are bigger more than X byte/second or bigger than the X% of the time windows.
The DASM outputs:
1. A set of traffic sessions organized by users - i.e. , a traffic session will contain traffic belonging to a single user - and split into time windows.
2. Per each traffic session the corresponding set of labels used to classify each time window in the session.
3. The traffic window collection parameters that will be then used during the Online Traffic analysis and prediction phase by the OPM to collect live traffic into time windows that will be used to predict the traffic load
Fig. 2 shows traffic window examples and their corresponding labels based on two different approaches used to split a user traffic session into time windows: size based or temporal based. When the size based approach is used the sequence of visited hosts is divided into chunks or time windows of the same size, e.g.: 10, 50, 100 contacted end hosts. Instead, when the temporal based approach is used each chunk or time window will contain the end host or end hosts visited in a given interval of time. Fig. 2 also shows the corresponding label, wherein for each time window in the session a label is associated. The label expresses the traffic load of the next time windows in the session. The DASM computes the following operations:
(1) Divides the historical network traffic trace on a per user basis, i.e. , an equal number of traffic traces and users is provided.
(2) Filters out from the historical network traffic traces the information that will be used by the LM, a. sequence of contacted end hosts, IP addresses or other destination identifiers b. timestamp of the contact c. associated data transfer size.
(3) Provides a statistical analysis of the filtered network traffic or network traffic information in order to discover the distribution of the inter arrival time between end host requests, per and across users.
(4) The distributions calculated in the previous point are used to divide the filtered traffic traces or network traffic data into traffic sessions, each session should express or expresses a user network activity for a continuous time.
(5) Provides a statistical analysis of the user session in order to discover the distribution of inter arrival window time, time window duration, number of end hosts per time window and network load per each time window.
(6) The distributions calculated in the previous step are used to select a time window size that fits with the input prediction parameters.
(7) Traffic sessions are divided into time windows.
(8) According to the distribution of the network load per time window and input parameters labels for time windows are created, e.g.: high traffic label will be used if the total traffic load is > X otherwise low traffic label will be used.
(9) Time windows are labeled according to the traffic load of the next time window.
The LM trains the prediction model. Machine learning techniques are used to create a prediction model that, given a sequence of visited end hosts by the same user, is capable to predict the traffic load that the user will generate in the immediate future. The prediction model can be directly used in or by the OPM, if such a module supports the execution of Machine Learning, ML, inference, if not, the LM can extract a set of rules from the ML model. These rules will then be used to predict the future traffic load.
The LM takes as inputs:
1. The traffic sessions divided into time windows
2. For each time window the corresponding label
The LM outputs:
1. The Prediction Model. Note that the prediction model, depending on the capability of the OPM, can be outputted as a trained Neural Network, NN, model or as a set of rules derived by a trained Neural Network model.
The LM computes the following operations:
1. Uses the sequence of end hosts present in a session to learn a vector representation for each end host present in the traffic session received as input. In order to learn such representation a skip-gram model, e.g. word2vec, or a generic embedding layer can be used.
2. The sequences of vectors in each time window and the corresponding labels are feed to a recurrent Neural Network, e.g.: RNN, LSTM, etc., for the learning phase.
3. The trained model is stored, the trained model at the previous step produces as output not only the prediction of the next time window load, but also a state machine associated with each input sequence.
4. The state machines used at the previous step are analyzed to extract the end host associated with a state change, see Fig. 3.
5. The extracted set of end hosts per each time window is used to derive rules that can directly be applied to predict the load of a or the next time window, e.g. : if the time window contains in sequence Host X and Host Y but not Host Z. The OPM keeps track of the end host or end hosts visited in the last time window by or per each network user, and feeds this information to the prediction model. The output of the model is a list of users that will have a high traffic load in the next time window.
The OPM takes as inputs:
1. Network live traffic
2. The prediction model
3. The traffic window collection parameters
The OPM outputs:
1. Traffic load of the next time window, e.g.: High, Medium, Low The OPM computes the following operations:
1. Receives live network traffic.
2. Extracts from the traffic triples of: user, end host, timestamp.
3. Keeps track of each state of each user, of a number of visited end hosts, of a sequence of visited end hosts and of an elapsed time since the start of the time window.
4. Once a time window is full, e.g. the sequence of visited end hosts is equal to the window size or the window time budget is elapsed, or a timer is expired the sequence of end hosts is fed to the prediction model.
5. Executes the prediction model, a. if the hardware supports NN inference and the time budget to compute the prediction is sufficient, the NN model is used, b. if the above conditions are not verified the rule based model is executed.
Generally, in this document, particularly in the preceding paragraphs, the term “end host” or “host” can also be understood as or as a synonym of an “IP address” or other identifier of a visited network component. Embodiments of the invention can be deployed or used as part of a telecom operator network traffic shaping system, the offline pre-processing and learning modules - DASM and LM - can be deployed into central datacenters where large storage and computation resources are available. While the OPM can be implemented in or co located with a traffic shaping function, where the prediction outputted by the OPM can be used to proactively allocate per user traffic shaping resources.
Many modifications and other embodiments of the invention set forth herein will come to mind to the one skilled in the art to which the invention pertains having the benefit of the teachings presented in the foregoing description and the associated drawings. Therefore, it is to be understood that the invention is not to be limited to the specific embodiments disclosed and that modifications and other embodiments are intended to be included within the scope of the appended claims. Although specific terms are employed herein, they are used in a generic and descriptive sense only and not for purposes of limitation.

Claims

C l a i m s
1. A method for operating a network, wherein user network traffic is controlled by an operator, comprising the following steps: a) monitoring user network traffic data on a per user basis, b) using said network traffic data in a learning process for providing a prediction of user network traffic on a per user basis, and c) controlling user network traffic under consideration of said prediction, including allocating network resources under consideration of said prediction to one or more users, preferably for providing a definable Quality of Service, QoS, per at least one of said one or more users and/or per at least one other user.
2. A method according to claim 1 , wherein said monitoring step comprises monitoring of network traffic data of current and/or historic network traffic, said network traffic data preferably comprising network flows information about several network entities, preferably hosts, and/or for several days of traffic monitoring.
3. A method according to claim 1 or 2, wherein the learning process comprises machine learning techniques.
4. A method according to one of claims 1 to 3, wherein the prediction of user network traffic comprises short term user network traffic or short term user network traffic behavior on a per user basis.
5. A method according to one of claims 1 to 4, wherein the prediction of user network traffic comprises the prediction of user network traffic generated in at least one defined future time interval or in at least one future time window, preferably on a fine-grained timescale.
6. A method according to one of claims 1 to 5, wherein - on the basis of a current time window - the prediction is provided for the following time window or windows.
7. A method according to one of claims 1 to 6, wherein the learning process comprises a learning phase and/or a classification phase, preferably within an offline pre-processing and/or training phase.
8. A method according to one of claims 1 to 7, wherein within the learning process a Data Analysis and Split Module, DASM, is used, which takes network traffic data and at least one parameter for tuning the prediction as inputs and which outputs at least one traffic session split into time windows, a corresponding set of labels per each traffic session for classifying each time window in the respective traffic session and traffic window collection parameters for an online prediction phase.
9. A method according to claim 8, wherein a traffic session is split into time windows size based, wherein the sequence of hosts visited by a user is divided into time windows of the same size, or temporal based, wherein each time window contains the end host or end hosts visited in a given interval of time.
10. A method according to one of claims 1 to 9, wherein within the learning process a Learning Module, LM, is used for training a prediction model for predicting a user network traffic load that a user will generate in at least one defined future time interval or in at least one future time window, preferably in an immediate future.
11. A method according to claim 10, wherein the LM takes traffic sessions split into time windows and a corresponding label for each time window as inputs, preferably from the DASM, and/or outputs the prediction model, preferably as a trained Neural Network, NN, model or as a set of rules derived by a trained NN model.
12. A method according to one of claims 1 to 11 , wherein during providing a prediction of user network traffic on a per user basis an online prediction module, OPM, for online predictions is used.
13. A method according to claim 12, wherein the OPM takes a prediction model, preferably the prediction model from the LM, live user network traffic, preferably for feeding the prediction model, and traffic window collection parameters, preferably the traffic window collection parameters from the DASM, as inputs and outputs an user network traffic load of a next time window.
14. A method according to one of claims 1 to 13, wherein said controlling step comprises allocating resources in or by a traffic processing module that performs traffic controlling, preferably shaping or pacing, only for flows that are assigned label corresponding to a large amount of traffic.
15. A network, preferably for carrying out the method for operating a network according to any one of claims 1 to 14, wherein user network traffic is controlled by an operator, comprising: a) monitoring means which monitor user network traffic data on a per user basis, b) using means or computing means which use said network traffic data in a learning process for providing a prediction of user network traffic on a per user basis, and c) controlling means which control user network traffic under consideration of said prediction, including allocating means which allocate network resources under consideration of said prediction to one or more users, preferably for providing a definable Quality of Service, QoS, per at least one of said one or more users and/or per at least one other user.
PCT/EP2020/064465 2020-05-25 2020-05-25 A method for operating a network and a network WO2021239213A1 (en)

Priority Applications (2)

Application Number Priority Date Filing Date Title
US17/909,403 US11870863B2 (en) 2020-05-25 2020-05-25 Method for operating a network
PCT/EP2020/064465 WO2021239213A1 (en) 2020-05-25 2020-05-25 A method for operating a network and a network

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
PCT/EP2020/064465 WO2021239213A1 (en) 2020-05-25 2020-05-25 A method for operating a network and a network

Publications (1)

Publication Number Publication Date
WO2021239213A1 true WO2021239213A1 (en) 2021-12-02

Family

ID=71078488

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/EP2020/064465 WO2021239213A1 (en) 2020-05-25 2020-05-25 A method for operating a network and a network

Country Status (2)

Country Link
US (1) US11870863B2 (en)
WO (1) WO2021239213A1 (en)

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20220253689A1 (en) * 2021-02-09 2022-08-11 Hewlett Packard Enterprise Development Lp Predictive data capacity planning

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP2073463A1 (en) * 2007-12-21 2009-06-24 Nokia Siemens Networks S.p.A. Method and device for setting QoS parameters for predictive scheduling in wireless communication systems
CN108062561A (en) 2017-12-05 2018-05-22 华南理工大学 A kind of short time data stream Forecasting Methodology based on long memory network model in short-term

Family Cites Families (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20070033263A1 (en) * 2005-08-08 2007-02-08 Goering Scott C Methods and apparatus for providing integrated bandwidth dedicated transport services
US8934414B2 (en) * 2011-12-06 2015-01-13 Seven Networks, Inc. Cellular or WiFi mobile traffic optimization based on public or private network destination
EP2788889A4 (en) * 2011-12-07 2015-08-12 Seven Networks Inc Flexible and dynamic integration schemas of a traffic management system with various network operators for network traffic alleviation
US10374965B2 (en) * 2015-06-01 2019-08-06 Huawei Technologies Co., Ltd. Systems and methods for managing network traffic with a network operator
US10831526B2 (en) * 2017-12-29 2020-11-10 Virtual Instruments Corporation System and method of application discovery
US10536380B2 (en) * 2018-01-26 2020-01-14 Wipro Limited Method and system for intelligent link load balancing
WO2020018133A1 (en) * 2018-07-17 2020-01-23 Facebook, Inc. Detecting communication network insights of alerts
US11128536B2 (en) * 2018-10-04 2021-09-21 Sandvine Corporation System and method for intent based traffic management
US11165883B2 (en) * 2019-01-02 2021-11-02 Bank Of America Corporation Entry point classification of requests requiring access to data
US20210352095A1 (en) * 2020-05-05 2021-11-11 U.S. Army Combat Capabilities Development Command, Army Research Labortary Cybersecurity resilience by integrating adversary and defender actions, deep learning, and graph thinking

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP2073463A1 (en) * 2007-12-21 2009-06-24 Nokia Siemens Networks S.p.A. Method and device for setting QoS parameters for predictive scheduling in wireless communication systems
CN108062561A (en) 2017-12-05 2018-05-22 华南理工大学 A kind of short time data stream Forecasting Methodology based on long memory network model in short-term

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
QI SUN CHINA MOBILE ET AL: "Draft new Supplement ITU-T Y.ML-IMT2020-Use-Cases: "Machine learning in future networks including IMT-2020: use cases", (output of Q20/13 meeting, Geneva, 28 June 2019);TD397/WP1", vol. 20/13, 2 August 2019 (2019-08-02), pages 1 - 57, XP044269099, Retrieved from the Internet <URL:https://www.itu.int/ifa/t/2017/sg13/exchange/wp1/q20/2019-08/contributions/C014-EM-Q20-2019/C014-EM-Q20-2019-att.docx> [retrieved on 20190802] *

Also Published As

Publication number Publication date
US11870863B2 (en) 2024-01-09
US20230100227A1 (en) 2023-03-30

Similar Documents

Publication Publication Date Title
CN113348645B (en) System and method for classifying data streams
Hamdan et al. Flow-aware elephant flow detection for software-defined networks
US20200112489A1 (en) Intelligent Network Equipment Failure Prediction System
US11689961B2 (en) Systems and methods for distribution of application logic in digital networks
CN109450790B (en) Intelligent network service function chain supply method based on flow perception
Gutierrez-Estevez et al. The path towards resource elasticity for 5G network architecture
EP3535976A1 (en) Live video analytics at scale
US11233744B2 (en) Real-time network application visibility classifier of encrypted traffic based on feature engineering
Luo et al. Intelligent Virtual Machine Provisioning in Cloud Computing.
US20220012611A1 (en) Method and machine learning manager for handling prediction of service characteristics
US11310125B2 (en) AI-enabled adaptive TCA thresholding for SLA assurance
US11870863B2 (en) Method for operating a network
EP3007407A1 (en) Configuration method, equipment, system and computer readable medium for determining a new configuration of calculation resources
Punitha et al. Traffic classification for efficient load balancing in server cluster using deep learning technique
Martín-Pérez et al. Dimensioning v2n services in 5g networks through forecast-based scaling
US20210014135A1 (en) Wan tunnel kpi/sla prediction and schedule recommender
Chintapalli et al. Wip: Impact of ai/ml model adaptation on ran control loop response time
Ahmed et al. Predicting high delays in mobile broadband networks
CN110781950A (en) Message processing method and device
JP2020136894A (en) Prediction device, prediction method, and program
JP2023520628A (en) Support Request Allocation Using Conversation Pace
Otoshi et al. Non-parametric decision-making by Bayesian attractor model for dynamic slice selection
CN115037625B (en) Network slice processing method and device, electronic equipment and readable storage medium
CN112559099B (en) Remote image display method, device and system based on user behaviors and storage medium
Gudepu et al. Generative Adversarial Networks-Based AI/ML Model Adaptive Retraining for Beyond 5G Networks

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 20731802

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 20731802

Country of ref document: EP

Kind code of ref document: A1