CN111415039B - Flight delay mode analysis method based on non-negative tensor decomposition - Google Patents

Flight delay mode analysis method based on non-negative tensor decomposition Download PDF

Info

Publication number
CN111415039B
CN111415039B CN202010197961.1A CN202010197961A CN111415039B CN 111415039 B CN111415039 B CN 111415039B CN 202010197961 A CN202010197961 A CN 202010197961A CN 111415039 B CN111415039 B CN 111415039B
Authority
CN
China
Prior art keywords
departure
potential
tensor
delay
flight
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN202010197961.1A
Other languages
Chinese (zh)
Other versions
CN111415039A (en
Inventor
杜文博
曹先彬
张明远
陈莘文
马彦
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beihang University
Original Assignee
Beihang University
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beihang University filed Critical Beihang University
Priority to CN202010197961.1A priority Critical patent/CN111415039B/en
Publication of CN111415039A publication Critical patent/CN111415039A/en
Application granted granted Critical
Publication of CN111415039B publication Critical patent/CN111415039B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q10/00Administration; Management
    • G06Q10/04Forecasting or optimisation specially adapted for administrative or management purposes, e.g. linear programming or "cutting stock problem"
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F17/00Digital computing or data processing equipment or methods, specially adapted for specific functions
    • G06F17/10Complex mathematical operations
    • G06F17/16Matrix or vector computation, e.g. matrix-matrix or matrix-vector multiplication, matrix factorization
    • G06Q50/40

Abstract

The invention discloses a flight delay pattern analysis method based on nonnegative tensor decomposition, and belongs to the field of civil aviation delay analysis. Firstly, collecting all historical flight record data of an airport, and extracting main variables from the historical data record of each flight; all the main variables are then constructed as a fourth order tensor x. And performing pattern recognition on the fourth-order tensor x by using a non-negative tensor decomposition method, and decomposing into the product of four non-negative factor matrixes and one non-negative core tensor. Modeling the modulus of the nonnegative factor matrix and the nonnegative core tensor as an optimization problem, and solving by using a gradient descent method to obtain an optimal decomposition result
Figure DDA0002418306190000012
Finally, the optimal decomposition result is obtained
Figure DDA0002418306190000011
Corresponding to the PLSA model based on the probability, carrying out statistical analysis on the distribution situation of each potential mode; therefore, delay conditions are predicted and judged, and then targeted measures are taken for management and control. The method is easy to operate, simple in steps, accurate in result and innovative.

Description

Flight delay mode analysis method based on non-negative tensor decomposition
Technical Field
The invention belongs to the field of civil aviation delay analysis, and particularly relates to a flight delay pattern analysis method based on non-negative tensor decomposition.
Background
With the rapid development of the civil aviation industry, flight delay becomes a serious problem, and inconvenience is brought to people. The delay in flights can cause inconvenience to passengers, lead to a reluctance to take an airplane to travel or divert to select another airline, and force airlines to incur the additional cost of airplane maintenance and insufficient fleet utilization. In addition, flight delay can also lead to increase of fuel consumption and carbon dioxide emission, and harm is brought to the environment; in addition to the direct effects described above, flight delays can negatively impact various aspects of the overall economy. In summary, flight delays are a serious and common problem with significant negative impact.
The complexity and difficulty of the flight delay problem is caused by a number of factors that are commonly summarized as exceptional weather and technical causes, including air traffic control, insufficient facility capacity, poor scheduling capability, program changes and limited buffer time, etc. The various factors described above make it difficult for people to understand the potential patterns of flight delays and to design appropriate strategies.
Recently, Han and Moutarde prove that the driving method based on historical observation data is not restricted by the past and is suitable for researching the potential dynamic characteristic of flight delay; thus, a way to facilitate system cognition and decision making is to leverage and learn historical data. For example, when severe weather is encountered, people may look for days with similar past weather conditions and refer to actions taken by air traffic controllers on those days.
Some previous studies have focused on identifying fixed patterns in air traffic management. Liu et al propose a semi-supervised learning algorithm that groups similar days as determined in different patterns. The first step is to measure the similarity between hourly weather forecasts and then determine the most similar days. The authors carried out two case studies at the new wak free international airport (EWR) using this method, demonstrating the effectiveness of this method. Mukherjee et al propose a pattern classification method based on the influence of severe weather conditions, using weather indicators as input, and applying factor analysis to determine the primary weather pattern. Then, the days were clustered using Ward's least square method. Days belonging to the same constellation have similar weather patterns.
In addition to weather patterns, some studies have attempted to determine similar days from other perspectives. Grabbe et al use a k-means clustering algorithm to identify similar days in the ground delay program, applying an expected maximization algorithm to data such as the start and end times and the predetermined arrival rates of the ground delay program. The studies by Zhou et al and Sternberg et al focused on using air traffic flow and flight delays to identify similar patterns. Gorriplay et al measured the principal components of demand and capacity data, but after cluster analysis of the data, it was found that neither capacity nor demand existed in natural patterns. Abdel-Aty et al found that some patterns could not be detected by statistical methods by identifying periodic arrival delay patterns of domestic fly-by flights.
Despite the many previous efforts, problems still exist in accurately understanding flight delay patterns. As mentioned earlier, finding clusters or patterns in spatio-temporal historical data is an effective method, but due to the high dimensionality of these data, it is difficult to find different euclidean spatial patterns in a number of studies. Therefore, proposed by Mislevy, the potential distribution analysis, the potential feature analysis, the membership degree analysis and other methods do not directly mine the mode, but form a low-dimensional projection subspace by using the features obtained by tensor factorization, so that the potential clustering structure of the space-time traffic dynamic mode is enhanced. Therefore, these methods open up new fields in the field of traffic science. Such as city flow analysis, traffic speed prediction, missing traffic data completion, ship track recovery, and the like.
Disclosure of Invention
Based on the problems, the invention provides a flight delay pattern analysis method based on non-negative tensor decomposition, which can be used for establishing a predictor based on a potential pattern, making a preliminary judgment on the situation of flight delay, better making arrangement for flights of an airport and being beneficial to making better decisions for the airport.
The flight delay pattern analysis method comprises the following steps:
step one, collecting all historical flight record data of an airport, and extracting main variables from the historical data record of each flight;
the main variables include the departure airport, departure time, departure date and delay level of the flight;
manually numbering all departure airports in sequence, wherein each airport corresponds to a unique number; the set is denoted as {1,2,3, …, Z }; z is an integer.
The departure date of the flight refers to: the days of the year are numbered in sequence, namely 1 month and 1 day is numbered 1,1 month and 2 days are numbered 2, and the like, and 12 months and 31 days are numbered 365/366. The departure date of each flight is the number of the corresponding specific days on the departure day; the set is denoted as {1,2,3 … …,365/366 }.
The departure time of the flight is: dividing the time of one day into every hour, wherein the number of flights taking off from 0 point to 1 point corresponds to the number 1, the number of flights taking off from 1 point to 2 points corresponds to the number 2, and so on, and the number of flights taking off from 23 points to 24 points corresponds to the number 24; the set is denoted as {1,2,3, … …,24 }.
The delay levels are divided into four according to actual conditions: (1) <15min, (2)15-45min, (3)45-90min, (4) >90min, and represent punctuality, slight delay, moderate delay, and severe delay with {1,2,3,4} respectively.
Step two, constructing a fourth-order tensor by the main variables extracted from all flights
Figure BDA0002418306170000023
Fourth order tensor
Figure BDA0002418306170000024
The method comprises four dimensions which respectively correspond to a departure airport, departure time, departure date and delay level;
step three, using a non-negative tensor decomposition method to carry out fourth-order tensor
Figure BDA0002418306170000025
Performing pattern recognition, and decomposing into four non-negative factor matrixes and a non-negative core tensorThe product of modulus of (a).
The non-negative tensor resolution formulation is as follows:
Figure BDA0002418306170000021
wherein
Figure BDA0002418306170000022
Representing a core tensor; a. the(1),A(2),A(3),A(4)Is a matrix of four factors, A(1)Corresponding to the potential pattern matrix of flights in the dimension of the departure airport, A(2)Corresponding to the potential matrix of flights in the departure time dimension, A(3)Corresponding to the potential pattern matrix of flights in the dimension of departure date, A(4)Corresponding to the potential pattern matrix of the flight in the delay level dimension.
Set of departure airports divided into R by potential patterns1Class, set of departure times divided by potential patterns into R2Class, set of departure dates divided into R by latent patterns3Class, set of delay levels are divided into R by potential mode4Class; each potential class corresponds to a respective subset.
Step four, modeling the modulus of the non-negative factor matrix and the non-negative core tensor as an optimization problem, and solving by using a gradient descent method to obtain an optimal decomposition result
Figure BDA0002418306170000031
The optimization problem is as follows:
minimization
Figure BDA0002418306170000032
Figure BDA0002418306170000033
Wherein the constraint condition is a core tensor
Figure BDA0002418306170000034
Sum factor matrix A(n)Are all non-negative, n is 1,2,3, 4. The F-norm is the sum of the squares of the values at each position, taken in the root.
The solved optimal decomposition result is as follows:
Figure BDA0002418306170000035
Figure BDA0002418306170000036
is the original fourth order tensor
Figure BDA0002418306170000037
The set of all the elements in the tree is decomposed into a form of multiplying a plurality of matrixes; i.e. i1Some potential class r representing the dimensions of the departure airport1A corresponding subset of departure airports; i.e. i2Some potential class r representing the departure time dimension2A corresponding subset of departure times; i.e. i3Some potential class r representing the dimension of the departure date3A corresponding subset of departure dates; i.e. i4Representing a potential class r of flights in the dimension of the delay level4A corresponding delay level subset;
Figure BDA0002418306170000038
represents i1*r1The stored elements are corresponding subsets of each potential class of the departure airport;
Figure BDA0002418306170000039
represents i2*r2The stored elements are corresponding subsets of potential classes of flights at departure time;
Figure BDA00024183061700000310
represents i3*r3The stored elements are corresponding subsets of potential classes of flights on departure dates;
Figure BDA00024183061700000311
represents i4*r4The stored elements are corresponding subsets of potential classes of flights in delay level dimensions;
Figure BDA00024183061700000312
representing the core tensor
Figure BDA00024183061700000313
Decomposed sub-tensors with dimension r1*r2*r3*r4(ii) a Representing potential classes r of departure airport dimensions1Potential class of departure times r2Potential class of departure date r3And latency class r of delay level4The interaction relationship between them. r is1,r2,r3,r4Respectively, the dimensions after the non-negative tensor decomposition.
Step five, the optimal decomposition result is obtained
Figure BDA00024183061700000314
Corresponding to the PLSA model based on the probability, carrying out statistical analysis on the distribution situation of each potential mode; therefore, delay conditions are predicted and judged, and then targeted measures are taken for management and control.
The specific corresponding process is as follows:
firstly, aiming at all flight sets S with fixed departure time, departure date and departure airport, converting the number of flights in the flight sets S into the proportion of all flights;
all flights refer to all historical flights of the airport collected.
Then, decomposing the proportion according to the optimal decomposition result to obtain a core tensor
Figure BDA0002418306170000041
Seen as a probability tensor p in the PLSA modelt(ii) a Factor matrix
Figure BDA0002418306170000042
Viewed as a probabilistic form
Figure BDA0002418306170000043
Thus, the corresponding PLSA model is:
Figure BDA0002418306170000044
and T is the decomposed potential class number obtained by solving the optimization problem by using a gradient descent method.
The analysis process is as follows:
aiming at a certain flight A, when a departure airport, a departure date and a departure time are respectively determined, the potential class r 'of the departure airport can be correspondingly found'1Potential class of departure time r'2Potential class of departure date r'3Then through the core sub-tensor
Figure BDA0002418306170000045
Finding potential class r 'with the departure airport'1Potential class of departure time r'2And potential class r of departure date'3The associated delay level, in each potential mode, looks up the probability of flight A taking off on time, the probability of slight delay, the probability of moderate delay, and the probability of severe delay.
The invention has the advantages and positive effects that:
(1) the flight delay pattern analysis method based on the nonnegative tensor decomposition proves that potential class analysis based on the tensor decomposition has effectiveness in identifying the flight delay pattern through data processing and analysis.
(2) The flight delay pattern analysis method based on the non-negative tensor decomposition is easy to operate, simple in steps, accurate in result and innovative.
Drawings
FIG. 1 is a flow chart of a flight delay pattern analysis method based on non-negative tensor decomposition according to the present invention;
Detailed Description
The present invention will be described in further detail and with reference to the accompanying drawings so that those skilled in the art can understand and practice the invention.
The aim of the invention is to use a large amount of air traffic data to learn about potential flight delay patterns. Flight records are first collected using statistical methods and modeled as an n-order tensor. Considering the nature of the data set, flight records are viewed as multivariate observations sampled from a common distribution, with a latent model similar to tensor analysis being used to identify the primary mode for each mode. Then, a predictor based on the underlying patterns is proposed to show the efficiency of the framework.
Specifically, the flight delay pattern analysis method based on non-negative tensor decomposition includes the following steps, as shown in fig. 1:
step one, collecting all historical flight record data of an airport, and extracting main variables from the historical data record of each flight;
extracting a plurality of main variables from the historical data record of each flight, wherein the main variables are respectively a departure airport, departure time, departure date, delay level, weather condition and the like of the flight;
the departure airports can number all the departure airports according to a certain rule, such as an alphabetical sequence table, and the numbers correspond to the airports one to one.
The departure date of the flight can be expressed by the number of the specific days in the year, i.e., 1 month, 1 day, 1 month, 2 days, and so on, and 12 months, 31 days, 365/366 days. If the departure date of a flight is 1 month and 30 days, the departure date number of the flight is 30.
The 24-hour value of the takeoff time of the flight is represented and is divided into 24 numbers, and each number corresponds to one hour in one day; for a 14 o' clock 30 takeoff flight, the takeoff time would correspond to 15.
The delay levels are classified into four levels according to the relevant regulations and practices of the Federal Aviation Administration (FAA) in the united states: (1) <15min, (2)15-45min, (3)45-90min, (4) >90min, and represent punctuality, slight delay, moderate delay, and severe delay with {1,2,3,4} respectively.
Step two, constructing a fourth-order tensor by the main variables extracted from all flights
Figure BDA0002418306170000051
A larger nuclear tensor can contain more information reflecting the synthetic relationships between different modalities, but a small nuclear tensor can facilitate interpretation of the results.
The present invention only studies include four dimensions: fourth-order tensor for departure airport, departure time, departure date and delay level
Figure BDA0002418306170000052
An N-th order dimension that can be extended to include other variables is within the scope of the present application.
Step three, using a non-negative tensor decomposition method to carry out fourth-order tensor
Figure BDA0002418306170000053
Pattern recognition is performed and decomposed into the product of four non-negative factor matrices and a non-negative core tensor.
Assuming that flight records are multivariate variables sampled from a general distribution, each observation comprises four variables, identifying a mode of each independent variable by using a correlation algorithm, describing historical record data of each flight by using a kernel tensor, and performing statistical analysis on the distribution condition of each mode in the variables;
the method utilizes the Tucker decomposition to realize the analysis of potential modes, the Tucker decomposition is also called as high-order singular value decomposition, a tensor is decomposed into a core tensor, a factor matrix is multiplied along each mode, and the factor matrix on each mode is called as a base matrix or a principal component of the tensor on each mode. By third order tensor
Figure BDA0002418306170000054
For example, it can be obtained by Tucker decomposition
Figure BDA0002418306170000055
Three factor matrices and a kernel tensor
Figure BDA0002418306170000056
In the third order tensor form, there are:
Figure BDA0002418306170000057
a, B, C are a matrix of modulo-N factors,
Figure BDA0002418306170000058
is the nuclear tensor, which represents the degree of interaction and relation between the different components.
In a higher dimension, there are
Figure BDA0002418306170000059
Wherein the content of the first and second substances,
Figure BDA0002418306170000061
representing the core tensor: (
Figure BDA0002418306170000062
The value representing each cell in the core tensor),
Figure BDA0002418306170000063
represents a factor matrix, hdThe number of time factor matrix patterns.
The core tensor G characterizes the degree of interaction between the different modes, so each element of the core tensor can be expressed as:
Figure BDA0002418306170000064
the pattern recognition is performed for each single variable using non-Negative Tensor Decomposition (NTD), which isDecomposition for finding a non-negative N-th order tensor
Figure BDA0002418306170000065
As a non-negative core tensor
Figure BDA0002418306170000066
And N non-negative mode matrices
Figure BDA0002418306170000067
The product of modulus of (a). When the data must be decomposed into a set of additive components, it can decompose a tensor into a set of matrices and a core tensor.
The non-negative tensor resolution formula corresponding to the fourth order tensor is as follows:
Figure BDA0002418306170000068
wherein
Figure BDA0002418306170000069
Representing the core tensor. A. the(1),A(2),A(3),A(4)Is a matrix of four factors, considered as the principal component of the modulus. A. the(1)Corresponding to the potential pattern matrix of flights in the dimension of the departure airport, A(2)Corresponding to the potential matrix of flights in the departure time dimension, A(3)Corresponding to the potential pattern matrix of flights in the dimension of departure date, A(4)Corresponding to the potential pattern matrix of the flight in the delay level dimension.
Set of departure airports divided into R by potential patterns1Class, set of departure times divided by potential patterns into R2Class, set of departure dates divided into R by latent patterns3Class, set of delay levels are divided into R by potential mode4Class; each potential class corresponds to a respective subset.
Step four, modeling the modulus of the non-negative factor matrix and the non-negative core tensor as an optimization problem, and solving by using a gradient descent method to obtain an optimal decomposition result
Figure BDA00024183061700000610
The optimization problem is as follows
Minimization
Figure BDA00024183061700000611
Figure BDA00024183061700000612
Wherein the constraint condition is a core tensor
Figure BDA00024183061700000613
Sum factor matrix A(n)Are all non-negative, n is 1,2,3, 4. The F-norm is the sum of the squares of the values at each position, taken in the root.
The solved optimal decomposition result is as follows:
Figure BDA0002418306170000071
Figure BDA0002418306170000072
is the original fourth order tensor
Figure BDA0002418306170000073
The set of all the elements in the tree is decomposed into a form of multiplying a plurality of matrixes; i.e. i1Some potential class r representing the dimensions of the departure airport1A corresponding subset of departure airports; i.e. i2Some potential class r representing the departure time dimension2A corresponding subset of departure times; i.e. i3Some potential class r representing the dimension of the departure date3A corresponding subset of departure dates; i.e. i4Representing a potential class r of flights in the dimension of the delay level4A corresponding delay level subset;
Figure BDA0002418306170000074
represents i1*r1The stored elements are corresponding subsets of each potential class of the departure airport;
Figure BDA0002418306170000075
represents i2*r2The stored elements are corresponding subsets of potential classes of flights at departure time;
Figure BDA0002418306170000076
represents i3*r3The stored elements are corresponding subsets of potential classes of flights on departure dates;
Figure BDA0002418306170000077
represents i4*r4The stored elements are corresponding subsets of potential classes of flights in delay level dimensions;
Figure BDA0002418306170000078
representing the core tensor
Figure BDA00024183061700000715
Decomposed sub-tensors with dimension r1*r2*r3*r4(ii) a Representing potential classes r of departure airport dimensions1Potential class of departure times r2Potential class of departure date r3And latency class r of delay level4The interaction relationship between them. r is1,r2,r3,r4Respectively, the dimensions after the non-negative tensor decomposition.
Step five, the optimal decomposition result is obtained
Figure BDA0002418306170000079
Corresponding to the PLSA model based on the probability, carrying out statistical analysis on the distribution situation of each potential mode; therefore, delay conditions are predicted and judged, and then targeted measures are taken for management and control.
The specific corresponding process is as follows:
firstly, aiming at all flight sets S with fixed departure time, departure date and departure airport, converting the number of flights in the flight sets S into the proportion of all flights;
all flights refer to all historical flights of the airport collected.
Then, decomposing the proportion according to the optimal decomposition result to obtain a core tensor
Figure BDA00024183061700000710
Seen as a probability tensor p in the PLSA modelt;ptAs a proportion of the total number of flights for each potential class. Factor matrix
Figure BDA00024183061700000711
Viewed as a probabilistic form
Figure BDA00024183061700000712
Figure BDA00024183061700000713
Is the probability of occurrence of such potential class under a certain classification.
Thus, the corresponding PLSA model is:
Figure BDA00024183061700000714
and T is the decomposed potential class number obtained by solving the optimization problem by using a gradient descent method.
Analyzing the proposed potential classes, and performing statistical analysis on the distribution condition of each mode in the variables; the kernel tensor is used to describe the relevance of different dimensional modes of each flight, such as weather and geographic position, time and geographic position, the relationship between departure airport and departure date, and the like. And describing the distribution characteristics of each dimension of the flight by using the factor matrix. The analysis process is as follows:
aiming at a certain flight A, when a departure airport, a departure date and a departure time are respectively determined, the submarine of the departure airport can be correspondingly foundR's'1Potential class of departure time r'2Potential class of departure date r'3Then through the core sub-tensor
Figure BDA0002418306170000081
Finding potential class r 'with the departure airport'1Potential class of departure time r'2And potential class r of departure date'3The associated delay level, in each potential mode, looks up the probability of flight A taking off on time, the probability of slight delay, the probability of moderate delay, and the probability of severe delay.
From the decomposition result, the operation of the flight in the dimension of the airport can be known to be divided into several modes; the operation in the starting date dimension is divided into several modes, such as a winter mode and a summer mode; the method is beneficial to finding the operation mode of the flight from the flight big data, so that the delay condition is predicted and judged, and further, the targeted measures are taken for management and control.

Claims (3)

1. A flight delay pattern analysis method based on non-negative tensor decomposition is characterized by comprising the following steps:
step one, collecting all historical flight record data of an airport, and extracting main variables from the historical data record of each flight;
step two, constructing a fourth-order tensor by the main variables extracted from all flights
Figure FDA0002984195770000011
Fourth order tensor
Figure FDA0002984195770000012
Comprises four dimensions which respectively correspond to an departure airport and an exit airportTime of departure, date of departure, and delay level;
step three, using a non-negative tensor decomposition method to carry out fourth-order tensor
Figure FDA0002984195770000013
Performing pattern recognition, and decomposing the pattern recognition into a product of four nonnegative factor matrixes and a nonnegative core tensor;
the non-negative tensor resolution formulation is as follows:
Figure FDA0002984195770000014
wherein
Figure FDA0002984195770000015
Representing a core tensor; a. the(1),A(2),A(3),A(4)Is a matrix of four factors, A(1)Corresponding to the potential pattern matrix of flights in the dimension of the departure airport, A(2)Corresponding to the potential matrix of flights in the departure time dimension, A(3)Corresponding to the potential pattern matrix of flights in the dimension of departure date, A(4)A potential mode matrix of the corresponding flight on the dimension of the delay level;
set of departure airports divided into R by potential patterns1Class, set of departure times divided by potential patterns into R2Class, set of departure dates divided into R by latent patterns3Class, set of delay levels are divided into R by potential mode4Class; each potential class corresponds to a respective subset;
step four, modeling the modulus of the non-negative factor matrix and the non-negative core tensor as an optimization problem, and solving by using a gradient descent method to obtain an optimal decomposition result
Figure FDA0002984195770000016
The optimization problem is as follows:
minimization
Figure FDA0002984195770000017
Figure FDA0002984195770000018
Wherein the constraint condition is a core tensor
Figure FDA0002984195770000019
Sum factor matrix A(n)Are all non-negative, n is 1,2,3, 4; the F norm is obtained by summing the squares of the values of each position in an evolution mode;
the solved optimal decomposition result is as follows:
Figure FDA00029841957700000110
Figure FDA00029841957700000111
is the original fourth order tensor
Figure FDA00029841957700000112
The set of all the elements in the tree is decomposed into a form of multiplying a plurality of matrixes; i.e. i1Some potential class r representing the dimensions of the departure airport1A corresponding subset of departure airports; i.e. i2Some potential class r representing the departure time dimension2A corresponding subset of departure times; i.e. i3Some potential class r representing the dimension of the departure date3A corresponding subset of departure dates; i.e. i4Representing a potential class r of flights in the dimension of the delay level4A corresponding delay level subset;
Figure FDA00029841957700000113
represents i1*r1The stored elements are corresponding subsets of each potential class of the departure airport;
Figure FDA00029841957700000114
represents i2*r2The stored elements are corresponding subsets of potential classes of flights at departure time;
Figure FDA00029841957700000115
represents i3*r3The stored elements are corresponding subsets of potential classes of flights on departure dates;
Figure FDA00029841957700000116
represents i4*r4The stored elements are corresponding subsets of potential classes of flights in delay level dimensions;
Figure FDA00029841957700000117
representing the core tensor
Figure FDA00029841957700000118
Decomposed sub-tensors with dimension r1*r2*r3*r4(ii) a Representing potential classes r of departure airport dimensions1Potential class of departure times r2Potential class of departure date r3And latency class r of delay level4The interaction relationship between the two; r is1,r2,r3,r4Respectively representing the dimensions after the non-negative tensor decomposition;
step five, the optimal decomposition result is obtained
Figure FDA0002984195770000021
Corresponding to the PLSA model based on the probability, carrying out statistical analysis on the distribution situation of each potential mode; therefore, delay conditions are predicted and judged, and then targeted measures are taken for management and control;
the specific corresponding process is as follows:
firstly, aiming at all flight sets S with fixed departure time, departure date and departure airport, converting the number of flights in the flight sets S into the proportion of all flights;
all flights refer to all historical flights of the airport collected;
then, decomposing the proportion according to the optimal decomposition result to obtain a core tensor
Figure FDA0002984195770000022
Seen as a probability tensor p in the PLSA modelt(ii) a Factor matrix
Figure FDA0002984195770000023
Viewed as a probabilistic form
Figure FDA0002984195770000026
Thus, the corresponding PLSA model is:
Figure FDA0002984195770000024
and T is the decomposed potential class number obtained by solving the optimization problem by using a gradient descent method.
2. The flight delay pattern analysis method based on the non-negative tensor decomposition as set forth in claim 1, wherein the main variables in the first step comprise the departure airport, departure time, departure date and delay level of the flight;
manually numbering all departure airports in sequence, wherein each airport corresponds to a unique number; the set is denoted as {1,2,3, …, Z }; z is an integer;
the departure date of the flight refers to: numbering the days in the year in sequence, namely numbering 1 on 1 month and 1 day, numbering 2 on 1 month and 2 days, and so on, numbering 365/366 on 12 months and 31 days; the departure date of each flight is the number of the corresponding specific days on the departure day; the set is denoted as {1,2,3 … …,365/366 };
the departure time of the flight is: dividing the time of one day into every hour, wherein the number of flights taking off from 0 point to 1 point corresponds to the number 1, the number of flights taking off from 1 point to 2 points corresponds to the number 2, and so on, and the number of flights taking off from 23 points to 24 points corresponds to the number 24; the set is denoted as {1,2,3, … …,24 };
the delay levels are divided into four according to actual conditions: (1) <15min, (2)15-45min, (3)45-90min, (4) >90min, and represent punctuality, slight delay, moderate delay, and severe delay with {1,2,3,4} respectively.
3. The flight delay pattern analysis method based on the non-negative tensor decomposition as set forth in claim 1, wherein the analysis process in the fifth step is as follows:
aiming at a certain flight A, when a departure airport, a departure date and a departure time are respectively determined, the potential class r 'of the departure airport can be correspondingly found'1Potential class of departure time r'2Potential class of departure date r'3Then through the core sub-tensor
Figure FDA0002984195770000025
Finding potential class r 'with the departure airport'1Potential class of departure time r'2And potential class r of departure date'3The associated delay level, in each potential mode, looks up the probability of flight A taking off on time, the probability of slight delay, the probability of moderate delay, and the probability of severe delay.
CN202010197961.1A 2020-03-19 2020-03-19 Flight delay mode analysis method based on non-negative tensor decomposition Active CN111415039B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202010197961.1A CN111415039B (en) 2020-03-19 2020-03-19 Flight delay mode analysis method based on non-negative tensor decomposition

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202010197961.1A CN111415039B (en) 2020-03-19 2020-03-19 Flight delay mode analysis method based on non-negative tensor decomposition

Publications (2)

Publication Number Publication Date
CN111415039A CN111415039A (en) 2020-07-14
CN111415039B true CN111415039B (en) 2021-05-28

Family

ID=71491305

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202010197961.1A Active CN111415039B (en) 2020-03-19 2020-03-19 Flight delay mode analysis method based on non-negative tensor decomposition

Country Status (1)

Country Link
CN (1) CN111415039B (en)

Families Citing this family (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112201037B (en) * 2020-09-27 2021-09-03 同济大学 Intersection arrival rate estimation method based on sampling trajectory data
CN113970710B (en) * 2021-10-26 2023-06-09 广东电网有限责任公司佛山供电局 Method and system for monitoring DC magnetic bias running state of power transformer

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102004774A (en) * 2010-11-16 2011-04-06 清华大学 Personalized user tag modeling and recommendation method based on unified probability model
US8060512B2 (en) * 2009-06-05 2011-11-15 Xerox Corporation Hybrid tensor-based cluster analysis

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8060512B2 (en) * 2009-06-05 2011-11-15 Xerox Corporation Hybrid tensor-based cluster analysis
CN102004774A (en) * 2010-11-16 2011-04-06 清华大学 Personalized user tag modeling and recommendation method based on unified probability model

Non-Patent Citations (3)

* Cited by examiner, † Cited by third party
Title
Application of Non-Negative Tensor Factorization;HAN ZHONG等;《IEEE Access》;20191125;全文 *
基于张量分解的高维数据建模研究;唐琳瑶;《中国优秀硕士学位论文全文数据库 信息科技辑》;20180315;全文 *
基于航路网流率协同控制的航班离场优化研究;钟涵;《中国博士学位论文全文数据库 基础科学辑》;20200115;全文 *

Also Published As

Publication number Publication date
CN111415039A (en) 2020-07-14

Similar Documents

Publication Publication Date Title
CN110503245B (en) Prediction method for large-area delay risk of airport flight
Gordon Steering a new course: Transportation, energy, and the environment
CN111415039B (en) Flight delay mode analysis method based on non-negative tensor decomposition
Singh et al. Weather forecasting using machine learning techniques
CN108492561B (en) Road network traffic state space-time characteristic analysis method based on matrix decomposition
Grabbe et al. Clustering days and hours with similar airport traffic and weather conditions
Zhang et al. Characterizing flight delay profiles with a tensor factorization framework
CN112232535A (en) Flight departure average delay prediction method based on supervised learning
Zhang et al. Development of a reliable method for general aviation flight phase identification
Liu et al. Causal analysis of flight en route inefficiency
Gao et al. Multi-level aircraft feature representation and selection for aviation environmental impact analysis
CN111340536A (en) Model training method, passenger seat rate progress prediction method, system, device and medium
CN112819064B (en) Terminal area time sequence meteorological scene identification method based on spectral clustering
Zhang et al. Flight delay prediciton at an airport using maching learning
CN112256681A (en) Air traffic control digital index application system and method
CN111090793B (en) Airport day delay characteristic classification method based on average delay time sequence
Ren et al. Exploration of the evolution of airport ground delay programs
CN112883169B (en) Contradiction evolution analysis method and device based on big data
Dai et al. Having a Bad Day? Predicting High Delay Days in the National Airspace System
Du et al. Airport capacity prediction with multisource features: A temporal deep learning approach
Yang et al. A data-driven method for flight time estimation based on air traffic pattern identification and prediction
Miraç et al. Measurement of Process-Performances of Turkish Airports Using Network Data Envelopment Analysis
CN112926809B (en) Flight flow prediction method and system based on clustering and improved xgboost
Du et al. Mining multicity urban data for sustainable population relocation
Li Economic Development Analysis of China’s Coastal Industries Based on Deep Learning and SDA Model Weighting

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant