WO2022252850A1 - Seasonal classification method and apparatus for flights, and machine-readable medium - Google Patents

Seasonal classification method and apparatus for flights, and machine-readable medium Download PDF

Info

Publication number
WO2022252850A1
WO2022252850A1 PCT/CN2022/087286 CN2022087286W WO2022252850A1 WO 2022252850 A1 WO2022252850 A1 WO 2022252850A1 CN 2022087286 W CN2022087286 W CN 2022087286W WO 2022252850 A1 WO2022252850 A1 WO 2022252850A1
Authority
WO
WIPO (PCT)
Prior art keywords
data
flight
income
revenue
target
Prior art date
Application number
PCT/CN2022/087286
Other languages
French (fr)
Chinese (zh)
Inventor
张毅
周榕
梁巍
陈思
Original Assignee
中国民航信息网络股份有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 中国民航信息网络股份有限公司 filed Critical 中国民航信息网络股份有限公司
Publication of WO2022252850A1 publication Critical patent/WO2022252850A1/en

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/28Databases characterised by their database models, e.g. relational or object models
    • G06F16/284Relational databases
    • G06F16/285Clustering or classification
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/24Querying
    • G06F16/245Query processing
    • G06F16/2458Special types of queries, e.g. statistical queries, fuzzy queries or distributed queries
    • G06F16/2465Query processing support for facilitating data mining operations in structured databases
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q50/00Information and communication technology [ICT] specially adapted for implementation of business processes of specific business sectors, e.g. utilities or tourism
    • G06Q50/10Services
    • G06Q50/26Government or public services

Definitions

  • the present application relates to the field of aviation, in particular to a method, device and machine-readable medium for seasonal classification of flights.
  • the seasonal classification of aviation flights is mainly carried out quantitatively or qualitatively.
  • artificially judging the seasonal classification of aviation flights will inevitably lead to biased results, which will lead to the accuracy of seasonal classification of aviation flights. not tall.
  • the application provides a method, device and machine-readable medium for seasonal classification of flights, which can improve the accuracy of seasonal classification and avoid bias caused by artificial seasonal classification of flights.
  • the first aspect of the embodiment of the present application provides a seasonal classification method for flights, including:
  • N N data pools for seasonal classification, wherein said N is an integer greater than or equal to 2;
  • the seasonal classification of the first flight is determined according to the second revenue data and the central data of each of the N data pools.
  • the seasonal classification corresponding to the second data pool is determined as the seasonal classification of the first flight.
  • the second aspect of the embodiment of the present application provides a flight seasonal classification device, including:
  • the first acquisition unit is used to acquire the first flight data of the first flight to be classified seasonally in the target airline company;
  • the second obtaining unit is configured to obtain the second flight data of each historical flight in the historical flight set corresponding to the first flight;
  • the first determination unit is used to determine N data pools for seasonal classification, wherein the N is an integer greater than or equal to 2;
  • a first calculation unit configured to calculate the first revenue data of each historical flight according to the second flight data
  • a second determining unit configured to determine the central data of each of the N data pools according to the first income data
  • a second calculation unit configured to calculate second revenue data of the first flight according to the first flight data
  • the third determination unit is configured to determine the seasonal classification of the first flight according to the second income data and the central data of each data pool in the N data pools.
  • the second determining unit is specifically configured to:
  • the target revenue data being the central data of the first data pool, the target revenue data being the revenue data corresponding to the second flight, the second flight being any flight in the set of historical flights, and the first data pool Any one of the N data pools;
  • the third income data is the income data corresponding to any flight in the flight subset, and the flight subset is the historical flight set except a collection of flights other than said second flight;
  • the fourth revenue data is the revenue data corresponding to the third flight
  • the third flight is the third flight in the subset of flights that is related to the target revenue data. - the flight corresponding to the nearest revenue data;
  • the calculation of the first distance between the third income data and the target income data by the second determination unit includes:
  • D(i, j) is the first distance between the target income data i and the third income data j
  • the third income data j is the income data corresponding to any flight in the flight subset
  • W(t) is the weight of the third income data
  • X it is the market demand value of the target income data
  • Y jt is the market demand value corresponding to the third income data.
  • the calculation of the center data of the target data pool after division by the second determining unit includes:
  • New_mean(t) is the central data of the target data pool after dividing the fourth income data into the target data pool
  • Old_mean(t) is dividing the fourth income input into the target data pool
  • m is the number of central data in the target data pool before the fourth income input is divided into the target data pool
  • X(t) is the number of central data in the target data pool The market demand value corresponding to any income data.
  • the first calculation unit is specifically configured to:
  • the first revenue data of each historical flight is calculated by the following formula:
  • Revenue Dcp(x) is the first revenue data of the xth flight in each of the historical flights, i represents the cabin, k is the total number of cabins, and BKG (i) is the reservation of the i-th cabin, Fare(i) is the fare of the ith cabin.
  • the third determining unit is specifically configured to:
  • the seasonal classification corresponding to the second data pool is determined as the seasonal classification of the first flight.
  • the third aspect of the present application provides a computer device, including: a memory, a processor, and a bus system; wherein, the memory is used to store programs, and the bus system is used to connect the memory and the processor so that the memory and the processor communicate; The device is used to execute the program in the memory, and execute the seasonal classification method of the flight according to the above-mentioned first aspect according to the instruction in the program code.
  • the fourth aspect of the embodiment of the present application provides a machine-readable medium, which includes instructions, and when it is run on a machine, causes the machine to execute the method for seasonally classifying flights described in the first aspect above.
  • the flight seasonal classification device determines the seasonal classification of the first flight, it can obtain the historical flight data corresponding to the first flight, and then the historical flight data The flight data is clustered to obtain the central data of N clusters, and the seasonal classification of the first flight is determined according to the central data of the N clusters and the income data of the first flight.
  • the accuracy of seasonal classification can be improved, and the bias caused by artificial seasonal classification of flights can be avoided.
  • Fig. 1 is a schematic flow chart of the seasonal classification method of flights provided by the embodiment of the present application
  • Fig. 2 is the virtual structural schematic diagram of the flight season classification device that the embodiment of the present application provides;
  • FIG. 3 is a schematic structural diagram of a machine-readable medium provided by an embodiment of the present application.
  • FIG. 4 is a schematic diagram of a hardware structure of a server provided by an embodiment of the present application.
  • the sample data after classification can better reflect the real historical market conditions and market trends in each time period, and the calculation for predicting output results is more accurate. For example, for the National Day For market forecasting of flights with long holidays, it is necessary to use the flight data of departing flights that are also National Day or long holidays of the same quality as a sample. At present, airlines divide aviation flights quantitatively or qualitatively into off-peak seasons, but through Artificial division of low and peak seasons will inevitably lead to biased results, leading to inaccurate seasonal divisions.
  • a method for seasonal classification of flights uses cluster analysis to perform cluster analysis on the historical data of the same period as the flights to be classified seasonally, and obtains N clusters class, and then classify the flights to be seasonally classified according to N clusters. Compared with the existing artificial seasonal return, it is more accurate and avoids biased classification results.
  • Clustering is to divide the sample space into multiple subspaces according to a certain similarity criterion, so that the sample points in each subspace are similar and similar, and the differences between sample points in different subspaces are far and wide.
  • the process is an unsupervised learning The process can realize the blind classification of the sample space.
  • Clustering is widely used in statistics, machine learning, pattern recognition, data analysis and other fields. At present, there are nearly a hundred kinds of clustering algorithms applied in many fields, and the processing objects range from general databases to ultra-large-scale databases, from low-dimensional data spaces to high-dimensional data spaces, and from digital attribute data to multi-attribute data.
  • FIG. 1 is a schematic flow chart of the seasonal classification method of the flight provided by the embodiment of the application. include:
  • the device for seasonal flight classification acquires the first flight data of the first flight to be classified seasonally in the target airline company.
  • the first flight includes at least one flight segment.
  • the segment of the voyage for example, the first flight is the flight corresponding to Beijing-Shanghai-San Francisco, there are three possible passenger voyages: Beijing-Shanghai, Shanghai-San Francisco and Beijing-San Francisco, that is, the first flight includes Beijing-Shanghai Flight segment, Shanghai-San Francisco flight segment and Beijing-San Francisco flight segment, a total of 3 flight segments.
  • the first flight data is the data collection points (Data collection points, DCP) data of the designated airline (that is, the target airline company), including at least one of the following: flight number, departure airport, arrival airport, flight segment departure date Reservation value and corresponding transportation value of each class of cabin and time, flight segment.
  • DCP data collection points
  • the first flight data may also include the departure date of the flight, the week corresponding to the departure date, the specific departure time, reservation value data and market demand value.
  • the flight data of the flight is used for description, and of course, the flight segment in the flight can also be used as an example for description, which is not specifically limited.
  • the market demand value refers to the demand that passengers have the ability to purchase and have actual purchase needs. Actual orders may or may not be generated.
  • the limit algorithm calculation model is used to output the market demand value, and the market demand value is obtained. It is widely used in judging market trends and dividing market seasons, and is an important input value applied to the optimization module of the core algorithm subsystem of the revenue management system.
  • the revenue management system is used by airlines to use flight plans, inventory, departure and freight data.
  • An automated management system that automatically manages the inventory of non-departure flights.
  • the method of obtaining the market demand value of the flight is not specifically limited here. For example, obtain the flight information of the specified flight of the target airline company, and obtain inventory data based on the flight information.
  • the inventory data specifically includes the inventory information of departing flights and non-departing flights Inventory information, where the inventory data of departed flights refers to the flight inventory data of departing flights in the past three years based on the current date of the specified flight, and the inventory data of non-departing flights refers to the future of the designated flight based on the current date One year's flight inventory data; judge the sales status of the specified flight class based on the inventory data.
  • the flight information of the designated airline the inventory data of the designated flight, etc., to identify the sales status of the designated cabin of the designated flight.
  • the sales status includes locked cabin, open cabin, etc., and finally based on the preset algorithm Process the sales status to obtain the market demand value of the specified flight.
  • the available status when the number of available seats in a designated cabin is less than or equal to zero, and the available status is open or closed, it means that the designated cabin is in a non-saleable state, that is, the cabin is locked; the number of available seats in a designated cabin is greater than zero, and the available If the status is closed, it means that the specified cabin is not available for sale; it is locked; if the number of available seats in the specified cabin is greater than zero, the available status is open, indicating that the specified cabin is available for sale; that is, the cabin is opened;
  • the market demand value of the data collection point DCP(n+1) is calculated by the following formula:
  • the DCP data is determined by the number of days away from the port at the current moment, and there is a one-to-one correspondence with the number of days away from the port.
  • the following examples illustrate:
  • the data collection point can be set to 24 (Dcp 1 , Dcp 2 , ..., Dcp 24 ), and the date from the port can be set to 365 days, wherein the data collection point corresponds to the number of days from the port, and the data collection point
  • the data collection point Dcp 1 corresponds to 365 days from the port date
  • Dcp 24 corresponds to 1 day from the port date.
  • the corresponding relationship of is just an example and is not specifically limited.
  • the data collection point can also include data such as cabin class, seat reservation, and ticket price, among which:
  • the seat reservation is marked as BKG, and the reservation of the passenger's reserved seat and class of boarding or the weight and volume of luggage;
  • the fare is marked as Fare, which refers to the fee charged by passengers from point A to point B, or it can be said that it is a price that passengers need to pay, and the conditions attached to this price to stipulate the use of this price (refers to freight, The sum of rules and various restrictions), the above are the basic conditions for automatic fare calculation.
  • the international freight rate mainly includes the following contents: city pair (also known as the market), rule number, freight level, footnote (optional), currency, amount, effective date, deadline, mileage, etc.
  • the device for seasonal flight classification can obtain the second flight data of each historical flight in the historical flight set corresponding to the first flight. Since the local database regularly stores the flight data of the target airline company, the first When the second flight data of each historical flight in the historical flight collection corresponding to the first flight only needs to be extracted from the local database.
  • the set of historical flights corresponding to the first flight is the departing flight associated with the current date in the three years before the current date (of course, it can also be other durations, such as 4 days, not specifically limited), such as the first The flight is a certain flight on April 25, 2021, and April 25 is a Sunday, then the historical flight collection is a collection of flights corresponding to the target flight on all Sundays in the past 3 years.
  • the device for seasonal classification of flights may determine N data pools for seasonal classification, where N is a positive integer greater than or equal to 2. It can be understood that, in this application, the number of N is set to 7, and the 7 data pools are divided into peak season 1 (denoted as peak), peak season 2 (denoted as peak1), off-season 1 (denoted as peak2), and average season 1 (denoted as peak2). Season 2 (denoted as offpeak2), off-season 1 (denoted as offpeak1), off-season 2 (denoted as offpeak) and the uncategorized data pool default. Do limited.
  • step 101 the first flight data of the first flight to be classified seasonally in the target airline company can be obtained, and through step 102, the data of each historical flight in the historical flight set corresponding to the first flight can be obtained
  • step 103 N data pools for seasonal classification can be determined through step 103.
  • Step 101 can be executed first, or step 102 can be executed first, or Step 103 is executed first, or executed at the same time, which is not specifically limited.
  • the device for seasonal flight classification can calculate the first income data of each historical flight according to the second flight data, specifically as follows The formula calculates the first revenue data for each historical flight:
  • Revenue Dcp(x) is the first revenue data of the xth flight in each of the historical flights, i represents the cabin, k is the total number of cabins, and BKG (i) is the reservation of the i-th cabin, Fare(i) is the fare of the ith cabin.
  • Dcp 19 , Dcp 20 , Dcp 21 , Dcp 22 , Dcp 23 and Dcp 24 can be used as the historical flights of the first flight, and of course other flights can also be used
  • the data collection points are not limited.
  • the device for seasonal flight classification can determine the central data of each data pool in the N data pools according to the first revenue data of each historical flight.
  • the flight seasonal classification device determines the central data of each of the N data pools according to the first revenue data including:
  • the target revenue data is the revenue data corresponding to the second flight
  • the second flight is any flight in the historical flight collection
  • the first data pool is any of the N data pools One;
  • the third income data is the income data corresponding to any flight in the flight subset, and the flight subset is the flight collection in the historical flight collection except the target flight ;
  • the fourth income data is the income data corresponding to the third flight
  • the third flight is the flight corresponding to the first income data closest to the target income data in the flight subset ;
  • the flight season classification device may first determine the target revenue data as the central data of the first data pool, the target revenue data is the revenue data of the target flight, and the target flight is any flight in the historical flight collection , the first data pool is any one of the N data pools, that is, the revenue data corresponding to the N historical flights can be randomly selected here as the central data of the N data pools.
  • the third income data is the income data corresponding to any flight in the flight subset, and the flight subset is the flight collection except the target flight in the historical flight collection, Specifically, the first distance between the third income data and the central data of the N data pools can be calculated by the following formula:
  • D(i, j) is the first distance between the target income data i and the third income data j
  • the third income data j is the income data corresponding to any flight in the flight subset
  • W(t) is the weight of the third income data
  • X it is the market demand value of the target income data
  • Y jt is the market demand value corresponding to the third income data.
  • the device for seasonal flight classification can divide the fourth revenue data into the first data pool, and the fourth revenue data is The revenue data corresponding to the third flight, the third flight is the flight corresponding to the revenue data with the closest distance to the target revenue data in the flight subset, that is to say, each flight in the flight subset can be separately Divide into the data pool closest to each flight.
  • the flight season classification device can calculate the center data of the divided first data pool, specifically, the center data of the first data pool can be calculated by the following formula:
  • New_mean(t) is the central data of the target data pool after the fourth income data is divided into the target data pool
  • Old_mean(t) is the central data of the target data pool before the fourth income input is divided into the target data pool
  • m is The amount of central data in the target data pool before the fourth income input is divided into the target data pool
  • X(t) is the market demand value corresponding to any income data in the target data pool.
  • the revenue data corresponding to each historical flight in the historical flight collection can be clustered and analyzed to divide the revenue data corresponding to each flight into N data pools, and the divided N data can be obtained pool and the central data of each of the N data pools.
  • the flight seasonal classification device can randomly select N second income data from the second income data of each flight in the historical flight collection as the central data of N data pools, and then randomly select from the flight subset A flight, calculate the distance between the second income data corresponding to the randomly selected flight and the central data of N data pools, and divide the second income data corresponding to the randomly selected flight into the The second revenue data corresponding to the flight is the closest data pool, and calculate the central data of the data pool; then select another flight from the flight subset, and repeat the above process until all the flights in the historical flight collection correspond to Until the division of the second income data is completed, the divided N data pools are obtained, and then the central data of the N data pools can be calculated.
  • the device for seasonal classification of flights can calculate the second income data of the first flight according to the data of the first flight, that is, calculate the income data of the first flight to be classified seasonally.
  • the following formula can be used Calculation:
  • Revenue Dcp(x) is the second income data
  • i represents the i-th cabin in the first flight
  • k is the total number of cabins in the first flight
  • BKG(i) is the reservation of the i-th cabin
  • Fare( i) is the fare of the i-th cabin.
  • the device for seasonal classification of flights can determine the seasonal classification of the first flight according to the second revenue data and the central data of each data pool, specifically , you can first calculate the second distance between the second income data and each of the N center data, where the N center data correspond to the N data pools, specifically, the second income data and N can be calculated by the following formula The second distance of each center data in center data:
  • D(i, j) is the first distance between the target income data i and the third income data j
  • the third income data j is the income data corresponding to any flight in the flight subset
  • W(t) is The weight of the third income data
  • X it is the market demand value of the target income data
  • Y jt is the market demand value corresponding to the third income data.
  • the second data pool is the data pool corresponding to the center data with the second smallest distance between the N center data and the second income data, that is, the second income
  • the data is divided into the most similar clusters
  • the seasonal classification corresponding to the second data pool is determined as the seasonal classification of the first flight.
  • the flight seasonal classification device determines the seasonal classification of the first flight, it can obtain the historical flight data corresponding to the first flight, and then the historical flight data The flight data is clustered to obtain the central data of N clusters, and the seasonal classification of the first flight is determined according to the central data of the N clusters and the income data of the first flight.
  • the accuracy of seasonal classification can be improved, and the bias caused by artificial seasonal classification of flights can be avoided.
  • each block in a flowchart or block diagram may represent a module, program segment, or portion of code that contains one or more logical functions for implementing specified executable instructions.
  • the functions noted in the block may occur out of the order noted in the figures. For example, two blocks shown in succession may, in fact, be executed substantially concurrently, or they may sometimes be executed in the reverse order, depending upon the functionality involved.
  • each block of the block diagrams and/or flowchart illustrations, and combinations of blocks in the block diagrams and/or flowchart illustrations can be implemented by a dedicated hardware-based system that performs the specified functions or operations , or may be implemented by a combination of dedicated hardware and computer instructions.
  • the present application can also write computer program codes for performing the operations of the present application in one or more programming languages or combinations thereof, and the above-mentioned programming languages include but are not limited to object-oriented programming languages—such as Java, Smalltalk, C++, and also conventional procedural programming languages—such as "C" or similar programming languages.
  • the program code may execute entirely on the user's computer, partly on the user's computer, as a stand-alone software package, partly on the user's computer and partly on a remote computer or entirely on the remote computer or server.
  • the remote computer can be connected to the user computer through any kind of network, including a local area network (LAN) or a wide area network (WAN), or it can be connected to an external computer (such as through an Internet service provider). Internet connection).
  • LAN local area network
  • WAN wide area network
  • Internet connection any kind of network, including a local area network (LAN) or a wide area network (WAN), or it can be connected to an external computer (such as through an Internet service provider). Internet connection).
  • Fig. 2 is the imaginary structural representation of the flight seasonal classification device that the embodiment of the present application provides, and this flight seasonal classification device 200 comprises:
  • the first obtaining unit 201 is used to obtain the first flight data of the first flight to be classified seasonally in the target airline company;
  • the second obtaining unit 202 is used to obtain the second flight data of each historical flight in the historical flight set corresponding to the first flight;
  • the first determination unit 203 is configured to determine N data pools for seasonal classification, wherein the N is an integer greater than or equal to 2;
  • the first calculation unit 204 is configured to calculate the first revenue data of each historical flight according to the second flight data
  • the second determination unit 205 is configured to determine the central data of each data pool in the N data pools according to the first income data
  • the second calculation unit 206 is configured to calculate the second income data of the first flight according to the first flight data
  • the third determining unit 207 is configured to determine the seasonal classification of the first flight according to the second revenue data and the central data of each of the N data pools.
  • the second determining unit 205 is specifically configured to:
  • the target revenue data being the central data of the first data pool, the target revenue data being the revenue data corresponding to the second flight, the second flight being any flight in the set of historical flights, and the first data pool Any one of the N data pools;
  • the third income data is the income data corresponding to any flight in the flight subset, and the flight subset is the historical flight set except a collection of flights other than said second flight;
  • the fourth revenue data is the revenue data corresponding to the third flight
  • the third flight is the third flight in the subset of flights that is related to the target revenue data. - the flight corresponding to the nearest revenue data;
  • the calculation of the first distance between the third income data and the target income data by the second determination unit 205 includes:
  • D(i, j) is the first distance between the target income data i and the third income data j
  • the third income data j is the income data corresponding to any flight in the flight subset
  • W(t) is the weight of the third income data
  • X it is the market demand value of the target income data
  • Y jt is the market demand value corresponding to the third income data.
  • the calculation of the center data of the target data pool after division by the second determination unit 205 includes:
  • New_mean(t) is the central data of the target data pool after dividing the fourth income data into the target data pool
  • Old_mean(t) is dividing the fourth income input into the target data pool
  • m is the number of central data in the target data pool before the fourth income input is divided into the target data pool
  • X(t) is the number of central data in the target data pool The market demand value corresponding to any income data.
  • the first computing unit 204 is specifically configured to:
  • the first revenue data of each historical flight is calculated by the following formula:
  • Revenue Dcp(x) is the first revenue data of the xth flight in each of the historical flights, i represents the cabin, k is the total number of cabins, and BKG (i) is the reservation of the i-th cabin, Fare(i) is the fare of the ith cabin.
  • the third determining unit 207 is specifically configured to:
  • the seasonal classification corresponding to the second data pool is determined as the seasonal classification of the first flight.
  • the units involved in the embodiments described in the present application may be implemented by means of software or by means of hardware.
  • the name of the unit does not constitute a limitation on the unit itself under certain circumstances, for example, the acquisition unit may also be described as "a unit that acquires the credential information of the target user".
  • FPGAs Field Programmable Gate Arrays
  • ASICs Application Specific Integrated Circuits
  • ASSPs Application Specific Standard Products
  • SOCs System on Chips
  • CPLD Complex Programmable Logical device
  • FIG. 3 is a schematic diagram of an embodiment of a machine-readable medium provided in an embodiment of the present application.
  • this embodiment provides a machine-readable medium 300 on which a computer program 311 is stored, and when the computer program 311 is executed by a processor, the seasonal classification method for flights described in FIG. 1 is implemented. A step of.
  • a machine-readable medium may be a tangible medium, which may contain or store a program for use by an instruction execution system, device, or device or in combination with an instruction execution system, device, or device.
  • a machine-readable medium may be a machine-readable signal medium or a machine-readable storage medium.
  • a machine-readable medium may include, but is not limited to, electronic, magnetic, optical, electromagnetic, infrared, or semiconductor systems, apparatus, or devices, or any suitable combination of the foregoing.
  • machine-readable storage media would include one or more wire-based electrical connections, portable computer discs, hard drives, random access memory (RAM), read only memory (ROM), erasable programmable read only memory (EPROM or flash memory), optical fiber, compact disk read only memory (CD-ROM), optical storage, magnetic storage, or any suitable combination of the foregoing.
  • RAM random access memory
  • ROM read only memory
  • EPROM or flash memory erasable programmable read only memory
  • CD-ROM compact disk read only memory
  • magnetic storage or any suitable combination of the foregoing.
  • machine-readable medium mentioned above in this application may be a computer-readable signal medium or a computer-readable storage medium or any combination of the above two.
  • a computer readable storage medium may be, for example, but not limited to, an electrical, magnetic, optical, electromagnetic, infrared, or semiconductor system, apparatus, or device, or any combination thereof. More specific examples of computer-readable storage media may include, but are not limited to, electrical connections with one or more wires, portable computer diskettes, hard disks, random access memory (RAM), read-only memory (ROM), erasable Programmable read-only memory (EPROM or flash memory), optical fiber, portable compact disk read-only memory (CD-ROM), optical storage device, magnetic storage device, or any suitable combination of the above.
  • a computer-readable storage medium may be any tangible medium that contains or stores a program that can be used by or in conjunction with an instruction execution system, apparatus, or device.
  • a computer-readable signal medium may include a data signal propagated in baseband or as part of a carrier wave, carrying computer-readable program code therein. Such propagated data signals may take many forms, including but not limited to electromagnetic signals, optical signals, or any suitable combination of the foregoing.
  • a computer-readable signal medium may also be any computer-readable medium other than a computer-readable storage medium, which can transmit, propagate, or transmit a program for use by or in conjunction with an instruction execution system, apparatus, or device .
  • Program code embodied on a computer readable medium may be transmitted by any appropriate medium, including but not limited to wires, optical cables, RF (radio frequency), etc., or any suitable combination of the above.
  • the above-mentioned computer-readable medium may be included in the above-mentioned electronic device, or may exist independently without being incorporated into the electronic device.
  • FIG. 4 is a schematic diagram of the hardware structure of a server provided by the embodiment of the present application.
  • the server 400 may have relatively large differences due to different configurations or performances, and may include one or more central processing units (central processing units, CPU) 422 (eg, one or more processors) and memory 432, and one or more storage media 430 (eg, one or more mass storage devices) for storing application programs 442 or data 444.
  • the memory 432 and the storage medium 430 may be temporary storage or persistent storage.
  • the program stored in the storage medium 430 may include one or more modules (not shown in the figure), and each module may include a series of instruction operations on the server.
  • the central processing unit 422 may be configured to communicate with the storage medium 430 , and execute a series of instruction operations in the storage medium 430 on the server 400 .
  • the server 400 can also include one or more power supplies 426, one or more wired or wireless network interfaces 450, one or more input and output interfaces 458, and/or, one or more operating systems 441, such as Windows ServerTM, Mac OS XTM, UnixTM, LinuxTM, FreeBSDTM, etc.
  • the steps performed by the device for classifying flight seasons in the above embodiments may be based on the server structure shown in FIG. 4 .
  • the process of the seasonal classification method for flights described in the schematic flowchart of FIG. 1 may be implemented as a computer software program.
  • the embodiments of the present application include a computer program product, which includes a computer program carried on a non-transitory computer-readable medium, the computer program including a program for performing the method shown in the flowchart schematic diagram of FIG. 1 above code.

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Databases & Information Systems (AREA)
  • Business, Economics & Management (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Data Mining & Analysis (AREA)
  • Tourism & Hospitality (AREA)
  • General Engineering & Computer Science (AREA)
  • Educational Administration (AREA)
  • Health & Medical Sciences (AREA)
  • Computational Linguistics (AREA)
  • Probability & Statistics with Applications (AREA)
  • Development Economics (AREA)
  • Mathematical Physics (AREA)
  • Fuzzy Systems (AREA)
  • Software Systems (AREA)
  • Economics (AREA)
  • General Health & Medical Sciences (AREA)
  • Human Resources & Organizations (AREA)
  • Marketing (AREA)
  • Primary Health Care (AREA)
  • Strategic Management (AREA)
  • General Business, Economics & Management (AREA)
  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)

Abstract

The present application provides a seasonal classification method and apparatus for flights, and a machine-readable medium, which can improve the accuracy of seasonal classification and avoid the bias phenomenon caused by the artificial seasonal classification of flights. The method comprises: obtaining first flight data of a first flight to be subjected to seasonal classification in a target airline; obtaining second flight data of each historical flight in a historical flight set corresponding to the first flight; determining N data pools for seasonal classification, N being an integer greater than or equal to 2; calculating first income data of each historical flight according to the second flight data; determining center data of each data pool in the N data pools according to the first income data; calculating second income data of the first flight according to the first flight data; and determining seasonal classification of the first flight according to the second income data and the center data of each data pool in the N data pools.

Description

航班的季节性归类方法、装置及机器可读介质Flight Seasonal Classification Method, Apparatus, and Machine-Readable Medium
本申请要求于2021年5月31日提交中国专利局、申请号为202110604313.8、发明名称为“航班的季节性归类方法、装置及机器可读介质”的中国专利申请的优先权,其全部内容通过引用结合在本申请中。This application claims the priority of the Chinese patent application with the application number 202110604313.8 and the title of the invention "Seasonal Classification Method, Device and Machine-Readable Medium for Flights" submitted to the China Patent Office on May 31, 2021, the entire content of which Incorporated in this application by reference.
技术领域technical field
本申请涉及航空领域,尤其涉及一种航班的季节性归类方法、装置及机器可读介质。The present application relates to the field of aviation, in particular to a method, device and machine-readable medium for seasonal classification of flights.
背景技术Background technique
受气候条件、突发事件、工农业生产生活、居民节假日等风俗习惯以及国民经济发展等因素的周期性影响,民航运输业客货运量呈季节性波动。在航空运输领域。通过将已离港历史航班进行淡旺季归类,也称之为季节性归类。Due to the cyclical influence of climate conditions, emergencies, industrial and agricultural production and life, residents' holidays and other customs, as well as national economic development and other factors, the passenger and cargo volume of the civil aviation transportation industry fluctuates seasonally. in the field of air transport. By classifying the departing historical flights into low and peak seasons, it is also called seasonal classification.
目前来说,主要是通过人为定量或定性的对航空航班进行的季节性归类,然而人为判断航空航班的季节性归类难免出现偏颇的结果,进而导致对航空航班的季节性归类精确度不高。At present, the seasonal classification of aviation flights is mainly carried out quantitatively or qualitatively. However, artificially judging the seasonal classification of aviation flights will inevitably lead to biased results, which will lead to the accuracy of seasonal classification of aviation flights. not tall.
发明内容Contents of the invention
本申请提供了一种航班的季节性归类方法、装置及机器可读介质,可以提高季节性归类的准确性,避免因为人为对航班的季节性归类而造成的偏颇现象。The application provides a method, device and machine-readable medium for seasonal classification of flights, which can improve the accuracy of seasonal classification and avoid bias caused by artificial seasonal classification of flights.
本申请实施例第一方面提供了一种航班的季节性归类方法,包括:The first aspect of the embodiment of the present application provides a seasonal classification method for flights, including:
获取目标航司中待进行季节性归类的第一航班的第一航班数据;Obtain the first flight data of the first flight to be classified seasonally in the target airline company;
获取所述第一航班所对应的历史航班集合中每个历史航班的第二航班数据;Obtaining the second flight data of each historical flight in the historical flight set corresponding to the first flight;
确定季节性归类的N个数据池,其中,所述N为大于或等2的整数;Determine N data pools for seasonal classification, wherein said N is an integer greater than or equal to 2;
根据所述第二航班数据计算所述每个历史航班的第一收入数据;calculating the first revenue data of each historical flight according to the second flight data;
根据所述第一收入数据确定所述N个数据池中每个数据池的中心数据;determining the central data of each of the N data pools according to the first income data;
根据所述第一航班数据计算所述第一航班的第二收入数据;calculating second revenue data for the first flight based on the first flight data;
根据所述第二收入数据以及所述N个数据池中每个数据池的中心数据确定所述第一航班的季节性归类。The seasonal classification of the first flight is determined according to the second revenue data and the central data of each of the N data pools.
将所述第二数据池所对应的季节性归类确定为所述第一航班的季节性归类。The seasonal classification corresponding to the second data pool is determined as the seasonal classification of the first flight.
本申请实施例第二方面提供了一种航班季节性归类装置,包括:The second aspect of the embodiment of the present application provides a flight seasonal classification device, including:
第一获取单元,用于获取目标航司中待进行季节性归类的第一航班的第一航班数据;The first acquisition unit is used to acquire the first flight data of the first flight to be classified seasonally in the target airline company;
第二获取单元,用于获取所述第一航班所对应的历史航班集合中每个历史航班的第二航班数据;The second obtaining unit is configured to obtain the second flight data of each historical flight in the historical flight set corresponding to the first flight;
第一确定单元,用于确定季节性归类的N个数据池,其中,所述N为大于或等2的整数;The first determination unit is used to determine N data pools for seasonal classification, wherein the N is an integer greater than or equal to 2;
第一计算单元,用于根据所述第二航班数据计算所述每个历史航班的第一收入数据;a first calculation unit, configured to calculate the first revenue data of each historical flight according to the second flight data;
第二确定单元,用于根据所述第一收入数据确定所述N个数据池中每个数据池的中心数据;A second determining unit, configured to determine the central data of each of the N data pools according to the first income data;
第二计算单元,用于根据所述第一航班数据计算所述第一航班的第二收入数据;a second calculation unit, configured to calculate second revenue data of the first flight according to the first flight data;
第三确定单元,用于根据所述第二收入数据以及所述N个数据池中每个数据池的中心 数据确定所述第一航班的季节性归类。The third determination unit is configured to determine the seasonal classification of the first flight according to the second income data and the central data of each data pool in the N data pools.
一种可能的设计中,所述第二确定单元具体用于:In a possible design, the second determining unit is specifically configured to:
将目标收入数据确定为第一数据池的中心数据,所述目标收入数据为第二航班对应的收入数据,所述第二航班为所述历史航班集合中任意一个航班,所述第一数据池为所述N个数据池中的任意一个;Determining the target revenue data as the central data of the first data pool, the target revenue data being the revenue data corresponding to the second flight, the second flight being any flight in the set of historical flights, and the first data pool Any one of the N data pools;
计算第三收入数据与所述目标收入数据的第一距离,所述第三收入数据为航班子集合中的任意一个航班所对应的收入数据,所述航班子集合为所述历史航班集合中除所述第二航班之外的航班集合;Calculate the first distance between the third income data and the target income data, the third income data is the income data corresponding to any flight in the flight subset, and the flight subset is the historical flight set except a collection of flights other than said second flight;
将第四收入数据划分至所述第一数据池,所述第四收入数据为第三航班所对应的收入数据,所述第三航班为所述航班子集合中与所述目标收入数据的第一距离最近的收入数据所对应的航班;Divide the fourth revenue data into the first data pool, the fourth revenue data is the revenue data corresponding to the third flight, and the third flight is the third flight in the subset of flights that is related to the target revenue data. - the flight corresponding to the nearest revenue data;
计算划分后所述第一数据池的中心数据。Calculate central data of the first data pool after division.
一种可能的设计中,所述第二确定单元计算第三收入数据与所述目标收入数据的第一距离包括:In a possible design, the calculation of the first distance between the third income data and the target income data by the second determination unit includes:
根据如下公式计算计算第三收入数据与所述目标收入数据的第一距离:Calculate and calculate the first distance between the third income data and the target income data according to the following formula:
D(i,j)=W(t)×|X it-Y jt|; D(i,j)=W(t)×|X it -Y jt |;
其中,D(i,j)为所述目标收入数据i与第三收入数据j之间的第一距离,所述第三收入数据j为所述航班子集合中任意一个航班所对应的收入数据,W(t)为所述第三收入数据的权重,X it为所述目标收入数据的市场需求值,Y jt为所述第三收入数据所对应的市场需求值。 Wherein, D(i, j) is the first distance between the target income data i and the third income data j, and the third income data j is the income data corresponding to any flight in the flight subset , W(t) is the weight of the third income data, X it is the market demand value of the target income data, and Y jt is the market demand value corresponding to the third income data.
一种可能的设计中,所述第二确定单元计算划分后所述目标数据池的中心数据包括:In a possible design, the calculation of the center data of the target data pool after division by the second determining unit includes:
通过如下公式计算划分后所述目标数据池的中心数据:Calculate the center data of the target data pool after division by the following formula:
Figure PCTCN2022087286-appb-000001
Figure PCTCN2022087286-appb-000001
其中,New_mean(t)为将所述第四收入数据划分至所述目标数据池后所述目标数据池的中心数据,Old_mean(t)为将所述第四收入输入划分至所述目标数据池前所述目标数据池的中心数据,m为将所述第四收入输入划分至所述目标数据池之前,所述目标数据池的中心数据的数量,X(t)为所述目标数据池中任意一个收入数据所对应的市场需求值。Wherein, New_mean(t) is the central data of the target data pool after dividing the fourth income data into the target data pool, and Old_mean(t) is dividing the fourth income input into the target data pool The central data of the aforementioned target data pool, m is the number of central data in the target data pool before the fourth income input is divided into the target data pool, and X(t) is the number of central data in the target data pool The market demand value corresponding to any income data.
一种可能的设计中,所述第一计算单元具体用于:In a possible design, the first calculation unit is specifically configured to:
通过如下公式计算所述每个历史航班的第一收入数据:The first revenue data of each historical flight is calculated by the following formula:
Figure PCTCN2022087286-appb-000002
Figure PCTCN2022087286-appb-000002
其中,Revenue Dcp(x)为所述每个历史航班中第x个航班的第一收入数据,i表示 舱位,k为舱位的总数,BKG(i)为所述第i个舱位的订座,Fare(i)为所述第i个舱位的票价。Wherein, Revenue Dcp(x) is the first revenue data of the xth flight in each of the historical flights, i represents the cabin, k is the total number of cabins, and BKG (i) is the reservation of the i-th cabin, Fare(i) is the fare of the ith cabin.
一种可能的设计中,所述第三确定单元具体用于:In a possible design, the third determining unit is specifically configured to:
计算所述第二收入数据与N个中心数据中每个中心数据的第二距离,其中,所述N个中心数据与所述N个数据池相对应;calculating a second distance between the second income data and each of the N central data, wherein the N central data correspond to the N data pools;
将所述第二收入数据划分至第二数据池,所述第二数据池为所述N个中心数据中与所述第二收入数据之间的第二距离最小的中心数据所对应的数据池;Dividing the second income data into a second data pool, the second data pool being the data pool corresponding to the central data with the smallest second distance between the N central data and the second income data ;
将所述第二数据池所对应的季节性归类确定为所述第一航班的季节性归类。The seasonal classification corresponding to the second data pool is determined as the seasonal classification of the first flight.
本申请第三方面提供了一种计算机设备,包括:存储器、处理器以及总线系统;其中,存储器用于存储程序,总线系统用于连接存储器以及处理器,以使存储器以及处理器进行通信;处理器用于执行所述存储器中的程序,并根据程序代码中的指令执行上述第一方面所述航班的季节性归类方法。The third aspect of the present application provides a computer device, including: a memory, a processor, and a bus system; wherein, the memory is used to store programs, and the bus system is used to connect the memory and the processor so that the memory and the processor communicate; The device is used to execute the program in the memory, and execute the seasonal classification method of the flight according to the above-mentioned first aspect according to the instruction in the program code.
本申请实施例第四方面提供了一种机器可读介质,其包括指令,当其在机器上运行时,使得机器执行上述第一方面所述的航班的季节性归类方法。The fourth aspect of the embodiment of the present application provides a machine-readable medium, which includes instructions, and when it is run on a machine, causes the machine to execute the method for seasonally classifying flights described in the first aspect above.
综上所述,可以看出,本申请提供的实施例中,航班季节性归类装置在确定第一航班的季节性分类时,可以获取第一航班所对应的历史航班数据,之后对该历史航班数据进行聚类,得到N个聚类的中心数据,并根据该N个聚类的中心数据以及该第一航班的收入数据确定该第一航班的季节性分类,相对于现有的通过人为对航班进行季节性归类来说,可以提高季节性归类的准确性,避免因为人为对航班的季节性归类而造成的偏颇现象。In summary, it can be seen that in the embodiment provided by this application, when the flight seasonal classification device determines the seasonal classification of the first flight, it can obtain the historical flight data corresponding to the first flight, and then the historical flight data The flight data is clustered to obtain the central data of N clusters, and the seasonal classification of the first flight is determined according to the central data of the N clusters and the income data of the first flight. Compared with the existing artificial For the seasonal classification of flights, the accuracy of seasonal classification can be improved, and the bias caused by artificial seasonal classification of flights can be avoided.
附图说明Description of drawings
结合附图并参考以下具体实施方式,本申请各实施例的上述和其他特征、优点及方面将变得更加明显。贯穿附图中,相同或相似的附图标记表示相同或相似的元素。应当理解附图是示意性的,原件和元素不一定按照比例绘制。The above and other features, advantages and aspects of the various embodiments of the present application will become more apparent with reference to the following detailed description in conjunction with the accompanying drawings. Throughout the drawings, the same or similar reference numerals denote the same or similar elements. It should be understood that the drawings are schematic and that elements and elements are not necessarily drawn to scale.
图1为本申请实施例提供的航班的季节性归类方法的流程示意图;Fig. 1 is a schematic flow chart of the seasonal classification method of flights provided by the embodiment of the present application;
图2为本申请实施例提供的航班季节性归类装置的虚拟结构示意图;Fig. 2 is the virtual structural schematic diagram of the flight season classification device that the embodiment of the present application provides;
图3为本申请实施例提供的机器可读介质的结构示意图;FIG. 3 is a schematic structural diagram of a machine-readable medium provided by an embodiment of the present application;
图4为本申请实施例提供的服务器的硬件结构示意图。FIG. 4 is a schematic diagram of a hardware structure of a server provided by an embodiment of the present application.
具体实施方式Detailed ways
下面将参照附图更详细地描述本申请的实施例。虽然附图中显示了本申请的某些实施例,然而应当理解的是,本申请可以通过各种形式来实现,而且不应该被解释为限于这里阐述的实施例,相反提供这些实施例是为了更加透彻和完整地理解本申请。应当理解的是,本申请的附图及实施例仅用于示例性作用,并非用于限制本申请的保护范围。Embodiments of the present application will be described in more detail below with reference to the accompanying drawings. Although certain embodiments of the present application are shown in the drawings, it should be understood that the application may be embodied in various forms and should not be construed as limited to the embodiments set forth herein; A more thorough and complete understanding of the application. It should be understood that the drawings and embodiments of the present application are for exemplary purposes only, and are not intended to limit the protection scope of the present application.
本申请中使用的术语“包括”及其变形是开放性包括,即“包括但不限于”。术语“基于”是“至少部分地基于”。术语“一个实施例”表示“至少一个实施例”;术语“另一实施例”表示“至少一个另外的实施例”;术语“一些实施例”表示“至少一些实施例”。其他术语的相关定义将在下文描述中给出。The term "comprising" and its variants used in this application are open-ended, ie "including but not limited to". The term "based on" is "based at least in part on". The term "one embodiment" means "at least one embodiment"; the term "another embodiment" means "at least one further embodiment"; the term "some embodiments" means "at least some embodiments." Relevant definitions of other terms will be given in the description below.
需要注意,本申请中提及的“第一”、“第二”等概念仅用于对不同的装置、模块或单 元进行区分,并非用于限定这些装置、模块或单元所执行的功能的顺序或者相互依存关系。It should be noted that concepts such as "first" and "second" mentioned in this application are only used to distinguish different devices, modules or units, and are not used to limit the sequence of functions performed by these devices, modules or units or interdependence.
需要注意,本申请中提及的“一个”、“多个”的修饰是示意性而非限制性的,本领域技术人员应当理解,除非在上下文另有明确指出,否则应该理解为“一个或多个”。It should be noted that the modifications of "one" and "multiple" mentioned in this application are illustrative and not restrictive. Those skilled in the art should understand that unless the context clearly indicates otherwise, it should be understood as "one or more" multiple".
通过将已离港历史航班进行淡旺季归类,归类后样本数据可以更好的反应真实的各时段的历史市场情况与市场走势,用于预测输出结果的计算更为精确,例如,对国庆节长假的航班进行市场预测,需要使用同样为国庆节或同质长假航班的已离港航班的航班数据作为样本,目前航空公司是通过人为对航空航班进行定量或定性的淡旺季划分,但是通过人为进行淡旺季的划分难免会出现偏颇的结果,导致季节性划分的不准确性。By classifying the departing historical flights into low and peak seasons, the sample data after classification can better reflect the real historical market conditions and market trends in each time period, and the calculation for predicting output results is more accurate. For example, for the National Day For market forecasting of flights with long holidays, it is necessary to use the flight data of departing flights that are also National Day or long holidays of the same quality as a sample. At present, airlines divide aviation flights quantitatively or qualitatively into off-peak seasons, but through Artificial division of low and peak seasons will inevitably lead to biased results, leading to inaccurate seasonal divisions.
有鉴于此,本申请实施例提供的一种航班的季节性归类方法,通过聚类分析的方式,将与待季节性归类的航班同时期的历史数据进行聚类分析,得到N个聚类,之后根据N个聚类对待季节性归类的航班进行分类。相对于现有的通过人为进行季节性归来来说,更加准确,避免出现偏颇的归类结果。In view of this, a method for seasonal classification of flights provided in the embodiment of the present application uses cluster analysis to perform cluster analysis on the historical data of the same period as the flights to be classified seasonally, and obtains N clusters class, and then classify the flights to be seasonally classified according to N clusters. Compared with the existing artificial seasonal return, it is more accurate and avoids biased classification results.
聚类为根据某种相似性准则将样本空间分为多个子空间,使每个子空间内部样本点相似相近,不同子空间内样本点之间差异性相异相远,其过程是一个无监督学习过程,能够实现样本空间的盲分类。聚类广泛应用于统计、机器学习、模式识别、数据分析等领域。目前己有应用于多个领域的聚类算法近百种,处理对象从一般数据库到超大规模数据库,从低维数据空间到高维数据空间,从数字属性数据到多种属性的数据。Clustering is to divide the sample space into multiple subspaces according to a certain similarity criterion, so that the sample points in each subspace are similar and similar, and the differences between sample points in different subspaces are far and wide. The process is an unsupervised learning The process can realize the blind classification of the sample space. Clustering is widely used in statistics, machine learning, pattern recognition, data analysis and other fields. At present, there are nearly a hundred kinds of clustering algorithms applied in many fields, and the processing objects range from general databases to ultra-large-scale databases, from low-dimensional data spaces to high-dimensional data spaces, and from digital attribute data to multi-attribute data.
下面从航班季节性归类装置的角度本申请提供的航班的季节性归类方法进行说明,请参阅图1,图1为本申请实施例提供的航班的季节性归类方法的一个流程示意图,包括:The following is an illustration of the seasonal classification method of the flight provided by the application from the perspective of the flight seasonal classification device. Please refer to FIG. 1. FIG. 1 is a schematic flow chart of the seasonal classification method of the flight provided by the embodiment of the application. include:
101、获取目标航司中待进行季节性归类的第一航班的第一航班数据。101. Obtain the first flight data of the first flight to be classified seasonally in the target airline company.
本实施例中,航班季节性归类装置获取目标航司中待进行季节性归类的第一航班的第一航班数据,该第一航班中包括至少一个航段,航段是指能够构成旅客航程的航段,例如该第一航班为北京-上海-旧金山所对应的航班,旅客航程有3种可能:北京-上海、上海-旧金山和北京-旧金山,也即该第一航班包括北京-上海航段、上海-旧金山航段和北京-旧金山航段,共3个航段。该第一航班数据为指定航空公司(也即目标航司)数据采集点(Data collection points,DCP)数据,包括以下至少之一:航班号、始发机场、达到机场、航班航段离港日期与时刻、航班航段各舱位订座值与对应运价值。In this embodiment, the device for seasonal flight classification acquires the first flight data of the first flight to be classified seasonally in the target airline company. The first flight includes at least one flight segment. The segment of the voyage, for example, the first flight is the flight corresponding to Beijing-Shanghai-San Francisco, there are three possible passenger voyages: Beijing-Shanghai, Shanghai-San Francisco and Beijing-San Francisco, that is, the first flight includes Beijing-Shanghai Flight segment, Shanghai-San Francisco flight segment and Beijing-San Francisco flight segment, a total of 3 flight segments. The first flight data is the data collection points (Data collection points, DCP) data of the designated airline (that is, the target airline company), including at least one of the following: flight number, departure airport, arrival airport, flight segment departure date Reservation value and corresponding transportation value of each class of cabin and time, flight segment.
可以理解的是,该第一航班数据还可以包括航班的离港日期、离港日期所对应的星期、具体的离港时刻、订座值数据与市场需求值。此处以航班的航班数据进行说明的,当然也还可以直接以航班中的航段为例进行说明,具体不做限定。市场需求值指的旅客具有能力购买且具有实际购买需求的需求,可以产生实际的订单,可以不产生实际的订单,在收益管理系统中应用限制算法计算模型输出市场需求值,该市场需求值被广泛应用与市场走势判断、市场淡旺季划分,并且是应用于收益管理系统核心算法子系统优化模块的重要输入值,是收益管理系统是航司利用航班计划、库存、离港与运价数据,对未离港航班的库存进行自动管理的自动化管理系统。此处具体不限定获取航班的市场需求值的方式,例如,获取目标航司的指定航班的航班信息,基于航班信息获取库存数据,该库存数据具体包括已离港航班库存信息和未离港航班库存信息,其中,已离岗航班库存数据为指定航班以当 前日期为基准的过往三年的已离港航班的航班库存数据,未离港航班的库存数据为指定航班在当前日期为基准的未来一年的航班库存数据;根据库存数据判断指定航班的指定舱位的销售状态。根据目标航司所对应的数据采集点,指定航空公司的航班信息、指定航班的库存数据等来识别指定航班的指定舱位的销售状态,销售状态包括锁舱、开舱等,最后基于预设算法对销售状态进行处理,得到指定航班的市场需求值。It can be understood that the first flight data may also include the departure date of the flight, the week corresponding to the departure date, the specific departure time, reservation value data and market demand value. Here, the flight data of the flight is used for description, and of course, the flight segment in the flight can also be used as an example for description, which is not specifically limited. The market demand value refers to the demand that passengers have the ability to purchase and have actual purchase needs. Actual orders may or may not be generated. In the revenue management system, the limit algorithm calculation model is used to output the market demand value, and the market demand value is obtained. It is widely used in judging market trends and dividing market seasons, and is an important input value applied to the optimization module of the core algorithm subsystem of the revenue management system. The revenue management system is used by airlines to use flight plans, inventory, departure and freight data. An automated management system that automatically manages the inventory of non-departure flights. The method of obtaining the market demand value of the flight is not specifically limited here. For example, obtain the flight information of the specified flight of the target airline company, and obtain inventory data based on the flight information. The inventory data specifically includes the inventory information of departing flights and non-departing flights Inventory information, where the inventory data of departed flights refers to the flight inventory data of departing flights in the past three years based on the current date of the specified flight, and the inventory data of non-departing flights refers to the future of the designated flight based on the current date One year's flight inventory data; judge the sales status of the specified flight class based on the inventory data. According to the data collection point corresponding to the target airline, the flight information of the designated airline, the inventory data of the designated flight, etc., to identify the sales status of the designated cabin of the designated flight. The sales status includes locked cabin, open cabin, etc., and finally based on the preset algorithm Process the sales status to obtain the market demand value of the specified flight.
需要说明的是,当指定舱位的可利用座位数小于等于零,可利用状态为开放、关闭,说明指定舱位处于非可销售状态,即为锁舱;指定舱位的可利用座位数大于零,可利用状态为关闭,说明指定舱位处于非可销售状态;即为锁舱;指定舱位的可利用座位数大于零,可利用状态为开放,说明指定舱位处于可销售状态;即为开舱;It should be noted that when the number of available seats in a designated cabin is less than or equal to zero, and the available status is open or closed, it means that the designated cabin is in a non-saleable state, that is, the cabin is locked; the number of available seats in a designated cabin is greater than zero, and the available If the status is closed, it means that the specified cabin is not available for sale; it is locked; if the number of available seats in the specified cabin is greater than zero, the available status is open, indicating that the specified cabin is available for sale; that is, the cabin is opened;
下面对如何基于预设算法对销售状态进行处理,得到指定航班的时长需求值进行说明:The following describes how to process the sales status based on the preset algorithm to obtain the demand value of the specified flight duration:
如果数据采集点DCP(n+1)的销售状态为开舱,且数据采集点DCP(n)的销售状态为开舱:If the sales status of the data collection point DCP(n+1) is open, and the sales status of the data collection point DCP(n) is open:
若数据采集点(n)较数据采集点(n+1)订座数是增加的,则使用如下公式计算数据采集点DCP(n+1)的市场需求值计算:If the data collection point (n) has more seats than the data collection point (n+1), use the following formula to calculate the market demand value of the data collection point DCP (n+1):
市场需求值DCP(n+1)=市场需求值DCP(n)+订座值增加变动值DCP(n),其中,订座增加变动值=实际订座值DCP(n+1)-实际订座值DCP(n)。Market demand value DCP(n+1)=market demand value DCP(n)+reservation value increase change value DCP(n), wherein, reservation increase change value=actual reservation value DCP(n+1)-actual reservation value Block value DCP(n).
若数据采集点(n)较数据采集点(n+1)订座数是减少的,则通过如下公式计算数据采集点DCP(n+1)的市场需求值:If the data collection point (n) has less reservations than the data collection point (n+1), the market demand value of the data collection point DCP(n+1) is calculated by the following formula:
市场需求值DCP(n+1)=市场需求值DCP(n)+订座减少变动值DCP(n),其中,订座减少变动值=(实际订座值DCP(n+1)x市场需求值(n))/实际订座值DCP(n)-市场需求值(n)。Market demand value DCP(n+1)=market demand value DCP(n)+reservation reduction fluctuation value DCP(n), wherein, reservation reduction fluctuation value=(actual reservation value DCP(n+1)×market demand value (n))/actual reservation value DCP(n)-market demand value (n).
若数据采集点DCP(n+1)的销售状态为锁舱,且数据采集点DCP(n)的销售状态为开舱:If the sales status of the data collection point DCP(n+1) is locked, and the sales status of the data collection point DCP(n) is open:
当数据采集点(n)较数据采集点(n+1)订座数是减少时,则通过如下公式计算When the number of reservations at the data collection point (n) is less than that of the data collection point (n+1), it is calculated by the following formula
市场需求值DCP(n+1)=市场需求值DCP(n)+订座减少变动值DCP(n),其中,订座减少变动值=(实际订座值DCP(n+1)x市场需求值(n))/实际订座值DCP(n)-市场需求值(n)。可以理解的是,上述市场需求值的计算是一个迭代的过程,即为DCP(1)的市场需求值等于实际订座值,迭代计算DCP+=1的市场需求值。Market demand value DCP(n+1)=market demand value DCP(n)+reservation reduction fluctuation value DCP(n), wherein, reservation reduction fluctuation value=(actual reservation value DCP(n+1)×market demand value (n))/actual reservation value DCP(n)-market demand value (n). It can be understood that the above calculation of the market demand value is an iterative process, that is, the market demand value of DCP(1) is equal to the actual reservation value, and the market demand value of DCP+=1 is iteratively calculated.
需要说明的是,DCP数据由当前时刻距离港天数而决定,与距离港天数为一一对应关系,下面进行举例说明:It should be noted that the DCP data is determined by the number of days away from the port at the current moment, and there is a one-to-one correspondence with the number of days away from the port. The following examples illustrate:
例如该数据采集点可以设置为24个(Dcp 1,Dcp 2,……,Dcp 24),距离港日期可以设置为365天,其中该数据采集点与距离港天数一一对应,数据采集点数据采集点Dcp 1对应距离港日期365天,Dcp 24对应距离港日期1天,可以理解的是,上述数据采集点的数量、距离港日期的天数以及数据采集点与距离港日期的天数之间的对应关系仅为举例说明,具体不做限定。 For example, the data collection point can be set to 24 (Dcp 1 , Dcp 2 , ..., Dcp 24 ), and the date from the port can be set to 365 days, wherein the data collection point corresponds to the number of days from the port, and the data collection point The data collection point Dcp 1 corresponds to 365 days from the port date, and Dcp 24 corresponds to 1 day from the port date. The corresponding relationship of is just an example and is not specifically limited.
还需要说明的是,数据采集点还可以包括舱位、订座以及票价等数据,其中:It should also be noted that the data collection point can also include data such as cabin class, seat reservation, and ticket price, among which:
舱位为支付价格、服务内容与设置的同类别的总称;Seat is the general term of the same category of payment price, service content and settings;
订座标记为BKG,对旅客预定的座位、舱位登机或对行李的重量、体积的预留;The seat reservation is marked as BKG, and the reservation of the passenger's reserved seat and class of boarding or the weight and volume of luggage;
票价标记为Fare,指旅客从地点A到地点B的被收取的费用,或者可以说是一个需要旅客付费的价格,以及附加在这个价格上、规定允许使用这个价格的条件(指运价、规则 和各种限制的总和),上述这些都是可以进行自动化票价计算的基础条件。国际运价主要包含以下内容:城市对(也成为市场)、规则号、运价等级、脚注(可选)、货币、金额、生效日期、截止日期、里程等。The fare is marked as Fare, which refers to the fee charged by passengers from point A to point B, or it can be said that it is a price that passengers need to pay, and the conditions attached to this price to stipulate the use of this price (refers to freight, The sum of rules and various restrictions), the above are the basic conditions for automatic fare calculation. The international freight rate mainly includes the following contents: city pair (also known as the market), rule number, freight level, footnote (optional), currency, amount, effective date, deadline, mileage, etc.
102、获取第一航班所对应的历史航班集合中每个历史航班的第二航班数据。102. Obtain the second flight data of each historical flight in the historical flight set corresponding to the first flight.
本实施例中,航班季节性归类装置可以获取第一航班所对应的历史航班集合中每个历史航班的第二航班数据,由于本地数据库会定期存储目标航司的航班数据,因此在获取第一航班所对应的历史航班集合中每个历史航班的第二航班数据时,只需要从本地数据库中提取即可。另外,该第一航班所对应的历史航班集合为当前日期之前三年(当然也还可以是其他时长,例如4天,具体不限定)中与当前日期相关联的已离港航班,例如第一航班为2021年4月25日中的某一个航班,4月25日为星期天,那么该历史航班集合中为过去3年中所有星期天中与该目标航班相对应的航班的集合。In this embodiment, the device for seasonal flight classification can obtain the second flight data of each historical flight in the historical flight set corresponding to the first flight. Since the local database regularly stores the flight data of the target airline company, the first When the second flight data of each historical flight in the historical flight collection corresponding to the first flight only needs to be extracted from the local database. In addition, the set of historical flights corresponding to the first flight is the departing flight associated with the current date in the three years before the current date (of course, it can also be other durations, such as 4 days, not specifically limited), such as the first The flight is a certain flight on April 25, 2021, and April 25 is a Sunday, then the historical flight collection is a collection of flights corresponding to the target flight on all Sundays in the past 3 years.
103、确定季节性归类的N个数据池。103. Determine N data pools for seasonal classification.
本实施例中,航班季节性归类装置可以确定季节性归类的N个数据池,其中,N为大于或等于2的正整数。可以理解的是,本申请中,将N的数量设置为7,将7个数据池划分为旺季1(记作peak)、旺季2(记作peak1)、平季1(记作peak2)、平季2(记作offpeak2)、淡季1(记作offpeak1)、淡季2(记作offpeak)以及无归类数据池default,此处的数据池的数量以及数据值的分类仅为举例说明,具体不做限定。In this embodiment, the device for seasonal classification of flights may determine N data pools for seasonal classification, where N is a positive integer greater than or equal to 2. It can be understood that, in this application, the number of N is set to 7, and the 7 data pools are divided into peak season 1 (denoted as peak), peak season 2 (denoted as peak1), off-season 1 (denoted as peak2), and average season 1 (denoted as peak2). Season 2 (denoted as offpeak2), off-season 1 (denoted as offpeak1), off-season 2 (denoted as offpeak) and the uncategorized data pool default. Do limited.
需要说明的是,通过步骤101可以获取目标航司中待进行季节性归类的第一航班的第一航班数据,通过步骤102可以获取第一航班所对应的历史航班集合中每个历史航班的第二航班数据,通过步骤103可以确定季节性归类的N个数据池,然而这三个步骤之间并没有先后执行顺序的限制,可以先执行步骤101,也可以先执行步骤102,也可以先执行步骤103,或者同时执行,具体不做限定。It should be noted that, through step 101, the first flight data of the first flight to be classified seasonally in the target airline company can be obtained, and through step 102, the data of each historical flight in the historical flight set corresponding to the first flight can be obtained For the second flight data, N data pools for seasonal classification can be determined through step 103. However, there is no restriction on the sequence of execution between these three steps. Step 101 can be executed first, or step 102 can be executed first, or Step 103 is executed first, or executed at the same time, which is not specifically limited.
104、根据第二航班数据计算每个历史航班的第一收入数据。104. Calculate the first revenue data of each historical flight according to the second flight data.
本实施例中,航班季节性归类装置在得到历史航班集合中每个历史航班的第二航班数据之后,可以根据第二航班数据计算每个历史航班的第一收入数据,具体的可以通过如下公式计算每个历史航班的第一收入数据:In this embodiment, after obtaining the second flight data of each historical flight in the historical flight collection, the device for seasonal flight classification can calculate the first income data of each historical flight according to the second flight data, specifically as follows The formula calculates the first revenue data for each historical flight:
Figure PCTCN2022087286-appb-000003
Figure PCTCN2022087286-appb-000003
其中,Revenue Dcp(x)为所述每个历史航班中第x个航班的第一收入数据,i表示舱位,k为舱位的总数,BKG(i)为所述第i个舱位的订座,Fare(i)为所述第i个舱位的票价。Wherein, Revenue Dcp(x) is the first revenue data of the xth flight in each of the historical flights, i represents the cabin, k is the total number of cabins, and BKG (i) is the reservation of the i-th cabin, Fare(i) is the fare of the ith cabin.
可以理解的是,为了计算简便,可以使用Dcp 19,Dcp 20,Dcp 21,Dcp 22,Dcp 23以及Dcp 24所对应的距离港天数的航班作为第一航班的历史航班,当然也还可以使用其他的数据采集点,具体不做限定。 It can be understood that, for the sake of simplicity of calculation, Dcp 19 , Dcp 20 , Dcp 21 , Dcp 22 , Dcp 23 and Dcp 24 can be used as the historical flights of the first flight, and of course other flights can also be used The data collection points are not limited.
105、根据第一收入数据确定N个数据池中每个数据池的中心数据。105. Determine the central data of each data pool in the N data pools according to the first income data.
本实施例中,航班季节性归类装置在得到每个历史航班的第一收入数据之后,可以根 据每个历史航班的第一收入数据确定N个数据池中每个数据池的中心数据。In this embodiment, after obtaining the first revenue data of each historical flight, the device for seasonal flight classification can determine the central data of each data pool in the N data pools according to the first revenue data of each historical flight.
一个实施例中,航班季节性归类装置根据第一收入数据确定N个数据池中每个数据池的中心数据包括:In one embodiment, the flight seasonal classification device determines the central data of each of the N data pools according to the first revenue data including:
将目标收入数据确定为第一数据池的中心数据,目标收入数据为第二航班对应的收入数据,第二航班为历史航班集合中任意一个航班,第一数据池为N个数据池中的任意一个;Determine the target revenue data as the central data of the first data pool, the target revenue data is the revenue data corresponding to the second flight, the second flight is any flight in the historical flight collection, and the first data pool is any of the N data pools One;
计算第三收入数据与目标收入数据的第一距离,所述第三收入数据为航班子集合中任意一个航班所对应的收入数据,航班子集合为历史航班集合中除目标航班之外的航班集合;Calculate the first distance between the third income data and the target income data, the third income data is the income data corresponding to any flight in the flight subset, and the flight subset is the flight collection in the historical flight collection except the target flight ;
将第四收入数据划分至第一数据池,第四收入数据为第三航班所对应的收入数据,第三航班为航班子集合中与目标收入数据的第一距离最近的收入数据所对应的航班;Divide the fourth income data into the first data pool, the fourth income data is the income data corresponding to the third flight, and the third flight is the flight corresponding to the first income data closest to the target income data in the flight subset ;
计算划分后第一数据池的中心数据。Calculate the central data of the divided first data pool.
本实施例中,航班季节性归类装置可以首先将目标收入数据确定为第一数据池的中心数据,该目标收入数据为目标航班的收入数据,该目标航班为历史航班集合中的任意一个航班,该第一数据池为N个数据池中的任意一个,也就是说,此处可以随机选择N个历史航班所对应的收入数据作为N个数据池的中心数据。In this embodiment, the flight season classification device may first determine the target revenue data as the central data of the first data pool, the target revenue data is the revenue data of the target flight, and the target flight is any flight in the historical flight collection , the first data pool is any one of the N data pools, that is, the revenue data corresponding to the N historical flights can be randomly selected here as the central data of the N data pools.
之后计算第三收入数据与目标收入数据的第一距离,第三收入数据为航班子集合中任意一个航班所对应的收入数据,航班子集合为历史航班集合中除目标航班之外的航班集合,具体的可以通过如下公式计算第三收入数据与N个数据池的中心数据之间的第一距离:Then calculate the first distance between the third income data and the target income data, the third income data is the income data corresponding to any flight in the flight subset, and the flight subset is the flight collection except the target flight in the historical flight collection, Specifically, the first distance between the third income data and the central data of the N data pools can be calculated by the following formula:
D(i,j)=W(t)×|X it-Y jt|; D(i,j)=W(t)×|X it -Y jt |;
其中,D(i,j)为所述目标收入数据i与第三收入数据j之间的第一距离,所述第三收入数据j为所述航班子集合中任意一个航班所对应的收入数据,W(t)为所述第三收入数据的权重,X it为所述目标收入数据的市场需求值,Y jt为所述第三收入数据所对应的市场需求值。 Wherein, D(i, j) is the first distance between the target income data i and the third income data j, and the third income data j is the income data corresponding to any flight in the flight subset , W(t) is the weight of the third income data, X it is the market demand value of the target income data, and Y jt is the market demand value corresponding to the third income data.
航班季节性归类装置在计算得到航班子集合中任意一个航班所对应的收入数据与目标收入数据的第一距离之后,可以将第四收入数据划分至第一数据池,该第四收入数据为第三航班所对应的收入数据,该第三航班为航班子集合中与目标收入数据的第一距离最近的收入数据所对应的航班,也就是说,可以将航班子集合中的每个航班分别划分至与每个航班距离最近的数据池中。After calculating the first distance between the revenue data corresponding to any flight in the flight subset and the target revenue data, the device for seasonal flight classification can divide the fourth revenue data into the first data pool, and the fourth revenue data is The revenue data corresponding to the third flight, the third flight is the flight corresponding to the revenue data with the closest distance to the target revenue data in the flight subset, that is to say, each flight in the flight subset can be separately Divide into the data pool closest to each flight.
最后航班季节性归类装置,可以计算划分后的第一数据池的中心数据,具体的可以通过如下公式计算第一数据池的中心数据:Finally, the flight season classification device can calculate the center data of the divided first data pool, specifically, the center data of the first data pool can be calculated by the following formula:
Figure PCTCN2022087286-appb-000004
Figure PCTCN2022087286-appb-000004
其中,New_mean(t)为将第四收入数据划分至目标数据池后目标数据池的中心数据,Old_mean(t)为将第四收入输入划分至目标数据池前目标数据池的中心数据,m为将 第四收入输入划分至目标数据池之前目标数据池的中心数据的数量,X(t)为目标数据池中任意一个收入数据所对应的市场需求值。Among them, New_mean(t) is the central data of the target data pool after the fourth income data is divided into the target data pool, Old_mean(t) is the central data of the target data pool before the fourth income input is divided into the target data pool, and m is The amount of central data in the target data pool before the fourth income input is divided into the target data pool, and X(t) is the market demand value corresponding to any income data in the target data pool.
由此,可以将历史航班集合中每个历史航班所对应的收入数据进行聚类分析,以将每个航班所对应的收入数据分别划分至N个数据池中,得到了划分后的N个数据池以及N个数据池中每个数据池的中心数据。Therefore, the revenue data corresponding to each historical flight in the historical flight collection can be clustered and analyzed to divide the revenue data corresponding to each flight into N data pools, and the divided N data can be obtained pool and the central data of each of the N data pools.
也就是说,航班季节性归类装置可以随机从历史航班集合中每个航班的第二收入数据中选取N个第二收入数据作为N个数据池的中心数据,之后从航班子集合中随机选取一个航班,计算该随机选取的航班所对应的第二收入数据与N个数据池的中心数据之间的距离,并将该随机选取的航班所对应的第二收入数据划分至与该随机选取的航班所对应的第二收入数据距离最近的数据池,并计算该数据池的中心数据;之后在从航班子集合再选取一个航班,重复执行上述过程,直至历史航班集合中的所有航班所对应的第二收入数据均划分完毕为止,得到划分后的N个数据池,之后计算N个数据池的中心数据即可。That is to say, the flight seasonal classification device can randomly select N second income data from the second income data of each flight in the historical flight collection as the central data of N data pools, and then randomly select from the flight subset A flight, calculate the distance between the second income data corresponding to the randomly selected flight and the central data of N data pools, and divide the second income data corresponding to the randomly selected flight into the The second revenue data corresponding to the flight is the closest data pool, and calculate the central data of the data pool; then select another flight from the flight subset, and repeat the above process until all the flights in the historical flight collection correspond to Until the division of the second income data is completed, the divided N data pools are obtained, and then the central data of the N data pools can be calculated.
106、根据第一航班数据计算第一航班的第二收入数据。106. Calculate the second income data of the first flight according to the data of the first flight.
本实施例中,航班季节性归类装置可以根据第一航班数据计算第一航班的第二收入数据,也即计算待进行季节性归类的第一航班的收入数据,具体的可以通过如下公式进行计算:In this embodiment, the device for seasonal classification of flights can calculate the second income data of the first flight according to the data of the first flight, that is, calculate the income data of the first flight to be classified seasonally. Specifically, the following formula can be used Calculation:
Figure PCTCN2022087286-appb-000005
Figure PCTCN2022087286-appb-000005
其中,Revenue Dcp(x)为第二收入数据,i表示第一航班中的第i个舱位,k为第一航班中舱位的总数,BKG(i)为第i个舱位的订座,Fare(i)为第i个舱位的票价。Among them, Revenue Dcp(x) is the second income data, i represents the i-th cabin in the first flight, k is the total number of cabins in the first flight, BKG(i) is the reservation of the i-th cabin, Fare( i) is the fare of the i-th cabin.
107、根据第二收入数据以及每个数据池的中心数据确定第一航班的季节性归类。107. Determine the seasonal classification of the first flight according to the second income data and the central data of each data pool.
本实施例中,航班季节性归类装置在计算得到第一航班的第二收入数据之后,可以根据第二收入数据以及每个数据池的中心数据确定第一航班的季节性归类,具体的,可以首先计算第二收入数据与N个中心数据中每个中心数据的第二距离,其中,N个中心数据与N个数据池相对应,具体的可以通过如下公式计算第二收入数据与N个中心数据中每个中心数据的第二距离:In this embodiment, after calculating the second revenue data of the first flight, the device for seasonal classification of flights can determine the seasonal classification of the first flight according to the second revenue data and the central data of each data pool, specifically , you can first calculate the second distance between the second income data and each of the N center data, where the N center data correspond to the N data pools, specifically, the second income data and N can be calculated by the following formula The second distance of each center data in center data:
D(i,j)=W(t)×|X it-Y jt|; D(i,j)=W(t)×|X it -Y jt |;
其中,D(i,j)为目标收入数据i与第三收入数据j之间的第一距离,第三收入数据j为航班子集合中任意一个航班所对应的收入数据,W(t)为第三收入数据的权重,X it为目标收入数据的市场需求值,Y jt为第三收入数据所对应的市场需求值。 Among them, D(i, j) is the first distance between the target income data i and the third income data j, the third income data j is the income data corresponding to any flight in the flight subset, W(t) is The weight of the third income data, X it is the market demand value of the target income data, and Y jt is the market demand value corresponding to the third income data.
之后将第二收入数据划分至第二数据池,该第二数据池为N个中心数据中与第二收入数据之间的第二距离最小的中心数据所对应的数据池,也即将第二收入数据划分至与之最相似的聚类中;Then divide the second income data into the second data pool, the second data pool is the data pool corresponding to the center data with the second smallest distance between the N center data and the second income data, that is, the second income The data is divided into the most similar clusters;
将第二数据池所对应的季节性归类确定为第一航班的季节性归类。The seasonal classification corresponding to the second data pool is determined as the seasonal classification of the first flight.
综上所述,可以看出,本申请提供的实施例中,航班季节性归类装置在确定第一航班的季节性分类时,可以获取第一航班所对应的历史航班数据,之后对该历史航班数据进行聚类,得到N个聚类的中心数据,并根据该N个聚类的中心数据以及该第一航班的收入数据确定该第一航班的季节性分类,相对于现有的通过人为对航班进行季节性归类来说,可以提高季节性归类的准确性,避免因为人为对航班的季节性归类而造成的偏颇现象。In summary, it can be seen that in the embodiment provided by this application, when the flight seasonal classification device determines the seasonal classification of the first flight, it can obtain the historical flight data corresponding to the first flight, and then the historical flight data The flight data is clustered to obtain the central data of N clusters, and the seasonal classification of the first flight is determined according to the central data of the N clusters and the income data of the first flight. Compared with the existing artificial For the seasonal classification of flights, the accuracy of seasonal classification can be improved, and the bias caused by artificial seasonal classification of flights can be avoided.
可以理解的是,附图中的流程图和框图,图示了按照本申请各种实施例的系统、方法和计算机程序产品的可能实现的体系架构、功能和操作。在这点上,流程图或框图中的每个方框可以代表一个模块、程序段、或代码的一部分,该模块、程序段、或代码的一部分包含一个或多个用于实现规定的逻辑功能的可执行指令。也应当注意,在有些作为替换的实现中,方框中所标注的功能也可以以不同于附图中所标注的顺序发生。例如,两个接连地表示的方框实际上可以基本并行地执行,它们有时也可以按相反的顺序执行,这依所涉及的功能而定。也要注意的是,框图和/或流程图中的每个方框、以及框图和/或流程图中的方框的组合,可以用执行规定的功能或操作的专用的基于硬件的系统来实现,或者可以用专用硬件与计算机指令的组合来实现。It should be understood that the flowcharts and block diagrams in the figures illustrate the architecture, functions and operations of possible implementations of systems, methods and computer program products according to various embodiments of the present application. In this regard, each block in a flowchart or block diagram may represent a module, program segment, or portion of code that contains one or more logical functions for implementing specified executable instructions. It should also be noted that, in some alternative implementations, the functions noted in the block may occur out of the order noted in the figures. For example, two blocks shown in succession may, in fact, be executed substantially concurrently, or they may sometimes be executed in the reverse order, depending upon the functionality involved. It should also be noted that each block of the block diagrams and/or flowchart illustrations, and combinations of blocks in the block diagrams and/or flowchart illustrations, can be implemented by a dedicated hardware-based system that performs the specified functions or operations , or may be implemented by a combination of dedicated hardware and computer instructions.
本申请实施方式中的多个装置之间所交互的消息或者信息的名称仅用于说明性的目的,而并不是用于对这些消息或信息的范围进行限制。The names of messages or information exchanged between multiple devices in the embodiments of the present application are used for illustrative purposes only, and are not used to limit the scope of these messages or information.
虽然采用特定次序描绘了各操作,但是这不应当理解为要求这些操作以所示出的特定次序或以顺序次序执行来执行。在一定环境下,多任务和并行处理可能是有利的。Although operations are depicted in a particular order, this should not be understood as requiring that the operations be performed in the particular order shown or to be performed in sequential order. Under certain circumstances, multitasking and parallel processing may be advantageous.
应当理解,本申请的方法实施方式中记载的各个步骤可以按照不同的顺序执行,和/或并行执行。此外,方法实施方式可以包括附加的步骤和/或省略执行示出的步骤。本申请的范围在此方面不受限制。It should be understood that the various steps described in the method implementations of the present application may be executed in different orders, and/or executed in parallel. Additionally, method embodiments may include additional steps and/or omit performing illustrated steps. The scope of the application is not limited in this respect.
另外,本申请还可以以一种或多种程序设计语言或其组合来编写用于执行本申请的操作的计算机程序代码,上述程序设计语言包括但不限于面向对象的程序设计语言—诸如Java、Smalltalk、C++,还包括常规的过程式程序设计语言—诸如“C”语言或类似的程序设计语言。程序代码可以完全地在用户计算机上执行、部分地在用户计算机上执行、作为一个独立的软件包执行、部分在用户计算机上部分在远程计算机上执行、或者完全在远程计算机或服务器上执行。在涉及远程计算机的情形中,远程计算机可以通过任意种类的网络——包括局域网(LAN)或广域网(WAN)—连接到用户计算机,或者,可以连接到外部计算机(例如利用因特网服务提供商来通过因特网连接)。In addition, the present application can also write computer program codes for performing the operations of the present application in one or more programming languages or combinations thereof, and the above-mentioned programming languages include but are not limited to object-oriented programming languages—such as Java, Smalltalk, C++, and also conventional procedural programming languages—such as "C" or similar programming languages. The program code may execute entirely on the user's computer, partly on the user's computer, as a stand-alone software package, partly on the user's computer and partly on a remote computer or entirely on the remote computer or server. In cases involving a remote computer, the remote computer can be connected to the user computer through any kind of network, including a local area network (LAN) or a wide area network (WAN), or it can be connected to an external computer (such as through an Internet service provider). Internet connection).
上面从航班的季节性归类方法的角度对本申请实施例进行说明,下面从航班季节性归类装置的角度对本申请实施例进行说明。The embodiment of the present application is described above from the perspective of the seasonal flight classification method, and the embodiment of the present application is described below from the perspective of the flight seasonal classification device.
请参阅图2,图2为本申请实施例提供的航班季节性归类装置的虚拟结构意图,该航班季节性归类装置200包括:Please refer to Fig. 2, Fig. 2 is the imaginary structural representation of the flight seasonal classification device that the embodiment of the present application provides, and this flight seasonal classification device 200 comprises:
第一获取单元201,用于获取目标航司中待进行季节性归类的第一航班的第一航班数据;The first obtaining unit 201 is used to obtain the first flight data of the first flight to be classified seasonally in the target airline company;
第二获取单元202,用于获取所述第一航班所对应的历史航班集合中每个历史航班的 第二航班数据;The second obtaining unit 202 is used to obtain the second flight data of each historical flight in the historical flight set corresponding to the first flight;
第一确定单元203,用于确定季节性归类的N个数据池,其中,所述N为大于或等2的整数;The first determination unit 203 is configured to determine N data pools for seasonal classification, wherein the N is an integer greater than or equal to 2;
第一计算单元204,用于根据所述第二航班数据计算所述每个历史航班的第一收入数据;The first calculation unit 204 is configured to calculate the first revenue data of each historical flight according to the second flight data;
第二确定单元205,用于根据所述第一收入数据确定所述N个数据池中每个数据池的中心数据;The second determination unit 205 is configured to determine the central data of each data pool in the N data pools according to the first income data;
第二计算单元206,用于根据所述第一航班数据计算所述第一航班的第二收入数据;The second calculation unit 206 is configured to calculate the second income data of the first flight according to the first flight data;
第三确定单元207,用于根据所述第二收入数据以及所述N个数据池中每个数据池的中心数据确定所述第一航班的季节性归类。The third determining unit 207 is configured to determine the seasonal classification of the first flight according to the second revenue data and the central data of each of the N data pools.
一种可能的设计中,所述第二确定单元205具体用于:In a possible design, the second determining unit 205 is specifically configured to:
将目标收入数据确定为第一数据池的中心数据,所述目标收入数据为第二航班对应的收入数据,所述第二航班为所述历史航班集合中任意一个航班,所述第一数据池为所述N个数据池中的任意一个;Determining the target revenue data as the central data of the first data pool, the target revenue data being the revenue data corresponding to the second flight, the second flight being any flight in the set of historical flights, and the first data pool Any one of the N data pools;
计算第三收入数据与所述目标收入数据的第一距离,所述第三收入数据为航班子集合中的任意一个航班所对应的收入数据,所述航班子集合为所述历史航班集合中除所述第二航班之外的航班集合;Calculate the first distance between the third income data and the target income data, the third income data is the income data corresponding to any flight in the flight subset, and the flight subset is the historical flight set except a collection of flights other than said second flight;
将第四收入数据划分至所述第一数据池,所述第四收入数据为第三航班所对应的收入数据,所述第三航班为所述航班子集合中与所述目标收入数据的第一距离最近的收入数据所对应的航班;Divide the fourth revenue data into the first data pool, the fourth revenue data is the revenue data corresponding to the third flight, and the third flight is the third flight in the subset of flights that is related to the target revenue data. - the flight corresponding to the nearest revenue data;
计算划分后所述第一数据池的中心数据。Calculate central data of the first data pool after division.
一种可能的设计中,所述第二确定单元205计算第三收入数据与所述目标收入数据的第一距离包括:In a possible design, the calculation of the first distance between the third income data and the target income data by the second determination unit 205 includes:
根据如下公式计算计算第三收入数据与所述目标收入数据的第一距离:Calculate and calculate the first distance between the third income data and the target income data according to the following formula:
D(i,j)=W(t)×|X it-Y jt|; D(i,j)=W(t)×|X it -Y jt |;
其中,D(i,j)为所述目标收入数据i与第三收入数据j之间的第一距离,所述第三收入数据j为所述航班子集合中任意一个航班所对应的收入数据,W(t)为所述第三收入数据的权重,X it为所述目标收入数据的市场需求值,Y jt为所述第三收入数据所对应的市场需求值。 Wherein, D(i, j) is the first distance between the target income data i and the third income data j, and the third income data j is the income data corresponding to any flight in the flight subset , W(t) is the weight of the third income data, X it is the market demand value of the target income data, and Y jt is the market demand value corresponding to the third income data.
一种可能的设计中,所述第二确定单元205计算划分后所述目标数据池的中心数据包括:In a possible design, the calculation of the center data of the target data pool after division by the second determination unit 205 includes:
通过如下公式计算划分后所述目标数据池的中心数据:Calculate the center data of the target data pool after division by the following formula:
Figure PCTCN2022087286-appb-000006
Figure PCTCN2022087286-appb-000006
其中,New_mean(t)为将所述第四收入数据划分至所述目标数据池后所述目标数据池的中心数据,Old_mean(t)为将所述第四收入输入划分至所述目标数据池前所述目标数据池的中心数据,m为将所述第四收入输入划分至所述目标数据池之前,所述目标数据池的中心数据的数量,X(t)为所述目标数据池中任意一个收入数据所对应的市场需求值。Wherein, New_mean(t) is the central data of the target data pool after dividing the fourth income data into the target data pool, and Old_mean(t) is dividing the fourth income input into the target data pool The central data of the aforementioned target data pool, m is the number of central data in the target data pool before the fourth income input is divided into the target data pool, and X(t) is the number of central data in the target data pool The market demand value corresponding to any income data.
一种可能的设计中,所述第一计算单元204具体用于:In a possible design, the first computing unit 204 is specifically configured to:
通过如下公式计算所述每个历史航班的第一收入数据:The first revenue data of each historical flight is calculated by the following formula:
Figure PCTCN2022087286-appb-000007
Figure PCTCN2022087286-appb-000007
其中,Revenue Dcp(x)为所述每个历史航班中第x个航班的第一收入数据,i表示舱位,k为舱位的总数,BKG(i)为所述第i个舱位的订座,Fare(i)为所述第i个舱位的票价。Wherein, Revenue Dcp(x) is the first revenue data of the xth flight in each of the historical flights, i represents the cabin, k is the total number of cabins, and BKG (i) is the reservation of the i-th cabin, Fare(i) is the fare of the ith cabin.
一种可能的设计中,所述第三确定单元207具体用于:In a possible design, the third determining unit 207 is specifically configured to:
计算所述第二收入数据与N个中心数据中每个中心数据的第二距离,其中,所述N个中心数据与所述N个数据池相对应;calculating a second distance between the second income data and each of the N central data, wherein the N central data correspond to the N data pools;
将所述第二收入数据划分至第二数据池,所述第二数据池为所述N个中心数据中与所述第二收入数据之间的第二距离最小的中心数据所对应的数据池;Dividing the second income data into a second data pool, the second data pool being the data pool corresponding to the central data with the smallest second distance between the N central data and the second income data ;
将所述第二数据池所对应的季节性归类确定为所述第一航班的季节性归类。The seasonal classification corresponding to the second data pool is determined as the seasonal classification of the first flight.
需要说明的是,描述于本申请实施例中所涉及到的单元可以通过软件的方式实现,也可以通过硬件的方式来实现。其中,单元的名称在某种情况下并不构成对该单元本身的限定,例如,获取单元还可以被描述为“获取目标用户的证件信息的单元”。It should be noted that the units involved in the embodiments described in the present application may be implemented by means of software or by means of hardware. Wherein, the name of the unit does not constitute a limitation on the unit itself under certain circumstances, for example, the acquisition unit may also be described as "a unit that acquires the credential information of the target user".
本文中以上描述的功能可以至少部分地由一个或多个硬件逻辑部件来执行。例如,非限制性地,可以使用的示范类型的硬件逻辑部件包括:现场可编程门阵列(FPGA)、专用集成电路(ASIC)、专用标准产品(ASSP)、片上系统(SOC)、复杂可编程逻辑设备(CPLD)等等。The functions described herein above may be performed at least in part by one or more hardware logic components. For example, without limitation, exemplary types of hardware logic components that may be used include: Field Programmable Gate Arrays (FPGAs), Application Specific Integrated Circuits (ASICs), Application Specific Standard Products (ASSPs), System on Chips (SOCs), Complex Programmable Logical device (CPLD) and so on.
请参阅图3,图3为本申请实施例提供的一种机器可读介质的实施例示意图。Please refer to FIG. 3 . FIG. 3 is a schematic diagram of an embodiment of a machine-readable medium provided in an embodiment of the present application.
如图3所示,本实施例提供了一种机器可读介质300,其上存储有计算机程序311,该计算机程序311被处理器执行时实现上述图1中所述航班的季节性归类方法的步骤。As shown in FIG. 3 , this embodiment provides a machine-readable medium 300 on which a computer program 311 is stored, and when the computer program 311 is executed by a processor, the seasonal classification method for flights described in FIG. 1 is implemented. A step of.
需要说明的是,本申请的上下文中,机器可读介质可以是有形的介质,其可以包含或存储以供指令执行系统、装置或设备使用或与指令执行系统、装置或设备结合地使用的程序。机器可读介质可以是机器可读信号介质或机器可读储存介质。机器可读介质可以包括但不限于电子的、磁性的、光学的、电磁的、红外的、或半导体系统、装置或设备,或者上述内容的任何合适组合。机器可读存储介质的更具体示例会包括基于一个或多个线的电气连接、便携式计算机盘、硬盘、随机存取存储器(RAM)、只读存储器(ROM)、可擦除可 编程只读存储器(EPROM或快闪存储器)、光纤、便捷式紧凑盘只读存储器(CD-ROM)、光学储存设备、磁储存设备、或上述内容的任何合适组合。It should be noted that, in the context of the present application, a machine-readable medium may be a tangible medium, which may contain or store a program for use by an instruction execution system, device, or device or in combination with an instruction execution system, device, or device. . A machine-readable medium may be a machine-readable signal medium or a machine-readable storage medium. A machine-readable medium may include, but is not limited to, electronic, magnetic, optical, electromagnetic, infrared, or semiconductor systems, apparatus, or devices, or any suitable combination of the foregoing. More specific examples of machine-readable storage media would include one or more wire-based electrical connections, portable computer discs, hard drives, random access memory (RAM), read only memory (ROM), erasable programmable read only memory (EPROM or flash memory), optical fiber, compact disk read only memory (CD-ROM), optical storage, magnetic storage, or any suitable combination of the foregoing.
需要说明的是,本申请上述的机器可读介质可以是计算机可读信号介质或者计算机可读存储介质或者是上述两者的任意组合。计算机可读存储介质例如可以是——但不限于——电、磁、光、电磁、红外线、或半导体的系统、装置或器件,或者任意以上的组合。计算机可读存储介质的更具体的例子可以包括但不限于:具有一个或多个导线的电连接、便携式计算机磁盘、硬盘、随机访问存储器(RAM)、只读存储器(ROM)、可擦式可编程只读存储器(EPROM或闪存)、光纤、便携式紧凑磁盘只读存储器(CD-ROM)、光存储器件、磁存储器件、或者上述的任意合适的组合。在本申请中,计算机可读存储介质可以是任何包含或存储程序的有形介质,该程序可以被指令执行系统、装置或者器件使用或者与其结合使用。而在本申请中,计算机可读信号介质可以包括在基带中或者作为载波一部分传播的数据信号,其中承载了计算机可读的程序代码。这种传播的数据信号可以采用多种形式,包括但不限于电磁信号、光信号或上述的任意合适的组合。计算机可读信号介质还可以是计算机可读存储介质以外的任何计算机可读介质,该计算机可读信号介质可以发送、传播或者传输用于由指令执行系统、装置或者器件使用或者与其结合使用的程序。计算机可读介质上包含的程序代码可以用任何适当的介质传输,包括但不限于:电线、光缆、RF(射频)等等,或者上述的任意合适的组合。It should be noted that the machine-readable medium mentioned above in this application may be a computer-readable signal medium or a computer-readable storage medium or any combination of the above two. A computer readable storage medium may be, for example, but not limited to, an electrical, magnetic, optical, electromagnetic, infrared, or semiconductor system, apparatus, or device, or any combination thereof. More specific examples of computer-readable storage media may include, but are not limited to, electrical connections with one or more wires, portable computer diskettes, hard disks, random access memory (RAM), read-only memory (ROM), erasable Programmable read-only memory (EPROM or flash memory), optical fiber, portable compact disk read-only memory (CD-ROM), optical storage device, magnetic storage device, or any suitable combination of the above. In the present application, a computer-readable storage medium may be any tangible medium that contains or stores a program that can be used by or in conjunction with an instruction execution system, apparatus, or device. In this application, however, a computer-readable signal medium may include a data signal propagated in baseband or as part of a carrier wave, carrying computer-readable program code therein. Such propagated data signals may take many forms, including but not limited to electromagnetic signals, optical signals, or any suitable combination of the foregoing. A computer-readable signal medium may also be any computer-readable medium other than a computer-readable storage medium, which can transmit, propagate, or transmit a program for use by or in conjunction with an instruction execution system, apparatus, or device . Program code embodied on a computer readable medium may be transmitted by any appropriate medium, including but not limited to wires, optical cables, RF (radio frequency), etc., or any suitable combination of the above.
上述计算机可读介质可以是上述电子设备中所包含的;也可以是单独存在,而未装配入该电子设备中。The above-mentioned computer-readable medium may be included in the above-mentioned electronic device, or may exist independently without being incorporated into the electronic device.
请参阅图4,图4是本申请实施例提供的一种服务器的硬件结构示意图,该服务器400可因配置或性能不同而产生比较大的差异,可以包括一个或一个以上中央处理器(central processing units,CPU)422(例如,一个或一个以上处理器)和存储器432,一个或一个以上存储应用程序442或数据444的存储介质430(例如一个或一个以上海量存储设备)。其中,存储器432和存储介质430可以是短暂存储或持久存储。存储在存储介质430的程序可以包括一个或一个以上模块(图示没标出),每个模块可以包括对服务器中的一系列指令操作。更进一步地,中央处理器422可以设置为与存储介质430通信,在服务器400上执行存储介质430中的一系列指令操作。Please refer to FIG. 4. FIG. 4 is a schematic diagram of the hardware structure of a server provided by the embodiment of the present application. The server 400 may have relatively large differences due to different configurations or performances, and may include one or more central processing units (central processing units, CPU) 422 (eg, one or more processors) and memory 432, and one or more storage media 430 (eg, one or more mass storage devices) for storing application programs 442 or data 444. Wherein, the memory 432 and the storage medium 430 may be temporary storage or persistent storage. The program stored in the storage medium 430 may include one or more modules (not shown in the figure), and each module may include a series of instruction operations on the server. Furthermore, the central processing unit 422 may be configured to communicate with the storage medium 430 , and execute a series of instruction operations in the storage medium 430 on the server 400 .
服务器400还可以包括一个或一个以上电源426,一个或一个以上有线或无线网络接口450,一个或一个以上输入输出接口458,和/或,一个或一个以上操作系统441,例如Windows ServerTM,Mac OS XTM,UnixTM,LinuxTM,FreeBSDTM等等。The server 400 can also include one or more power supplies 426, one or more wired or wireless network interfaces 450, one or more input and output interfaces 458, and/or, one or more operating systems 441, such as Windows Server™, Mac OS XTM, UnixTM, LinuxTM, FreeBSDTM, etc.
上述实施例中由航班季节性归类装置所执行的步骤可以基于该图4所示的服务器结构。The steps performed by the device for classifying flight seasons in the above embodiments may be based on the server structure shown in FIG. 4 .
还需要说明的,根据本申请的实施例,上述图1的流程示意图描述的所述航班的季节性归类方法的过程可以被实现为计算机软件程序。例如,本申请的实施例包括一种计算机程序产品,其包括承载在非暂态计算机可读介质上的计算机程序,该计算机程序包含用于执行上述图1的流程示意图中所示的方法的程序代码。It should also be noted that, according to an embodiment of the present application, the process of the seasonal classification method for flights described in the schematic flowchart of FIG. 1 may be implemented as a computer software program. For example, the embodiments of the present application include a computer program product, which includes a computer program carried on a non-transitory computer-readable medium, the computer program including a program for performing the method shown in the flowchart schematic diagram of FIG. 1 above code.
尽管已经采用特定于结构特征和/或方法逻辑动作的语言描述了本主题,但是应当理解所附权利要求书中所限定的主题未必局限于上面描述的特定特征或动作。相反,上面所描 述的特定特征和动作仅仅是实现权利要求书的示例形式。Although the subject matter has been described in language specific to structural features and/or methodological acts, it is to be understood that the subject matter defined in the appended claims is not necessarily limited to the specific features or acts described above. Rather, the specific features and acts described above are merely example forms of implementing the claims.
虽然在上面论述中包含了若干具体实现细节,但是这些不应当被解释为对本申请的范围的限制。在单独的实施例的上下文中描述的某些特征还可以组合地实现在单个实施例中。相反地,在单个实施例的上下文中描述的各种特征也可以单独地或以任何合适的子组合的方式实现在多个实施例中。While several specific implementation details are contained in the above discussion, these should not be construed as limitations on the scope of the application. Certain features that are described in the context of separate embodiments can also be implemented in combination in a single embodiment. Conversely, various features that are described in the context of a single embodiment can also be implemented in multiple embodiments separately or in any suitable subcombination.
以上描述仅为本申请的较佳实施例以及对所运用技术原理的说明。本领域技术人员应当理解,本申请中所涉及的公开范围,并不限于上述技术特征的特定组合而成的技术方案,同时也应涵盖在不脱离上述公开构思的情况下,由上述技术特征或其等同特征进行任意组合而形成的其它技术方案。例如上述特征与本申请中公开的(但不限于)具有类似功能的技术特征进行互相替换而形成的技术方案。The above description is only a preferred embodiment of the present application and an illustration of the applied technical principle. Those skilled in the art should understand that the scope of disclosure involved in this application is not limited to the technical solutions formed by the specific combination of the above technical features, but also covers the technical solutions made by the above technical features or Other technical solutions formed by any combination of equivalent features. For example, a technical solution formed by replacing the above-mentioned features with technical features with similar functions disclosed in (but not limited to) this application.

Claims (10)

  1. 一种航班的季节性归类方法,其特征在于,包括:A seasonal classification method for flights, characterized in that it includes:
    获取目标航司中待进行季节性归类的第一航班的第一航班数据;Obtain the first flight data of the first flight to be classified seasonally in the target airline company;
    获取所述第一航班所对应的历史航班集合中每个历史航班的第二航班数据;Obtaining the second flight data of each historical flight in the historical flight set corresponding to the first flight;
    确定季节性归类的N个数据池,其中,所述N为大于或等2的整数;Determine N data pools for seasonal classification, wherein said N is an integer greater than or equal to 2;
    根据所述第二航班数据计算所述每个历史航班的第一收入数据;calculating the first revenue data of each historical flight according to the second flight data;
    根据所述第一收入数据确定所述N个数据池中每个数据池的中心数据;determining the central data of each of the N data pools according to the first income data;
    根据所述第一航班数据计算所述第一航班的第二收入数据;calculating second revenue data for the first flight based on the first flight data;
    根据所述第二收入数据以及所述N个数据池中每个数据池的中心数据确定所述第一航班的季节性归类。The seasonal classification of the first flight is determined according to the second revenue data and the central data of each of the N data pools.
  2. 根据权利要求1所述的方法,其特征在于,所述根据所述第一收入数据确定所述N个数据池中每个数据池的中心数据包括:The method according to claim 1, wherein said determining the central data of each data pool in said N data pools according to said first income data comprises:
    将目标收入数据确定为第一数据池的中心数据,所述目标收入数据为第二航班对应的收入数据,所述第二航班为所述历史航班集合中任意一个航班,所述第一数据池为所述N个数据池中的任意一个;Determining the target revenue data as the central data of the first data pool, the target revenue data being the revenue data corresponding to the second flight, the second flight being any flight in the set of historical flights, and the first data pool Any one of the N data pools;
    计算第三收入数据与所述目标收入数据的第一距离,所述第三收入数据为航班子集合中的任意一个航班所对应的收入数据,所述航班子集合为所述历史航班集合中除所述第二航班之外的航班集合;Calculate the first distance between the third income data and the target income data, the third income data is the income data corresponding to any flight in the flight subset, and the flight subset is the historical flight set except a collection of flights other than said second flight;
    将第四收入数据划分至所述第一数据池,所述第四收入数据为第三航班所对应的收入数据,所述第三航班为所述航班子集合中与所述目标收入数据的第一距离最近的收入数据所对应的航班;Divide the fourth revenue data into the first data pool, the fourth revenue data is the revenue data corresponding to the third flight, and the third flight is the third flight in the subset of flights that is related to the target revenue data. - the flight corresponding to the nearest revenue data;
    计算划分后所述第一数据池的中心数据。Calculate central data of the first data pool after division.
  3. 根据权利要求2所述的方法,其特征在于,所述计算第三收入数据与所述目标收入数据的第一距离包括:The method according to claim 2, wherein the calculating the first distance between the third income data and the target income data comprises:
    根据如下公式计算计算第三收入数据与所述目标收入数据的第一距离:Calculate and calculate the first distance between the third income data and the target income data according to the following formula:
    D(i,j)=W(t)×|X it-Y jt|; D(i,j)=W(t)×|X it -Y jt |;
    其中,D(i,j)为所述目标收入数据i与第三收入数据j之间的第一距离,所述第三收入数据j为所述航班子集合中任意一个航班所对应的收入数据,W(t)为所述第三收入数据的权重,X it为所述目标收入数据的市场需求值,Y jt为所述第三收入数据所对应的市场需求值。 Wherein, D(i, j) is the first distance between the target income data i and the third income data j, and the third income data j is the income data corresponding to any flight in the flight subset , W(t) is the weight of the third income data, X it is the market demand value of the target income data, and Y jt is the market demand value corresponding to the third income data.
  4. 根据权利要求2所述的方法,其特征在于,所述计算划分后所述目标数据池的中心数据包括:The method according to claim 2, wherein the calculation of the center data of the target data pool after division comprises:
    通过如下公式计算划分后所述目标数据池的中心数据:Calculate the center data of the target data pool after division by the following formula:
    Figure PCTCN2022087286-appb-100001
    Figure PCTCN2022087286-appb-100001
    其中,New_mean(t)为将所述第四收入数据划分至所述目标数据池后所述目标数据池的中心数据,Old_mean(t)为将所述第四收入输入划分至所述目标数据池前所述目标数据池的中心数据,m为将所述第四收入输入划分至所述目标数据池之前,所述目标数据池的中心数据的数量,X(t)为所述目标数据池中任意一个收入数据所对应的市场需求值。Wherein, New_mean(t) is the central data of the target data pool after dividing the fourth income data into the target data pool, and Old_mean(t) is dividing the fourth income input into the target data pool The central data of the aforementioned target data pool, m is the number of central data in the target data pool before the fourth income input is divided into the target data pool, and X(t) is the number of central data in the target data pool The market demand value corresponding to any income data.
  5. 根据权利要求1至4中任一项所述的方法,其特征在于,所述根据所述第二航班数据计算所述每个历史航班的第一收入数据包括:The method according to any one of claims 1 to 4, wherein the calculating the first revenue data of each historical flight according to the second flight data comprises:
    通过如下公式计算所述每个历史航班的第一收入数据:The first revenue data of each historical flight is calculated by the following formula:
    Figure PCTCN2022087286-appb-100002
    Figure PCTCN2022087286-appb-100002
    其中,Revenue Dcp(x)为所述每个历史航班中第x个航班的第一收入数据,i表示舱位,k为舱位的总数,BKG(i)为所述第i个舱位的订座,Fare(i)为所述第i个舱位的票价。Wherein, Revenue Dcp(x) is the first revenue data of the xth flight in each of the historical flights, i represents the cabin, k is the total number of cabins, and BKG (i) is the reservation of the i-th cabin, Fare(i) is the fare of the ith cabin.
  6. 根据权利要求1至4中任一项所述的方法,其特征在于,所述根据所述第二收入数据以及所述N个数据池中每个数据池的中心数据确定所述第一航班的季节性归类包括:The method according to any one of claims 1 to 4, characterized in that the said first flight is determined according to said second income data and the central data of each data pool in said N data pools Seasonal classifications include:
    计算所述第二收入数据与N个中心数据中每个中心数据的第二距离,其中,所述N个中心数据与所述N个数据池相对应;calculating a second distance between the second income data and each of the N central data, wherein the N central data correspond to the N data pools;
    将所述第二收入数据划分至第二数据池,所述第二数据池为所述N个中心数据中与所述第二收入数据之间的第二距离最小的中心数据所对应的数据池;Dividing the second income data into a second data pool, the second data pool being the data pool corresponding to the central data with the smallest second distance between the N central data and the second income data ;
    将所述第二数据池所对应的季节性归类确定为所述第一航班的季节性归类。The seasonal classification corresponding to the second data pool is determined as the seasonal classification of the first flight.
  7. 一种航班季节性归类装置,其特征在于,包括:A flight season classification device is characterized in that it comprises:
    第一获取单元,用于获取目标航司中待进行季节性归类的第一航班的第一航班数据;The first acquisition unit is used to acquire the first flight data of the first flight to be classified seasonally in the target airline company;
    第二获取单元,用于获取所述第一航班所对应的历史航班集合中每个历史航班的第二航班数据;The second obtaining unit is configured to obtain the second flight data of each historical flight in the historical flight set corresponding to the first flight;
    第一确定单元,用于确定季节性归类的N个数据池,其中,所述N为大于或等2的整数;The first determination unit is used to determine N data pools for seasonal classification, wherein the N is an integer greater than or equal to 2;
    第一计算单元,用于根据所述第二航班数据计算所述每个历史航班的第一收入数据;a first calculation unit, configured to calculate the first revenue data of each historical flight according to the second flight data;
    第二确定单元,用于根据所述第一收入数据确定所述N个数据池中每个数据池的中心数据;A second determining unit, configured to determine the central data of each of the N data pools according to the first income data;
    第二计算单元,用于根据所述第一航班数据计算所述第一航班的第二收入数据;a second calculation unit, configured to calculate second revenue data of the first flight according to the first flight data;
    第三确定单元,用于根据所述第二收入数据以及所述N个数据池中每个数据池的中心数据确定所述第一航班的季节性归类。A third determining unit, configured to determine the seasonal classification of the first flight according to the second revenue data and the central data of each of the N data pools.
  8. 根据权利要求7所述的装置,其特征在于,所述第二确定单元具体用于:The device according to claim 7, wherein the second determination unit is specifically configured to:
    将目标收入数据确定为第一数据池的中心数据,所述目标收入数据为第二航班对应的 收入数据,所述第二航班为所述历史航班集合中任意一个航班,所述第一数据池为所述N个数据池中的任意一个;Determining the target revenue data as the central data of the first data pool, the target revenue data being the revenue data corresponding to the second flight, the second flight being any flight in the set of historical flights, and the first data pool Any one of the N data pools;
    计算第三收入数据与所述目标收入数据的第一距离,所述第三收入数据为航班子集合中的任意一个航班所对应的收入数据,所述航班子集合为所述历史航班集合中除所述第二航班之外的航班集合;Calculate the first distance between the third income data and the target income data, the third income data is the income data corresponding to any flight in the flight subset, and the flight subset is the historical flight set except a collection of flights other than said second flight;
    将第四收入数据划分至所述第一数据池,所述第四收入数据为第三航班所对应的收入数据,所述第三航班为所述航班子集合中与所述目标收入数据的第一距离最近的收入数据所对应的航班;Divide the fourth revenue data into the first data pool, the fourth revenue data is the revenue data corresponding to the third flight, and the third flight is the third flight in the subset of flights that is related to the target revenue data. - the flight corresponding to the nearest revenue data;
    计算划分后所述第一数据池的中心数据。Calculate central data of the first data pool after division.
  9. 一种计算机设备,其特征在于,包括:存储器、处理器以及总线系统;A computer device, characterized in that it includes: a memory, a processor, and a bus system;
    其中,所述存储器用于存储程序;Wherein, the memory is used to store programs;
    所述总线系统用于连接所述存储器以及所述处理器,以使所述存储器以及所述处理器进行通信;The bus system is used to connect the memory and the processor, so that the memory and the processor can communicate;
    所述处理器用于执行所述存储器中的程序,并根据程序代码中的指令执行权利要求1至6中任一项所述的季节性归类方法。The processor is used to execute the program in the memory, and execute the seasonal classification method according to any one of claims 1 to 6 according to the instructions in the program code.
  10. 一种机器可读介质,其特征在于,包括指令,当所述指令在机器上运行时,使得机器执行上述权利要求1至6中任一项所述的季节性归类方法。A machine-readable medium, which is characterized by comprising instructions, and when the instructions are run on the machine, the machine is made to execute the seasonal classification method according to any one of claims 1 to 6 above.
PCT/CN2022/087286 2021-05-31 2022-04-18 Seasonal classification method and apparatus for flights, and machine-readable medium WO2022252850A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN202110604313.8A CN113297336B (en) 2021-05-31 2021-05-31 Seasonal classification method, device and machine-readable medium for flights
CN202110604313.8 2021-05-31

Publications (1)

Publication Number Publication Date
WO2022252850A1 true WO2022252850A1 (en) 2022-12-08

Family

ID=77326542

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2022/087286 WO2022252850A1 (en) 2021-05-31 2022-04-18 Seasonal classification method and apparatus for flights, and machine-readable medium

Country Status (2)

Country Link
CN (1) CN113297336B (en)
WO (1) WO2022252850A1 (en)

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113297336B (en) * 2021-05-31 2023-12-19 中国民航信息网络股份有限公司 Seasonal classification method, device and machine-readable medium for flights

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107808308A (en) * 2017-10-19 2018-03-16 天津伊翔运达网络科技有限公司 A kind of yield management method of intellectualized detection dull and rush season
CN109446275A (en) * 2018-09-03 2019-03-08 厦门快商通信息技术有限公司 A kind of aeronautical data analysis method, equipment and storage medium based on big data
CN111737634A (en) * 2020-06-23 2020-10-02 携程旅游网络技术(上海)有限公司 Flight income prediction method, system, electronic equipment and readable storage medium
CN112396244A (en) * 2020-11-30 2021-02-23 中国民航信息网络股份有限公司 Flight booking value processing method and system based on zero booking model
CN113297336A (en) * 2021-05-31 2021-08-24 中国民航信息网络股份有限公司 Method, device and machine readable medium for seasonality classification of flight

Family Cites Families (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
SG10201506680YA (en) * 2015-08-24 2017-03-30 Mastercard International Inc Method and System For Predicting Lowest Airline Ticket Fares
CN107038886B (en) * 2017-05-11 2019-05-28 厦门大学 A kind of taxi based on track data is cruised path recommended method and system
CN107944625B (en) * 2017-11-23 2021-09-07 南京航空航天大学 Single-airport flight season change time slot optimization method based on historical operation data driving
CN112308616B (en) * 2020-11-02 2024-05-28 沈阳民航东北凯亚有限公司 Group division method and device for avionics passengers
CN112307342B (en) * 2020-11-02 2023-10-10 沈阳民航东北凯亚有限公司 Flight recommendation method and device

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107808308A (en) * 2017-10-19 2018-03-16 天津伊翔运达网络科技有限公司 A kind of yield management method of intellectualized detection dull and rush season
CN109446275A (en) * 2018-09-03 2019-03-08 厦门快商通信息技术有限公司 A kind of aeronautical data analysis method, equipment and storage medium based on big data
CN111737634A (en) * 2020-06-23 2020-10-02 携程旅游网络技术(上海)有限公司 Flight income prediction method, system, electronic equipment and readable storage medium
CN112396244A (en) * 2020-11-30 2021-02-23 中国民航信息网络股份有限公司 Flight booking value processing method and system based on zero booking model
CN113297336A (en) * 2021-05-31 2021-08-24 中国民航信息网络股份有限公司 Method, device and machine readable medium for seasonality classification of flight

Also Published As

Publication number Publication date
CN113297336A (en) 2021-08-24
CN113297336B (en) 2023-12-19

Similar Documents

Publication Publication Date Title
WO2022252847A1 (en) Method and apparatus for predicting seasonal classification of flights, and machine readable medium
CN110503245B (en) Prediction method for large-area delay risk of airport flight
EP3258430A1 (en) Transport capacity scheduling method and system
WO2022252846A1 (en) Method and apparatus for predicting market demand value of flight segment, and machine-readable medium
WO2022252850A1 (en) Seasonal classification method and apparatus for flights, and machine-readable medium
CN109460871A (en) Airport passenger amount prediction technique based on the identification of typical day
Grabbe et al. Clustering days with similar airport weather conditions
CN113284369B (en) Prediction method for actually measured airway data based on ADS-B
Hatıpoğlu et al. Flight delay prediction based with machine learning
CN116956757A (en) Departure flight taxi time prediction method, electronic device, and storage medium
CN112926809B (en) Flight flow prediction method and system based on clustering and improved xgboost
Alla et al. Flight arrival delay prediction using supervised machine learning algorithms
Vadlamani et al. Using machine learning to analyze and predict entry patterns of low-cost airlines: a study of Southwest Airlines
Wang et al. A data-driven prediction model for aircraft taxi time by considering time series about gate and real-time factors
Zhou et al. The UAV landing quality evaluation research based on combined weight and VIKOR algorithm
Du et al. Airport capacity prediction with multisource features: A temporal deep learning approach
Kang et al. Quantile Regression–Based Estimation of Dynamic Statistical Contingency Fuel
CN108986554B (en) Airspace sector crowding degree dynamic identification method based on fuzzy comprehensive judgment
Jiang et al. A multi-index prediction method for flight delay based on long short-term memory network model
CN111737634A (en) Flight income prediction method, system, electronic equipment and readable storage medium
CN117591919B (en) Passenger flow prediction method, passenger flow prediction device, electronic equipment and storage medium
Xia et al. Forecast of Traffic Vehicle Demand Based on AHP Decision Model
CN115712850B (en) Airport similar day selection method based on improved k-prototype and gray correlation analysis
Panigrahi et al. Flight Price Prediction Using Machine Learning.
Gandikota et al. Detailed Study of Unsupervised Machine Learning Clustering Efficacy in Identifying Unstable Approaches of Flight Energy Signature Profiles

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 22814897

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 22814897

Country of ref document: EP

Kind code of ref document: A1