CN111626479B

CN111626479B - Taxi passenger flow distribution point passenger carrying decision method and system based on real-time data

Info

Publication number: CN111626479B
Application number: CN202010361903.8A
Authority: CN
Inventors: 叶竞成; 包兴达; 方筠捷; 王浩宇; 邹辉
Original assignee: Nanjing University of Posts and Telecommunications
Current assignee: Nanjing University of Posts and Telecommunications
Priority date: 2020-04-30
Filing date: 2020-04-30
Publication date: 2022-08-26
Anticipated expiration: 2040-04-30
Also published as: CN111626479A

Abstract

The invention provides a valuation vehicle passenger flow distribution point passenger carrying decision method and a system based on real-time data, wherein the method comprises the following steps: (S1) establishing a multivariate linear regression model in which the decision value varies with the demand factor, the return income factor, the wait income factor and the personal ability factor; (S2) initializing the multiple linear regression model, providing a decision for a driver according to the multiple linear regression model based on the current demand factor, return income factor, waiting income factor and personal ability factor, and training the multiple linear regression model according to a decision satisfaction sample fed back by the driver after the decision; (S3) providing a decision for the driver using the trained multiple linear regression model; the system provides decision advice to the taxi driver according to the method. The invention can realize the real-time prediction of supply and demand relationship with higher accuracy so as to help a driver make a decision and improve the passenger trip efficiency and the driver income efficiency.

Description

Valuation vehicle passenger flow distribution point passenger carrying decision method and system based on real-time data

Technical Field

The invention relates to a decision-making method and a decision-making system, in particular to a taxi passenger flow distribution point passenger carrying decision-making method and a system based on real-time data.

Background

After a taxi driver sends a passenger to an airport, the taxi driver often needs to judge whether the passenger stays in the airport to carry a taxi to the airport or returns to the urban area to search for a passenger source. The prior art generally has the passenger to judge whether to receive the order according to the demand that sends through the APP for the taxi driver. The disadvantage of this approach is that it is generally difficult to predict the supply-demand relationship in real time for this type of APP, resulting in slightly lower reaction time and efficiency after the driver takes an order than if the driver were moving forward along with the supply-demand relationship. Only when the supply-demand relation is predicted in real time, the balance between the passenger flow of large-scale passenger distribution places and the supply and demand of taxi drivers can be ensured, and therefore the taxi can meet the requirements of the passengers in real time.

The invention patent application with the application number of 201911036243X discloses a decision-making method for passenger carrying problems of taxis in airports. The method comprises the steps of establishing a profit maximization model, comparing waiting net profits and returning net profits of taxi drivers, and selecting a decision that the net profits are the maximum. However, this application only considers objective factors such as flight and season factors and revenue factors (including waiting for net revenue and returning net revenue), but does not consider at all individual difference factors in driver passenger capacity and decision tendency under the same objective factors. Compared with objective factors, individual difference factors of drivers are difficult to grasp, but the individual difference factors directly cause that different individuals obtain obvious differences under the same decision. For example, the individual passenger capacity of a driver can be visualized by the empty rate. The greater the idle rate, indicating that it is more difficult to pull to the passenger at ordinary times, the less money is earned by the passenger, and the greater the idle rate the driver may be more inclined to remain at the airport for more consistent revenue. In addition, the decision method provided by the application is too rational and lacks elasticity, and the decision proportion of each factor cannot be adjusted according to the experience and feedback of a driver.

Disclosure of Invention

The purpose of the invention is as follows: in order to overcome the defects of the prior art, the invention aims to provide a taxi pricing car route planning suggestion method based on real-time data.

The invention also provides a pricing vehicle route planning and suggesting system based on the real-time data.

The technical scheme is as follows: in one aspect, the invention discloses a valuation vehicle passenger flow distribution point passenger carrying decision method based on real-time data. The method comprises the following steps:

(S1) establishing a multiple linear regression model:

wherein p is a decision value, when p is less than 0.5, waiting decision is made for the driver, and when p is more than 0.5, a decision for returning to the urban area is made for the driver; x ₁ 、X ₂ 、X ₃ 、X ₄ Respectively corresponding to demand factor, return income factor, waiting income factor and personal ability factor, beta ₀ 、β ₁ 、β ₂ 、β ₃ 、β ₄ First to fifth decision factors, respectively; the demand factor is the passenger flow volume after the weather influence is considered; the return income factor is the expected return net income B of the driver from the passenger flow distribution point to the urban area _{Return _ net} (ii) a The waiting income factor is the expected waiting net income B of drivers returning to the urban area from the passenger flow distribution point _{Wait _ net} (ii) a The personal ability factor is a driver idle rate;

(S2) is beta ₀ 、β ₁ 、β ₂ 、β ₃ 、β ₄ Setting an initial value based on the current demand factor X ₁ Returning a profit factor X ₂ Waiting for revenue factor X ₃ And personal ability factor X ₄ The decision value p is calculated by the formula (1) to provide a decision for a driver, and beta is adjusted according to the decision satisfaction fed back by the driver after the decision ₀ 、β ₁ 、β ₂ 、β ₃ 、β ₄ Is repeated until β before and after the correction, by correcting the decision value p ₀ 、β ₁ 、β ₂ 、β ₃ 、β ₄ Is less than a threshold;

(S3) finally correcting the beta ₀ 、β ₁ 、β ₂ 、β ₃ 、β ₄ Is used for the subsequent calculation of the decision value p, instead of the equation (1).

Further, in step (S2), β is adjusted according to the decision satisfaction of the driver feedback after the decision ₀ 、β ₁ 、β ₂ 、β ₃ 、β ₄ To modify the decision value p, in particular by adjusting β according to the following equation ₀ 、β ₁ 、β ₂ 、β ₃ 、β ₄ ：

h _β ＝β ₀ +β ₁ X ₁ +β ₂ X ₂ +β ₃ X ₃ ，

Wherein, beta' _j Is beta _j The adjustment value of (d); α is the learning rate and is a positive number; m is the total number of samples fed back by the driver; x is the number of ⁱ And y ⁱ Satisfaction and corresponding decision value in the ith sample, and x, respectively, of driver feedback ⁱ Can only take a value of 0 or 1, when x ⁱ When the value of (A) is 0, the driver does not recognize the decision, and when x is equal to ⁱ When the value of (1) represents a driver acceptance decision;

and the demand factor, the return income factor, the waiting income factor and the personal ability factor which correspond to the ith sample are respectively fed back by the driver.

Further, the passenger flow volume after considering the weather influence is equal to the product of the expected passenger flow volume when not considering the weather influence and the weather influence factor; the expected passenger flow volume is a variable which changes along with time; the weather impact factor is a ratio of a passenger volume that takes into account the current weather impact to an expected passenger volume that does not take into account the weather impact.

Further, the expectation returns a net profit B _{Return _ Net} Calculated by the following way:

B _{return _ Net} ＝B _{Return to} -C _{Return to} ，

B _{Return to} ＝B _h ×T _w ，

C _{Return to} ＝B _h ×T _r ，

Wherein, B _{Return to} Return the expected return revenue, i.e. the expected waiting time consumed by leaving the airport for the revenue gained by the passenger after the driver returns to the downtown empty; c _{Return to} No-load cost for drivers to return to the urban area from the passenger flow distribution points in no-load mode; b is _h For average hourly income of drivers, B _h The calculation method is as follows: calculating the average passenger-carrying mileage of a driver per hour through the historical data of the real-time positioning information of the valuation vehicle and the historical data of the real-time passenger-carrying information of the valuation vehicle, and calculating by combining with the valuation rule to obtain the average passenger-carrying mileage；T _w Expected waiting time consumed for staying at an airport, T _r The time for returning from the passenger flow distribution point to the downtown; and the urban area positioning is obtained by calculating real-time position data of the taxi and historical data of real-time passenger carrying information.

Further, the expected wait net gain B _{Wait _ net} By the following equations:

B _{wait _ net} ＝B _{Wait for} -C _{Wait for} ，

C _{Wait for} ＝β×B _h ×T _w ，

Wherein, B _{Wait for} The average waiting income of the urban area is returned from the passenger flow distribution point after the driver stays at the airport and carries passengers for the expected waiting income; c _{Wait for} Wait costs for expected wait durations consumed in the case of staying at an airport; beta is a rest factor and is a constant; the expected waiting revenue B _{Wait for} Determining the average distance from the passenger flow distribution point to the urban area through the pricing rule; and the urban area positioning is obtained by calculating real-time position data of the taxi and historical data of real-time passenger carrying information.

Further, the urban location is calculated by real-time taxi position data and historical real-time passenger information data, and specifically comprises the following steps:

(a) analyzing to obtain all passenger getting-off points according to the real-time position data of the taxi and the historical data of the real-time passenger carrying information;

(b) clustering the getting-off points of the passengers into 4 central points by using a K-means clustering method, which comprises the following steps: selecting 4 alighting points from all passenger alighting points randomly as initial central points by a random point selection mode; forming a cluster by each initial central point and the nearest departure point, so as to form 4 clusters, calculating the central positioning point of each cluster by combining the occurrence times and the positioning of all departure points in each cluster to replace the previous central point, and repeating the calculation until the positioning of the central point is not changed;

(c) and determining the positioning information of the urban area according to the finally obtained positioning of the 4 clustering center points and the corresponding number of the points closest to the getting-off point.

Further, the driver idle rate is calculated by:

(S41) determining the average daily idle distance and the average daily total driving distance of the driver according to the real-time position data of the taxi and the historical data of the real-time passenger carrying information;

(S42) dividing the daily average idling distance by the daily average total driving distance to obtain the idling rate of the driver.

Further, the passenger distribution point includes an airport or a passenger station.

In another aspect, the invention relates to a taxi passenger flow distribution point passenger carrying decision making system based on real-time data. The system comprises:

the demand factor calculation module is used for calculating the passenger flow volume after the weather influence is considered as a demand factor;

the return income factor calculation module is used for calculating expected return net income of the driver from the passenger flow distribution point to the urban area as a return income factor;

the waiting income factor calculating module is used for calculating expected waiting net income of the driver returned to the urban area from the passenger flow distribution point as a waiting income factor;

the personal ability factor calculation module is used for calculating the no-load rate of the driver as a personal ability factor;

a training module, configured to train the established multiple linear regression model, where the multiple linear regression model is expressed as:

wherein p is a decision value, when p is less than 0.5, waiting decision is made for the driver, and when p is more than 0.5, a decision for returning to the urban area is made for the driver; x ₁ 、X ₂ 、X ₃ 、X ₄ Corresponding to demand factor, return income factor, waiting income factor and personal ability factor, beta ₀ 、β ₁ 、β ₂ 、β ₃ 、β ₄ First to fifth decisions, respectivelyA factor; training the established multiple linear regression model comprises: is beta ₀ 、β ₁ 、β ₂ 、β ₃ 、β ₄ Setting an initial value based on the current demand factor X ₁ Return income factor X ₂ Waiting for revenue factor X ₃ And personal ability factor X ₄ A decision value p is calculated according to the multiple linear regression model to provide a decision for a driver, and beta is adjusted according to the decision satisfaction degree fed back by the driver after the decision ₀ 、β ₁ 、β ₂ 、β ₃ 、β ₄ Is repeated until β before and after the correction, by correcting the decision value p ₀ 、β ₁ 、β ₂ 、β ₃ 、β ₄ The difference between the values of (a) is less than a threshold value;

a decision module for adjusting the final beta ₀ 、β ₁ 、β ₂ 、β ₃ 、β ₄ And (3) calculating a decision value p by combining the multiple linear regression model.

Further, the training module is further configured to adjust β according to the following equation ₀ 、β ₁ 、β ₂ 、β ₃ 、β ₄ ：

h _β ＝β ₀ +β ₁ X ₁ +β ₂ X ₂ +β ₃ X ₃ +β ₄ X ₄ ，

Wherein is beta' _j Is beta _j The adjustment value of (d); α is a positive scale factor and is a positive number; m is the total number of samples fed back by the driver; x is the number of ⁱ And y ⁱ Satisfaction and corresponding decision value in the ith sample fed back by the driver, respectively, and x ⁱ Can only take a value of 0 or 1, when x is ⁱ When the value of (A) is 0, the driver does not recognize the decision, and when x is equal to ⁱ When the value of (1) represents a driver acceptance decision;

Has the advantages that: compared with the prior art, the invention has the following advantages:

1. the system can predict the supply and demand relationship in real time to help a driver to make a decision, so that taxis near a large-scale passenger collecting and distributing place basically meet the requirement of the passenger, overflow is avoided, and the traveling efficiency of the passenger and the income efficiency of the driver are improved;

2. meanwhile, weather factors, expected return net income, expected waiting net income and individual difference factors of drivers are considered, so that the decision result is closer to the ideal condition;

3. the influence ratio of model benefit factors, personal factors and demand factors on the final decision result can be gradually adjusted according to the decision satisfaction information fed back by the user, and the decision result is corrected, so that the requirements of a driver or a client are better met.

4. The urban area can be divided more effectively by utilizing the K-means clustering, and compared with the rough estimation of the distance returned to the urban area, the more accurate distance returned to the urban area can be obtained more effectively, so that the decision which is as correct as possible and has better user experience can be made.

Drawings

FIG. 1 is a data relationship diagram of one embodiment of the method of the present invention.

Detailed Description

The invention will be explained in more detail below with reference to specific embodiments and the accompanying drawings.

In one embodiment of the invention, an airport is selected as the passenger flow distribution point to illustrate the decision method of the invention. The decision method comprises the following steps:

the method comprises the following steps: and establishing a multivariate linear regression model with the decision value varying with the demand factor, the return income factor, the waiting income factor and the personal ability factor.

Firstly, establishing a multivariate linear regression model of logit (p) and X:

where Y ═ 1 denotes the driver decision return, p (Y ═ 1| X) is the probability that Y ═ 1 occurs under the condition that X occurs, X being the set of factors. Meanwhile, according to the regression relationship, there are

logit(p)＝β ₀ +β ₁ X ₁ +β ₂ X ₂ +…+β _k X _k

Wherein p is a decision value, X ₁ 、X ₂ ...X _k Represents k influencing factors, beta ₀ 、β ₁ 、β ₂ 、β ₃ 、β _k Is related to the decision proportion of each factor. Since the factors considered in this embodiment include demand factors, return income factors, waiting income factors, and personal ability factors, the value of k is 4, and the finally obtained multiple linear regression model is:

wherein, X ₁ 、X ₂ 、X ₃ 、X ₄ Respectively corresponding to demand factor, return income factor, waiting income factor and personal ability factor, beta ₀ 、β ₁ 、β ₂ 、β ₃ 、β ₄ Are the first to fifth decision factors. When p is less than 0.5, a waiting decision is made for the driver, and when p is more than 0.5, a return-to-urban decision is made for the driver.

The calculation of the factors is described below

1. Demand factors

On the one hand, the traffic volume at an airport generally varies with the number of flights, regardless of weather, and the taxi drivers who come to and go to the airport generally have some knowledge of the flights arriving at the airport. When the driver takes the passenger to the airport, if it is found that there are a lot of flights arriving at the airport, or it is found that it is the prime time to arrive at the airport. For example, because the late flight has a lower fare and the nighttime temperature is cooler and more comfortable than the afternoon and midday, people often do not choose the airplane arriving in the midday and the afternoon, but choose the airplane arriving at the evening to achieve higher cost performance. Thus, depending on the driver's experience, the expected traffic at night may be greater, and when the driver takes the passenger to the airport at night, there is a greater likelihood that the passenger will remain at the airport and move to a "pool" ready for pick-up. Likewise, the expected traffic volume at an airport is constantly in dynamic change at different times, i.e. the expected traffic volume is a variable f (t) that changes over time. To simplify the complexity of airport traffic data, it is also believed that the expected traffic volume at the airport will not change significantly every two hours. Thus, t is set to a time period divided every two hours, and f (t) represents the average expected passenger volume at the airport during the corresponding time period.

On the other hand, the weather conditions have various obvious influences on the flight of the airplane, and once the weather conditions are poor, the airplane can be delayed, stopped flying and the like. Thus, the influence of weather factors on passenger traffic does not vary appreciably. In this embodiment, different weather information is classified into 4 categories, and the specific categories are shown in table I:

TABLE I

Weather type	Description of the invention
		Weather of the first category	Sunny and cloudy days
Weather of the second category	Rain gust, rain in the small to medium, snow in the small to medium
		Weather of type III	Rain and snow, fog and haze
Weather of the IV class	Heavy rain, heavy snow, thunderstorm, floating dust and fog

The impact of different weather types on passenger flow is indicated by a weather impact factor. For example, a weather effect factor may be defined as: the ratio of the passenger flow volume taking into account the current weather effect to the expected passenger flow volume without taking into account the weather effect. For weather types I, II, III, and IV in Table I, the weather impact factors may be 100%, 77%, 75%, 73%, respectively, based on historical airport statistics.

After comprehensively considering the expected passenger flow volume without considering the weather influence and the weather influence factor, determining the demand factor as the passenger flow volume with considering the weather influence, which is equal to the product of the expected passenger flow volume without considering the weather influence and the weather influence factor.

2. Expected return net gain

The driver's decision can be influenced by the magnitude of their expected net gain. If the driver expects that the passenger can get more net income when returning to the urban area, he is likely to be more inclined to return to the urban area to search for the passenger source; while the driver may be more inclined to leave a stable source of passengers at the reception airport if he believes that he may receive a greater net gain in leaving the "storage pool".

The return of net profit B is expected according to the general principles of economics _{Return _ Net} Is equal to the expected return revenue B _{Return to} And no load cost C _{Return to} The difference of (a) is:

B _{return _ Net} ＝B _{Return to} -C _{No load}

For drivers who choose to return to urban passenger carriers, return is expectedYi B _{Return to} I.e. the expected waiting time T consumed in the case of leaving the airport after the driver returns to the urban area without load _w The revenue obtained for carrying the passenger, namely:

B _{return to} ＝B _h ×T _w ，

Wherein, B _h Average driver gain per hour, B _h The calculation method is as follows: the average passenger-carrying mileage of the driver per hour is calculated through the historical data of the real-time positioning information of the valuation vehicle and the historical data of the real-time passenger-carrying information of the valuation vehicle, and then the average passenger-carrying mileage is calculated by combining with the valuation rule.

And no-load cost C _{Return to} The time cost in returning to the downtown area is mainly the idling cost of the driver returning to the downtown area from the airport where the driver is located without load. C _{Return to} Equal to the product of the time to return to downtown and the average revenue per hour of the driver:

C _{return to} ＝B _h ×T _r

Wherein, T _r The time to return from the airport to the downtown.

Further explanation is needed regarding urban location. Generally, the urban area is a large area, and the departure point of the passenger is uncertain after the driver returns to the urban area. In order to overcome the problem and make the model more accurate, the invention utilizes a K-means clustering method to determine the location of the urban area. The specific method comprises the following steps:

(a) and analyzing to obtain the getting-off points of all passengers according to the real-time position data of the valuation vehicle and the historical data of the real-time passenger carrying information. For example, in the real-time passenger information, no load can be represented by "0", a passenger can be represented by "1", and the position of the pricing car when changing from "1" to "0" is determined by combining the real-time position data, that is, the location of the getting-off point of the passenger can be determined.

(b) Clustering the passenger getting-off points into 4 central points by using a K-means clustering method, comprising the following steps: selecting 4 alighting points from all passenger alighting points randomly as initial central points by a random point selection mode; forming a cluster by each initial central point and the nearest departure point, so as to form 4 clusters, calculating the central positioning point of each cluster by combining the occurrence times and the positioning of all departure points in each cluster to replace the previous central point, and repeating the calculation until the positioning of the central point is not changed;

After the positioning information of the urban area is obtained through calculation, the average distance of the driver for carrying passengers to the urban area in the airport can be determined, and then the related income is calculated by combining with the pricing rule.

3. Expected wait for net gain

Expected waiting net profit B _{Wait _ net} Is equal to the expected waiting gain B _{Wait for} And waiting cost C _{Wait for} The difference of (a) is:

B _{wait _ net} ＝B _{Wait for} -C _{Wait for}

Waiting revenue B is expected for drivers who choose to stay at the airport for pickup _{Wait for} The average waiting income is the average waiting income returned to the urban area from the airport after the driver stays at the airport to carry the passenger, and the income is determined as the average passenger carrying income through the empirical judgment of the taxi driver and can be determined by the pricing rule of taxi charging and the average distance of returning the passenger from the airport to the urban area. While waiting for a cost C _{Wait for} The latency cost is mainly due to the expected latency consumed in case of staying at the airport. The driver also has a certain rest time of relaxing while waiting in line, and when the driver feels a little tired, the driver may be more inclined to stay at the airport to wait, and take a rest while waiting in line. Therefore, wait for cost C _{Wait for} The calculation method of (A) is as follows:

C _{wait for} ＝β×B _h ×T _w

Where β is a rest factor, the rest factor β can be statistically analyzed according to questionnaires, and this embodiment is set to 70%.

4. Personal ability factors

The personal ability factor of the driver is mainly determined by the empty load rate. The idling rate of the driver is also an important factor influencing the decision, and is determined by the real-time positioning information and passenger carrying information of the vehicle. The driver's idling rate, and thus the amount of traffic on average, can affect his/her expectation of revenue. For example, a driver with a high idle rate may not receive a large number of orders at ordinary times, may prefer to leave them when transporting passengers to an airport, and earns a certain amount of revenue in the "pool". And if the driver is a driver with extremely strong working capacity, the idling rate is extremely low at ordinary times, and when the driver is confronted with selection, the driver can think that more net benefits can be obtained by returning to the urban area according to the working capacity of the driver, so that the driver is more inclined to return to the urban area. The driver idle rate is set to a variable a. It is defined as the ratio of the average daily idle distance of the driver to the average daily total distance traveled by the driver, i.e.:

the daily average idle distance and the daily average total driving distance of the driver can be determined according to the real-time position data of the taxi and the historical data of the real-time passenger carrying information.

Step two: determining each decision proportion beta in the multiple linear regression model established in the step one ₀ 、β ₁ 、β ₂ 、β ₃ 、β ₄ The value of (c). The method mainly comprises the following steps: is beta ₀ 、β ₁ 、β ₂ 、β ₃ 、β ₄ Setting an initial value based on the current demand factor X ₁ Return income factor X ₂ Waiting for a revenue factor X ₃ And personal ability factor X ₄ The decision value p is calculated by the formula (1) to provide a decision for a driver, and beta is adjusted according to the decision satisfaction degree fed back by the driver after the decision ₀ 、β ₁ 、β ₂ 、β ₃ 、β ₄ Is repeated until β is before and after correction ₀ 、β ₁ 、β ₂ 、β ₃ 、β ₄ Is less than the threshold value. Wherein, beta ₀ 、β ₁ 、β ₂ 、β ₃ 、β ₄ The initial value setting of (A) can be random, or can be analyzed by the information of the valuation vehicle provided by the company and the likeAn approximate value was obtained.

The correction process mainly comprises adjusting the parameter beta by using a random ascending gradient method _j The iterative formula is:

where α is a positive scale factor, and is the "learning rate" for setting the step size, J (β) _j ) Is a function of the criteria. Alignment rule function J (β) _j ) Differentiation was performed to obtain:

h _β ＝β ₀ +β ₁ X ₁ +β ₂ X ₂ +β ₃ X ₃ +β ₄ X ₄ ， (4)

wherein m is the total number of samples fed back by the driver; x is the number of ⁱ And y ⁱ Satisfaction and corresponding decision value in the ith sample fed back by the driver, respectively, and x ⁱ Can only take a value of 0 or 1, when x is ⁱ When the value of (A) is 0, the driver does not recognize the decision, and when x is equal to ⁱ When the value of (1) represents that the driver accepts the decision. Therefore, the above iterative equation (1) can be further converted into:

the above formula (1/m) is removed without affecting the result, and is equivalent to the following formula:

finally, the parameter β is given by the equation (6) _j And (4) adjusting.

Step three: will finally correct beta ₀ 、β ₁ 、β ₂ 、β ₃ 、β ₄ Substituting the value of (a) into the multiple linear regression model (i.e. the model in equation (1)) built in the step one for the subsequent calculation of the decision value p, so as to provide the driver with a decision according to the decision value p.

Correspondingly, the system for deciding the passenger carrying of the valuation vehicle passenger flow distribution point based on the real-time data comprises:

wherein, p is a decision value, when p is less than 0.5, a waiting decision is made for the driver, and when p is more than 0.5, a city return decision is made for the driver; x ₁ 、X ₂ 、X ₃ 、X ₄ Respectively corresponding to demand factor, return income factor, waiting income factor and personal ability factor, beta ₀ 、β ₁ 、β ₂ 、β ₃ 、β ₄ Are each X ₁ 、X ₂ 、X ₃ 、X ₄ The decision ratio is occupied; training the established multiple linear regression model comprises: is beta ₀ 、β ₁ 、β ₂ 、β ₃ 、β ₄ Setting an initial value based on the current demand factor X ₁ Return income factor X ₂ Waiting for a revenue factor X ₃ And personal ability factor X ₄ According toThe multiple linear regression model calculates a decision value p to provide a decision for a driver, and beta is adjusted according to the decision satisfaction degree fed back by the driver after the decision ₀ 、β ₁ 、β ₂ 、β ₃ 、β ₄ Is repeated until β before and after the correction, by correcting the decision value p ₀ 、β ₁ 、β ₂ 、β ₃ 、β ₄ The difference between the values of (a) is less than a threshold value;

h _β ＝β ₀ +β ₁ X ₁ +β ₂ X ₂ +β ₃ X ₃ +β ₄ X ₄ ，

Wherein is beta' _j Is beta _j The adjustment value of (d); α is a positive scale factor and is a positive number; m is the total number of samples fed back by the driver; x is a radical of a fluorine atom ⁱ And y ⁱ Satisfaction and corresponding decision value in the ith sample fed back by the driver, respectively, and x ⁱ Can only take a value of 0 or 1, when x is ⁱ When the value of (b) is 0, the driver does not recognize the decision, and when x is ⁱ When the value of (1) represents a driver acceptance decision;

demand factors, return income factors, waiting income factors and personal ability factors corresponding to the ith sample fed back by the driver respectively

The above-described system may be implemented by software and/or programmable hardware.

Claims

1. A valuation vehicle passenger flow distribution point passenger carrying decision method based on real-time data is characterized by comprising the following steps:

(S1) establishing a multiple linear regression model, said multiple linear regression model being represented by:

wherein, p is a decision value, when p is less than 0.5, a waiting decision is made for the driver, and when p is more than 0.5, a city return decision is made for the driver; x ₁ 、X ₂ 、X ₃ 、X ₄ Respectively corresponding to demand factor, return income factor, waiting income factor and personal ability factor, beta ₀ 、β ₁ 、β ₂ 、β ₃ 、β ₄ First to fifth decision scale factors, respectively; the demand factor is the passenger flow volume after the weather influence is considered; the return income factor is the expected return net income B of the driver from the passenger flow distribution point to the urban area _{Return _ Net} (ii) a The waiting income factor is the expected waiting net income B of drivers returning to the urban area from the passenger flow distribution point _{Wait _ net} (ii) a The personal ability factor is a driver idle rate;

(S2) is beta ₀ 、β ₁ 、β ₂ 、β ₃ 、β ₄ Setting an initial value based on the current demand factor X ₁ Returning a profit factor X ₂ Waiting for a revenue factor X ₃ And personal ability factor X ₄ A decision value p is calculated according to the multiple linear regression model to provide a decision for a driver, and beta is adjusted according to the decision satisfaction degree fed back by the driver after the decision ₀ 、β ₁ 、β ₂ 、β ₃ 、β ₄ Is repeated until β is adjusted before and after ₀ 、β ₁ 、β ₂ 、β ₃ 、β ₄ The difference between the values of (a) is less than a threshold value;

(S3) according to the finally adjusted beta ₀ 、β ₁ 、β ₂ 、β ₃ 、β ₄ In combination with said multiple linear regression modelThe calculation of the subsequent decision value p is performed.

2. The real-time data-based valuation vehicle passenger flow distribution point passenger carrying decision making method according to claim 1, wherein in the step (S2), β is adjusted according to decision satisfaction fed back by a driver after decision making ₀ 、β ₁ 、β ₂ 、β ₃ 、β ₄ To modify the decision value p, in particular by adjusting β according to the following equation ₀ 、β ₁ 、β ₂ 、β ₃ 、β ₄ ：

h _β ＝β ₀ +β ₁ X ₁ +β ₂ X ₂ +β ₃ X ₃ +β ₄ X ₄ ，

Wherein is beta' _j Is beta _j The adjustment value of (d); alpha is a positive scale factor and is a positive number; m is the total number of samples fed back by the driver; x is the number of ⁱ And y ⁱ Satisfaction and corresponding decision value in the ith sample, and x, respectively, of driver feedback ⁱ Can only take a value of 0 or 1, when x is ⁱ When the value of (b) is 0, the driver does not recognize the decision, and when x is ⁱ When the value of (1) represents a driver acceptance decision;

and the demand factor, the return income factor, the waiting income factor and the personal ability factor corresponding to the ith sample are respectively fed back by the driver.

3. The real-time data-based pricing vehicle passenger flow distribution point passenger carrying decision method according to claim 1, characterized in that the passenger flow volume after considering weather influence is equal to the product of the expected passenger flow volume without considering weather influence and a weather influence factor; the expected passenger flow volume is a variable which changes along with time; the weather impact factor is a ratio of a passenger volume that takes into account the current weather impact to an expected passenger volume that does not take into account the weather impact.

4. The real-time data based pricing vehicle passenger flow distribution point passenger carrying decision method of claim 1, wherein the expected return net gain B _{Return _ Net} Calculated by the following way:

B _{return _ Net} ＝B _{Return to} -C _{Return to} ，

B _{Return to} ＝B _h ×T _w ，

C _{Return to} ＝B _h ×T _r ，

Wherein, B _{Return to} The expected return income, namely the expected waiting time consumed in the case of leaving the airport after the driver returns to the urban area without load is used for the income obtained by carrying passengers; c _{Return to} No-load cost for drivers to return to the urban area from the passenger flow distribution points in no-load mode; b _h For average hourly income of drivers, B _h The calculation method is as follows: calculating the average passenger-carrying mileage of a driver per hour according to the historical data of the real-time positioning information of the valuation vehicle and the historical data of the real-time passenger-carrying information of the valuation vehicle, and then calculating by combining with the valuation rule; t is _w Expected waiting time consumed for staying at an airport, T _r The time for returning from the passenger flow distribution point to the downtown; and the urban area positioning is obtained by calculating real-time position data of the taxi and historical data of real-time passenger carrying information.

5. The real-time data based pricing vehicle passenger flow distribution point passenger carrying decision method of claim 1, wherein the expected waiting net gain B _{Wait _ net} By the following equations:

B _{wait _ net} ＝B _{Wait for} -C _{Wait for} ，

C _{Wait for} ＝β×B _h ×T _w ，

6. The taxi passenger flow distribution point passenger carrying decision method based on real-time data as claimed in claim 4 or 5, wherein the urban location is calculated from taxi real-time position data and historical data of real-time passenger carrying information, and specifically comprises:

(b) clustering the passenger getting-off points into 4 central points by using a K-means clustering method, comprising the following steps: selecting 4 alighting points from all passenger alighting points randomly as initial central points by a random point selection mode; forming a cluster by each initial central point and the nearest departure point, so as to form 4 clusters, calculating the central positioning point of each cluster by combining the occurrence times and positioning of all departure points in each cluster to replace the central point of the last time, and repeating the calculation until the positioning of the central point is not changed;

7. The real-time data-based pricing vehicle passenger flow distribution point passenger carrying decision method according to claim 1, characterized in that the driver idle rate is calculated by:

8. A real-time data based fare collection and distribution point passenger loading decision method as claimed in claim 1, characterized in that the passenger flow collection and distribution point comprises an airport or a passenger station.

9. A valuation vehicle passenger flow distribution point passenger carrying decision making system based on real-time data is characterized by comprising:

wherein, p is a decision value, when p is less than 0.5, a waiting decision is made for the driver, and when p is more than 0.5, a city return decision is made for the driver; x ₁ 、X ₂ 、X ₃ 、X ₄ Respectively corresponding to demand factor, return income factor, waiting income factor and personal ability factor, beta ₀ 、β ₁ 、β ₂ 、β ₃ 、β ₄ First to fifth decision scale factors, respectively; training the established multiple linear regression model comprises: is beta ₀ 、β ₁ 、β ₂ 、β ₃ 、β ₄ Setting an initial value based on the current demand factor X ₁ Return income factor X ₂ Waiting for a revenue factor X ₃ And personal ability factor X ₄ A decision value p is calculated according to the multiple linear regression model to provide a decision for a driver, and beta is adjusted according to the decision satisfaction degree fed back by the driver after the decision ₀ 、β ₁ 、β ₂ 、β ₃ 、β ₄ Is repeated until β is before and after correction ₀ 、β ₁ 、β ₂ 、β ₃ 、β ₄ The difference between the values of (a) is less than a threshold value;

a decision module for adjusting the final beta ₁ 、β ₂ 、β ₃ 、β ₄ And (3) calculating a decision value p by combining the multiple linear regression model.

10. The real-time data based fare collection and distribution point passenger loading decision system of claim 9, characterized in that the training module is further configured to adjust β according to the following equation ₁ 、β ₂ 、β ₃ 、β ₄ ：

h _β ＝β ₀ +β ₁ X ₁ +β ₂ X ₂ +β ₃ X ₃ +β ₄ X ₄ ，

Wherein, beta' _j Is beta _j The adjustment value of (a); α is a positive scale factor and is a positive number; m is the total number of samples fed back by the driver; x is a radical of a fluorine atom ⁱ And y ⁱ Satisfaction and corresponding decision value in the ith sample fed back by the driver, respectively, and x ⁱ Can only take a value of 0 or 1, when x is ⁱ When the value of (A) is 0, the driver does not recognize the decision, and when x is equal to ⁱ When the value of (1) represents a driver acceptance decision;

demand factors, return income factors, waiting income factors and personal ability factors corresponding to the ith sample fed back by the driver respectivelyAnd (4) element.