JP2012050241A

JP2012050241A - Demand predicting system and demand predicting method

Info

Publication number: JP2012050241A
Application number: JP2010189867A
Authority: JP
Inventors: Motonari Kobayashi; 基成小林; Tomohiro Nagata; 智大永田; Daizo Ikeda; 大造池田; Ichiro Okajima; 一郎岡島
Original assignee: NTT Docomo Inc
Current assignee: NTT Docomo Inc
Priority date: 2010-08-26
Filing date: 2010-08-26
Publication date: 2012-03-08
Anticipated expiration: 2030-08-26
Also published as: JP5480755B2

Abstract

PROBLEM TO BE SOLVED: To provide a demand predicting system and a demand predicting method that can perform demand prediction while taking geographical correlation into consideration with a small calculation amount.SOLUTION: According to a demand predicting system 1, when a final demand prediction value (a second demand prediction value) for a prediction target area is to be determined, the correlation of actual demand values between the prediction target area and each of other areas is calculated, and an area having the correlation of a fixed value or more with the target area is extracted as a correlated area. The final demand prediction value (the second demand prediction value) of the prediction target area is calculated by using a regression equation (a second regression equation) obtained by regression analysis using the actual demand value of the correlated area and a first demand prediction value of the correlated area. As described above, the demand prediction of the prediction target area can be performed with a small calculation amount by using only the demand value of the area having high correlation with the prediction target area when geographical correlation is considered.

Description

本発明は、需要予測システム及び需要予測方法に関する。 The present invention relates to a demand prediction system and a demand prediction method.

従来から、需要予測に関する様々な手法が知られている。例えば下記特許文献１には、需要予測値（具体的にはタイヤの販売予測本数）を地域毎に算出する装置が記載されている。具体的には、この装置は、所定期間において販売されるタイヤの予測本数を、新車のタイヤの予測本数と、買替え時のタイヤの予測本数と、過去のタイヤの販売率とに基づいて算出する。 Conventionally, various methods relating to demand prediction are known. For example, Patent Document 1 described below describes an apparatus that calculates a demand prediction value (specifically, a predicted sales number of tires) for each region. Specifically, this device calculates the predicted number of tires sold in a given period based on the predicted number of new tires, the estimated number of tires at the time of replacement, and the past tire sales rate. To do.

特開２００６−８５６４５号公報JP 2006-85645 A

上記特許文献１に記載されているような地域毎の需要予測値を求める際には、地理的な相関が考慮される場合がある。しかし、このような地理的回帰手法は、必要となる計算量が多く、処理時間が長くなってしまう。 When obtaining a demand forecast value for each region as described in Patent Document 1, geographical correlation may be considered. However, such a geographical regression method requires a large amount of calculation and requires a long processing time.

そこで本発明は、地理的な相関を考慮した需要予測を少ない計算量で行うことが可能な需要予測システム及び需要予測方法を提供することを目的とする。 Therefore, an object of the present invention is to provide a demand prediction system and a demand prediction method capable of performing demand prediction in consideration of geographical correlation with a small amount of calculation.

本発明の需要予測システムは、複数のエリアのそれぞれについて、該エリアの過去の需要実績値を記憶する実績値記憶手段と、複数のエリアのそれぞれについて、該エリアの需要実績値を実績値記憶手段から読み出し、該需要実績値を用いた回帰分析を実行して第１の回帰式を生成し、該第１の回帰式を用いて該エリアの第１の需要予測値を算出する第１算出手段と、複数のエリアのそれぞれについて、第１算出手段により算出された第１の需要予測値を記憶する予測値記憶手段と、複数のエリアのうちの一つを予測対象エリアとして、実績値記憶手段から読み出した該予測対象エリアの需要実績値と他のエリアの需要実績値との相関を算出し、該相関が所定の閾値以上である他のエリアを相関エリアとして抽出する抽出手段と、抽出手段により抽出された相関エリアの需要実績値を用いた回帰分析を実行して第２の回帰式を生成する生成手段と、生成手段により生成された第２の回帰式と、予測値記憶手段から読み出した相関エリアの需要予測値とを用いて、予測対象エリアの第２の需要予測値を算出する第２算出手段と、第２算出手段により抽出された第２の需要予測値を出力する出力手段と、を備える。 The demand prediction system according to the present invention includes, for each of a plurality of areas, a record value storage means for storing a past demand record value of the area, and a record of the demand record value of the area for each of the plurality of areas. First calculation means for generating a first regression equation by executing a regression analysis using the actual demand value, and calculating a first demand forecast value for the area using the first regression equation And for each of the plurality of areas, the predicted value storage means for storing the first demand predicted value calculated by the first calculation means, and the actual value storage means with one of the plurality of areas as the prediction target area An extraction means for calculating a correlation between the actual demand value of the prediction target area read from the demand actual value of another area and extracting the other area having the correlation equal to or greater than a predetermined threshold as a correlation area; In A generation unit that generates a second regression equation by executing a regression analysis using the actual demand value of the correlation area extracted, a second regression equation generated by the generation unit, and a read from the predicted value storage unit Second calculation means for calculating the second demand prediction value of the prediction target area using the demand prediction value of the correlation area, and output means for outputting the second demand prediction value extracted by the second calculation means And comprising.

本発明の需要予測方法は、需要予測システムにより実行される需要予測方法であって、複数のエリアのそれぞれについて、該エリアの過去の需要実績値を実績値記憶手段に記憶する実績値記憶ステップと、複数のエリアのそれぞれについて、該エリアの需要実績値を実績値記憶手段から読み出し、該需要実績値を用いた回帰分析を実行して第１の回帰式を生成し、該第１の回帰式を用いて該エリアの第１の需要予測値を算出する第１算出ステップと、複数のエリアのそれぞれについて、第１算出ステップにおいて算出された第１の需要予測値を予測値記憶手段に記憶する予測値記憶ステップと、複数のエリアのうちの一つを予測対象エリアとして、実績値記憶手段から読み出した該予測対象エリアの需要実績値と他のエリアの需要実績値との相関を算出し、該相関が所定の閾値以上である他のエリアを相関エリアとして抽出する抽出ステップと、抽出ステップにおいて抽出された相関エリアの需要実績値を用いた回帰分析を実行して第２の回帰式を生成する生成ステップと、生成ステップにおいて生成された第２の回帰式と、予測値記憶手段から読み出した相関エリアの需要予測値とを用いて、予測対象エリアの第２の需要予測値を算出する第２算出ステップと、第２算出ステップにおいて抽出された第２の需要予測値を出力する出力ステップと、を含むことを特徴とする。 The demand prediction method of the present invention is a demand prediction method executed by a demand prediction system, and for each of a plurality of areas, an actual value storage step of storing past demand actual values of the area in an actual value storage means; For each of the plurality of areas, the actual demand value of the area is read from the actual value storage means, a regression analysis is performed using the actual demand value, and a first regression equation is generated, and the first regression equation is generated. The first calculation step of calculating the first demand prediction value of the area using the first and the first demand prediction value calculated in the first calculation step for each of the plurality of areas is stored in the prediction value storage means. A predicted value storage step, and a demand actual value of the prediction target area read from the actual value storage means and a demand actual value of another area as one of a plurality of areas as a prediction target area A second step of calculating a function, extracting another area having the correlation equal to or greater than a predetermined threshold as a correlation area, and performing a regression analysis using the actual demand value of the correlation area extracted in the extraction step. The second demand prediction of the prediction target area using the generation step of generating the regression equation, the second regression equation generated in the generation step, and the demand prediction value of the correlation area read from the prediction value storage means A second calculation step of calculating a value; and an output step of outputting the second demand prediction value extracted in the second calculation step.

このような発明によれば、ある一つの予測対象エリアの最終的な需要予測値（第２の需要予測値）を求める際に、その予測対象エリアと他のエリアとの間で需要実績値の相関が算出され、その相関が一定以上であるエリアが相関エリアとして抽出される。そして、当該相関エリアの需要実績値を用いた回帰分析で得られた回帰式（第２の回帰式）と、当該相関エリアの第１の需要予測値とを用いて、予測対象エリアの最終的な需要予測値（第２の需要予測値）が算出される。このように、地理的な相関を考慮する際に、予測対象エリアとの相関が高い他のエリアの需要値のみを用いることで、予測対象エリアの需要予測を少ない計算量で行うことが可能になる。 According to such an invention, when the final demand forecast value (second demand forecast value) of a certain forecast target area is obtained, the demand actual value between the forecast target area and other areas is calculated. A correlation is calculated, and an area where the correlation is a certain level or more is extracted as a correlation area. Then, using the regression equation (second regression equation) obtained by the regression analysis using the actual demand value of the correlation area and the first demand prediction value of the correlation area, the final prediction target area is obtained. A demand forecast value (second demand forecast value) is calculated. In this way, when considering geographical correlation, it is possible to perform demand prediction of the prediction target area with a small amount of calculation by using only the demand values of other areas having high correlation with the prediction target area. Become.

本発明の需要予測システムでは、抽出手段が、予測対象エリアとの距離が所定値以下の他のエリアを周辺エリアとして抽出し、該予測対象エリアの需要実績値と該周辺エリアの需要実績値との相関を算出し、該相関が閾値以上である該周辺エリアを相関エリアとして抽出してもよい。 In the demand prediction system of the present invention, the extracting means extracts another area whose distance from the prediction target area is a predetermined value or less as a peripheral area, the demand actual value of the prediction target area and the demand actual value of the peripheral area, And the surrounding area where the correlation is equal to or greater than a threshold may be extracted as a correlation area.

この場合には、相関エリアの候補となるエリアが予測対象エリアの近くに位置するものに予め絞り込まれた上で、絞り込まれた候補についてのみ予測対象エリアとの相関が計算される。これにより、相関を計算する量を少なくすることができる。 In this case, after the area that is a candidate for the correlation area is narrowed down in advance to be located near the prediction target area, the correlation with the prediction target area is calculated only for the narrowed candidates. Thereby, the amount of calculating the correlation can be reduced.

このような需要予測システム及び需要予測方法によれば、予測対象エリアとの相関が高い他のエリアの需要値のみを用いて予測対象エリアの需要予測値が算出されるので、地理的な相関を考慮した需要予測を少ない計算量で行うことができる。 According to such a demand forecasting system and demand forecasting method, the demand forecast value of the forecast target area is calculated using only the demand value of the other area having a high correlation with the forecast target area. It is possible to carry out demand forecast in consideration with a small amount of calculation.

実施形態に係る需要予測システムの機能構成を示す図である。It is a figure which shows the function structure of the demand prediction system which concerns on embodiment. 図１に示す需要予測システムのハードウェア構成を示す図である。It is a figure which shows the hardware constitutions of the demand prediction system shown in FIG. 分析用データの生成方法を示すフローチャートである。It is a flowchart which shows the production | generation method of the data for analysis. 第１の回帰式データの生成方法を示すフローチャートである。It is a flowchart which shows the production | generation method of 1st regression type data. 予測用データの生成方法を示すフローチャートである。It is a flowchart which shows the production | generation method of the data for prediction. 第１の予測結果データの生成方法を示すフローチャートである。It is a flowchart which shows the production | generation method of 1st prediction result data. 相関エリアの抽出方法を示すフローチャートである。It is a flowchart which shows the extraction method of a correlation area. 第２の回帰式データの生成方法を示すフローチャートである。It is a flowchart which shows the production | generation method of 2nd regression type data. 第２の予測結果データの生成方法を示すフローチャートである。It is a flowchart which shows the production | generation method of 2nd prediction result data. 図２に示す需要予測システムの動作を示すフローチャートである。It is a flowchart which shows operation | movement of the demand prediction system shown in FIG.

以下、添付図面を参照しながら本発明の実施形態を詳細に説明する。なお、図面の説明において同一又は同等の要素には同一の符号を付し、重複する説明を省略する。 Hereinafter, embodiments of the present invention will be described in detail with reference to the accompanying drawings. In the description of the drawings, the same or equivalent elements are denoted by the same reference numerals, and redundant description is omitted.

まず、図１〜９を用いて、実施形態に係る需要予測システム１の機能及び構成を説明する。需要予測システム１は、複数のエリアのそれぞれにおける需要を予測するコンピュータシステムである。特に本実施形態では、需要予測システム１はタクシー需要を予測するものとして説明する（目的変数は、タクシーの乗客数）。 First, the function and configuration of the demand prediction system 1 according to the embodiment will be described with reference to FIGS. The demand prediction system 1 is a computer system that predicts demand in each of a plurality of areas. In particular, in the present embodiment, the demand prediction system 1 will be described as predicting taxi demand (the objective variable is the number of taxi passengers).

図１に示すように、需要予測システム１は機能的構成要素として分析用データ生成部１１、分析用データ記憶部（実績値記憶手段）１２、第１回帰式生成部（第１算出手段）１３、予測用データ生成部（第１算出手段）１４、予測用データ記憶部１５、第１予測結果生成部（第１算出手段）１６、第１予測結果記憶部（予測値記憶手段）１７、エリア抽出部（抽出手段）１８、第２回帰式生成部（生成手段）１９、第２予測結果生成部（第２算出手段、出力手段）２０、及び第２予測結果記憶部２１を備えている。 As shown in FIG. 1, the demand prediction system 1 includes, as functional components, an analysis data generation unit 11, an analysis data storage unit (actual value storage unit) 12, and a first regression equation generation unit (first calculation unit) 13. , Prediction data generation unit (first calculation unit) 14, prediction data storage unit 15, first prediction result generation unit (first calculation unit) 16, first prediction result storage unit (prediction value storage unit) 17, area An extraction unit (extraction unit) 18, a second regression equation generation unit (generation unit) 19, a second prediction result generation unit (second calculation unit, output unit) 20, and a second prediction result storage unit 21 are provided.

この需要予測システム１は１台のサーバで構成してもよいし、複数のサーバ（分散システム）で構成してもよい。いずれにしても、需要予測システム１で用いられるサーバＶは、図２に示すようなハードウェア構成を備えている。すなわち、サーバＶは、オペレーティングシステムやアプリケーションプログラムなどを実行するＣＰＵ１０１と、ＲＯＭ及びＲＡＭで構成される主記憶部１０２と、ハードディスクなどで構成される補助記憶部１０３と、ネットワークカードなどで構成される通信制御部１０４と、キーボードやマウスなどの入力部１０５と、モニタなどの出力部１０６とで構成される。 The demand prediction system 1 may be configured with a single server or a plurality of servers (distributed systems). In any case, the server V used in the demand prediction system 1 has a hardware configuration as shown in FIG. That is, the server V includes a CPU 101 that executes an operating system, application programs, and the like, a main storage unit 102 that includes a ROM and a RAM, an auxiliary storage unit 103 that includes a hard disk, a network card, and the like. The communication control unit 104 includes an input unit 105 such as a keyboard and a mouse, and an output unit 106 such as a monitor.

以下に示す需要予測システム１の各機能的構成要素は、ＣＰＵ１０１や主記憶部１０２の上に所定のソフトウェアを読み込ませ、ＣＰＵ１０１の制御の下で通信制御部１０４や入力部１０５、出力部１０６などを動作させ、主記憶部１０２や補助記憶部１０３におけるデータの読み出し及び書き込みを行うことで実現される。処理に必要なデータやデータベースは主記憶部１０２や補助記憶部１０３内に格納される。 Each functional component of the demand prediction system 1 shown below reads predetermined software on the CPU 101 or the main storage unit 102, and under the control of the CPU 101, the communication control unit 104, the input unit 105, the output unit 106, and the like. This is realized by reading and writing data in the main storage unit 102 and the auxiliary storage unit 103. Data and databases necessary for processing are stored in the main storage unit 102 and the auxiliary storage unit 103.

図１に戻って、分析用データ生成部１１は、図３に示すような手順で分析用データを生成する手段である。すなわち、分析用データ生成部１１はまずメッシュデータＤ１１と乗車データＤ１２とに基づいて、所定の範囲のエリア毎に実績乗車数（タクシーの乗客数の実績値）を集計したメッシュ乗車データを生成する（ステップＳ１１）。続いて、分析用データ生成部１１はそのメッシュ乗車データと、人口分布データＤ１３、天気データＤ１４、気温データＤ１５、及び時間帯データＤ１６とに基づいて分析用データＤ１７を生成する（ステップＳ１２）。 Returning to FIG. 1, the analysis data generation unit 11 is means for generating analysis data in the procedure as shown in FIG. 3. That is, the analysis data generation unit 11 first generates mesh ride data in which the actual number of rides (actual value of the number of taxi passengers) is tabulated for each area within a predetermined range based on the mesh data D11 and the ride data D12. (Step S11). Subsequently, the analysis data generation unit 11 generates the analysis data D17 based on the mesh boarding data, the population distribution data D13, the weather data D14, the temperature data D15, and the time zone data D16 (step S12).

メッシュデータＤ１１は、エリアに関するデータであり、需要予測システム１内に予め記憶されている。設定される各エリアは、例えば５００Ｍ四方の正方形（メッシュ）であるが、エリアの形状や大きさはこれに限定されない。例えば、各エリアを１００Ｍ四方の正方形としたり、矩形以外の任意の形状のエリアを設定したりしてもよい。また、すべてのエリアの形状及び大きさを揃えなくてもよく、例えば地域の人口密度や地理的状況などに基づいて、各地域に合ったエリアを設定してもよい。いずれにしても、複数のエリアに関する複数のメッシュデータＤ１１が用意される。 The mesh data D11 is data relating to the area, and is stored in the demand prediction system 1 in advance. Each set area is, for example, a 500M square (mesh), but the shape and size of the area are not limited to this. For example, each area may be a 100M square, or an area having an arbitrary shape other than a rectangle may be set. In addition, the shape and size of all the areas need not be uniform. For example, an area suitable for each region may be set based on the population density or the geographical situation of the region. In any case, a plurality of mesh data D11 relating to a plurality of areas are prepared.

メッシュデータＤ１１は、エリアを特定するエリアＩＤと、エリアの形状や大きさを示すエリアポリゴンと、エリアの中心点の緯度経度を示す中心点ポイントとが互いに関連付けられて成るレコードである。 The mesh data D11 is a record in which an area ID that identifies an area, an area polygon that indicates the shape and size of the area, and a center point that indicates the latitude and longitude of the center point of the area are associated with each other.

乗車データＤ１２は、タクシーに客が乗車したポイント及び時間が互いに関連付けられて成るレコードである。なお、乗車データＤ１２の収集方法は限定されない。例えば、移動体通信網から得られる乗客の携帯電話機の位置の変化に基づいて乗車の位置及び時間を推定して乗車データとして記録してもよいし、アンケート調査などで得られた静的な情報を乗車データとしてデータ化してもよい。 The boarding data D12 is a record formed by associating points and times when passengers get on the taxi. In addition, the collection method of boarding data D12 is not limited. For example, the position and time of the boarding may be estimated based on the change in the position of the passenger's mobile phone obtained from the mobile communication network, and recorded as boarding data, or static information obtained through questionnaire surveys, etc. May be converted into data as boarding data.

分析用データ生成部１１は、メッシュデータＤ１１及び乗車データＤ１２から、エリア毎及び時間毎の乗車数を集計することで、メッシュ乗車データを生成する。このメッシュ乗車データは、エリアＩＤ、エリアポリゴン、中心点ポイント、時間、及び実績乗車数が互いに関連付けられて成るレコードである。なお、乗車数を集計する際の単位時間は任意に決めてよい。例えば、集計単位時間を５分、１時間、１日などと設定してよい。 The analysis data generation unit 11 generates mesh ride data by counting the number of rides for each area and time from the mesh data D11 and the ride data D12. This mesh boarding data is a record in which an area ID, area polygon, center point, time, and actual boarding number are associated with each other. The unit time for counting the number of rides may be arbitrarily determined. For example, the total unit time may be set to 5 minutes, 1 hour, 1 day, and the like.

続いて、分析用データ生成部１１は分析用データＤ１７を生成する。ここで、このときに用いる各データについて説明する。 Subsequently, the analysis data generation unit 11 generates analysis data D17. Here, each data used at this time will be described.

人口分布データＤ１３は、エリアＩＤ、エリアポリゴン、時間、及びその時間におけるエリア内の人口が互いに関連付けられて成るデータである。天気データＤ１４は、エリアＩＤ、エリアポリゴン、時間、及びその時間におけるエリア内の降水量が互いに関連付けられて成るデータである。気温データＤ１５は、エリアＩＤ、エリアポリゴン、時間、及びその時間におけるエリア内の気温が互いに関連付けられて成るデータである。時間帯データＤ１６は、時間と、曜日と、平日又は休日の区分とが互いに関連付けられて成るデータである。なお、これら４種類のデータの収集方法は限定されず、例えば需要予測システム１はこれらのデータを提供する他のシステムから各データを受信して予め記憶しておいてもよい。なお、これら各データにおける時間は、上記メッシュ乗車データにおける時間と同様に設定される。 The population distribution data D13 is data in which an area ID, an area polygon, time, and a population in the area at that time are associated with each other. The weather data D14 is data in which the area ID, area polygon, time, and precipitation in the area at that time are associated with each other. The temperature data D15 is data in which an area ID, an area polygon, a time, and the temperature in the area at that time are associated with each other. The time zone data D16 is data in which time, day of the week, and weekday or holiday classification are associated with each other. In addition, the collection method of these four types of data is not limited, For example, the demand prediction system 1 may receive each data from the other system which provides these data, and may memorize | store it beforehand. The time in each data is set in the same manner as the time in the mesh ride data.

分析用データ生成部１１は、生成したメッシュ乗車データと、人口分布データＤ１３、天気データＤ１４、気温データＤ１５、及び時間帯データＤ１６とに基づいて、エリア毎及び時間毎の乗車数、人口、降水量、気温を示す分析用データＤ１７を生成する。すなわち、分析用データＤ１７は、エリアＩＤ、エリアポリゴン、中心点ポイント、時間、乗車数、人口、降水量、気温が互いに関連付けられて成るデータである。分析用データＤ１７の乗車数は、過去の需要実績値ということができる。分析用データ生成部１１は、生成した分析用データＤ１７を分析用データ記憶部１２に格納する。 Based on the generated mesh ride data and the population distribution data D13, weather data D14, temperature data D15, and time zone data D16, the analysis data generation unit 11 is configured to calculate the number of passengers, population, Analytical data D17 indicating the amount and temperature is generated. That is, the analysis data D17 is data in which the area ID, area polygon, center point, time, number of passengers, population, precipitation, and temperature are associated with each other. The number of rides in the analysis data D17 can be said to be a past demand actual value. The analysis data generation unit 11 stores the generated analysis data D17 in the analysis data storage unit 12.

なお、分析用データＤ１７を生成する際に用いるデータは、上記のような人口、降水量、気温、曜日に限定されず、他のデータを用いてもよい。例えば、株価などの他の要素を用いて分析用データを生成してもよい。 The data used when generating the analysis data D17 is not limited to the population, precipitation, temperature, and day of the week as described above, and other data may be used. For example, analysis data may be generated using other factors such as stock prices.

分析用データ記憶部１２は、分析用データＤ１７を記憶する手段である。分析用データ生成部１１により各エリアの各時間における分析用データが生成されるので、分析用データ記憶部１２は、複数のエリアのそれぞれについて、時間毎の乗車数の実績値を記憶することになる。 The analysis data storage unit 12 is means for storing the analysis data D17. Since the analysis data generation unit 11 generates analysis data at each time in each area, the analysis data storage unit 12 stores the actual value of the number of passengers per hour for each of the plurality of areas. Become.

第１回帰式生成部１３は、図４に示すような手順で、回帰分析に用いる変数（説明変数）を決定し、その変数を用いた第１の回帰式を生成する手段である。 The first regression equation generation unit 13 is means for determining a variable (explanatory variable) to be used for regression analysis in the procedure as shown in FIG. 4 and generating a first regression equation using the variable.

すなわち、第１回帰式生成部１３は、分析用データ記憶部１２に記憶されている分析用データＤ１７について、時間毎に、乗車数の合計と、人口、降水量、及び気温それぞれの平均値とを求める（ステップＳ２１）。続いて、第１回帰式生成部１３は分析用データＤ１７からいくつかの変数を選び（ステップＳ２２）、選んだ変数を用いて回帰分析を行った結果を交差検証により評価する（ステップＳ２３）。第１回帰式生成部１３はステップＳ２２，Ｓ２３の処理を様々な変数の組み合わせについて実行し、最終的に、最も高い評価を得られた変数の組合せを決定する（ステップＳ２４）。 That is, the first regression equation generation unit 13 calculates the total number of passengers and the average values of population, precipitation, and temperature for each time for the analysis data D17 stored in the analysis data storage unit 12. Is obtained (step S21). Subsequently, the first regression equation generation unit 13 selects some variables from the analysis data D17 (step S22), and evaluates the result of the regression analysis using the selected variables by cross-validation (step S23). The first regression equation generation unit 13 executes the processing of steps S22 and S23 for various combinations of variables, and finally determines the combination of variables that has obtained the highest evaluation (step S24).

続いて、第１回帰式生成部１３は決定した変数の組合せを用いて分析用データＤ１７に対する回帰分析を実行することで第１の回帰式データＤ１８を生成する（ステップＳ２５）。第１の回帰式データＤ１８は、エリアＩＤ、エリアポリゴン、中心点ポイント、及び得られた第１の回帰式が互いに関連付けられて成るデータである。各エリアに関する第１の回帰式データを生成すると、第１回帰式生成部１３はこれらの第１の回帰式データを第１予測結果生成部１６に出力する。 Subsequently, the first regression equation generation unit 13 performs the regression analysis on the analysis data D17 using the determined combination of variables to generate the first regression equation data D18 (step S25). The first regression equation data D18 is data obtained by associating the area ID, the area polygon, the center point, and the obtained first regression equation. When the first regression equation data relating to each area is generated, the first regression equation generation unit 13 outputs the first regression equation data to the first prediction result generation unit 16.

なお、本実施形態では重回帰分析を用いているが、回帰分析の具体的な手法は限定されるものではなく、例えばニューラル・ネットワークや、特願２００９−２６５５１２号明細書に記載されている手法を用いてもよい。 Although multiple regression analysis is used in the present embodiment, the specific method of regression analysis is not limited. For example, a neural network or a method described in Japanese Patent Application No. 2009-265512 is described. May be used.

予測用データ生成部１４は、図５に示すような手順で予測用データを生成する手段である。すなわち、予測用データ生成部１４は、メッシュデータＤ１１、人口分布予測データＤ１９、天気予報データＤ２０、気温予測データＤ２１、及び時間帯データＤ１６から予測用データＤ２２を生成する（ステップＳ３１）。予測用データＤ２２の生成は分析用データＤ１７の生成と似ているが、分析用データＤ１７は過去のデータ、すなわち既知の実績値を用いて生成するのに対して、予測用データＤ２２は未来のデータであり、したがって予測データや予報データを用いて生成する。 The prediction data generation unit 14 is means for generating prediction data in the procedure as shown in FIG. That is, the prediction data generation unit 14 generates the prediction data D22 from the mesh data D11, the population distribution prediction data D19, the weather prediction data D20, the temperature prediction data D21, and the time zone data D16 (step S31). The generation of the prediction data D22 is similar to the generation of the analysis data D17. However, the analysis data D17 is generated using past data, that is, a known actual value, whereas the prediction data D22 is generated in the future. Therefore, it is generated using forecast data and forecast data.

人口分布予測データＤ１９、天気予報データＤ２０、気温予測データＤ２１のデータ構造はそれぞれ人口分布データＤ１３、天気データＤ１４、気温データＤ１５と同じであるが、その具体的な人口、降水量、及び気温の値は予測値である。予測用データＤ２２は、エリアＩＤ、エリアポリゴン、中心点ポイント、時間、予測人口、予想降水量、及び予想気温が互いに関連付けられて成るデータである。 The data structures of the population distribution prediction data D19, the weather prediction data D20, and the temperature prediction data D21 are the same as the population distribution data D13, the weather data D14, and the temperature data D15, respectively. The value is a predicted value. The prediction data D22 is data in which an area ID, an area polygon, a center point, time, a predicted population, an estimated precipitation amount, and an estimated temperature are associated with each other.

各エリアの各時間における予測用データを生成すると、予測用データ生成部１４はこれらの予測用データを予測用データ記憶部１５に格納する。 When the prediction data at each time in each area is generated, the prediction data generation unit 14 stores these prediction data in the prediction data storage unit 15.

予測用データ記憶部１５は、予測用データＤ２２を記憶する手段である。予測用データ記憶部１５に格納された予測用データＤ２２は、後述する第１予測結果生成部１６及び第２予測結果生成部２０により参照される。 The prediction data storage unit 15 is means for storing the prediction data D22. The prediction data D22 stored in the prediction data storage unit 15 is referred to by a first prediction result generation unit 16 and a second prediction result generation unit 20 described later.

第１予測結果生成部１６は、図６に示すような手順で、第１の回帰式を用いて各エリアの第１の予測乗車数（第１の需要予測値）を算出する手段である。すなわち、第１予測結果生成部１６は、一のエリアの第１の回帰式データＤ１８で示される第１の回帰式に当該エリアの予測用データＤ２２を適用することで、当該エリアの第１の予測結果データＤ２３を生成し（ステップＳ４１）、その結果データＤ２３を第１予測結果記憶部１７に格納する。第１予測結果生成部１６はこのような処理を各エリアについて実行する。第１の予測結果データＤ２３は、エリアＩＤ、エリアポリゴン、中心点ポイント、時間、及び第１の予測乗車数（第１の需要予測値）が互いに関連付けられて成るデータである。第１予測結果生成部１６は、各エリアの各時間における乗車数の予測値を求め、第１の予測結果データＤ２３として第１予測結果記憶部１７に格納する。 The 1st prediction result production | generation part 16 is a means which calculates the 1st prediction boarding number (1st demand prediction value) of each area using a 1st regression equation in the procedure as shown in FIG. That is, the first prediction result generation unit 16 applies the prediction data D22 of the area to the first regression equation indicated by the first regression equation data D18 of the one area, so that the first prediction result of the area Prediction result data D23 is generated (step S41), and the result data D23 is stored in the first prediction result storage unit 17. The 1st prediction result production | generation part 16 performs such a process about each area. The first prediction result data D23 is data obtained by associating the area ID, area polygon, center point, time, and first predicted number of rides (first demand predicted value). The 1st prediction result production | generation part 16 calculates | requires the predicted value of the boarding number in each time of each area, and stores it in the 1st prediction result memory | storage part 17 as 1st prediction result data D23.

第１予測結果記憶部１７は、第１の予測結果データＤ２３を記憶する手段である。第１予測結果記憶部１７は、複数のエリアのそれぞれについて、時間毎の第１の予測乗車数を記憶する。第１予測結果記憶部１７に記憶される予測乗車数は、最終結果である第２の予測乗車数（第２の予測需要値）を得る過程で導かれる中間結果である。 The first prediction result storage unit 17 is means for storing the first prediction result data D23. The 1st prediction result storage part 17 memorizes the 1st prediction boarding number for every time about each of a plurality of areas. The predicted number of passengers stored in the first prediction result storage unit 17 is an intermediate result derived in the process of obtaining the second predicted number of passengers (second predicted demand value) that is the final result.

エリア抽出部１８は、図７に示すような手順で、予測対象エリアとの相関が高い他のエリアを抽出する手段である。まず、エリア抽出部１８は複数のエリアのうちの一つを予測対象エリアとして設定する。続いて、エリア抽出部１８はメッシュデータＤ１１を参照してその予測対象エリアとの距離が所定値以下の他のエリアを抽出する。以下ではここで抽出されたエリアを「周辺エリア」という。周辺エリアを抽出するのは、ある一のエリアとの相関が高いエリアは当該一のエリアの周辺に存在することが多く、また、すべてのエリアとの相関を求めるとなると計算量が多くなることから、相関を求める対象を予め絞り込むためである。 The area extracting unit 18 is means for extracting another area having a high correlation with the prediction target area in the procedure as shown in FIG. First, the area extraction unit 18 sets one of a plurality of areas as a prediction target area. Subsequently, the area extraction unit 18 refers to the mesh data D11 and extracts another area whose distance from the prediction target area is a predetermined value or less. Hereinafter, the extracted area is referred to as a “peripheral area”. The reason for extracting the surrounding area is that an area that has a high correlation with a certain area often exists in the vicinity of the one area, and that the calculation amount increases when the correlation with all the areas is obtained. This is because the objects for which correlation is to be obtained are narrowed down in advance.

なお、エリア間の距離の求め方としては、例えば、中心点ポイント間の距離をエリア間の距離として設定する手法や、エリアの境界間の距離のうち最小のものをエリア間の距離として設定する採用する手法などが挙げられるが、これらに限定されるものではない。 In addition, as a method for obtaining the distance between areas, for example, a method of setting a distance between center point points as a distance between areas, or a minimum distance among the boundaries between areas is set as a distance between areas. Although the method etc. to employ | adopt are mentioned, it is not limited to these.

続いて、エリア抽出部１８は予測対象エリアの分析用データＤ１７ａ及び周辺エリアの分析用データＤ１７ｂを分析用データ記憶部１２から読み出し、予測対象エリアと周辺エリアとの間の実績乗車数の相関係数を下記式（１）により求める（ステップＳ５１）。なお、相関係数の範囲は−１〜１である。

ここで、ｘ_ｉは予測対象エリアの実績乗車数であり、ｙ_ｉは一の周辺エリアの実績乗車数である。また、

は予測対象エリアの全時間における実績乗車数の相加平均であり、

は一の周辺エリアの全時間における実績乗車数の相加平均である。 Subsequently, the area extraction unit 18 reads the analysis data D17a for the prediction target area and the analysis data D17b for the peripheral area from the analysis data storage unit 12, and the correlation of the actual number of rides between the prediction target area and the peripheral area. The number is obtained by the following equation (1) (step S51). The range of the correlation coefficient is −1 to 1.

Here, x _i is the actual number of rides in the prediction target area, and y _i is the actual number of rides in one peripheral area. Also,

Is the arithmetic average of the actual number of rides over the entire forecast area,

Is the arithmetic average of the actual number of passengers in the entire surrounding area.

続いて、エリア抽出部１８は、求めた相関係数の絶対値が所定の閾値以上であるか否かを判定し（ステップＳ５２）、当該絶対値が閾値以上である周辺エリアを相関エリアとして抽出する（ステップＳ５３）。なお、閾値として任意の値を用いてよい。 Subsequently, the area extraction unit 18 determines whether or not the absolute value of the obtained correlation coefficient is equal to or greater than a predetermined threshold (step S52), and extracts a peripheral area whose absolute value is equal to or greater than the threshold as a correlation area. (Step S53). An arbitrary value may be used as the threshold value.

エリア抽出部１８はこのような処理をすべての周辺エリアについて実行し（ステップＳ５４）、予測対象エリアのエリアＩＤ（予測対象エリアＩＤ）と一以上の相関エリアのエリアＩＤ（相関エリアＩＤ）との組合せである相関エリアデータＤ２４を生成する。エリア抽出部１８は、メッシュデータＤ１１で規定されたエリアのそれぞれについてこのような相関エリアデータＤ２４を生成し、第２回帰式生成部１９に出力する。 The area extraction unit 18 performs such processing for all peripheral areas (step S54), and calculates the area ID (prediction target area ID) of the prediction target area and the area ID (correlation area ID) of one or more correlation areas. Correlation area data D24 that is a combination is generated. The area extraction unit 18 generates such correlation area data D24 for each of the areas defined by the mesh data D11 and outputs the correlation area data D24 to the second regression equation generation unit 19.

第２回帰式生成部１９は、図８に示すような手順で、第１回帰式生成部１３で用いた説明変数と相関エリアの実績乗車数とを用いた回帰分析を実行して第２の回帰式を生成する手段である。すなわち、第２回帰式生成部１９は相関エリアの実績乗車数も説明変数に加えた上で新たに回帰式を生成する。 The second regression equation generation unit 19 executes the regression analysis using the explanatory variables used in the first regression equation generation unit 13 and the actual number of rides in the correlation area in the procedure as shown in FIG. A means for generating a regression equation. That is, the second regression equation generation unit 19 newly generates a regression equation after adding the actual number of rides in the correlation area to the explanatory variables.

第２回帰式生成部１９は各予測対象エリアについて以下の処理を行う。すなわち、第２回帰式生成部１９は相関エリアデータで示される予測対象エリアＩＤに対応する分析用データＤ１７ａを分析用データ記憶部１２から読み出す。また、第２回帰式生成部１９はその相関エリアデータで示される相関エリアＩＤに対応する分析用データを分析用データ記憶部１２から読み出して、一以上の相関エリアの実績乗車数を取得する。続いて、第２回帰式生成部１９は第１回帰式生成部１３で用いた説明変数及び読み出した実績乗車数を今回の説明変数として、予測対象エリアの分析用データＤ１７ａに対する回帰分析を実行し、第２の回帰式データＤ２５を生成する（ステップＳ６１）。第２の回帰式データＤ２５は、予測対象エリアのエリアＩＤ、エリアポリゴン及び中心点ポイントと、得られた第２の回帰式と、当該予測対象エリアに対応する一以上の相関エリアＩＤとが互いに関連付けられて成るデータである。 The second regression equation generation unit 19 performs the following processing for each prediction target area. That is, the second regression equation generation unit 19 reads out the analysis data D17a corresponding to the prediction target area ID indicated by the correlation area data from the analysis data storage unit 12. In addition, the second regression equation generation unit 19 reads analysis data corresponding to the correlation area ID indicated by the correlation area data from the analysis data storage unit 12 and acquires the actual number of rides in one or more correlation areas. Subsequently, the second regression equation generation unit 19 performs a regression analysis on the analysis data D17a in the prediction target area using the explanatory variables used in the first regression equation generation unit 13 and the read actual number of rides as the current explanatory variables. Second regression equation data D25 is generated (step S61). In the second regression equation data D25, the area ID, area polygon and center point of the prediction target area, the obtained second regression equation, and one or more correlation area IDs corresponding to the prediction target area are mutually connected. It is data that is related.

第２回帰式生成部１９は、このような処理をすべての予測対象エリアについて実行し、各予測対象エリアの第２の回帰式データＤ２５を第２予測結果生成部２０に出力する。 The second regression equation generation unit 19 executes such processing for all the prediction target areas, and outputs the second regression equation data D25 of each prediction target area to the second prediction result generation unit 20.

第２予測結果生成部２０は、図９に示すような手順で、第２の回帰式と相関エリアの第１の予測乗車数とを用いて、予測対象エリアの最終的な第２の予測乗車数を算出する手段である。 The second prediction result generation unit 20 uses the second regression equation and the first predicted number of rides in the correlation area in the procedure as shown in FIG. 9 to obtain the final second predicted ride in the prediction target area. It is a means for calculating the number.

第２予測結果生成部２０は各予測対象エリアについて以下の処理を行う。すなわち、第２予測結果生成部２０は、予測対象エリアのエリアＩＤに対応する予測用データＤ２２ａを予測用データ記憶部１５から読み出す。また、第２予測結果生成部２０は当該予測対象エリアに関連付けられた相関エリアＩＤに対応する第１の予測結果データＤ２３ａを第１予測結果記憶部１７から読み出す。続いて、第２予測結果生成部２０は読み出した予測用データＤ２２ａ及び第１の予測結果データＤ２３ａを、予測対象エリアの第２の回帰式データＤ２５ａで示される第２の回帰式に適用することで第２の予測結果データＤ２６を生成する（ステップＳ７１）。そして、第２予測結果生成部２０は第２の予測結果データＤ２６を第２予測結果記憶部２１に格納する。第２の予測結果データＤ２６は、予測対象エリアのエリアＩＤ、エリアポリゴン、中心点ポイント、時間、及び第２の予測乗車数（第２の需要予測値）が互いに関連付けられて成るデータである。 The second prediction result generation unit 20 performs the following process for each prediction target area. That is, the second prediction result generation unit 20 reads the prediction data D22a corresponding to the area ID of the prediction target area from the prediction data storage unit 15. In addition, the second prediction result generation unit 20 reads the first prediction result data D23a corresponding to the correlation area ID associated with the prediction target area from the first prediction result storage unit 17. Subsequently, the second prediction result generation unit 20 applies the read prediction data D22a and the first prediction result data D23a to the second regression equation indicated by the second regression equation data D25a of the prediction target area. Then, the second prediction result data D26 is generated (step S71). Then, the second prediction result generation unit 20 stores the second prediction result data D26 in the second prediction result storage unit 21. The second prediction result data D26 is data in which the area ID, area polygon, center point, time, and second predicted number of rides (second demand predicted value) of the prediction target area are associated with each other.

第２予測結果生成部２０は、このような処理をすべての予測対象エリアについて実行し、各予測対象エリアの第２の予測結果データＤ２６を第２予測結果記憶部２１に格納する。 The second prediction result generation unit 20 performs such processing for all the prediction target areas, and stores the second prediction result data D26 of each prediction target area in the second prediction result storage unit 21.

第２予測結果記憶部２１は、第２の予測結果データＤ２６を記憶する手段である。第２予測結果記憶部２１は、複数のエリアのそれぞれについて、時間毎の第２の予測乗車数を記憶する。第２予測結果記憶部２１に記憶される予測乗車数は最終の予測結果である。 The second prediction result storage unit 21 is means for storing second prediction result data D26. The 2nd prediction result storage part 21 memorizes the 2nd prediction boarding number for every time about each of a plurality of areas. The predicted number of passengers stored in the second prediction result storage unit 21 is the final prediction result.

次に、図１０を用いて、図１に示す需要予測システム１の動作を説明するとともに本実施形態に係る需要予測方法について説明する。需要予測システム１は、二段階の予測処理を行うことで各エリアの需要予測値を確定する。 Next, the operation of the demand prediction system 1 shown in FIG. 1 will be described using FIG. 10 and the demand prediction method according to the present embodiment will be described. The demand prediction system 1 determines the demand prediction value of each area by performing a two-stage prediction process.

１段階目の予測処理では、まず、分析用データ生成部１１が各エリアについて分析用データを生成し、分析用データ記憶部１２に格納する（ステップＳ８１、実績値記憶ステップ）。続いて、第１回帰式生成部１３が、各エリアについて、当該エリアの分析用データに対する回帰分析を行うことで第１の回帰式を生成する（ステップＳ８２、第１算出ステップ）。続いて、予測用データ生成部１４が各エリアについて予測用データを生成する（ステップＳ８３、第１算出ステップ）。続いて、第１予測結果生成部１６が、各エリアについて、分析用データを第１の回帰式に適用することで第１の予測乗車数を算出し（ステップＳ８４、第１算出ステップ）、第１の予測結果データとして第１予測結果記憶部１７に格納する（予測値記憶ステップ）。 In the first-stage prediction process, first, the analysis data generation unit 11 generates analysis data for each area and stores it in the analysis data storage unit 12 (step S81, actual value storage step). Then, the 1st regression type production | generation part 13 produces | generates a 1st regression formula about each area by performing the regression analysis with respect to the data for an analysis of the said area (step S82, 1st calculation step). Subsequently, the prediction data generation unit 14 generates prediction data for each area (step S83, first calculation step). Subsequently, the first prediction result generation unit 16 calculates the first predicted number of rides by applying the analysis data to the first regression equation for each area (step S84, first calculation step), The first prediction result data is stored in the first prediction result storage unit 17 (prediction value storage step).

次に２段階目の予測処理が行われる。まず、エリア抽出部１８が各予測対象エリアについて相関エリアを抽出する（ステップＳ８５、抽出ステップ）。この相関エリアは、実績乗車数に関する予測対象エリアとの相関が所定値以上の周辺エリアである。続いて、第２回帰式生成部１９が、各予測対象エリアについて、相関エリアの実績乗車数が説明変数として加わった回帰分析を当該予測対象エリアの分析用データに対して行うことで第２の回帰式を生成する（ステップＳ８６、生成ステップ）。そして、第２予測結果生成部２０が、各予測対象エリアについて、当該予測対象エリアの分析用データと相関エリアの第１の予測結果データとを第２の回帰式に適用することで第２の予測乗車数を算出する（ステップＳ８７、第２算出ステップ）。第２予測結果生成部２０は、算出した各第２の予測乗車数を第２予測結果記憶部２１に格納する（出力ステップ）。これにより、各エリアの予測乗車数が確定する。 Next, a second stage prediction process is performed. First, the area extraction unit 18 extracts a correlation area for each prediction target area (step S85, extraction step). This correlation area is a peripheral area whose correlation with the prediction target area regarding the actual number of rides is a predetermined value or more. Subsequently, the second regression equation generation unit 19 performs, for each prediction target area, a regression analysis in which the actual number of rides in the correlation area is added as an explanatory variable to the analysis data of the prediction target area. A regression equation is generated (step S86, generation step). Then, the second prediction result generation unit 20 applies, for each prediction target area, the analysis data of the prediction target area and the first prediction result data of the correlation area to the second regression equation. The predicted number of passengers is calculated (step S87, second calculation step). The second prediction result generation unit 20 stores the calculated second predicted number of rides in the second prediction result storage unit 21 (output step). Thereby, the estimated number of boarding of each area is decided.

以上説明したように、本実施形態によれば、ある一つの予測対象エリアの最終的な予測乗車数を求める際に、その予測対象エリアと周辺エリアとの間で実績乗車数の相関が算出され、その相関が一定以上であるエリアが相関エリアとして抽出される。そして、当該相関エリアの需要実績値を用いた回帰分析で得られた第２の回帰式と、当該相関エリアの第１の予測乗車数とを用いて、予測対象エリアの最終的な第２の予測乗車数が算出される。このように、地理的な相関を考慮する際に、予測対象エリアとの相関が高い他のエリアの需要値のみを用いることで、予測対象エリアの需要予測を少ない計算量で行うことが可能になる。 As described above, according to the present embodiment, when obtaining the final predicted number of passengers in a certain prediction target area, the correlation of the actual number of passengers is calculated between the prediction target area and the surrounding area. , Areas whose correlation is above a certain level are extracted as correlation areas. Then, using the second regression equation obtained by the regression analysis using the actual demand value of the correlation area and the first predicted number of rides of the correlation area, the final second of the prediction target area is obtained. The predicted number of rides is calculated. In this way, when considering geographical correlation, it is possible to perform demand prediction of the prediction target area with a small amount of calculation by using only the demand values of other areas having high correlation with the prediction target area. Become.

また、本実施形態によれば、相関エリアの候補となるエリアが予測対象エリアの近くに位置するものに予め絞り込まれた上で、絞り込まれた候補（周辺エリア）についてのみ予測対象エリアとの相関が計算される。これにより、相関を計算する量を少なくすることができる。 Further, according to the present embodiment, the correlation area candidates are narrowed down in advance to those that are close to the prediction target area, and only the narrowed candidates (peripheral areas) are correlated with the prediction target area. Is calculated. Thereby, the amount of calculating the correlation can be reduced.

以上、本発明をその実施形態に基づいて詳細に説明した。しかし、本発明は上記実施形態に限定されるものではない。本発明は、その要旨を逸脱しない範囲で様々な変形が可能である。 The present invention has been described in detail based on the embodiments. However, the present invention is not limited to the above embodiment. The present invention can be variously modified without departing from the gist thereof.

上記実施形態では需要予測システム１がタクシーの乗客数を予測したが、予測する需要の種類はこれに限定されず、本発明は様々な需要の予測に適用可能である。例えば、商品の売上の予測（目的変数は売上の個数又は金額）や、将来の人口予測（目的変数は人口）などに本発明を適用できる。 In the above embodiment, the demand prediction system 1 predicts the number of taxi passengers. However, the type of demand to be predicted is not limited to this, and the present invention can be applied to various demand predictions. For example, the present invention can be applied to prediction of sales of products (the objective variable is the number or amount of sales) and future population prediction (the objective variable is population).

上記実施形態では、エリア抽出部１８が、まず一の予測対象エリアに対する周辺エリアを抽出し、各周辺エリアについて当該予測対象エリアとの間の実績乗車数の相関を求めることで相関エリアを抽出したが、相関エリアの抽出方法はこれに限定されない。例えば、抽出手段は、周辺エリアを抽出することなく、一の予測対象エリアについて他のすべてのエリアとの間の実績乗車数の相関を求め、当該他のすべてのエリアから相関エリアを抽出してもよい。すなわち、抽出手段はエリア間の距離を考慮することなく実績乗車数の相関のみに基づいて相関エリアを抽出してもよい。 In the above embodiment, the area extraction unit 18 first extracts a peripheral area for one prediction target area, and extracts a correlation area by obtaining a correlation of the actual number of rides with the prediction target area for each peripheral area. However, the method of extracting the correlation area is not limited to this. For example, the extraction means obtains the correlation of the actual number of rides with all other areas for one prediction target area without extracting the surrounding area, and extracts the correlation area from all the other areas. Also good. That is, the extracting means may extract the correlation area based only on the correlation of the actual number of rides without considering the distance between the areas.

上記実施形態では、最終結果である第２の予測乗車数が第２予測結果記憶部２１に格納されたが、第２の需要予測値の出力方法はこれに限定されない。例えば、他の情報処理システムや端末などに第２の需要予測値が送信されてもよい。 In the above embodiment, the second predicted number of rides that is the final result is stored in the second prediction result storage unit 21, but the method of outputting the second demand predicted value is not limited to this. For example, the second demand prediction value may be transmitted to another information processing system or a terminal.

１…需要予測システム、１１…分析用データ生成部、１２…分析用データ記憶部（実績値記憶手段）、１３…第１回帰式生成部（第１算出手段）、１４…予測用データ生成部（第１算出手段）、１５…予測用データ記憶部、１６…第１予測結果生成部（第１算出手段）、１７…第１予測結果記憶部（予測値記憶手段）、１８…エリア抽出部（抽出手段）、１９…第２回帰式生成部（生成手段）、２０…第２予測結果生成部（第２算出手段、出力手段）、２１…第２予測結果記憶部。 DESCRIPTION OF SYMBOLS 1 ... Demand prediction system, 11 ... Analytical data generation part, 12 ... Analytical data storage part (actual value storage means), 13 ... 1st regression type production | generation part (1st calculation means), 14 ... Prediction data generation part (First calculation means), 15 ... prediction data storage section, 16 ... first prediction result generation section (first calculation means), 17 ... first prediction result storage section (prediction value storage means), 18 ... area extraction section (Extraction means), 19... Second regression equation generation section (generation means), 20... Second prediction result generation section (second calculation means, output means), 21.

Claims

For each of the plurality of areas, actual value storage means for storing the past demand actual value of the area;
For each of the plurality of areas, the actual demand value of the area is read from the actual value storage means, a regression analysis using the actual demand value is performed, a first regression equation is generated, and the first regression First calculating means for calculating a first demand forecast value of the area using an equation;
Predicted value storage means for storing the first demand predicted value calculated by the first calculation means for each of the plurality of areas;
Using one of the plurality of areas as a prediction target area, a correlation between the demand actual value of the prediction target area read from the actual value storage unit and the demand actual value of another area is calculated, and the correlation is predetermined. Extracting means for extracting the other area that is equal to or greater than the threshold value as a correlation area;
Generating means for generating a second regression equation by performing regression analysis using the actual demand value of the correlation area extracted by the extracting means;
A second demand calculation value for the prediction target area is calculated using the second regression equation generated by the generation means and the demand prediction value for the correlation area read from the prediction value storage means. A calculation means;
Output means for outputting a second demand forecast value extracted by the second calculation means;
A demand forecasting system comprising:

The extraction means extracts another area whose distance from the prediction target area is a predetermined value or less as a peripheral area, calculates a correlation between the demand actual value of the prediction target area and the demand actual value of the peripheral area, Extracting the peripheral area where the correlation is equal to or greater than the threshold as the correlation area;
The demand forecasting system according to claim 1 characterized by things.

A demand forecast method executed by a demand forecast system,
For each of the plurality of areas, an actual value storage step of storing the past demand actual value of the area in the actual value storage means;
For each of the plurality of areas, the actual demand value of the area is read from the actual value storage means, a regression analysis using the actual demand value is performed, a first regression equation is generated, and the first regression A first calculation step of calculating a first demand forecast value of the area using an equation;
For each of the plurality of areas, a predicted value storage step of storing the first demand predicted value calculated in the first calculation step in a predicted value storage means;
Using one of the plurality of areas as a prediction target area, a correlation between the demand actual value of the prediction target area read from the actual value storage unit and the demand actual value of another area is calculated, and the correlation is predetermined. An extraction step of extracting the other area that is equal to or greater than the threshold value as a correlation area;
A generation step of generating a second regression equation by performing a regression analysis using the actual demand value of the correlation area extracted in the extraction step;
A second demand calculation value for the prediction target area is calculated using the second regression equation generated in the generation step and the demand prediction value for the correlation area read from the prediction value storage means. A calculation step;
An output step of outputting the second demand forecast value extracted in the second calculation step;
Demand forecasting method characterized by including.