JP6855359B2

JP6855359B2 - Congestion prediction model creation device

Info

Publication number: JP6855359B2
Application number: JP2017188737A
Authority: JP
Inventors: 裕人赤塚; 寺田　雅之; 雅之寺田; 永田　智大; 智大永田
Original assignee: NTT Docomo Inc
Current assignee: NTT Docomo Inc
Priority date: 2017-09-28
Filing date: 2017-09-28
Publication date: 2021-04-07
Anticipated expiration: 2037-09-28
Also published as: JP2019066930A

Description

本発明は、交通渋滞の予測に用いられる渋滞予測モデルを作成する渋滞予測モデル作成装置に関する。 The present invention relates to a traffic congestion prediction model creating device that creates a traffic congestion prediction model used for predicting traffic congestion.

交通情報を取得し、取得された交通情報に基づいて所望地点の交通渋滞を予測する技術が知られている（例えば、特許文献１参照）。特許文献１では、過去の渋滞データに基づいて渋滞予測モデルを作成し、渋滞予測モデルに現在の交通情報を入力して交通渋滞を予測することを記載している。 A technique for acquiring traffic information and predicting traffic congestion at a desired point based on the acquired traffic information is known (see, for example, Patent Document 1). Patent Document 1 describes that a traffic congestion prediction model is created based on past traffic congestion data, and current traffic information is input to the traffic congestion prediction model to predict traffic congestion.

特開２００４−２７２４０８号公報Japanese Unexamined Patent Publication No. 2004-272408

交通渋滞には様々な要因があり、従来の渋滞予測では、予期していない要因によって、予測した渋滞と実際の渋滞とが乖離することが懸念される。例えば特許文献１に記載の手法では、過去の渋滞データ、すなわち既に走行状態にある自動車のデータから渋滞予測モデルが作成されており、過去の渋滞データをサンプリングした時点で走行状態にない自動車は考慮されていない。このため、過去の渋滞データのみから交通渋滞を予測する場合、現在の時刻から予測時刻までの時間が長いほど、現在走行状態にないが予測時刻に走行状態となる自動車に起因して渋滞予測の誤差が増加するおそれがある。 There are various factors in traffic congestion, and in the conventional congestion prediction, there is a concern that the predicted congestion and the actual congestion may deviate due to unexpected factors. For example, in the method described in Patent Document 1, a traffic jam prediction model is created from past traffic jam data, that is, data of a car that is already in a running state, and a car that is not in the running state at the time of sampling the past traffic jam data is considered. It has not been. For this reason, when predicting traffic congestion only from past congestion data, the longer the time from the current time to the predicted time, the more the congestion prediction is caused by a vehicle that is not currently in the driving state but is in the driving state at the predicted time. The error may increase.

一方、交通渋滞の様々な要因を考慮して渋滞予測モデルを構築すれば、渋滞予測の誤差を低減して渋滞予測の精度が向上され得る。しかし、この場合、渋滞予測モデルの構築に要する演算量が劇的に増加するおそれがある。 On the other hand, if a traffic congestion prediction model is constructed in consideration of various factors of traffic congestion, the error of the traffic congestion prediction can be reduced and the accuracy of the traffic congestion prediction can be improved. However, in this case, the amount of calculation required to build the congestion prediction model may increase dramatically.

本発明は、上記の問題に鑑みてなされたものであり、渋滞予測の精度が向上され得ると共に渋滞予測モデルの構築に要する演算量の増加が抑制され得る渋滞予測モデル作成装置を提供することを目的とする。 The present invention has been made in view of the above problems, and provides a traffic jam prediction model creating device capable of improving the accuracy of traffic jam prediction and suppressing an increase in the amount of calculation required for constructing a traffic jam prediction model. The purpose.

上記の目的を達成するために、本発明に係る渋滞予測モデル作成装置は、交通渋滞の予測に用いる渋滞予測モデルを作成する渋滞予測モデル作成装置であって、人の位置と属性とに応じて分類された複数のグループの人口データを取得する人口データ取得部と、位置に応じた渋滞状況を示す渋滞データを取得する渋滞データ取得部と、人口データをグループごとに正規化した後に、グループごとの正規化された人口データを説明変数とし、交通渋滞を予測する渋滞予測位置における渋滞データを目的変数とする回帰分析を行うことで、複数のグループの中から、渋滞予測位置における渋滞データに対する寄与度が相対的に高いグループの人口データを選択し、選択された人口データと渋滞予測位置における渋滞データとに基づく回帰分析によって渋滞予測モデルを構築する渋滞予測モデル構築部と、を備える。 In order to achieve the above object, the traffic jam prediction model creating device according to the present invention is a traffic jam prediction model creating device that creates a traffic jam prediction model used for predicting traffic congestion, and is a traffic jam prediction model creating device according to the position and attribute of a person. The population data acquisition unit that acquires the population data of multiple classified groups, the congestion data acquisition unit that acquires the congestion data indicating the congestion status according to the position, and the congestion data acquisition unit that acquires the congestion data according to the position, and after normalizing the population data for each group, each group Contribution to the traffic congestion data at the traffic congestion prediction position from multiple groups by performing regression analysis using the normalized population data of the above as the explanatory variable and the traffic congestion data at the traffic congestion prediction position as the objective variable. degrees selects population data of relatively high group, and a traffic jam prediction model constructing section for constructing the traffic jam prediction model by congestion data and the based rather regression analysis on selected population data traffic jam predicted position.

本発明に係る渋滞予測モデル作成装置では、人口データと渋滞データとに基づく回帰分析によって渋滞予測モデルを構築している。人口データをグループごとに正規化した後に、グループごとの正規化された人口データを用いることで、交通渋滞の様々な要因に対応し得る。例えば、人口データは、現在走行状態にないが予測時刻に走行状態になる自動車の数にも相関している。このため、上記渋滞予測モデルを用いて渋滞予測を行えば、渋滞予測の誤差を低減して渋滞予測の精度が向上され得る。また、回帰分析を用いて、渋滞予測位置における渋滞データに対する寄与度が高いグループの人口データを選択し、選択された人口データと渋滞データとに基づく回帰分析によって渋滞予測モデルを構築する。このため、渋滞予測の精度が確保されながら、渋滞予測モデルの構築に要する演算量の増加が抑制され得る。すなわち、モデル構築部によって構築された渋滞予測モデルを用いて渋滞予測を行うことで、渋滞予測度の向上と演算量の増加の抑制とが両立され得る。 In the traffic jam prediction model generating apparatus according to the present invention is to construct a traffic jam prediction model by based rather regression analysis and population data and traffic congestion data. By normalizing the population data for each group and then using the normalized population data for each group, it is possible to deal with various factors of traffic congestion. For example, population data also correlates with the number of vehicles that are not currently in driving but will be in driving at the predicted time. Therefore, if the traffic jam is predicted using the above-mentioned traffic jam prediction model, the error of the traffic jam prediction can be reduced and the accuracy of the traffic jam prediction can be improved. Further, using regression analysis, select the population data of contribution higher group for congestion data in the traffic jam prediction position, building a traffic jam prediction model by a selected population data congestion data and the based rather regression analysis. Therefore, while ensuring the accuracy of the traffic jam prediction, it is possible to suppress an increase in the amount of calculation required for constructing the traffic jam prediction model. That is, by performing the traffic jam prediction using the traffic jam prediction model constructed by the model construction unit, it is possible to both improve the degree of traffic jam prediction and suppress the increase in the amount of calculation.

本発明によれば、渋滞予測の精度が向上され得ると共に渋滞予測モデルの構築に要する演算量の増加が抑制され得る渋滞予測モデル作成装置を提供することができる。 According to the present invention, it is possible to provide a traffic jam prediction model creating device that can improve the accuracy of traffic jam prediction and suppress an increase in the amount of calculation required for constructing a traffic jam prediction model.

一の実施形態に係る渋滞予測モデル作成装置の機能を説明するための図である。It is a figure for demonstrating the function of the traffic jam prediction model creation apparatus which concerns on one Embodiment. 人口データの構成例を示す図である。It is a figure which shows the composition example of the population data. 渋滞データの構成例を示す図である。It is a figure which shows the composition example of the traffic jam data. 渋滞がある又は渋滞がないと予測された場合の渋滞予測結果の出力の構成例を示す図である。It is a figure which shows the composition example of the output of the traffic jam prediction result when it is predicted that there is a traffic jam or there is no traffic jam. 渋滞がある又は渋滞がないと予測された場合の渋滞予測結果の表示例を示す図である。It is a figure which shows the display example of the traffic jam prediction result when it is predicted that there is a traffic jam or there is no traffic jam. 一の実施形態に係る渋滞予測モデル作成装置で実行される処理を示すフローチャートである。It is a flowchart which shows the process executed by the traffic jam prediction model creation apparatus which concerns on one Embodiment. 一の実施形態に係る渋滞予測モデル作成装置で実行される処理を示すフローチャートである。It is a flowchart which shows the process executed by the traffic jam prediction model creation apparatus which concerns on one Embodiment. 一の実施形態に係る渋滞予測モデル作成装置のハードウェアの構成を示す図である。It is a figure which shows the hardware structure of the traffic jam prediction model creation apparatus which concerns on one Embodiment.

以下、図面と共に本発明に係る渋滞予測モデル作成装置の実施形態について詳細に説明する。なお、図面の説明においては同一要素には同一符号を付し、重複する説明を省略する。 Hereinafter, embodiments of the congestion prediction model creating device according to the present invention will be described in detail together with the drawings. In the description of the drawings, the same elements are designated by the same reference numerals, and duplicate description will be omitted.

図１は、本実施形態に係る渋滞予測モデル作成装置１００を示す。渋滞予測モデル作成装置１００は、学習用人口データと学習用渋滞データとに基づいて、交通渋滞の予測に用いる渋滞予測モデルを作成する。渋滞予測モデル作成装置１００は、作成された渋滞予測モデルを用いて、人口データを入力として交通渋滞を予測する。以下では、予め定められた期間（以下、「サンプリング期間」という）内にサンプリングされた元データに基づいて渋滞予測モデルを構築し、当日の全国の地域メッシュごとの人口データを入力として、当日の交通渋滞を予測する場合を例に説明するが、これに限定されない。 FIG. 1 shows a traffic jam prediction model creating device 100 according to the present embodiment. The congestion prediction model creating device 100 creates a congestion prediction model used for predicting traffic congestion based on the learning population data and the learning congestion data. The traffic congestion prediction model creating device 100 uses the created traffic congestion prediction model to predict traffic congestion by inputting population data. In the following, a congestion prediction model will be constructed based on the original data sampled within a predetermined period (hereinafter referred to as "sampling period"), and the population data for each regional mesh of the day will be input as the input for the day. The case of predicting traffic congestion will be described as an example, but the present invention is not limited to this.

人口データとは、ある時刻にある地域にどれくらい人が存在したかを示すデータである。本実施形態では、人口データは、サンプリング時刻と、人の位置（たとば、地域メッシュなど）と、人の属性（例えば、性別、年代、居住地など）とに応じて分類された複数のグループの各々に属する人口を示している。サンプリング時刻は、データのサンプリングを行った時刻である。人の位置及び属性は、データのサンプリングの対象者（以下、単に「サンプリング対象者」という）の位置及び属性である。例えば、人口データは、外部のデータベース等に保存されている、予め携帯端末の定期的にサンプリングされた位置情報と当該携帯端末のユーザの属性についての登録情報とから生成される。本実施形態では、人口データは、図２に示したように、「時刻」、「位置識別番号」、「性別」、「年代」、「居住地」、及び「人口」の項目で構成される。 Population data is data showing how many people existed in an area at a certain time. In this embodiment, the population data is a plurality of groups classified according to the sampling time, the position of the person (such as a tabby, the area mesh), and the attribute of the person (for example, gender, age, place of residence, etc.). Shows the population belonging to each of. The sampling time is the time when the data was sampled. The position and attribute of a person are the position and attribute of a person whose data is to be sampled (hereinafter, simply referred to as "sampling target person"). For example, the population data is generated from the location information that is periodically sampled in advance of the mobile terminal and the registration information about the attributes of the user of the mobile terminal, which are stored in an external database or the like. In the present embodiment, as shown in FIG. 2, the population data is composed of the items of "time", "location identification number", "gender", "age", "residential place", and "population". ..

「時刻」では、データのサンプリングを行った日付と時刻が示される。「位置識別番号」では、サンプリング対象者の位置を識別する番号が示される。例えば、「位置識別番号」として、「ＪＩＳＸ０４１０地域メッシュコード」が用いられる。「位置識別番号」には、第２次メッシュ、第３次メッシュ、又は２分の１地域メッシュ等の地域メッシュコードが用いられてもよいし、居住地の都道府県コード又は市区町村コードが用いられてもよい。本実施形態では、「位置識別番号」として２分の１地域メッシュが用いられるため、位置識別番号は約１５０万種類に分類されている。 "Time" indicates the date and time when the data was sampled. The "position identification number" indicates a number that identifies the position of the sampling target person. For example, "JIS X 0410 area mesh code" is used as the "position identification number". For the "position identification number", a regional mesh code such as a secondary mesh, a tertiary mesh, or a half regional mesh may be used, or a prefecture code or a city / ward / town / village code of the place of residence may be used. You may. In the present embodiment, since the half area mesh is used as the "position identification number", the position identification numbers are classified into about 1.5 million types.

「性別」では、サンプリング対象者の性別が示される。「年代」では、サンプリングの対象者の年齢区分が示される。本実施形態では、「年代」は、２０歳以下、２１〜２５歳、２６〜３０歳、３１〜３５歳、３６〜４０歳、４１〜４５歳、４６〜５０歳、５１〜５５歳、５６〜６０歳、６１〜６５歳、６６〜７０歳、７１〜７５歳、７６歳以上の１３種類に分類されている。「居住地」では、サンプリングの対象者の居住地が示される。「居住地」には、第２次メッシュ、第３次メッシュ、又は２分の１地域メッシュ等の地域メッシュコードが用いられてもよいし、居住地の都道府県コード又は市区町村コードが用いられてもよい。本実施形態では、「居住地」として、約２０００個に分類された市区町村コードが用いられる。 “Gender” indicates the gender of the sampling target. "Age" indicates the age group of the person to be sampled. In the present embodiment, the "age" is 20 years old or younger, 21-25 years old, 26-30 years old, 31-35 years old, 36-40 years old, 41-45 years old, 46-50 years old, 51-55 years old, 56 years old. It is classified into 13 types: ~ 60 years old, 61-65 years old, 66-70 years old, 71-75 years old, 76 years old and over. “Residence” indicates the residence of the person to be sampled. For the "place of residence", a regional mesh code such as a secondary mesh, a tertiary mesh, or a half area mesh may be used, or a prefecture code or a city / ward / town / village code of the place of residence is used. May be good. In the present embodiment, as the "residential place", the city, ward, town, and village codes classified into about 2000 are used.

本実施形態では、人口データは、人の位置に相当する「位置識別番号」と、人の属性に相当する、「性別」、「年代」、及び「居住地」とに応じて複数のグループに分類される。このため、人口データは、約１５０万×２×１３×約２０００のグループに分類され得る。すなわち、人の位置及び属性の組み合わせを変数とする空間は、理論上、約８００億次元に及ぶ。 In the present embodiment, the population data is divided into a plurality of groups according to the "position identification number" corresponding to the position of the person and the "gender", "age", and "place of residence" corresponding to the attribute of the person. being classified. Therefore, the population data can be classified into groups of about 1.5 million x 2 x 13 x about 2000. That is, the space in which the combination of the position and the attribute of a person is a variable theoretically extends to about 80 billion dimensions.

渋滞データとは、位置に応じた渋滞状況を示すデータであり、渋滞情報を管理する外部のデータベースに予め保存されている。例えば、渋滞データは、道路（例えば、全国の主要な道路）が予め区分けされた区間（いわゆるリンク）ごとに、図３に示したように、「渋滞発生日付」、「開始時刻」、「終了時刻」、「最大渋滞長」、及び「渋滞ピーク時刻」の項目で構成されている。本実施形態では、上述した予め区分けされた区間（以下、単に「位置」という）が予測の対象とされる。「渋滞発生日付」では、対象の位置で渋滞が発生した日付が示される。「開始時刻」では、対象の位置で渋滞が発生した時刻が示される。「終了時刻」では、対象の位置で渋滞が終了した時刻が示される。「最大渋滞長」では、対象の位置で発生した渋滞が開始時刻から終了時刻までの間で、最大となった渋滞の長さが示される。「渋滞ピーク時刻」では、最大渋滞長に達した時刻が示される。 The traffic jam data is data showing the traffic jam situation according to the position, and is stored in advance in an external database that manages the traffic jam information. For example, in the traffic jam data, as shown in FIG. 3, the "traffic jam occurrence date", "start time", and "end" are collected for each section (so-called link) in which roads (for example, major roads nationwide) are divided in advance. It consists of items such as "time", "maximum congestion length", and "congestion peak time". In the present embodiment, the above-mentioned pre-divided section (hereinafter, simply referred to as “position”) is the target of prediction. In the "traffic jam occurrence date", the date when the traffic jam occurred at the target position is indicated. The "start time" indicates the time when the traffic jam occurred at the target position. The "end time" indicates the time when the traffic jam ended at the target position. The "maximum congestion length" indicates the maximum congestion length between the start time and the end time of the congestion that occurred at the target position. "Traffic congestion peak time" indicates the time when the maximum congestion length is reached.

学習用人口データ取得部１１１は、渋滞予測モデル作成装置１００の外部から、渋滞予測モデルの構築に用いられる学習用の人口データ（以下、「学習用人口データ」という）を取得する。学習用人口データ取得部１１１は、予め定められたサンプリング期間内にサンプリングされた学習用人口データを渋滞予測に用いる元データとして取得する。学習用人口データは、例えば、外部のデータベースに記録された全国の過去の人口データであり、サンプリング時刻と人の位置と人の属性とに応じて分類された複数のグループに分かれている。本実施形態では、学習用人口データ取得部１１１は、日毎に正午にサンプリングされた学習用人口データのみを取得する。学習用人口データ格納部１１２は、学習用人口データ取得部１１１に入力された学習用人口データを格納する。 The learning population data acquisition unit 111 acquires the learning population data (hereinafter, referred to as “learning population data”) used for constructing the congestion prediction model from the outside of the congestion prediction model creating device 100. The learning population data acquisition unit 111 acquires the learning population data sampled within a predetermined sampling period as the original data used for the congestion prediction. The learning population data is, for example, the past population data of the whole country recorded in an external database, and is divided into a plurality of groups classified according to the sampling time, the position of the person, and the attribute of the person. In the present embodiment, the learning population data acquisition unit 111 acquires only the learning population data sampled at noon every day. The learning population data storage unit 112 stores the learning population data input to the learning population data acquisition unit 111.

学習用渋滞データ取得部１１３は、渋滞予測モデル作成装置１００の外部から、渋滞予測モデルの構築に用いられる学習用の渋滞データ（以下、「学習用渋滞データ」という）を取得する。学習用渋滞データ取得部１１３は、予め定められたサンプリング期間内にサンプリングされた、渋滞予測位置における学習用渋滞データを渋滞予測に用いる元データとして取得する。学習用渋滞データは、例えば、外部のデータベースに記録された過去の渋滞データのうち、交通渋滞を予測する渋滞予測位置に応じた渋滞状況を示す。学習用渋滞データ格納部１１４は、学習用渋滞データ取得部１１３に入力された学習用渋滞データを格納する。 The learning congestion data acquisition unit 113 acquires learning congestion data (hereinafter, referred to as “learning congestion data”) used for constructing the congestion prediction model from the outside of the congestion prediction model creating device 100. The learning congestion data acquisition unit 113 acquires the learning congestion data at the congestion prediction position sampled within a predetermined sampling period as the original data used for the congestion prediction. The learning congestion data indicates, for example, the congestion status according to the congestion prediction position for predicting the traffic congestion among the past congestion data recorded in the external database. The learning congestion data storage unit 114 stores the learning congestion data input to the learning congestion data acquisition unit 113.

渋滞予測モデル構築部１１５は、学習用人口データ格納部１１２及び学習用渋滞データ格納部１１４を参照して、位置ごとに、交通渋滞の予測に用いる渋滞予測モデルを構築する。渋滞予測モデル構築部１１５は、変数選択部１１６、次元圧縮部１１７、予測用回帰式作成部１１８を有する。 The congestion prediction model construction unit 115 constructs a congestion prediction model used for predicting traffic congestion for each position with reference to the learning population data storage unit 112 and the learning congestion data storage unit 114. The congestion prediction model construction unit 115 includes a variable selection unit 116, a dimension compression unit 117, and a prediction regression equation creation unit 118.

以下、渋滞予測モデル構築部１１５が、ある渋滞予測位置について、学習用渋滞データの「渋滞発生日付」、「開始時刻」、「終了時刻」、「最大渋滞長」、及び「渋滞ピーク時刻」の項目のそれぞれに基づいて、渋滞予測モデルを構築する例について説明する。具体的には、渋滞予測モデル構築部１１５は、当該渋滞予測位置についての「渋滞発生日付」ごとの学習用渋滞データに基づいて渋滞予測対象となる日の渋滞の有無を予測する渋滞予測モデルを構築する。また、渋滞予測モデル構築部１１５は、当該渋滞予測位置についての「開始時刻」に基づいて渋滞の開始時刻を予測する渋滞予測モデルを構築する。また、渋滞予測モデル構築部１１５は、当該渋滞予測位置についての「終了時刻」に基づいて渋滞の終了時刻を予測する渋滞予測モデルを構築する。また、渋滞予測モデル構築部１１５は、「最大渋滞長」に基づいて渋滞の最大渋滞長を予測する渋滞予測モデルを構築する。また、渋滞予測モデル構築部１１５は、「渋滞ピーク時刻」に基づいて渋滞ピーク時刻を予測する渋滞予測モデルを構築する。即ち、渋滞予測モデル構築部１１５は、渋滞予測位置ごとに５つの渋滞予測モデルを構築する。 Hereinafter, the traffic jam prediction model construction unit 115 describes the "traffic jam occurrence date", "start time", "end time", "maximum traffic jam length", and "traffic jam peak time" of the learning traffic jam data for a certain traffic jam prediction position. An example of constructing a congestion prediction model based on each of the items will be described. Specifically, the traffic jam prediction model construction unit 115 calculates a traffic jam prediction model that predicts the presence or absence of traffic jam on the day to be traffic jam prediction target based on the learning traffic jam data for each "traffic jam occurrence date" for the traffic jam prediction position. To construct. Further, the traffic jam prediction model building unit 115 builds a traffic jam prediction model that predicts the start time of the traffic jam based on the "start time" of the traffic jam prediction position. In addition, the traffic jam prediction model building unit 115 builds a traffic jam prediction model that predicts the end time of the traffic jam based on the "end time" of the traffic jam prediction position. In addition, the traffic jam prediction model construction unit 115 builds a traffic jam prediction model that predicts the maximum traffic jam length based on the "maximum traffic jam length". In addition, the traffic jam prediction model building unit 115 builds a traffic jam prediction model that predicts the traffic jam peak time based on the “traffic jam peak time”. That is, the traffic jam prediction model construction unit 115 builds five traffic jam prediction models for each traffic jam prediction position.

渋滞予測モデル構築部１１５では、渋滞予測位置における学習用渋滞データの上述した各項目の値に基づいて求めた数値を用いて、渋滞予測モデルを構築する。例えば、渋滞の有無を予測する渋滞予測モデルを構築する場合には、渋滞予測モデル構築部１１５は、上記サンプリング期間内のうち、学習用渋滞データにおいて渋滞予測位置に渋滞があったと判断される日付（当該渋滞予測位置についての学習用渋滞データがある日付）における渋滞の有無を示す数値を「１」とし、学習用渋滞データにおいて渋滞予測位置に渋滞がなかったと判断される日付（当該渋滞予測位置についての学習用渋滞データがない日付）における渋滞の有無を示す数値を「０」として取り扱う。渋滞の開始時刻を予測する渋滞予測モデルを構築する場合には、渋滞予測モデル構築部１１５は、サンプリング期間内の渋滞予測位置における学習用渋滞データの「開始時刻」の値を周知の手法で数値化する。渋滞の終了時刻を予測する渋滞予測モデルを構築する場合には、渋滞予測モデル構築部１１５は、サンプリング期間内の渋滞予測位置における学習用渋滞データの「終了時刻」の値を周知の手法で数値化する。渋滞の最大渋滞長を予測する渋滞予測モデルを構築する場合には、渋滞予測モデル構築部１１５は、サンプリング期間内の渋滞予測位置における学習用渋滞データの「最大渋滞長」の値を上記数値として取り扱う。渋滞ピーク時刻を予測する渋滞予測モデルを構築する場合には、渋滞予測モデル構築部１１５は、サンプリング期間内の渋滞予測位置における学習用渋滞データの「渋滞ピーク時刻」の値を周知の手法で数値化する。 The traffic jam prediction model construction unit 115 builds a traffic jam prediction model using the numerical values obtained based on the values of the above-mentioned items of the traffic jam data for learning at the traffic jam prediction position. For example, when constructing a congestion prediction model that predicts the presence or absence of congestion, the congestion prediction model construction unit 115 determines that there was congestion at the congestion prediction position in the learning congestion data within the above sampling period. The numerical value indicating the presence or absence of traffic jam in (the date when there is the traffic jam data for learning about the predicted traffic jam position) is set to "1", and the date when it is judged that there is no traffic jam in the predicted traffic jam position in the learning traffic jam position (the predicted traffic jam position). The numerical value indicating the presence or absence of traffic congestion on the date when there is no traffic congestion data for learning is treated as "0". When constructing a traffic jam prediction model that predicts the start time of traffic jam, the traffic jam prediction model building unit 115 numerically calculates the value of the "start time" of the training traffic jam data at the traffic jam prediction position within the sampling period by a well-known method. To become. When constructing a traffic jam prediction model that predicts the end time of traffic jam, the traffic jam prediction model building unit 115 numerically calculates the value of the "end time" of the training traffic jam data at the traffic jam prediction position within the sampling period by a well-known method. To become. When constructing a congestion prediction model that predicts the maximum congestion length of congestion, the congestion prediction model construction unit 115 uses the value of the "maximum congestion length" of the learning congestion data at the congestion prediction position within the sampling period as the above numerical value. handle. When constructing a traffic jam prediction model for predicting a traffic jam peak time, the traffic jam prediction model building unit 115 numerically calculates the value of the "traffic jam peak time" of the learning traffic jam data at the traffic jam prediction position within the sampling period by a well-known method. To become.

変数選択部１１６は、学習用人口データ格納部１１２に格納されている学習用人口データ（例えば、地域メッシュごとの全国の学習用人口データ）から渋滞と関連のあるグループの学習用人口データ（説明変数）を選択する。以下に、変数選択部１１６が行う処理を具体的に説明する。 The variable selection unit 116 is a learning population data (explanation) of a group related to traffic congestion from the learning population data (for example, the national learning population data for each regional mesh) stored in the learning population data storage unit 112. Variable) is selected. The processing performed by the variable selection unit 116 will be specifically described below.

変数選択部１１６は、まず、サンプリング期間内の学習用人口データをグループごとに正規化する。例えば、変数選択部１１６は、学習用人口データの各グループにおける人口を、上記サンプリング期間内の全時刻に関し、各グループにおいて平均が０で分散が１となる値に変換する。学習用人口データを正規化することによって、各グループの普段の人口の規模に応じた、人口の増減の割合が適切に判断され得る。 First, the variable selection unit 116 normalizes the learning population data within the sampling period for each group. For example, the variable selection unit 116 converts the population in each group of the learning population data into a value having an average of 0 and a variance of 1 in each group with respect to all the times within the sampling period. By normalizing the learning population data, the rate of increase or decrease in the population can be appropriately determined according to the size of the normal population of each group.

以降、正規化された学習用人口データを、単に「学習用人口データ」という。渋滞の有無を示す数値、「開始時刻」を示す数値、「終了時刻」を示す数値、「最大渋滞長」を示す数値、及び「渋滞ピーク時刻」を示す数値を纏めて、単に「学習用渋滞データ」という。 Hereinafter, the normalized population data for learning is simply referred to as "population data for learning". A numerical value indicating the presence or absence of traffic congestion, a numerical value indicating "start time", a numerical value indicating "end time", a numerical value indicating "maximum traffic congestion length", and a numerical value indicating "congestion peak time" are simply "congestion for learning". It is called "data".

変数選択部１１６は、グループごとの学習用人口データを説明変数とし、学習用渋滞データを目的変数とするロジスティック回帰を行う。変数選択部１１６は、構築する渋滞予測モデルごとに、互いに同日の情報を示す学習用人口データと学習用渋滞データとを１セットとして、サンプリング期間の複数セットを用いて、ロジスティック回帰を行う。例えば、変数選択部１１６は、「開始時刻」を示す数値を目的変数としてロジスティック回帰を行う。変数選択部１１６は、ロジスティック回帰の結果において、上位ｎ位までの係数の絶対値が大きい説明変数を選択する。ｎは、パラメータであり、ユーザが任意に変更できる。ｎは、例えば、数百に設定される。変数選択部１１６が行われる変数選択は、上述したロジスティック回帰を用いた手法に限定されない。例えば、ｌａｓｓｏ回帰などのパラメトリックな回帰を用いてもよいし、相関係数の値によって選択してもよい。 The variable selection unit 116 performs logistic regression using the learning population data for each group as an explanatory variable and the learning congestion data as an objective variable. The variable selection unit 116 performs logistic regression using a plurality of sets of sampling periods, with the learning population data and the learning congestion data showing information on the same day as one set for each congestion prediction model to be constructed. For example, the variable selection unit 116 performs logistic regression using a numerical value indicating the "start time" as an objective variable. The variable selection unit 116 selects an explanatory variable having a large absolute value of the coefficients up to the upper nth position in the result of logistic regression. n is a parameter and can be arbitrarily changed by the user. n is set to, for example, several hundreds. The variable selection performed by the variable selection unit 116 is not limited to the method using the logistic regression described above. For example, parametric regression such as lasso regression may be used, or selection may be made according to the value of the correlation coefficient.

次元圧縮部１１７は、変数選択部１１６で選択されたグループの学習用人口データを説明変数とし、学習用渋滞データを目的変数として、ＰＬＳ回帰によって座標変換を行う。次元圧縮部１１７は、構築する渋滞予測モデルごとに、互いに同日の情報を示す変数選択部１１６で選択された学習用人口データと学習用渋滞データとを１セットとして、サンプリング期間の複数セットを用いて、ＰＬＳ回帰を行う。例えば、次元圧縮部１１７は、「開始時刻」を示す数値を目的変数としてＰＬＳ回帰を行う。これによって、次元圧縮部１１７は、目的変数に対する寄与度が上位ｍ位の説明変数に次元圧縮を行う。ｍは、パラメータであり、ユーザが任意に変更できる。ｍは、例えば、５〜１００程度に設定される。この結果、変数選択部１１６で選択された説明変数で似た動きをする説明変数が組み合わされて低次元のデータに変換されるため、予測用回帰式作成部１１８において、次元の呪いによる精度悪化を防ぐことができる。次元圧縮部１１７で用いられる回帰は、ＰＬＳ回帰に限られない。ＰＬＳ回帰の代わりに、例えば、ＰＣＡ、ＮＭＦなどを用いてもよい。 The dimension compression unit 117 performs coordinate conversion by PLS regression using the learning population data of the group selected by the variable selection unit 116 as an explanatory variable and the learning congestion data as an objective variable. The dimension compression unit 117 uses a plurality of sets of sampling periods, with the learning population data and the learning congestion data selected by the variable selection unit 116 showing information on the same day as one set for each congestion prediction model to be constructed. Then, PLS regression is performed. For example, the dimension compression unit 117 performs PLS regression using a numerical value indicating the "start time" as an objective variable. As a result, the dimensional compression unit 117 performs dimensional compression on the explanatory variable having the higher degree of contribution to the objective variable. m is a parameter and can be arbitrarily changed by the user. m is set to, for example, about 5 to 100. As a result, the explanatory variables that behave similarly with the explanatory variables selected by the variable selection unit 116 are combined and converted into low-dimensional data. Therefore, in the prediction regression equation creation unit 118, the accuracy deteriorates due to the curse of dimensionality. Can be prevented. The regression used in the dimensional compression unit 117 is not limited to PLS regression. For example, PCA, NMF, or the like may be used instead of PLS regression.

予測用回帰式作成部１１８は、次元圧縮部１１７で次元圧縮を行った学習用人口データと学習用渋滞データとに基づいて、渋滞予測に用いる予測用回帰式（渋滞予測モデル）を作成する。予測用回帰式作成部１１８は、次元圧縮部１１７で変換された変数を説明変数とし、学習用渋滞データを目的変数として、渋滞予測モデルを作成する。予測用回帰式作成部１１８は、構築する渋滞予測モデルごとに、互いに同日の情報を示す次元圧縮部１１７で変換された変数の値と学習用渋滞データとを１セットとして、サンプリング期間の複数セットを用いて、渋滞予測モデルを作成する。例えば、予測用回帰式作成部１１８は、次元圧縮部１１７で次元圧縮を行った学習用人口データと「開始時刻」を示す数値とに基づいて、渋滞の開始時刻を予測する予測用回帰式（渋滞予測モデル）を作成する。すなわち、渋滞予測モデル構築部１１５は、低次元化されたデータから渋滞の値を予測するための渋滞予測モデルを作成する。具体的には、予測用回帰式作成部１１８は、最近傍のサンプルの距離による重み付平均を用いたｋ近傍回帰を行う。 The prediction regression equation creation unit 118 creates a prediction regression equation (congestion prediction model) used for congestion prediction based on the learning population data and the learning congestion data that have been dimensionally compressed by the dimension compression unit 117. The prediction regression equation creation unit 118 creates a congestion prediction model using the variable converted by the dimension compression unit 117 as an explanatory variable and the training congestion data as an objective variable. The prediction regression equation creation unit 118 sets a plurality of sampling periods for each congestion prediction model to be constructed, with the values of variables converted by the dimension compression unit 117 showing information on the same day and the congestion data for learning as one set. Create a traffic jam prediction model using. For example, the prediction regression equation creation unit 118 predicts the start time of congestion based on the learning population data subjected to dimension compression by the dimension compression unit 117 and the numerical value indicating the "start time" (prediction regression equation (prediction regression equation creation unit 118). Create a congestion prediction model). That is, the traffic jam prediction model construction unit 115 creates a traffic jam prediction model for predicting the traffic jam value from the low-dimensional data. Specifically, the prediction regression equation creation unit 118 performs k-nearest neighbor regression using a weighted average based on the distance of the nearest neighbor sample.

最近傍点の数は、パラメータであり、ユーザが任意に変更できる。最近傍点の数は、例えば、２〜１００程度に設定される。予測用回帰式作成部１１８で用いられる回帰は、ｋ近傍回帰に限られない。ｋ近傍回帰の代わりに、例えば、線形回帰、ＳＶＲ回帰などのパラメトリックな手法を用いてもよいし、決定木などノンパラメトリックな手法を用いてもよい。なお、人口データと渋滞データとの関係は非線形であるため、ｋ近傍回帰などのノンパラメトリックな手法で回帰分析を行うことで、パラメトリックな手法を用いるよりも高い精度で渋滞予測を行うことが可能な渋滞予測モデルを作成できる。 The number of nearest neighbor points is a parameter and can be arbitrarily changed by the user. The number of nearest neighbor points is set to, for example, about 2 to 100. The regression used in the prediction regression equation creation unit 118 is not limited to the k-nearest neighbor regression. Instead of k-nearest neighbor regression, for example, a parametric method such as linear regression or SVR regression may be used, or a nonparametric method such as a decision tree may be used. Since the relationship between population data and congestion data is non-linear, it is possible to predict congestion with higher accuracy than using parametric methods by performing regression analysis using non-parametric methods such as k-nearest neighbor regression. You can create a traffic jam prediction model.

渋滞予測モデル格納部１１９は、渋滞予測モデル構築部１１５によって構築された渋滞予測モデルを格納する。本実施形態では、渋滞予測モデル格納部１１９は、渋滞予測位置ごとに、渋滞予測対象となる日の渋滞の有無を予測する渋滞予測モデル、渋滞の開始時刻を予測する渋滞予測モデル、渋滞の終了時刻を予測する渋滞予測モデル、渋滞の最大渋滞長を予測する渋滞予測モデル、及び渋滞ピーク時刻を予測する渋滞予測モデルを格納する。渋滞予測モデル格納部１１９は、渋滞予測モデルごとに、変数選択部１１６によって選択されたグループと、次元圧縮部１１７で行われた座標変換を行うための情報とを格納する。 The traffic jam prediction model storage unit 119 stores the traffic jam prediction model constructed by the traffic jam prediction model construction unit 115. In the present embodiment, the traffic jam prediction model storage unit 119 includes a traffic jam prediction model that predicts the presence or absence of traffic jam on the day that is the target of traffic jam prediction, a traffic jam prediction model that predicts the start time of traffic jam, and the end of traffic jam for each traffic jam prediction position. It stores a traffic jam prediction model that predicts the time, a traffic jam prediction model that predicts the maximum traffic jam length, and a traffic jam prediction model that predicts the peak traffic time. The traffic jam prediction model storage unit 119 stores the group selected by the variable selection unit 116 and the information for performing the coordinate conversion performed by the dimension compression unit 117 for each traffic jam prediction model.

以上のように、渋滞予測モデル構築部１１５は、グループごとの学習用人口データを説明変数とし、学習用渋滞データを目的変数とする回帰分析を行う。これによって、渋滞予測モデル構築部１１５は、複数のグループの中から、渋滞予測位置における渋滞データに対する寄与度が相対的に高いグループの学習用人口データを選択する。渋滞予測モデル構築部１１５は、選択された人口データと渋滞予測位置における渋滞データとに基づいて、回帰分析で渋滞予測モデルを構築する。 As described above, the congestion prediction model construction unit 115 performs regression analysis using the learning population data for each group as an explanatory variable and the learning congestion data as an objective variable. As a result, the traffic jam prediction model construction unit 115 selects the learning population data of the group having a relatively high contribution to the traffic jam data at the traffic jam prediction position from the plurality of groups. The traffic jam prediction model construction unit 115 builds a traffic jam prediction model by regression analysis based on the selected population data and the traffic jam data at the traffic jam prediction position.

渋滞予測位置取得部１２０は、渋滞予測モデル作成装置１００の外部から渋滞予測位置を取得する。渋滞予測位置取得部１２０は、不図示の入力デバイスによって、ユーザが入力した渋滞予測位置を取得してもよい。 The traffic jam prediction position acquisition unit 120 acquires the traffic jam prediction position from the outside of the traffic jam prediction model creation device 100. The traffic jam prediction position acquisition unit 120 may acquire the traffic jam prediction position input by the user by an input device (not shown).

予測用人口データ取得部１２１は、交通渋滞の予測に用いる予測用人口データを取得する。予測用人口データは、予め決められた所定日時の人口データであり、例えば、取得可能な人口データのうち最新の人口データ、当日の所定時刻の人口データ等である。本実施形態では、予測用人口データ取得部１２１は、渋滞予測を行う当日の正午にサンプリングされた人口データを取得する。 The forecasting population data acquisition unit 121 acquires the forecasting population data used for forecasting traffic congestion. The forecasting population data is population data at a predetermined date and time, and is, for example, the latest population data among the available population data, population data at a predetermined time on the day, and the like. In the present embodiment, the forecasting population data acquisition unit 121 acquires the population data sampled at noon on the day when the traffic congestion is predicted.

交通渋滞予測部１２２は、渋滞予測モデル格納部１１９に格納されている渋滞予測モデル、並びに、渋滞予測位置取得部１２０及び予測用人口データ取得部１２１によって取得された各データを用いて交通渋滞を予測する。交通渋滞予測部１２２は、渋滞予測位置取得部１２０によって取得された渋滞予測位置に対応する渋滞予測モデルを渋滞予測モデル格納部１１９から取得する。本実施形態では、交通渋滞予測部１２２は、予測用人口データ（例えば、当日の人口データ）を入力とし、渋滞予測モデル格納部１１９から取得された渋滞予測モデルを用いて、渋滞予測時間における渋滞予測位置の渋滞予測結果を出力する。渋滞予測対象となる日の渋滞の有無を予測する渋滞予測モデル、渋滞の開始時刻を予測する渋滞予測モデル、渋滞の終了時刻を予測する渋滞予測モデル、渋滞の最大渋滞長を予測する渋滞予測モデル、及び渋滞ピーク時刻を予測する渋滞予測モデルの各々に対して予測用人口データを当て嵌めることで、各渋滞予測が行われる。例えば、交通渋滞予測部１２２は、予測用人口データにおける複数のグループのうち、変数選択部１１６によって選択された学習用人口データのグループに対応するグループを選択し、次元圧縮部１１７で行われた座標変換と同様の座標変換を行い、予測用回帰式作成部１１８で作成された回帰式に代入する。 The traffic congestion prediction unit 122 uses the traffic congestion prediction model stored in the traffic congestion prediction model storage unit 119 and the data acquired by the traffic congestion prediction position acquisition unit 120 and the prediction population data acquisition unit 121 to detect traffic congestion. Predict. The traffic jam prediction unit 122 acquires a traffic jam prediction model corresponding to the traffic jam prediction position acquired by the traffic jam prediction position acquisition unit 120 from the traffic jam prediction model storage unit 119. In the present embodiment, the traffic congestion prediction unit 122 inputs the forecast population data (for example, the population data of the day), and uses the traffic congestion prediction model acquired from the traffic congestion prediction model storage unit 119 to use the traffic congestion prediction model at the traffic congestion prediction time. Outputs the congestion prediction result of the predicted position. Congestion prediction model that predicts the presence or absence of congestion on the target day, congestion prediction model that predicts the start time of congestion, congestion prediction model that predicts the end time of congestion, congestion prediction model that predicts the maximum congestion length of congestion , And by applying the forecasting population data to each of the congestion prediction models that predict the congestion peak time, each congestion prediction is performed. For example, the traffic congestion prediction unit 122 selects a group corresponding to the group of the learning population data selected by the variable selection unit 116 from a plurality of groups in the prediction population data, and is performed by the dimension compression unit 117. Performs the same coordinate transformation as the coordinate transformation, and substitutes it into the regression equation created by the prediction regression equation creation unit 118.

交通渋滞予測部１２２は、例えば、渋滞予測対象となる日の渋滞の有無を予測する渋滞予測モデルに予測用人口データを当て嵌めることで予測値を得る。予測値は、予測用回帰式作成部１１８で作成された回帰式から得られる０〜１の数値である。交通渋滞予測部１２２は、得られた予測値が予め定められた閾値以上である場合に渋滞ありと判断し、当該予測値が予め定められた閾値未満である場合に渋滞なしと判断する。 The traffic congestion prediction unit 122 obtains a predicted value by, for example, applying the forecasting population data to a traffic congestion prediction model that predicts the presence or absence of traffic congestion on the day of the traffic congestion prediction target. The predicted value is a numerical value of 0 to 1 obtained from the regression formula created by the prediction regression formula creating unit 118. The traffic congestion prediction unit 122 determines that there is a traffic jam when the obtained predicted value is equal to or higher than a predetermined threshold value, and determines that there is no traffic jam when the predicted value is less than a predetermined threshold value.

交通渋滞予測部１２２は、渋滞ありと判断した場合に、渋滞の開始時刻を予測する渋滞予測モデル、渋滞の終了時刻を予測する渋滞予測モデル、渋滞の最大渋滞長を予測する渋滞予測モデル、及び渋滞ピーク時刻を予測する渋滞予測モデルに、予測用人口データを当て嵌めることで予測値を得る。図４（ａ）は、渋滞ありと判断された場合における、交通渋滞予測部１２２からの予測結果の出力例を示している。例えば、図４（ａ）に示されているように、「渋滞発生有無」の項目については「渋滞あり」を示す値が出力され、「開始時刻」、「終了時刻」、「最大渋滞長」、及び「渋滞ピーク時刻」の項目についてはそれぞれ導出された結果を出力する。 The traffic congestion prediction unit 122 determines that there is a traffic jam, a traffic jam prediction model that predicts the start time of the traffic jam, a traffic jam prediction model that predicts the end time of the traffic jam, a traffic jam prediction model that predicts the maximum traffic jam length, and a traffic jam prediction model. Predicted values are obtained by applying the population data for prediction to the traffic jam prediction model that predicts the peak time of traffic jam. FIG. 4A shows an output example of the prediction result from the traffic congestion prediction unit 122 when it is determined that there is congestion. For example, as shown in FIG. 4A, a value indicating "congestion" is output for the item "presence or absence of congestion", and "start time", "end time", and "maximum congestion length" are output. , And for the item of "traffic jam peak time", the derived results are output respectively.

交通渋滞予測部１２２は、渋滞なしと判断した場合に、「開始時刻」、「終了時刻」、「最大渋滞長」、及び「渋滞ピーク時刻」の予測を行わない。図４（ｂ）は、渋滞なしと判断された場合における、交通渋滞予測部１２２からの予測結果の出力例を示している。例えば、図４（ｂ）に示されているように、「渋滞発生有無」の項目については「渋滞なし」を示す値が出力され、予測を行っていない「開始時刻」、「終了時刻」、「最大渋滞長」、及び「渋滞ピーク時刻」の項目はＮＵＬＬとなる。 When it is determined that there is no traffic jam, the traffic jam prediction unit 122 does not predict the "start time", "end time", "maximum traffic jam length", and "traffic jam peak time". FIG. 4B shows an output example of the prediction result from the traffic congestion prediction unit 122 when it is determined that there is no congestion. For example, as shown in FIG. 4B, for the item of "presence or absence of traffic jam", a value indicating "no traffic jam" is output, and "start time", "end time", and "end time", which are not predicted, are output. The items of "maximum traffic jam length" and "traffic jam peak time" are NULL.

交通渋滞予測結果表示部１２３は、交通渋滞予測部１２２によって予測された交通渋滞の予測結果をディスプレイ１２４に表示する。図５（ａ）は、渋滞ありと判断され図４（ａ）に示した値が交通渋滞予測部１２２から出力された場合にディスプレイ１２４に表示される表示例を示している。図５（ｂ）は、渋滞なしと判断された場合にディスプレイ１２４に表示される表示例を示している。 The traffic congestion prediction result display unit 123 displays the traffic congestion prediction result predicted by the traffic congestion prediction unit 122 on the display 124. FIG. 5A shows a display example displayed on the display 124 when it is determined that there is a traffic jam and the value shown in FIG. 4A is output from the traffic jam prediction unit 122. FIG. 5B shows a display example displayed on the display 124 when it is determined that there is no traffic congestion.

次に、図６を参照して、本実施形態に係る渋滞予測モデル作成装置１００の動作について説明する。図６は、渋滞予測モデル作成装置１００が行う処理を示すフローチャートである。 Next, the operation of the traffic jam prediction model creating device 100 according to the present embodiment will be described with reference to FIG. FIG. 6 is a flowchart showing a process performed by the traffic jam prediction model creating device 100.

ステップＳ１０１において、渋滞予測モデル作成装置１００は、渋滞予測位置取得部１２０によって渋滞予測位置を取得する。続いて、渋滞予測モデル作成装置１００は、ステップＳ１０２に処理を進める。 In step S101, the traffic jam prediction model creating device 100 acquires the traffic jam prediction position by the traffic jam prediction position acquisition unit 120. Subsequently, the traffic jam prediction model creating device 100 proceeds to the process in step S102.

ステップＳ１０２において、渋滞予測モデル作成装置１００は、予測用人口データ取得部１２１によって予測用人口データを取得する。続いて、渋滞予測モデル作成装置１００は、ステップＳ１０３に処理を進める。 In step S102, the traffic jam prediction model creating device 100 acquires the forecasting population data by the forecasting population data acquisition unit 121. Subsequently, the traffic jam prediction model creating device 100 proceeds to the process in step S103.

ステップＳ１０３において、渋滞予測モデル作成装置１００は、ステップＳ１０１で取得された渋滞予測位置に対応する渋滞予測モデルが渋滞予測モデル格納部１１９に存在するか判定する。ステップＳ１０１で取得された渋滞予測位置に対応する渋滞予測モデルが渋滞予測モデル格納部１１９に存在する場合（ステップＳ１０３でＹＥＳ）、渋滞予測モデル作成装置１００は、ステップＳ１０５に処理を進める。ステップＳ１０１で取得された渋滞予測位置に対応する渋滞予測モデルが渋滞予測モデル格納部１１９に存在しない場合（ステップＳ１０３でＮＯ）、渋滞予測モデル作成装置１００は、ステップＳ１０４に処理を進める。 In step S103, the traffic jam prediction model creating device 100 determines whether or not the traffic jam prediction model corresponding to the traffic jam prediction position acquired in step S101 exists in the traffic jam prediction model storage unit 119. When the traffic jam prediction model corresponding to the traffic jam prediction position acquired in step S101 exists in the traffic jam prediction model storage unit 119 (YES in step S103), the traffic jam prediction model creating device 100 proceeds to the process in step S105. When the traffic jam prediction model corresponding to the traffic jam prediction position acquired in step S101 does not exist in the traffic jam prediction model storage unit 119 (NO in step S103), the traffic jam prediction model creating device 100 proceeds to the process in step S104.

ステップＳ１０４において、渋滞予測モデル作成装置１００は、後述する渋滞予測モデル構築処理を行う。続いて、渋滞予測モデル作成装置１００は、ステップＳ１０５に処理を進める。 In step S104, the traffic jam prediction model creating device 100 performs the traffic jam prediction model construction process described later. Subsequently, the traffic jam prediction model creating device 100 proceeds to step S105.

ステップＳ１０５において、渋滞予測モデル作成装置１００は、ステップＳ１０１で取得された渋滞予測位置と、ステップＳ１０２で取得された予測用人口データと、渋滞予測モデル格納部１１９に格納されている渋滞予測モデルとから、渋滞予測位置における交通渋滞の予測を行う。続いて、渋滞予測モデル作成装置１００は、ステップＳ１０６に処理を進める。 In step S105, the traffic jam prediction model creating device 100 includes the traffic jam prediction position acquired in step S101, the forecast population data acquired in step S102, and the traffic jam prediction model stored in the traffic jam prediction model storage unit 119. Therefore, the traffic congestion at the congestion prediction position is predicted. Subsequently, the traffic jam prediction model creating device 100 proceeds to the process in step S106.

ステップＳ１０６において、渋滞予測モデル作成装置１００は、ステップＳ１０５において行った交通渋滞の予測結果をディスプレイ１２４に表示する。渋滞予測モデル作成装置１００は、ステップＳ１０６が終了すると、全体の処理を終了する。 In step S106, the traffic congestion prediction model creating device 100 displays the traffic congestion prediction result performed in step S105 on the display 124. When the traffic jam prediction model creating device 100 finishes step S106, the entire process is finished.

次に、図７を参照して、ステップＳ１０４における渋滞予測モデル構築処理について説明する。図７は、渋滞予測モデル作成装置１００が行う渋滞予測モデル構築処理を示すフローチャートである。 Next, the congestion prediction model construction process in step S104 will be described with reference to FIG. 7. FIG. 7 is a flowchart showing a traffic jam prediction model construction process performed by the traffic jam prediction model creation device 100.

ステップＳ１１１において、渋滞予測モデル作成装置１００は、学習用人口データが学習用人口データ格納部１１２に存在するか判定する。学習用人口データが学習用人口データ格納部１１２に存在する場合（ステップＳ１１１でＹＥＳ）、渋滞予測モデル作成装置１００は、ステップＳ１１４に処理を進める。学習用人口データが学習用人口データ格納部１１２に存在しない場合（ステップＳ１１１でＮＯ）、渋滞予測モデル作成装置１００は、ステップＳ１１２に処理を進める。 In step S111, the congestion prediction model creating device 100 determines whether the learning population data exists in the learning population data storage unit 112. When the learning population data exists in the learning population data storage unit 112 (YES in step S111), the congestion prediction model creating device 100 proceeds to the process in step S114. When the learning population data does not exist in the learning population data storage unit 112 (NO in step S111), the congestion prediction model creating device 100 proceeds to the process in step S112.

ステップＳ１１２において、渋滞予測モデル作成装置１００は、学習用人口データ取得部１１１によって予め定められたサンプル期間内にサンプリングされた学習用人口データを取得し、ステップＳ１１３に処理を進める。 In step S112, the congestion prediction model creating device 100 acquires the learning population data sampled within the sample period predetermined by the learning population data acquisition unit 111, and proceeds to the process in step S113.

ステップＳ１１３において、渋滞予測モデル作成装置１００は、ステップＳ１１２で取得した学習用人口データを学習用人口データ格納部１１２に格納し、ステップＳ１１４に処理を進める。 In step S113, the congestion prediction model creating device 100 stores the learning population data acquired in step S112 in the learning population data storage unit 112, and proceeds to the process in step S114.

ステップＳ１１４において、渋滞予測モデル作成装置１００は、ステップＳ１０１で取得した渋滞予測位置の学習用渋滞データが学習用渋滞データ格納部１１４に存在するか判定する。渋滞予測モデル作成装置１００は、ステップＳ１０１で取得した渋滞予測位置の学習用渋滞データが学習用渋滞データ格納部１１４に存在する場合（ステップＳ１１４でＹＥＳ）、ステップＳ１１７に処理を進める。渋滞予測モデル作成装置１００は、ステップＳ１０１で取得した渋滞予測位置の学習用渋滞データが学習用渋滞データ格納部１１４に存在しない場合（ステップＳ１１４でＮＯ）、ステップＳ１１５に処理を進める。 In step S114, the traffic jam prediction model creating device 100 determines whether the learning traffic jam data of the traffic jam prediction position acquired in step S101 exists in the learning traffic jam data storage unit 114. When the learning congestion data of the congestion prediction position acquired in step S101 exists in the learning congestion data storage unit 114 (YES in step S114), the congestion prediction model creating device 100 proceeds to the process in step S117. The congestion prediction model creating device 100 proceeds to step S115 when the learning congestion data of the congestion prediction position acquired in step S101 does not exist in the learning congestion data storage unit 114 (NO in step S114).

ステップＳ１１５において、渋滞予測モデル作成装置１００は、学習用渋滞データ取得部１１３によって、予め定められたサンプリング期間内にサンプリングされた、ステップＳ１０１で取得した渋滞予測位置における学習用渋滞データを取得し、ステップＳ１１６に処理を進める。 In step S115, the congestion prediction model creating device 100 acquires the learning congestion data at the congestion prediction position acquired in step S101, which is sampled within a predetermined sampling period by the learning congestion data acquisition unit 113. The process proceeds to step S116.

ステップＳ１１６において、渋滞予測モデル作成装置１００は、ステップＳ１１５で取得した学習用渋滞データを学習用渋滞データ格納部１１４に格納し、ステップＳ１１７に処理を進める。 In step S116, the congestion prediction model creating device 100 stores the learning congestion data acquired in step S115 in the learning congestion data storage unit 114, and proceeds to the process in step S117.

ステップＳ１１７において、渋滞予測モデル作成装置１００は、学習用人口データ格納部１１２からサンプリング期間内の学習用人口データを取得し、学習用渋滞データ格納部１１４からサンプリング期間内の渋滞予測位置における学習用渋滞データを取得する。続いて、渋滞予測モデル作成装置１００は、ステップＳ１１８に処理を進める。 In step S117, the congestion prediction model creation device 100 acquires learning population data within the sampling period from the learning population data storage unit 112, and learns from the learning congestion data storage unit 114 at the congestion prediction position within the sampling period. Get traffic jam data. Subsequently, the traffic jam prediction model creating device 100 proceeds to the process in step S118.

ステップＳ１１８において、渋滞予測モデル作成装置１００は、変数選択部１１６で回帰分析を行うことで、渋滞予測位置における渋滞データに対する寄与度が相対的に高いグループの学習用人口データを選択する。続いて、渋滞予測モデル作成装置１００は、ステップＳ１１９に処理を進める。 In step S118, the traffic jam prediction model creating device 100 selects the learning population data of the group having a relatively high contribution to the traffic jam data at the traffic jam prediction position by performing regression analysis in the variable selection unit 116. Subsequently, the traffic jam prediction model creating device 100 proceeds to step S119.

ステップＳ１１９において、渋滞予測モデル作成装置１００は、次元圧縮部１１７で回帰分析を行うことで、次元圧縮を行う。続いて、渋滞予測モデル作成装置１００は、ステップＳ１２０に処理を進める。 In step S119, the congestion prediction model creating device 100 performs dimensional compression by performing regression analysis in the dimensional compression unit 117. Subsequently, the traffic jam prediction model creating device 100 proceeds to the process in step S120.

ステップＳ１２０において、渋滞予測モデル作成装置１００は、次元圧縮部１１７で次元圧縮を行った学習用人口データと学習用渋滞データとに基づいて、渋滞予測モデルである予測用回帰式を作成する。続いて、渋滞予測モデル作成装置１００は、ステップＳ１２１に処理を進める。 In step S120, the congestion prediction model creating device 100 creates a prediction regression equation, which is a congestion prediction model, based on the learning population data and the learning congestion data that have been dimensionally compressed by the dimension compression unit 117. Subsequently, the traffic jam prediction model creating device 100 proceeds to the process in step S121.

ステップＳ１２１において、渋滞予測モデル作成装置１００は、変数選択、次元圧縮、予測用回帰式の作成の３つのステップ（ステップＳ１１８、ステップＳ１１９、ステップＳ１２０）で構築された渋滞予測モデルを渋滞予測モデル格納部１１９に格納する。渋滞予測モデル作成装置１００は、ステップＳ１２１が終了すると、渋滞予測モデル構築処理を終了する。 In step S121, the congestion prediction model creating device 100 stores the congestion prediction model constructed in the three steps (step S118, step S119, and step S120) of variable selection, dimension compression, and creation of the prediction regression equation. Store in unit 119. When the traffic jam prediction model creating device 100 finishes step S121, the traffic jam prediction model building process ends.

次に、本実施形態に係る渋滞予測モデル作成装置１００の作用及び効果について説明する。 Next, the operation and effect of the traffic congestion prediction model creating device 100 according to the present embodiment will be described.

渋滞予測モデル作成装置１００は、人口データと渋滞データとに基づいて、回帰分析で渋滞予測モデルを構築する。人口データを用いることで、交通渋滞の様々な要因に対応し得る。このため、渋滞予測の誤差を低減することができる。例えば、人口データは、現在走行状態にないが予測時刻に走行状態になる自動車の数にも相関している。渋滞予測モデル作成装置１００は、上記渋滞予測モデルと交通渋滞の予測に用いる予測用人口データとに基づいて交通渋滞を予測するため、渋滞予測の誤差を低減して渋滞予測の精度が向上され得る。 The traffic jam prediction model creating device 100 builds a traffic jam prediction model by regression analysis based on the population data and the traffic jam data. By using population data, it is possible to deal with various factors of traffic congestion. Therefore, it is possible to reduce the error of the congestion prediction. For example, population data also correlates with the number of vehicles that are not currently in driving but will be in driving at the predicted time. Since the traffic congestion prediction model creating device 100 predicts traffic congestion based on the above-mentioned congestion prediction model and the population data for prediction used for predicting traffic congestion, it is possible to reduce the error of the congestion prediction and improve the accuracy of the congestion prediction. ..

また、上述したように、本実施形態では、人の位置に相当する「位置識別番号」と、人の属性に相当する、「性別」、「年代」、及び「居住地」とに応じてグループを分類する。本実施形態では、「位置識別番号」は約１５０万通り、「性別」は２通り、「年代」は１３通り、及び「居住地」は約２０００通りある。この場合、人口データは、約１５０万×２×１３×約２０００のグループに分類され得る。すなわち、人の位置及び属性の組み合わせを変数とする空間は、理論上、約８００億次元に及ぶ。これらのグループの人口データの多くは、予測対象位置の渋滞への影響が非常に小さく予測のノイズとなる。 Further, as described above, in the present embodiment, there are groups according to the "position identification number" corresponding to the position of the person and the "gender", "age", and "place of residence" corresponding to the attribute of the person. To classify. In this embodiment, there are about 1.5 million "position identification numbers", 2 "genders", 13 "ages", and about 2000 "residential areas". In this case, the population data can be divided into groups of about 1.5 million x 2 x 13 x about 2000. That is, the space in which the combination of the position and the attribute of a person is a variable theoretically extends to about 80 billion dimensions. Most of the population data of these groups has a very small effect on the congestion of the predicted target position and becomes noise for prediction.

渋滞予測モデル作成装置１００は、回帰分析を用いて、渋滞予測位置における渋滞データに対する寄与度が高いグループの人口データを選択し、選択された人口データと渋滞データとに基づいて回帰分析で渋滞予測モデルを構築する。すなわち、渋滞予測モデル作成装置１００は、渋滞に関連のあるグループの人口データ（説明変数）を選択する。このため、渋滞予測の精度が確保されながら、渋滞予測モデルの構築に要する演算量の増加が抑制され得る。例えば、東京の道路の渋滞に沖縄にいる人はほとんど影響しないため、人の位置が沖縄であるグループ以外の人口データを選択することで、渋滞予測モデルの構築に要する演算量を低減できる。以上のように、渋滞予測モデル作成装置１００では、渋滞予測精度の向上と演算量の増加の抑制とが両立され得る。 The traffic jam prediction model creation device 100 uses regression analysis to select population data of a group having a high contribution to the traffic jam data at the traffic jam prediction position, and predicts traffic jam by regression analysis based on the selected population data and the traffic jam data. Build a model. That is, the traffic jam prediction model creating device 100 selects the population data (explanatory variable) of the group related to the traffic jam. Therefore, while ensuring the accuracy of the traffic jam prediction, it is possible to suppress an increase in the amount of calculation required for constructing the traffic jam prediction model. For example, since people in Okinawa have almost no effect on road congestion in Tokyo, the amount of calculation required to build a congestion prediction model can be reduced by selecting population data other than the group in which the position of the person is Okinawa. As described above, in the traffic jam prediction model creating device 100, improvement of the traffic jam prediction accuracy and suppression of an increase in the amount of calculation can be achieved at the same time.

人口データにおける人の属性は、当該人（サンプリング対象者）の居住地を含んでいる。この場合、居住地に帰る人によって生じる渋滞、すなわち、所謂Ｕターンラッシュの予測を行うことができる。例えば、お昼に木更津のアウトレットモールにいる東京及び神奈川に居住の人の人口から、アクアラインの夕方の渋滞を予測することができる。 The attributes of a person in the population data include the place of residence of the person (sampling target). In this case, it is possible to predict the congestion caused by the person returning to the place of residence, that is, the so-called U-turn rush. For example, from the population of people living in Tokyo and Kanagawa at the outlet mall in Kisarazu at noon, it is possible to predict the traffic congestion in the evening of Aqualine.

渋滞予測モデル作成装置１００は、渋滞予測位置を取得し、取得された渋滞予測位置に対応する渋滞予測モデルが渋滞予測モデル格納部１１９に存在しない場合に、取得された渋滞予測位置に対応する渋滞予測モデルを構築する。すなわち、必要に応じて渋滞予測モデルを構築するため、渋滞予測モデルを構築する演算量が低減され得る。また、最新の学習用人口データを用いて、渋滞予測モデルを構築することができる。 The traffic jam prediction model creating device 100 acquires the traffic jam prediction position, and when the traffic jam prediction model corresponding to the acquired traffic jam prediction position does not exist in the traffic jam prediction model storage unit 119, the traffic jam corresponding to the acquired traffic jam prediction position. Build a predictive model. That is, since the congestion prediction model is constructed as needed, the amount of calculation for constructing the congestion prediction model can be reduced. In addition, a congestion prediction model can be constructed using the latest learning population data.

人口の増減数が同じでも、普段の人口の規模が大きい地域ほど、人口の増減数と渋滞との関連性は低いと考えられる。例えば、普段の人口が１００００人の地域と普段の人口が５００人の地域とでは、５００人の増減があった場合の渋滞発生への影響度合いは異なると想定される。変数選択部１１６は、学習用人口データの各グループにおける人口を、上記サンプリング期間内の全時刻に関し、各グループにおいて平均が０で分散が１となる値に変換する。このように、上記サンプリング期間内の学習用人口データをグループごとに正規化することよって、各グループの普段の人口の規模に応じた人口の増減の割合が適切に判断され得る。したがって、上述したように学習用人口データを正規化した後に、変数選択を行うことで、各グループの普段の人口の規模を考慮して渋滞予測の精度を更に向上することができる。 Even if the number of population increases / decreases is the same, it is considered that the larger the size of the population, the lower the relationship between the number of population increases / decreases and traffic congestion. For example, it is assumed that an area with a normal population of 10,000 and an area with a normal population of 500 have different degrees of influence on the occurrence of traffic congestion when an increase or decrease of 500 people occurs. The variable selection unit 116 converts the population in each group of the learning population data into a value in which the average is 0 and the variance is 1 in each group with respect to all the times within the sampling period. In this way, by normalizing the learning population data within the sampling period for each group, the rate of increase or decrease in the population according to the size of the normal population of each group can be appropriately determined. Therefore, by performing variable selection after normalizing the learning population data as described above, it is possible to further improve the accuracy of traffic congestion prediction in consideration of the size of the normal population of each group.

なお、上記実施の形態の説明に用いたブロック図は、機能単位のブロックを示している。これらの機能ブロック（構成部）は、ハードウェア及び／又はソフトウェアの任意の組み合わせによって実現される。また、各機能ブロックの実現手段は特に限定されない。すなわち、各機能ブロックは、物理的及び／又は論理的に結合した１つの装置により実現されてもよいし、物理的及び／又は論理的に分離した２つ以上の装置を直接的及び／又は間接的に(例えば、有線及び／又は無線)で接続し、これら複数の装置により実現されてもよい。 The block diagram used in the description of the above embodiment shows a block of functional units. These functional blocks (components) are realized by any combination of hardware and / or software. Further, the means for realizing each functional block is not particularly limited. That is, each functional block may be realized by one physically and / or logically coupled device, or directly and / or indirectly by two or more physically and / or logically separated devices. (For example, wired and / or wireless) may be connected and realized by these a plurality of devices.

例えば、本発明の一実施の形態における渋滞予測モデル作成装置は、本実施形態における処理を行うコンピュータとして機能してもよい。図８は、本発明の一実施の形態に係る渋滞予測モデル作成装置のハードウェア構成の一例を示す図である。上述した渋滞予測モデル作成装置１００は、物理的には、プロセッサ１００１、メモリ１００２、ストレージ１００３、通信装置１００４、入力装置１００５、出力装置１００６、バス１００７などを含むコンピュータ装置として構成されてもよい。 For example, the congestion prediction model creating device according to the embodiment of the present invention may function as a computer that performs the processing according to the present embodiment. FIG. 8 is a diagram showing an example of a hardware configuration of a traffic jam prediction model creating device according to an embodiment of the present invention. The congestion prediction model creating device 100 described above may be physically configured as a computer device including a processor 1001, a memory 1002, a storage 1003, a communication device 1004, an input device 1005, an output device 1006, a bus 1007, and the like.

なお、以下の説明では、「装置」という文言は、回路、デバイス、ユニットなどに読み替えることができる。渋滞予測モデル作成装置１００のハードウェア構成は、図に示した各装置を１つ又は複数含むように構成されてもよいし、一部の装置を含まずに構成されてもよい。 In the following description, the word "device" can be read as a circuit, a device, a unit, or the like. The hardware configuration of the traffic congestion prediction model creating device 100 may be configured to include one or more of the devices shown in the figure, or may be configured not to include some of the devices.

渋滞予測モデル作成装置１００における各機能は、プロセッサ１００１、メモリ１００２などのハードウェア上に所定のソフトウェア（プログラム）を読み込ませることで、プロセッサ１００１が演算を行い、通信装置１００４による通信や、メモリ１００２及びストレージ１００３におけるデータの読み出し及び／又は書き込みを制御することで実現される。 For each function of the congestion prediction model creating device 100, by loading predetermined software (program) on hardware such as the processor 1001 and the memory 1002, the processor 1001 performs an operation, and communication by the communication device 1004 and the memory 1002 are performed. And by controlling the reading and / or writing of data in the storage 1003.

プロセッサ１００１は、例えば、オペレーティングシステムを動作させてコンピュータ全体を制御する。プロセッサ１００１は、周辺装置とのインターフェース、制御装置、演算装置、レジスタなどを含む中央処理装置（ＣＰＵ：Central Processing Unit）で構成されてもよい。例えば、変数選択部１１６、次元圧縮部１１７、予測用回帰式作成部１１８、交通渋滞予測部１２２、渋滞予測結果表示部１２３などは、プロセッサ１００１で実現されてもよい。 Processor 1001 operates, for example, an operating system to control the entire computer. The processor 1001 may be composed of a central processing unit (CPU) including an interface with a peripheral device, a control device, an arithmetic unit, a register, and the like. For example, the variable selection unit 116, the dimension compression unit 117, the prediction regression equation creation unit 118, the traffic congestion prediction unit 122, the congestion prediction result display unit 123, and the like may be realized by the processor 1001.

また、プロセッサ１００１は、プログラム（プログラムコード）、ソフトウェアモジュールやデータを、ストレージ１００３及び／又は通信装置１００４からメモリ１００２に読み出し、これらに従って各種の処理を実行する。プログラムとしては、上述の実施の形態で説明した動作の少なくとも一部をコンピュータに実行させるプログラムが用いられる。例えば、変数選択部１１６、次元圧縮部１１７、予測用回帰式作成部１１８、交通渋滞予測部１２２、渋滞予測結果表示部１２３は、メモリ１００２に格納され、プロセッサ１００１で動作する制御プログラムによって実現されてもよく、他の機能ブロックについても同様に実現されてもよい。上述の各種処理は、１つのプロセッサ１００１で実行される旨を説明してきたが、２以上のプロセッサ１００１により同時又は逐次に実行されてもよい。プロセッサ１００１は、１以上のチップで実装されてもよい。なお、プログラムは、電気通信回線を介してネットワークから送信されても良い。 Further, the processor 1001 reads a program (program code), a software module, and data from the storage 1003 and / or the communication device 1004 into the memory 1002, and executes various processes according to these. As the program, a program that causes a computer to execute at least a part of the operations described in the above-described embodiment is used. For example, the variable selection unit 116, the dimension compression unit 117, the regression equation creation unit 118 for prediction, the traffic congestion prediction unit 122, and the traffic congestion prediction result display unit 123 are stored in the memory 1002 and realized by a control program that operates on the processor 1001. It may be realized in the same manner for other functional blocks. Although it has been described that the various processes described above are executed by one processor 1001, they may be executed simultaneously or sequentially by two or more processors 1001. Processor 1001 may be mounted on one or more chips. The program may be transmitted from the network via a telecommunication line.

メモリ１００２は、コンピュータ読み取り可能な記録媒体であり、例えば、ＲＯＭ（Read Only Memory）、ＥＰＲＯＭ（Erasable Programmable ＲＯＭ）、ＥＥＰＲＯＭ（Electrically Erasable Programmable ＲＯＭ）、ＲＡＭ（Random Access Memory）などの少なくとも１つで構成されてもよい。メモリ１００２は、レジスタ、キャッシュ、メインメモリ（主記憶装置）などと呼ばれてもよい。メモリ１００２は、本発明の一実施の形態に係る方法を実施するために実行可能なプログラム（プログラムコード）、ソフトウェアモジュールなどを保存することができる。 The memory 1002 is a computer-readable recording medium, and is composed of at least one such as a ROM (Read Only Memory), an EPROM (Erasable Programmable ROM), an EPROM (Electrically Erasable Programmable ROM), and a RAM (Random Access Memory). May be done. The memory 1002 may be referred to as a register, a cache, a main memory (main storage device), or the like. The memory 1002 can store a program (program code), a software module, or the like that can be executed to carry out the method according to the embodiment of the present invention.

ストレージ１００３は、コンピュータ読み取り可能な記録媒体であり、例えば、ＣＤ−ＲＯＭ（Compact Disc ＲＯＭ）などの光ディスク、ハードディスクドライブ、フレキシブルディスク、光磁気ディスク(例えば、コンパクトディスク、デジタル多用途ディスク、Ｂｌｕ−ｒａｙ（登録商標）ディスク)、スマートカード、フラッシュメモリ(例えば、カード、スティック、キードライブ)、フロッピー（登録商標）ディスク、磁気ストリップなどの少なくとも１つで構成されてもよい。ストレージ１００３は、補助記憶装置と呼ばれてもよい。上述の記憶媒体は、例えば、メモリ１００２及び／又はストレージ１００３を含むデータベース、サーバその他の適切な媒体であってもよい。 The storage 1003 is a computer-readable recording medium, for example, an optical disk such as a CD-ROM (Compact Disc ROM), a hard disk drive, a flexible disk, an optical magnetic disk (for example, a compact disk, a digital versatile disk, a Blu-ray). It may consist of at least one (registered trademark) disk), smart card, flash memory (eg, card, stick, key drive), floppy (registered trademark) disk, magnetic strip, and the like. The storage 1003 may be referred to as an auxiliary storage device. The storage medium described above may be, for example, a database, server or other suitable medium containing memory 1002 and / or storage 1003.

通信装置１００４は、有線及び／又は無線ネットワークを介してコンピュータ間の通信を行うためのハードウェア（送受信デバイス）であり、例えばネットワークデバイス、ネットワークコントローラ、ネットワークカード、通信モジュールなどともいう。例えば、学習用人口データ取得部１１１、学習用渋滞データ取得部１１３、渋滞予測位置取得部１２０、予測用人口データ取得部１２１などは、通信装置１００４で実現されてもよい。 The communication device 1004 is hardware (transmission / reception device) for performing communication between computers via a wired and / or wireless network, and is also referred to as, for example, a network device, a network controller, a network card, a communication module, or the like. For example, the learning population data acquisition unit 111, the learning congestion data acquisition unit 113, the congestion prediction position acquisition unit 120, the prediction population data acquisition unit 121, and the like may be realized by the communication device 1004.

入力装置１００５は、外部からの入力を受け付ける入力デバイス（例えば、キーボード、マウス、マイクロフォン、スイッチ、ボタン、センサなど）である。出力装置１００６は、外部への出力を実施する出力デバイス（例えば、ディスプレイ、スピーカー、LEDランプなど）である。なお、入力装置１００５及び出力装置１００６は、一体となった構成（例えば、タッチパネル）であってもよい。 The input device 1005 is an input device (for example, a keyboard, a mouse, a microphone, a switch, a button, a sensor, etc.) that receives an input from the outside. The output device 1006 is an output device (for example, a display, a speaker, an LED lamp, etc.) that outputs to the outside. The input device 1005 and the output device 1006 may have an integrated configuration (for example, a touch panel).

また、プロセッサ１００１やメモリ１００２などの各装置は、情報を通信するためのバス１００７で接続される。バス１００７は、単一のバスで構成されてもよいし、装置間で異なるバスで構成されてもよい。 Further, each device such as the processor 1001 and the memory 1002 is connected by a bus 1007 for communicating information. Bus 1007 may be composed of a single bus, or may be composed of different buses between devices.

また、渋滞予測モデル作成装置１００は、マイクロプロセッサ、デジタル信号プロセッサ（ＤＳＰ：Digital Signal Processor）、ＡＳＩＣ（Application Specific Integrated Circuit）、ＰＬＤ（Programmable Logic Device）、ＦＰＧＡ（Field Programmable Gate Array）などのハードウェアを含んで構成されてもよく、当該ハードウェアにより、各機能ブロックの一部又は全てが実現されてもよい。例えば、プロセッサ１００１は、これらのハードウェアの少なくとも１つで実装されてもよい。 Further, the congestion prediction model creation device 100 is hardware such as a microprocessor, a digital signal processor (DSP: Digital Signal Processor), an ASIC (Application Specific Integrated Circuit), a PLD (Programmable Logic Device), and an FPGA (Field Programmable Gate Array). It may be configured to include, and a part or all of each functional block may be realized by the hardware. For example, processor 1001 may be implemented on at least one of these hardware.

以上、本発明について詳細に説明したが、当業者にとっては、本発明が本明細書中に説明した実施形態に限定されるものではないということは明らかである。本発明は、特許請求の範囲の記載により定まる本発明の趣旨及び範囲を逸脱することなく修正及び変更態様として実施することができる。したがって、本明細書の記載は、例示説明を目的とするものであり、本発明に対して何ら制限的な意味を有するものではない。 Although the present invention has been described in detail above, it is clear to those skilled in the art that the present invention is not limited to the embodiments described herein. The present invention can be implemented as modifications and modifications without departing from the spirit and scope of the invention as defined by the claims. Therefore, the description of the present specification is for the purpose of exemplification and does not have any limiting meaning to the present invention.

例えば、上述した実施形態では、渋滞予測モデル作成装置１００は、渋滞予測モデルを作成し、当該渋滞予測モデルを用いて交通渋滞の予測を行うことで交通渋滞予測装置として機能する。しかし、渋滞予測モデルの作成と交通渋滞の予測とを別の装置で行う構成であってもよい。 For example, in the above-described embodiment, the traffic congestion prediction model creating device 100 functions as a traffic congestion prediction device by creating a traffic congestion prediction model and predicting traffic congestion using the traffic congestion prediction model. However, a configuration may be configured in which the creation of the traffic congestion prediction model and the prediction of traffic congestion are performed by different devices.

学習用人口データ及び予測用人口データは、現実の人口データであってもよいし、架空の人口データであってもよい。 The learning population data and the forecasting population data may be real population data or fictitious population data.

上述した実施形態では、学習用人口データ取得部１１１が、外部から学習用人口データを取得し、学習用人口データ格納部１１２に格納する構成について説明したが、これに限定されない。例えば、学習用人口データ格納部１１２が予め学習用人口データを格納しており、学習用人口データ取得部１１１が学習用人口データ格納部１１２から学習用人口データを取得して渋滞予測モデル構築部１１５に当該データを渡す構成であってもよい。 In the above-described embodiment, the configuration in which the learning population data acquisition unit 111 acquires the learning population data from the outside and stores it in the learning population data storage unit 112 has been described, but the present invention is not limited to this. For example, the learning population data storage unit 112 stores the learning population data in advance, and the learning population data acquisition unit 111 acquires the learning population data from the learning population data storage unit 112 and is a congestion prediction model construction unit. The data may be passed to 115.

上述した実施形態では、学習用渋滞データ取得部１１３が、外部から学習用渋滞データを取得し、学習用渋滞データ格納部１１４に格納する構成について説明したが、これに限定されない。例えば、学習用渋滞データ格納部１１４が予め渋滞データを格納しており、学習用渋滞データ取得部１１３が学習用渋滞データ格納部１１４から学習用渋滞データを取得して渋滞予測モデル構築部１１５に当該データを渡す構成であってもよい。 In the above-described embodiment, the configuration in which the learning congestion data acquisition unit 113 acquires the learning congestion data from the outside and stores it in the learning congestion data storage unit 114 has been described, but the present invention is not limited to this. For example, the learning congestion data storage unit 114 stores the congestion data in advance, and the learning congestion data acquisition unit 113 acquires the learning congestion data from the learning congestion data storage unit 114 and causes the congestion prediction model construction unit 115. It may be configured to pass the data.

上述した実施形態では、学習用人口データ取得部１１１は、各日で正午にサンプリングされた人口データを取得するが、数日置きの人口データを取得してもよいし、各日で複数の時刻にサンプリングされた人口データを取得してもよい。学習用人口データ取得部１１１が各日で複数の時刻にサンプリングされた人口データを取得する場合、人口データが分類され得るグループの数は更に増加する。この場合であっても、渋滞予測モデル作成装置１００は、渋滞に関連のあるグループの人口データを選択するため、渋滞予測モデルの構築に要する演算量の増加が抑制され得る。 In the above-described embodiment, the learning population data acquisition unit 111 acquires the population data sampled at noon on each day, but may acquire the population data every few days, or at a plurality of times on each day. Population data sampled in may be obtained. When the learning population data acquisition unit 111 acquires population data sampled at a plurality of times on each day, the number of groups in which the population data can be classified further increases. Even in this case, since the traffic jam prediction model creating device 100 selects the population data of the group related to the traffic jam, the increase in the amount of calculation required for constructing the traffic jam prediction model can be suppressed.

予測用人口データ取得部１２１によって取得される人口データのサンプリング時刻は、学習用人口データ取得部１１１によって取得された人口データのサンプリング時刻に対応していればよい。したがって、予測用人口データ取得部１２１によって取得される人口データは、正午にサンプリングされた人口データに限定されない。 The sampling time of the population data acquired by the prediction population data acquisition unit 121 may correspond to the sampling time of the population data acquired by the learning population data acquisition unit 111. Therefore, the population data acquired by the forecasting population data acquisition unit 121 is not limited to the population data sampled at noon.

上述した、渋滞予測モデル構築部１１５は、予測日の交通渋滞を予測するものであるとしたが、それ以外の予め定めた所定時間幅内における交通渋滞を予測する渋滞予測モデルを構築するように構成されてもよい。この場合、変数選択部１１６、次元圧縮部１１７、予測用回帰式作成部１１８では、当該所定時間幅に応じた渋滞データを学習用渋滞データとして用いる。 The traffic congestion prediction model construction unit 115 described above predicts the traffic congestion on the prediction date, but other than that, the traffic congestion prediction model for predicting the traffic congestion within a predetermined time width is constructed. It may be configured. In this case, the variable selection unit 116, the dimension compression unit 117, and the prediction regression equation creation unit 118 use the congestion data according to the predetermined time width as the learning congestion data.

上述した実施形態では、渋滞予測モデル構築部１１５は、次元圧縮部１１７によって次元圧縮を行った。しかし、渋滞予測モデル構築部１１５が次元圧縮部１１７を有していなくともよく、次元圧縮は行われなくてもよい。 In the above-described embodiment, the congestion prediction model construction unit 115 performs dimensional compression by the dimensional compression unit 117. However, the congestion prediction model construction unit 115 does not have to have the dimensional compression unit 117, and the dimensional compression may not be performed.

上述した実施形態では、変数選択部１１６は、学習用人口データをグループごとに正規化した後に、変数選択を行った。しかし、学習用人口データの正規化は、行われなくてもよい。 In the above-described embodiment, the variable selection unit 116 performs variable selection after normalizing the learning population data for each group. However, normalization of learning population data does not have to be performed.

渋滞予測モデル作成装置１００は、渋滞予測位置を取得する度に、取得された渋滞予測位置に対応する渋滞予測モデルを構築してもよい。これによって、毎回、最新の学習用人口データを用いて、渋滞予測モデルを構築することができる。 The traffic jam prediction model creating device 100 may build a traffic jam prediction model corresponding to the acquired traffic jam prediction position each time the traffic jam prediction position is acquired. This makes it possible to build a congestion prediction model using the latest learning population data each time.

本明細書で説明した各態様／実施形態は、ＬＴＥ（Long Term Evolution）、ＬＴＥ−Ａ（LTE-Advanced）、ＳＵＰＥＲ３Ｇ、ＩＭＴ−Ａｄｖａｎｃｅｄ、４Ｇ、５Ｇ、ＦＲＡ（Future Radio Access）、Ｗ−ＣＤＭＡ（登録商標）、ＧＳＭ（登録商標）、ＣＤＭＡ２０００、ＵＭＢ（Ultra Mobile Broadband）、ＩＥＥＥ８０２．１１（Ｗｉ−Ｆｉ）、ＩＥＥＥ８０２．１６（ＷｉＭＡＸ）、ＩＥＥＥ８０２．２０、ＵＷＢ（Ultra-WideBand）、Ｂｌｕｅｔｏｏｔｈ（登録商標）、その他の適切なシステムを利用するシステム及び／又はこれらに基づいて拡張された次世代システムに適用されてもよい。 Each aspect / embodiment described herein includes LTE (Long Term Evolution), LTE-A (LTE-Advanced), SUPER 3G, IMT-Advanced, 4G, 5G, FRA (Future Radio Access), W-CDMA. (Registered Trademarks), GSM (Registered Trademarks), CDMA2000, UMB (Ultra Mobile Broadband), IEEE 802.11 (Wi-Fi), IEEE 802.16 (WiMAX), IEEE 802.20, UWB (Ultra-WideBand), It may be applied to systems utilizing Bluetooth®, other suitable systems and / or next-generation systems extended based on them.

本明細書で説明した各態様／実施形態の処理手順、シーケンス、フローチャートなどは、矛盾の無い限り、順序を入れ替えてもよい。例えば、本明細書で説明した方法については、例示的な順序で様々なステップの要素を提示しており、提示した特定の順序に限定されない。 The order of the processing procedures, sequences, flowcharts, etc. of each aspect / embodiment described in the present specification may be changed as long as there is no contradiction. For example, the methods described herein present elements of various steps in an exemplary order, and are not limited to the particular order presented.

入出力された情報等は特定の場所(例えば、メモリ)に保存されてもよいし、管理テーブルで管理してもよい。入出力される情報等は、上書き、更新、または追記され得る。出力された情報等は削除されてもよい。入力された情報等は他の装置へ送信されてもよい。 The input / output information and the like may be stored in a specific location (for example, memory) or may be managed by a management table. Input / output information and the like can be overwritten, updated, or added. The output information and the like may be deleted. The input information or the like may be transmitted to another device.

判定は、１ビットで表される値（０か１か）によって行われてもよいし、真偽値（Boolean：trueまたはfalse）によって行われてもよいし、数値の比較（例えば、所定の値との比較）によって行われてもよい。 The determination may be made by a value represented by 1 bit (0 or 1), by a true / false value (Boolean: true or false), or by comparing numerical values (for example, a predetermined value). It may be done by comparison with the value).

本明細書で説明したデータは、様々な異なる技術のいずれかを使用して表されてもよい。例えば、上記の説明全体に渡って言及され得るデータは、電圧、電流、電磁波、磁界若しくは磁性粒子、光場若しくは光子、又はこれらの任意の組み合わせによって表されてもよい。 The data described herein may be represented using any of a variety of different techniques. For example, the data that can be referred to throughout the above description may be represented by voltage, current, electromagnetic waves, magnetic fields or magnetic particles, light fields or photons, or any combination thereof.

また、本明細書で説明したデータ、パラメータなどは、絶対値で表されてもよいし、所定の値からの相対値で表されてもよいし、対応する別の情報で表されてもよい。例えば、無線リソースはインデックスで指示されるものであってもよい。上述したパラメータに使用する名称はいかなる点においても限定的なものではない。 Further, the data, parameters, etc. described in the present specification may be represented by absolute values, relative values from predetermined values, or other corresponding information. .. For example, the radio resource may be indexed. The names used for the above parameters are not limited in any way.

本明細書で使用する「判断(determining)」、「決定(determining)」という用語は、多種多様な動作を包含する場合がある。「判断」、「決定」は、例えば、判定（judging）、計算(calculating)、算出(computing)、処理(processing)、導出(deriving)、調査(investigating)、探索(looking up)（例えば、テーブル、データベースまたは別のデータ構造での探索）、確認(ascertaining)した事を「判断」「決定」したとみなす事などを含み得る。また、「判断」、「決定」は、受信(receiving)（例えば、情報を受信すること）、送信(transmitting)(例えば、情報を送信すること)、入力(input)、出力(output)、アクセス(accessing)（例えば、メモリ中のデータにアクセスすること）した事を「判断」「決定」したとみなす事などを含み得る。また、「判断」、「決定」は、解決(resolving)、選択(selecting)、選定(choosing)、確立(establishing)、比較(comparing)などした事を「判断」「決定」したとみなす事を含み得る。つまり、「判断」「決定」は、何らかの動作を「判断」「決定」したとみなす事を含み得る。 The terms "determining" and "determining" as used herein may include a wide variety of actions. "Judgment" and "decision" are, for example, judgment, calculation, computing, processing, deriving, investigating, looking up (for example, table). , Searching in a database or another data structure), ascertaining can be considered as a "judgment" or "decision". Also, "judgment" and "decision" are receiving (for example, receiving information), transmitting (for example, transmitting information), input (input), output (output), and access. (Accessing) (for example, accessing data in memory) may be regarded as "judgment" or "decision". In addition, "judgment" and "decision" mean that the things such as solving, selecting, choosing, establishing, and comparing are regarded as "judgment" and "decision". Can include. That is, "judgment" and "decision" may include considering some action as "judgment" and "decision".

本明細書で使用する「に基づいて」という記載は、別段に明記されていない限り、「のみに基づいて」を意味しない。言い換えれば、「に基づいて」という記載は、「のみに基づいて」と「に少なくとも基づいて」の両方を意味する。 The phrase "based on" as used herein does not mean "based on" unless otherwise stated. In other words, the statement "based on" means both "based only" and "at least based on".

「含む(include)」、「含んでいる（including）」、およびそれらの変形が、本明細書あるいは特許請求の範囲で使用されている限り、これら用語は、用語「備える」と同様に、包括的であることが意図される。さらに、本明細書あるいは特許請求の範囲において使用されている用語「または（or）」は、排他的論理和ではないことが意図される。 As long as "include", "including", and variations thereof are used within the scope of the present specification or claims, these terms are as comprehensive as the term "include". Intended to be targeted. Moreover, the term "or" as used herein or in the claims is intended not to be an exclusive OR.

本開示の全体において、文脈から明らかに単数を示したものではなければ、複数のものを含むものとする。 In the whole of the present disclosure, if the context clearly does not indicate the singular, it shall include more than one.

１００…渋滞予測モデル作成装置、１１１…学習用人口データ取得部、１１３…学習用渋滞データ取得部、１１５…渋滞予測モデル構築部、１２１…予測用人口データ取得部、１２２…交通渋滞予測部。 100 ... Congestion prediction model creation device, 111 ... Learning population data acquisition unit, 113 ... Learning congestion data acquisition unit, 115 ... Congestion prediction model construction unit, 121 ... Prediction population data acquisition unit, 122 ... Traffic congestion prediction unit.

Claims

It is a traffic jam prediction model creation device that creates a traffic jam prediction model used for traffic jam prediction.
A population data acquisition unit that acquires population data of multiple groups classified according to the position and attributes of people,
The traffic jam data acquisition unit that acquires traffic jam data indicating the traffic jam status according to the position,
After normalizing the population data for each group, regression analysis is performed using the normalized population data for each group as an explanatory variable and the congestion data at the congestion prediction position for predicting traffic congestion as an objective variable. Therefore, the population data of the group having a relatively high contribution to the traffic jam data at the traffic jam prediction position is selected from the plurality of groups, and the selected population data and the traffic jam data at the traffic jam prediction position are selected. and a congestion prediction model constructing section for constructing the traffic jam prediction model by based rather regression analysis DOO, traffic jam prediction model generating apparatus.

The traffic congestion prediction model creating device according to claim 1, wherein the attribute includes the place of residence of the person.

The forecasting population data acquisition department, which acquires population data used for forecasting traffic congestion,
The traffic congestion prediction model creating device according to claim 1 or 2, further comprising a traffic congestion prediction unit that predicts traffic congestion based on the congestion prediction model and population data used for predicting traffic congestion.