JP5602283B1

JP5602283B1 - Prediction device, prediction method, and computer program

Info

Publication number: JP5602283B1
Application number: JP2013125933A
Authority: JP
Inventors: 克彦永倉
Original assignee: Yanmar Co Ltd
Current assignee: Yanmar Co Ltd
Priority date: 2013-06-14
Filing date: 2013-06-14
Publication date: 2014-10-08
Anticipated expiration: 2033-06-14
Also published as: WO2014200086A1; JP2015001821A

Abstract

【課題】時系列的に変化する事象に係る値を予測するための予測装置、予測方法及びコンピュータプログラムの提供。
【解決手段】時系列的に変化する事象に係る値と、事象に関連する複数の関連事象に係る項目値とを対応付けて時系列的に記録する記録手段、事象に係る値を対数変換する手段、対数変換した値に対し、項目値を２次変数変換する手段、変換後の事象に係る値及び項目値を用いて、相関の強さを導出する導出手段、及び導出した相関の強さに基づき、予測対象期間における事象に係る予測値を算出する予測手段を備える。
【選択図】図１０A prediction apparatus, a prediction method, and a computer program for predicting a value related to an event that changes in time series are provided.
Recording means for associating values related to events that change in time series with item values related to a plurality of related events related to the events in time series, logarithmically converting the values related to the events Means for converting the item value into a secondary variable for the logarithmically converted value, a derivation means for deriving the strength of the correlation using the value and the item value related to the event after the conversion, and the strength of the derived correlation And a prediction means for calculating a predicted value related to the event in the prediction target period.
[Selection] Figure 10

Description

本発明は、時系列的に変化する事象に係る値を予測するための予測装置、予測方法及びコンピュータプログラムに関する。 The present invention relates to a prediction apparatus, a prediction method, and a computer program for predicting a value related to an event that changes in time series.

製品の需要予測及び販売予測等の経済予測は、会社経営における方向性及び戦略を検証する上で、極めて重要である。そして、予測した需要を販売、在庫、生産、物流、開発等の各部門の計画にどのように結び付けるかが経営課題となっている。更に、需要予測、販売予測等の経済的な事象に対する予測に限らず、経時的に変化する事象についてそれまでの情報を用いて以降の事象について予測することは様々な分野で重要な課題である。 Economic forecasts such as product demand forecasts and sales forecasts are extremely important in examining the direction and strategy in company management. Then, how to connect the predicted demand to the plans of each department such as sales, inventory, production, distribution, and development is a management issue. Furthermore, not only forecasts for economic events such as demand forecasts and sales forecasts, but forecasting future events using information that has changed over time is an important issue in various fields. .

製品の需要台数、又は販売台数等の時系列的に推移する事象の以後の変化を予測するための方法として、種々の時系列分析方法が提案されている。このような分析方法としては、重回帰分析、Ｔ法等の多変量解析が挙げられる（例えば、特許文献１、非特許文献１）。更に、これらの分析方法に対して種々の応用案が提案されている（非特許文献２等）。 Various time-series analysis methods have been proposed as methods for predicting subsequent changes in time-series events such as the number of products demanded or the number of products sold. Examples of such an analysis method include multivariate analysis such as multiple regression analysis and T method (for example, Patent Document 1 and Non-Patent Document 1). Furthermore, various application proposals have been proposed for these analysis methods (Non-Patent Document 2, etc.).

特許第３１４１１６４号公報Japanese Patent No. 3141164

立林和夫編著、手島昌一、長谷川良子著、「入門ＭＴシステム」、日科技連出版社、２００８年１２月Edited by Kazuo Tatebayashi, Shoichi Teshima, Ryoko Hasegawa, “Introductory MT System”, Nikka Giren Shuppansha, December 2008 増田雪也、「非線形成分を考慮したＴ法の研究」、第１７回品質工学研究発表大会論文集、ｐ．４２２−４２５、２００９年Yukiya Masuda, “Research of T Method Considering Nonlinear Components”, Proceedings of 17th Quality Engineering Research Conference, p. 422-425, 2009

Ｔ法は元来、時系列データの予測のための方法ではない。時間軸の概念は特に無く、多数の要因が関連する事象の値に対し、いずれの要因が最も影響があるかを選択するための方法である。Ｔ法を時系列データの予測方法に用いる場合、ある期間における各項目の項目値と信号値との関係を特定して、次の期間の信号値を予測するのである。このため、各項目の項目値における推定誤差が重なって、信号値の予測精度を悪化させるという問題点を有していた。 The T method is not originally a method for prediction of time series data. There is no particular concept of the time axis, and it is a method for selecting which factor has the most influence on the value of an event related to a large number of factors. When the T method is used as a time series data prediction method, the relationship between the item value of each item and the signal value in a certain period is specified, and the signal value in the next period is predicted. For this reason, there has been a problem that estimation errors in the item values of the respective items overlap to deteriorate the prediction accuracy of the signal values.

本発明は、斯かる事情に鑑みてなされたものであり、信号における経時変化の傾向を反映させて予測精度を向上させることができる予測装置、予測方法及びコンピュータプログラムを提供することを目的とする。 The present invention has been made in view of such circumstances, and an object of the present invention is to provide a prediction device, a prediction method, and a computer program capable of improving prediction accuracy by reflecting a tendency of a signal to change over time. .

本願の予測装置は、時系列的に変化する事象に係る値と、該事象に関連する複数の関連事象の夫々を項目とする各項目の項目値とを対応付けて時系列的に記録する記録手段を備え、該記録手段に記録されている期間内の第１期間における前記事象に係る値と、前記第１期間よりも所定期間前の第２期間における前記関連事象に係る項目値との相関の強さに基づき、予測対象期間における前記事象に係る値を予測する予測装置であって、前記事象に係る値を対数変換する手段と、対数変換した値に対し、前記項目に係る項目値を２次変数変換する手段と、変換後の前記事象に係る値及び前記項目に係る項目値を用いて、前記事象に係る値の変化に対する各項目の相関の強さを示す要因効果値を算出する手段と、算出した要因効果値が大きいものから順に選択した複数の項目に対する予測対象との相関の強さを、項目数毎に算出する手段と、算出した相関の強さに基づき、前記予測対象との相関の強さを最大化する項目数を決定する手段と、前記要因効果値が大きいものから順に、決定した項目数分の項目を選択する手段と、選択した項目の項目値に基づき、前記予測対象期間における前記事象に係る予測値を算出する予測手段とを備えることを特徴とする。 The prediction device according to the present application records a value related to an event that changes in time series and an item value of each item that includes each of a plurality of related events related to the event in a time series as a record. A value related to the event in a first period within a period recorded in the recording means, and an item value related to the related event in a second period before the first period. A prediction device that predicts a value related to the event in a prediction target period based on the strength of the correlation, and a unit that logarithmically converts the value related to the event and relates to the item for the logarithmically converted value Factor effect indicating the strength of correlation of each item with respect to a change in the value related to the event, using the means for converting the item value into a secondary variable, the value related to the event after the conversion, and the item value related to the item Means for calculating the value and the factor effect value calculated is large A means for calculating the correlation strength with the prediction target for each of a plurality of items selected in order from the item number, and an item for maximizing the correlation strength with the prediction target based on the calculated correlation strength A means for determining the number, a means for selecting items for the determined number of items in descending order of the factor effect value, and a prediction relating to the event in the prediction target period based on the item values of the selected items And a predicting means for calculating a value.

本願の予測方法は、時系列的に変化する事象に係る値と、該事象に関連する複数の関連事象の夫々を項目とする各項目の項目値とを対応付けて時系列的に記録する記録手段を備えたコンピュータにより、該記録手段に記録されている期間内の第１期間における前記事象に係る値と、前記第１期間よりも所定期間前の第２期間における前記関連事象に係る項目値との相関の強さに基づき、予測対象期間における前記事象に係る値を予測する予測方法であって、前記コンピュータは、前記事象に係る値を対数変換し、対数変換した値に対し、前記項目に係る項目値を２次変数変換し、変換後の前記事象に係る値及び前記項目に係る項目値を用いて、前記事象に係る値の変化に対する各項目の相関の強さを示す要因効果値を算出し、算出した要因効果値が大きいものから順に選択した複数の項目に対する予測対象との相関の強さを、項目数毎に算出し、算出した相関の強さに基づき、前記予測対象との相関の強さを最大化する項目数を決定し、前記要因効果値が大きいものから順に、決定した項目数分の項目を選択し、選択した項目の項目値に基づき、前記予測対象期間における前記事象に係る予測値を算出することを特徴とする。 The prediction method of the present application is a recording in which a value related to an event that changes in time series and an item value of each item that includes each of a plurality of related events related to the event are associated and recorded in time series A value relating to the event in the first period within the period recorded in the recording means by the computer provided with the means, and an item relating to the related event in the second period before the first period. A prediction method for predicting a value related to the event in a prediction target period based on the strength of correlation with the value, wherein the computer logarithmically converts the value related to the event, The item value related to the item is converted into a secondary variable, and the strength of the correlation of each item with respect to the change of the value related to the event is calculated using the value related to the event and the item value related to the item after the conversion. The factor effect value shown is calculated and the calculated factor effect Calculate the strength of the correlation with the prediction target for each item selected in descending order of the value for each number of items, and maximize the strength of the correlation with the prediction target based on the calculated correlation strength The number of items to be determined is determined, the items corresponding to the determined number of items are selected in descending order of the factor effect value , and based on the item value of the selected item, the predicted value related to the event in the prediction target period is determined. It is characterized by calculating.

本願のコンピュータプログラムは、時系列的に変化する事象に係る値と、該事象に関連する複数の関連事象の夫々を項目とする各項目の項目値とを対応付けて時系列的に記録する記録手段を備えたコンピュータに、該記録手段に記録されている期間内の第１期間における前記事象に係る値と、前記第１期間よりも所定期間前の第２期間における前記関連事象に係る項目値との相関の強さに基づき、予測対象期間における前記事象に係る値を予測させるコンピュータプログラムであって、前記コンピュータを、前記事象に係る値を対数変換する手段、対数変換した値に対し、前記項目に係る項目値を２次変数変換する手段、変換後の前記事象に係る値及び前記項目に係る項目値を用いて、前記事象に係る値の変化に対する各項目の相関の強さを示す要因効果値を算出する手段と、算出した要因効果値が大きいものから順に選択した複数の項目に対する予測対象との相関の強さを、項目数毎に算出する手段と、算出した相関の強さに基づき、前記予測対象との相関の強さを最大化する項目数を決定する手段と、前記要因効果値が大きいものから順に、決定した項目数分の項目を選択する手段と、選択した項目の項目値に基づき、前記予測対象期間における前記事象に係る予測値を算出する予測手段として機能させる。 The computer program of the present application records a time series in which values related to events that change in time series are associated with item values of each item having each of a plurality of related events related to the events as items. And a value related to the event in the first period within the period recorded in the recording means, and an item related to the related event in the second period before the first period. A computer program for predicting a value related to the event in a prediction target period based on the strength of correlation with the value, wherein the computer is configured to logarithmically convert the value related to the event to a logarithmically converted value. On the other hand, using the means for secondary variable conversion of the item value related to the item, the value related to the event after conversion, and the item value related to the item, the correlation strength of each item with respect to the change in the value related to the event The A means for calculating the factor effect value, a means for calculating, for each number of items, the strength of correlation with the prediction target for a plurality of items selected in descending order of the calculated factor effect value, and a strength of the calculated correlation. And a means for determining the number of items that maximizes the strength of correlation with the prediction target, a means for selecting items for the determined number of items in order from the largest factor effect value, and Based on the item value of an item, it is made to function as a prediction means for calculating a predicted value related to the event in the prediction target period.

本願では、時系列的に変化する事象に係る値と、該事象に関連する複数の関連事象の夫々を項目とする各項目の項目値とを対応付けて時系列的に記録する記録手段から、前記事象に係る値及び各項目の項目値を読出し、事象に係る値については対数変換を行い、対数変換した値に対して項目値についての２次変数変換を行う。信号（事象に係る値）に対して対数変換を行い、対数変換後の信号に対し項目を２次変数変換するので、信号に対する項目の３次以上の偶然誤差が除去され、２次成分までを主効果とすることで、信号及び項目の双方について分散が安定化する。 In the present application, from the recording means for recording the value related to the event that changes in time series and the item value of each item having each of the related events related to the event as items, in time series, The value related to the event and the item value of each item are read, logarithmic conversion is performed on the value related to the event, and secondary variable conversion is performed on the item value for the logarithmically converted value. Since logarithmic transformation is performed on the signal (value related to the event), and the second-order variable transformation is performed on the item after the logarithmic transformation, the third-order or higher-order accidental error of the item on the signal is removed, and up to the second-order component. By setting it as the main effect, dispersion | distribution is stabilized about both a signal and an item.

本願によれば、信号に対する項目の３次以上の偶然誤差を除去して２次成分までを主効果とすることで、信号及び項目の双方について分散が安定化し、予測精度を向上させることができる。 According to the present application, by removing the third or higher order coincidence error of an item with respect to a signal and using a secondary component as a main effect, dispersion can be stabilized for both the signal and the item, and prediction accuracy can be improved. .

Ｔ法で用いる信号及び各項目の内容例を示す説明図である。It is explanatory drawing which shows the signal used by T method, and the example of the content of each item. Ｔ法で用いる各項目の比例定数β及びＳＮ比η（２乗比）の一例を示す説明図である。It is explanatory drawing which shows an example of the proportionality constant (beta) and SN ratio (eta) (square ratio) of each item used by T method. 各メンバーの実際の信号値又は基準化した信号値の実値と、各メンバーについて求めた総合推定値とを一覧形式で示した図である。It is the figure which showed the actual signal value of each member or the actual value of the standardized signal value, and the comprehensive estimated value calculated | required about each member in list format. Ｔ法における各項目の総合推定値のＳＮ比を示す要因効果図である。It is a factor effect figure which shows the S / N ratio of the comprehensive estimated value of each item in T method. Ｔ法における各項目の要因効果に対する影響を示すグラフである。It is a graph which shows the influence with respect to the factor effect of each item in T method. 時間差モデルを概念的に説明する説明図である。It is explanatory drawing which illustrates a time difference model notionally. 項目選択数を決定するための処理を概念的に示すグラフである。It is a graph which shows notionally the processing for determining the number of item selection. 項目選択数と総合推定値のＳＮ比との対応を示すグラフである。It is a graph which shows a response | compatibility with the number of item selections, and SN ratio of a comprehensive estimated value. 本実施の形態に係る予測装置の構成を示すブロック図である。It is a block diagram which shows the structure of the prediction apparatus which concerns on this Embodiment. 予測装置による予測処理手順の一例を示すフローチャートである。It is a flowchart which shows an example of the prediction process procedure by a prediction apparatus. 変数変換の処理手順を示すフローチャートである。It is a flowchart which shows the process sequence of variable conversion. 総合推定値のＳＮ比の算出手順を示すフローチャートである。It is a flowchart which shows the calculation procedure of SN ratio of a comprehensive estimated value. 本実施例における項目の内容を示す説明図である。It is explanatory drawing which shows the content of the item in a present Example. 変数変換前の信号値及び項目値の関係を示すグラフである。It is a graph which shows the relationship between the signal value before variable conversion, and an item value. 変数変換後の信号値及び項目値の関係を示すグラフである。It is a graph which shows the relationship between the signal value after variable conversion, and an item value. 項目選択数と総合推定値のＳＮ比との対応を示すグラフである。It is a graph which shows a response | compatibility with the number of item selections, and SN ratio of a comprehensive estimated value. 本願の予測方法による予測結果の精度評価を示すグラフである。It is a graph which shows the accuracy evaluation of the prediction result by the prediction method of this application. 比較例の予測方法による予測結果の精度評価を示すグラフである。It is a graph which shows accuracy evaluation of the prediction result by the prediction method of a comparative example. 比較例の予測方法による予測結果の精度評価を示すグラフである。It is a graph which shows accuracy evaluation of the prediction result by the prediction method of a comparative example.

本発明をその実施の形態を示す図面に基づいて具体的に説明する。
本願の予測方法は、従前から利用されているＭＴシステム（例えば、Ｔ法）を用いた方法を基本として、時系列的に変化する事象に係る値（信号）の経時変化を予測するために、時間差モデルを導入したものである。本願では、信号の要因となり得る複数の関連事象の夫々を項目とした項目値を、信号に対応付けて時系列的に記録しておき、信号及び項目値の相関の強さに基づき、予測に用いる信号及び項目の取捨選択を行うことを特徴の１つとしている。 The present invention will be specifically described with reference to the drawings showing the embodiments thereof.
The prediction method of the present application is based on a method using an MT system (for example, T method) that has been used in the past, in order to predict a change over time of a value (signal) related to an event that changes in time series, A time difference model is introduced. In the present application, item values each having a plurality of related events that may cause a signal are recorded in time series in association with the signal, and prediction is made based on the strength of correlation between the signal and the item value. One of the features is to select signals and items to be used.

なお、以下の予測方法は、予測に用いるデータを記録している記録手段から各データを読み出すことができるコンピュータ（後述する予測装置）などの演算手段により実施されるものである。 In addition, the following prediction methods are implemented by calculation means, such as a computer (prediction apparatus mentioned later) which can read each data from the recording means which has recorded the data used for prediction.

まず、本願の予測方法の基本となるＴ法の概要について説明する。図１はＴ法で用いる信号及び各項目の内容例を示す説明図である。図１において、メンバーとは、単位期間毎のデータであることを示すインデックスであり、１，２，…，ｎとして示している。事象に係る値は、信号値Ｍとして記録される。「項目１」、「項目２」、…、「項目ｋ」は、信号値Ｍに関連する複数の関連事象の夫々を示している。Ｘ₁₁，Ｘ₁₂，…，Ｘ_1kは、メンバー１の信号値Ｍ₁ に関連する各項目の項目値を示している。例えば、メンバーが月毎の電気使用量である場合、月毎の電気使用量（Ｗ）が信号値Ｍであり、項目値として、「月」、「気温」、「風速」、「降水量（月平均）」、「日照時間（月平均）」、「最高気温（月平均）」、「最低気温（月平均）」…等の電気使用量の上下に関連するであろう変量を記録する。 First, an outline of the T method which is the basis of the prediction method of the present application will be described. FIG. 1 is an explanatory diagram showing examples of signals used in the T method and the contents of each item. In FIG. 1, a member is an index indicating data for each unit period, and is indicated as 1, 2,..., N. The value relating to the event is recorded as a signal value M. “Item 1”, “Item 2”,..., “Item k” indicate each of a plurality of related events related to the signal value M. X ₁₁ , X ₁₂ ,..., X _1k indicate item values of each item related to the signal value M ₁ of member 1. For example, if the member has monthly electricity usage, the monthly electricity usage (W) is the signal value M, and the item values are “month”, “temperature”, “wind speed”, “precipitation ( Record the variables that would be related to the ups and downs of electricity usage, such as “monthly average”, “sunshine hours (monthly average)”, “maximum temperature (monthly average)”, “minimum temperature (monthly average)”.

なお、信号値Ｍ及び各項目値Ｘは、実際の値を用いてもよいが、演算手段を用いて規準化しておいてから用いることが好ましい。規準化の方法は例えば、演算手段が、各項目の項目値Ｘ（例えば月平均の気温）から、当該項目の項目値の平均値（用いる全メンバーの月平均の気温の平均値）を減算しておくなどの方法がある。これにより、信号値に対して項目値をプロットした場合に（単位空間とよぶ）、原点を通る直線として予測式を表現することが可能となる。 In addition, although the actual value may be used for the signal value M and each item value X, it is preferable to use them after normalization using a calculation means. In the normalization method, for example, the calculation means subtracts the average value of the item value of each item (for example, the average value of the monthly average temperature of all members to be used) from the item value X of each item (for example, the monthly average temperature). There is a way to keep it. Thereby, when the item value is plotted against the signal value (referred to as a unit space), the prediction formula can be expressed as a straight line passing through the origin.

Ｔ法では、各メンバーの信号値Ｍと、関連する多変数である項目値との関係を特定する特定式を求め、予測対象である次の単位期間の項目値を予測し、その予測した項目値を特定式にあてはめて、予測するものである。しかしながら、項目のデータは多変量である。例えばメンバーが１２個であって、項目の数が２０（ｋ＝２０）存在する場合がある。このような場合、次の信号値を予測するのに特定式を求めることが難しくなる。Ｔ法は、多数の項目夫々に対して、信号値の変動に対する要因効果の強さを示す値を評価し、要因効果を示す値を重み付けする。これにより、効果的な項目を選択することにより、全ての項目のデータを使用せずとも、選択された項目を使用することで十分に精度の高い予測等を行うことができるというものである。 In the T method, a specific formula that specifies the relationship between the signal value M of each member and the related multi-variable item value is obtained, the item value of the next unit period to be predicted is predicted, and the predicted item A value is applied to a specific formula to predict. However, the item data is multivariate. For example, there may be 12 members and there are 20 items (k = 20). In such a case, it is difficult to obtain a specific expression for predicting the next signal value. In the T method, for each of a large number of items, a value indicating the strength of the factor effect with respect to signal value fluctuation is evaluated, and a value indicating the factor effect is weighted. Thereby, by selecting effective items, it is possible to perform prediction with sufficiently high accuracy by using the selected items without using data of all items.

そこで、Ｔ法では、演算手段によって、信号値Ｍを持つメンバーを用いて、項目毎に比例定数βと、ＳＮ比η（２乗比）とを下記の式１及び式２を適用して算出する。ＳＮ比とは、式２に示すような分散の逆数を用いて示される値であり、各項目に対する信号値の感度であり、各項目と信号値との相関の強さを示す。 Therefore, in the T method, the calculation means calculates the proportionality constant β and the SN ratio η (square ratio) for each item using the members having the signal value M by applying the following formulas 1 and 2. To do. The S / N ratio is a value indicated by using the reciprocal of dispersion as shown in Equation 2, is the sensitivity of the signal value for each item, and indicates the strength of correlation between each item and the signal value.

なお、上述の式１及び式２は、項目１についての比例定数β及びＳＮ比η（２乗比）を求める式である。演算手段は、項目２から項目ｋについても項目１と同様の計算を行なう。図２はＴ法で用いる各項目の比例定数β及びＳＮ比η（２乗比）の一例を示す説明図である。図２では、上述の式１及び式２を各項目に適用することによって算出した項目毎の比例定数β及びＳＮ比η（２乗比）を表形式で示している。 In addition, the above-mentioned Formula 1 and Formula 2 are formulas for obtaining the proportionality constant β and the SN ratio η (square ratio) for Item 1. The calculation means performs the same calculation as for item 1 for item 2 to item k. FIG. 2 is an explanatory diagram showing an example of the proportionality constant β and SN ratio η (square ratio) of each item used in the T method. In FIG. 2, the proportionality constant β and SN ratio η (square ratio) for each item calculated by applying Equation 1 and Equation 2 described above to each item are shown in a table format.

次に、項目毎の比例定数β及びＳＮ比η（２乗比）を用いて、各メンバーについて、演算手段によって項目毎の出力の推定値を求める。第ｉ番目メンバーについて、項目１による出力の推定値は、下記の式３にて示すことができる。また、同様に、演算手段により、項目２から項目ｋによる出力の推定値を求める。 Next, using the proportionality constant β and the SN ratio η (square ratio) for each item, an estimated value of the output for each item is obtained for each member by the computing means. For the i-th member, the estimated output value of item 1 can be expressed by the following equation 3. Similarly, the estimated value of the output from item 2 to item k is obtained by the calculation means.

これにより、演算手段は、項目値と信号値の総合推定値との関係を示す予測式として、総合推定式（式４）を導出することができる。ただし、予測対象の信号値に対し、全ての項目（１〜ｋ）を用いた総合推定式が最も予測精度が高いわけではない。そこで、予測対象への影響に対する寄与を高くし、予測精度を高めるべく、演算手段によって全ての項目のうちから適切な項目の組み合わせを選択する。 Thereby, the calculating means can derive the comprehensive estimation formula (Formula 4) as a prediction formula indicating the relationship between the item value and the total estimated value of the signal value. However, the comprehensive estimation formula using all items (1 to k) with respect to the signal value to be predicted does not have the highest prediction accuracy. Therefore, in order to increase the contribution to the influence on the prediction target and increase the prediction accuracy, an appropriate combination of items is selected from all items by the calculation means.

そこで、演算手段によって各項目の推定値についての推定精度であるＳＮ比η₁ ，η₂ ，…，η_k （２乗比）を重み付け係数として用いた総合推定値を算出する。従って、第ｉ番目のメンバーの総合推定値は、下記の式４にて示すことができる。 Therefore, the calculation means calculates an overall estimated value using the SN ratios η ₁ , η ₂ ,..., Η _k (square ratio), which is the estimation accuracy for the estimated value of each item, as a weighting coefficient. Therefore, the total estimated value of the i-th member can be expressed by the following Equation 4.

図３は各メンバーの実際の信号値又は基準化した信号値の実値と、各メンバーについて求めた総合推定値とを一覧形式で示した図である。そして、図３に示したように得られた各メンバーの実値と総合推定値とを用い、演算手段は下記式５によって、各項目について、総合推定値のＳＮ比η（ｄｂ）を算出する。 FIG. 3 is a diagram showing, in a list form, the actual signal values of each member or the actual values of the standardized signal values and the total estimated values obtained for each member. Then, using the actual value and the total estimated value of each member obtained as shown in FIG. 3, the computing means calculates the SN ratio η (db) of the total estimated value for each item by the following equation 5. .

Ｔ法では、上述したように求めた各項目についての総合推定値を用いて更に、要因効果値という値を求め、要因効果値を元に項目を選択するなどの方法がとられている。図４はＴ法における各項目の総合推定値のＳＮ比を示す要因効果図であり、図５はＴ法における各項目の要因効果に対する影響を示すグラフである。 In the T method, a method of obtaining a value called a factor effect value by using the total estimated value obtained for each item as described above, and selecting an item based on the factor effect value is used. FIG. 4 is a factor effect diagram showing the SN ratio of the overall estimated value of each item in the T method, and FIG. 5 is a graph showing the influence of each item on the factor effect in the T method.

図４は、横軸に選択の対象となる項目を示し、縦軸に総合推定値のＳＮ比をとって、各項目についてのＳＮ比を示している。また図４では、各項目について、左側にその項目のデータを含む各データに対する総合推定値のＳＮ比、即ち信号値との相関の強さを示し、右側にその項目のデータを除いた各データに対する総合推定値のＳＮ比を示している。図４に示す例では、項目は３６個あり、選択の組み合わせは２³⁶通り存在する。演算手段は、これらの各組み合わせについて１又は複数の項目に対する予測対象のＳＮ比を導出する。そして、演算手段は、対象となる当該項目を含む組み合わせのＳＮ比の平均値と、当該項目を含まない組み合わせのＳＮ比の平均値とを算出する。図４では、このようにして算出された当該項目のデータを含むＳＮ比の平均値を左側に、当該項目を含まないＳＮ比の平均値を右側に、項目毎に示したものである。 In FIG. 4, the horizontal axis indicates items to be selected, and the vertical axis indicates the SN ratio of each item by taking the SN ratio of the comprehensive estimated value. In FIG. 4, for each item, the SN ratio of the total estimated value for each data including the data of the item, that is, the strength of the correlation with the signal value is shown on the left side, and each data excluding the data of the item is shown on the right side. The SN ratio of the comprehensive estimated value with respect to is shown. In the example shown in FIG. 4, there are 36 items, and there are 2 ³⁶ combinations of selections. The computing means derives the SN ratio of the prediction target for one or a plurality of items for each of these combinations. And a calculating means calculates the average value of SN ratio of the combination containing the said item used as object, and the average value of S / N ratio of the combination which does not contain the said item. In FIG. 4, the average value of the SN ratio including the data of the item calculated in this way is shown on the left side, and the average value of the SN ratio not including the item is shown on the right side for each item.

図５は、横軸に選択の対象となる項目を示し、縦軸に要因効果値をとって、各項目についての要因効果値の度合いを示している。図５の縦軸の要因効果値は、図４における予測対象の相関の強さをｄｂ単位で示した各項目の総合推定値のＳＮ比について、右側の（項目を含まない）ＳＮ比に対する左側の（項目を含む）ＳＮ比の度合い、即ち、左側のＳＮ比から右側のＳＮ比を減じた値を示している。つまり、項目毎に、その項目を含む場合の、含まない場合に対するＳＮ比への効果度を示している。したがって算出された要因効果値が正である項目は、その項目を使用することにより、総合推定値のＳＮ比が上昇することを示している。両側Ｔ法と呼ばれる方法においては、このような要因効果値が正である項目のみを選択し、上述に示したようなＴ法による解析が行なわれる。 In FIG. 5, the horizontal axis indicates items to be selected, and the vertical axis indicates the factor effect value, and the degree of the factor effect value for each item is indicated. The factor effect value on the vertical axis in FIG. 5 is the left side of the SN ratio on the right side (excluding items) of the SN ratio of the overall estimated value of each item indicating the strength of the correlation of the prediction target in FIG. Of the SN ratio (including items), that is, a value obtained by subtracting the right SN ratio from the left SN ratio. That is, for each item, the degree of effect on the S / N ratio when the item is included is not shown. Therefore, an item for which the calculated factor effect value is positive indicates that the SN ratio of the comprehensive estimated value is increased by using the item. In a method called a two-sided T method, only items having such a positive factor effect value are selected, and analysis by the T method as described above is performed.

上述に示したようなＴ法（両側Ｔ法）を用いて予測を行なう場合には依然として、実際の予測精度が高まらない。そこで発明者はまず、上述のＴ法に、時間差モデルという考え方を適用し、信号値と項目値との非線形性を考慮した変換処理を行ない、更に、項目選択方法として、要因効果値の正負によって選択する方法ではなく、総合推定値のＳＮ比を最大化する方法を用いることとした。 When prediction is performed using the T method (two-sided T method) as described above, the actual prediction accuracy still does not increase. Therefore, the inventor first applied the concept of a time difference model to the above-described T method, and performed conversion processing taking into account the nonlinearity between the signal value and the item value. Instead of the method of selection, a method of maximizing the SN ratio of the comprehensive estimated value is used.

次に、本願の予測方法にて導入する時間差モデルについて説明する。
Ｔ法は元来、時系列データの予測のための方法ではない。時間軸の概念は特に無く、多数の要因が関連する事象の値に対し、いずれの要因が最も影響があるかを選択するための方法である。Ｔ法をそのまま予測方法に用いる場合、上述したように、各メンバーの項目毎の項目値と信号値との関係を特定して、次のメンバーの信号値を予測するのである。したがって、次のメンバーの信号値を予測するには、次のメンバーの項目毎の項目値を予測しなければならない。このときの各項目の項目値の推定誤差が重なって、信号値の予測精度を悪化させることは容易に想定される。また、時系列に推移していくデータを元に、次の信号値を予測する場合、特に経済予測又は販売予測のような分野では、未来に起こる事象には、過去にその予兆があるはずである。そこで、発明者は、上述したような信号値と項目毎の項目値との対応付けを、同時期のメンバーではなく、所定時間ずらしたものどうしで対応付けることとした。具体的には、本実施の形態の予測方法を実施する演算手段は、ある期間における項目毎の項目値に対し、所定期間後の信号値を対応づけ、これらの間で式１〜５の計算を行ない、結果得られた総合推定式を用い、現在（直近）の任意の期間の項目毎の項目値に基づき、所定期間後の信号値を推定（予測）する。 Next, the time difference model introduced by the prediction method of the present application will be described.
The T method is not originally a method for prediction of time series data. There is no particular concept of the time axis, and it is a method for selecting which factor has the most influence on the value of an event related to a large number of factors. When the T method is used as it is in the prediction method, as described above, the relationship between the item value and the signal value for each item of each member is specified, and the signal value of the next member is predicted. Therefore, in order to predict the signal value of the next member, the item value for each item of the next member must be predicted. It is easily assumed that the estimation error of the item value of each item at this time overlaps and the prediction accuracy of the signal value is deteriorated. In addition, when predicting the next signal value based on time-series data, especially in fields such as economic forecasting or sales forecasting, there should be signs of future events. is there. Accordingly, the inventor decided to associate the association between the signal value and the item value for each item as described above not by the members of the same period but by the ones shifted by a predetermined time. Specifically, the computing means that implements the prediction method of the present embodiment associates the signal value after a predetermined period with the item value for each item in a certain period, and calculates Equations 1 to 5 between these values. And the signal value after a predetermined period is estimated (predicted) based on the item value for each item in the present (most recent) period using the comprehensive estimation formula obtained as a result.

図６は時間差モデルを概念的に説明する説明図である。図６では、図の左から右へ向かって時間の経過を示している。図６の下部の各矩形は、１年毎の各項目の項目値を示し、上部の各矩形は、各項目と同時期の１年毎の信号を示している。時間差モデルでは、同時期の信号と各項目の項目値とを対応付けるのではなく、信号Ｍ_i に対して、所定期間前の各項目の項目値Ｘ_i-t ₁ ，Ｘ_i-t ₂ ，…，Ｘ_i-t _k（図６の例では、ｔ＝１年）を対応付ける。
なお、図６に示す時間差モデルの例では、１年間の信号を前年の１年間の項目値に対応付ける構成としたが、これらの期間は図６に示す期間に限定されるものではなく、適宜設定し得るものである。 FIG. 6 is an explanatory diagram conceptually illustrating the time difference model. FIG. 6 shows the passage of time from the left to the right in the figure. Each rectangle in the lower part of FIG. 6 indicates an item value of each item for each year, and each rectangle in the upper part indicates a signal for each year at the same time as each item. The time difference model, rather than associating the item value of the signal and each item in the same period, with respect to the signal M _i, item values of the items of predetermined period before _{_{_{_{X it 1, X it 2,}}}} ..., X it k (In the example of FIG. 6, t = 1 year) is associated.
In the example of the time difference model shown in FIG. 6, the signal of one year is configured to correspond to the item value of the previous year. However, these periods are not limited to the periods shown in FIG. It is possible.

本実施の形態の予測方法では、演算手段は、時間差モデルを適用した信号及び各項目の項目値を、上述した式１〜式５における信号値Ｍ₁ 及び項目値Ｘ₁₁，Ｘ₁₂，…，Ｘ_1kの関係に当てはめてＴ法を適用する。そして、予測対象期間の信号値は、所定期間前の各項目の項目値（図６の例では２００９年１月から１２月までの項目値）を総合推定式（式４）に入力することによって求める。つまり、本実施の形態の予測方法における時間差モデルでは、演算手段は、複数項目の項目値と所定期間後の信号値との関係を特定することにより、過去又は現在の項目値から、未来の信号値を予測することを可能とするものである。 In the prediction method of the present embodiment, the calculation means uses the signal to which the time difference model is applied and the item value of each item as the signal value M ₁ and the item values X ₁₁ , X ₁₂ ,. The T method is applied by applying the relationship of X _1k . The signal value of the prediction target period is obtained by inputting the item value of each item before the predetermined period (the item value from January to December 2009 in the example of FIG. 6) into the comprehensive estimation formula (Formula 4). Ask. That is, in the time difference model in the prediction method of the present embodiment, the computing means specifies the relationship between the item values of a plurality of items and the signal values after a predetermined period, thereby determining the future signal from the past or current item value. It is possible to predict the value.

以上の、本実施の形態では、予測対象期間である２０１０年の信号（未知信号）を予測するために、２００５年〜２００９年までの各年の信号（既知信号）と、２００４年〜２００８年までの各年の項目（既知項目）とを対応付ける時間差モデルを導入する構成とした。２００９年の各項目（すなわち、予測対象期間よりも所定期間前の対応期間の項目）は、既に取得済みのデータではあるが、時間差モデルには使用していない項目であるため、以下の説明では「未知項目」と呼ぶこととする。 In the present embodiment, in order to predict a 2010 signal (unknown signal) that is a prediction target period, signals of each year (known signal) from 2005 to 2009, and 2004 to 2008 are used. It was set as the structure which introduces the time difference model which matches the item (known item) of each year until. Each item in 2009 (that is, the item corresponding to the predetermined period before the prediction target period) is already acquired data, but is not used in the time difference model. This is called “unknown item”.

本実施の形態の予測方法では、上述した時間差モデルに加えて、最良予測スキームを採用する。最良予測スキームは、対数変換した信号に対して項目を２次変数変換し、信号及び項目の双方について分散の安定化を図り、Ｔ法計算で総合推定ＳＮ比が最大になる項目選択により予測値を得る方法である。 In the prediction method of the present embodiment, the best prediction scheme is adopted in addition to the time difference model described above. In the best prediction scheme, items are subjected to second-order variable transformation for logarithmically transformed signals, dispersion is stabilized for both signals and items, and predicted values are selected by item selection that maximizes the total estimated S / N ratio in the T method calculation. Is the way to get.

以下、最良予測スキームで実施する変数変換について説明する。従来のＴ法では、項目毎に、項目のデータと信号値との関係に対し、ゼロ点を通る直線が設定され、この直線からのずれに基づく重み付け（ＳＮ比として数値化）が行なわれる。このような線形変換では、直線に対するばらつきが少ない項目ほど、信号値の変化に対し寄与する項目であり、ばらつきが多い、すなわち項目値の変化と信号値の変化とに相関がない項目ほど、信号値の変化に寄与しない項目となる。しかしながら、全ての項目の項目値が、信号値との関係において直線、即ち線形な関係を有しているとは限らない。そこで、最良予測スキームでは、信号について対数変換を行うと共に、対数変換した信号に対して項目を２次変数変換を行う。 In the following, variable conversion performed in the best prediction scheme will be described. In the conventional T method, for each item, a straight line that passes through the zero point is set for the relationship between the item data and the signal value, and weighting based on the deviation from this straight line (numericalization as an SN ratio) is performed. In such a linear transformation, items that have less variation with respect to a straight line are items that contribute to changes in the signal value, and items that have more variation, that is, items that have no correlation between the change in the item value and the change in the signal value, This item does not contribute to the change in value. However, the item values of all items do not necessarily have a straight line, that is, a linear relationship in relation to the signal value. Therefore, in the best prediction scheme, logarithmic transformation is performed on the signal, and the item is subjected to secondary variable transformation on the logarithmically transformed signal.

なお、上述した時間差モデルにおいて、未知項目に対応する未知信号は不明であるため、未知信号の変数変換はできない。このため、本実施の形態では、既知項目について変数変換を行ったときの変換係数（すなわち、２次項の係数、１次項の係数、及び定数）を流用して、未知項目における項目値の変数変換を実施するようにしている。 In the above-described time difference model, the unknown signal corresponding to the unknown item is unknown, so variable conversion of the unknown signal cannot be performed. For this reason, in the present embodiment, the conversion coefficient (that is, the coefficient of the secondary term, the coefficient of the primary term, and the constant) when the variable conversion is performed on the known item is used to convert the variable of the item value in the unknown item. To implement.

次に、項目選択について説明する。図７は項目選択数を決定するための処理を概念的に示すグラフである。まず、時間差モデルを適用し、変数変換処理を行なった信号値と項目毎の項目値について、式１〜式５を適用して各項目の総合推定値のＳＮ比を算出する。そして図４及び図５について説明したように、演算手段が、総合推定値のＳＮ比から、項目毎の要因効果値を算出する。図７に示すように、演算手段はまず、項目毎の要因効果値の最小値を初期的な閾値として設定する。演算手段は、要因効果値が閾値以上である項目を選択し、信号値（例えば１〜ｎ全て）に対する総合推定値のＳＮ比を算出する。閾値が初期値である場合、全項目が選択される。次に演算手段は、閾値に所定値を加算した値を次の閾値として設定し、同様にして要因効果値が閾値以上である項目を選択し、信号値に対する総合推定値のＳＮ比を算出する。演算手段がこのような処理を、閾値が要因効果値の最大値以上、即ち図７のＭＡＸとして示したラインに到達するまで繰り返すことにより、選択した複数の項目の項目値に対する総合推定値のＳＮ比を、項目数毎に算出することが可能である。なお、演算手段は、閾値の初期値を最大値ＭＡＸ以上に設定し、閾値を所定値ずつ小さくし、各閾値以上の要因効果値である項目を選択して総合推定値のＳＮ比を算出していくようにしてもよい。 Next, item selection will be described. FIG. 7 is a graph conceptually showing a process for determining the number of item selections. First, the time difference model is applied, and the signal-to-variable conversion signal value and the item value for each item are applied to calculate the SN ratio of the overall estimated value of each item by applying Equations 1 to 5. Then, as described with reference to FIGS. 4 and 5, the calculation means calculates a factor effect value for each item from the SN ratio of the comprehensive estimated value. As shown in FIG. 7, the computing means first sets the minimum value of the factor effect value for each item as an initial threshold value. The computing means selects an item whose factor effect value is equal to or greater than a threshold value, and calculates the SN ratio of the comprehensive estimated value with respect to the signal value (for example, all 1 to n). If the threshold is the initial value, all items are selected. Next, the calculation means sets a value obtained by adding a predetermined value to the threshold value as the next threshold value, similarly selects an item whose factor effect value is equal to or greater than the threshold value, and calculates the SN ratio of the comprehensive estimated value with respect to the signal value. . The calculation means repeats such processing until the threshold reaches the maximum value of the factor effect value, that is, reaches the line shown as MAX in FIG. The ratio can be calculated for each number of items. The calculation means sets the initial value of the threshold value to the maximum value MAX or more, decreases the threshold value by a predetermined value, selects an item that is a factor effect value above each threshold value, and calculates the SN ratio of the comprehensive estimated value. You may make it go.

図８は項目選択数と総合推定値のＳＮ比との対応を示すグラフである。図８に示すグラフは、縦軸に総合推定値のＳＮ比（ｄｂ）をとり、横軸に項目選択数をとって、項目選択数に対する総合推定値のＳＮ比の推移を示したものである。ＳＮ比の値のうち、白抜きの四角は、要因効果値が正の項目を選択するという両側Ｔ法を用いて項目を選択した場合の総合推定値のＳＮ比である。白抜きの菱形は、全項目を選択した場合の総合推定値のＳＮ比を示す。白丸は総合ＳＮ比が最大となる項目数を示している。図８に示すように、要因効果値が正の項目を選択する方法では、総合推定値のＳＮ比は最大とならない。本実施の形態では、図７に示した方法を実施することにより、総合推定値のＳＮ比が最も高くなる項目数を決定し、要因効果値が大きいものから順に、決定した項目数分の項目を選択する手法を採用する。 FIG. 8 is a graph showing the correspondence between the number of item selections and the SN ratio of the overall estimated value. The graph shown in FIG. 8 shows the transition of the SN ratio of the total estimated value with respect to the number of item selections, with the SN ratio (db) of the total estimated value on the vertical axis and the number of item selections on the horizontal axis. . Among the S / N ratio values, the open square is the S / N ratio of the overall estimated value when an item is selected using the two-sided T method of selecting an item having a positive factorial effect value. Open diamonds indicate the SN ratio of the overall estimated value when all items are selected. White circles indicate the number of items with the maximum total SN ratio. As shown in FIG. 8, in the method of selecting an item having a positive factor effect value, the SN ratio of the comprehensive estimated value does not become maximum. In the present embodiment, by implementing the method shown in FIG. 7, the number of items for which the SN ratio of the overall estimated value is the highest is determined, and the items for the determined number of items in descending order of the factor effect value. The method of selecting is adopted.

本願発明者は上述したように、Ｔ法に時間差モデルを適用し、信号値と各項目の項目値との非線形性を考慮した変換処理を行なった上で総合推定値のＳＮ比を算出し、当該ＳＮ比を最大化する項目の選択方法を適用することによって、予測精度を向上させることができるという知見を得た。以下、本発明の適用例について具体的に開示する。 As described above, the present inventor applies a time difference model to the T method, calculates a signal-to-noise ratio of the overall estimated value after performing a conversion process considering the nonlinearity between the signal value and the item value of each item, The knowledge that prediction accuracy can be improved by applying the selection method of the item which maximizes the S / N ratio was obtained. Hereinafter, application examples of the present invention will be specifically disclosed.

図９は本実施の形態に係る予測装置１の構成を示すブロック図である。予測装置１は、例えば、パーソナルコンピュータ、サーバコンピュータ等のコンピュータであり、制御部１０、記録部１１、一時記憶部１２、入力部１３、及び出力部１４を備える。 FIG. 9 is a block diagram showing a configuration of the prediction apparatus 1 according to the present embodiment. The prediction device 1 is a computer such as a personal computer or a server computer, and includes a control unit 10, a recording unit 11, a temporary storage unit 12, an input unit 13, and an output unit 14.

制御部１０は、例えば、ＣＰＵ（Central Processing Unit）を含む。制御部１０は、以下に説明する予測プログラム２に基づき、上述したハードウェア各部の動作を制御し、本実施の形態に係る予測装置１としての機能を発揮させる。なお、制御部１０は、上述した予測方法を実施する演算手段としての機能を有している。 The control unit 10 includes, for example, a CPU (Central Processing Unit). The control part 10 controls the operation | movement of each part of the hardware mentioned above based on the prediction program 2 demonstrated below, and exhibits the function as the prediction apparatus 1 which concerns on this Embodiment. In addition, the control part 10 has a function as a calculating means which implements the prediction method mentioned above.

記録部１１は、ＲＯＭ（Read Only Memory）、ハードディスクドライブ等の不揮発性メモリを用いる。なお、記録部１１は、外付けのハードディスクドライブ、光学ディスクドライブ、通信網を介して接続される他の記録装置であってもよい。すなわち、記録部１１とは、制御部１０からアクセス可能な１又は複数の情報記録媒体の総称である。 The recording unit 11 uses a nonvolatile memory such as a ROM (Read Only Memory) or a hard disk drive. The recording unit 11 may be an external hard disk drive, an optical disk drive, or another recording device connected via a communication network. That is, the recording unit 11 is a general term for one or a plurality of information recording media accessible from the control unit 10.

記録部１１には、本実施の形態の予測方法を実現するための各種手順を含む予測プログラム２が記録されている。また、記録部１１の記録領域の一部は、信号値及び該値に対応する複数の項目のデータ（各項目の項目値）を記録するデータベース（ＤＢ）１１０として用いられる。制御部１０は、データベース１１０に対し、信号値及び各項目の項目値の読み書きが可能である。データベース１１０は、例えば図３に示した形式にて各メンバーの信号値及び各項目の項目値を時系列に記録している。 The recording unit 11 records a prediction program 2 including various procedures for realizing the prediction method of the present embodiment. A part of the recording area of the recording unit 11 is used as a database (DB) 110 that records signal values and data of a plurality of items corresponding to the values (item values of each item). The control unit 10 can read and write signal values and item values of each item with respect to the database 110. The database 110 records the signal value of each member and the item value of each item in time series, for example, in the format shown in FIG.

一時記憶部１２は、例えば、ＤＲＡＭ（Dynamic Random Access Memory）、ＳＲＡＭ（Static Random Access Memory ）等の不揮発性メモリである。一時記憶部１２は、制御部１０の処理によって発生した情報を一時的に記憶する。 The temporary storage unit 12 is, for example, a nonvolatile memory such as a DRAM (Dynamic Random Access Memory) or an SRAM (Static Random Access Memory). The temporary storage unit 12 temporarily stores information generated by the processing of the control unit 10.

入力部１３は、キーボード、マウス等を用い、ユーザの操作入力を受け付ける。 The input unit 13 receives a user operation input using a keyboard, a mouse, or the like.

出力部１４は、液晶モニタなどの表示部、又はプリンタ等の印刷部を用い、制御部１０による情報の処理結果を出力する。 The output unit 14 uses a display unit such as a liquid crystal monitor or a printing unit such as a printer, and outputs information processing results by the control unit 10.

このように構成される予測装置１にて、制御部１０が予測プログラム２に基づく処理を実行することにより、未来の事象に係る信号値を予測する。図１０は予測装置１による予測処理手順の一例を示すフローチャートである。 In the prediction apparatus 1 configured as described above, the control unit 10 executes a process based on the prediction program 2 to predict a signal value related to a future event. FIG. 10 is a flowchart illustrating an example of a prediction processing procedure performed by the prediction device 1.

制御部１０は、入力部１３から、信号値及びこれに関連する各項目の項目値について入力を受付け、受付けた信号値及び項目毎の項目値を記録部１１のデータベース１１０に記録する（ステップＳ１１）。なお、データベース１１０に記録する信号値及び項目毎の項目値は、入力部１３から入力されるもののみならず、通信網を介して他の装置から入力されてもよいし、他の情報記録媒体から入力されてもよい。 The control unit 10 receives input of the signal value and the item value of each item related thereto from the input unit 13, and records the received signal value and the item value for each item in the database 110 of the recording unit 11 (step S11). ). The signal values and item values for each item to be recorded in the database 110 may be input from other devices via the communication network as well as those input from the input unit 13 or other information recording media. May be input.

制御部１０は、記録部１１のデータベース１１０に記録した信号値及び項目毎の項目値に基づいて、時間差モデルを生成する（ステップＳ１２）。ステップＳ１２にて生成する時間差モデルとは、図６を参照して説明したように、項目毎の項目値と、該項目の項目値の所定期間後の信号値とを対応付けたモデルである。すなわち、制御部１０は、ステップＳ１２において、時系列に記憶された信号値と、項目毎の項目値とを、所定期間ずらして対応付ける。 The control unit 10 generates a time difference model based on the signal value recorded in the database 110 of the recording unit 11 and the item value for each item (step S12). The time difference model generated in step S12 is a model in which the item value for each item is associated with the signal value after a predetermined period of the item value of the item, as described with reference to FIG. That is, in step S12, the control unit 10 associates the signal value stored in time series with the item value for each item while shifting by a predetermined period.

制御部１０は、生成した時間差モデルにおいて対応する信号値と項目毎の項目値との関係に基づき、信号値及び項目毎の項目値を変数変換する（ステップＳ１３）。変数変換の処理手順については、後述の図１１のフローチャートを参照して詳細を説明する。 The control unit 10 performs variable conversion of the signal value and the item value for each item based on the relationship between the corresponding signal value and the item value for each item in the generated time difference model (step S13). The variable conversion processing procedure will be described in detail with reference to the flowchart of FIG.

制御部１０は、時間差モデルにより対応付けられた変数変換後の信号値及び各項目の項目値に基づき、前述の式１及び式２を用いて、項目毎の比例定数β及びＳＮ比η（２乗比）を算出する（ステップＳ１４）。 Based on the signal value after variable conversion and the item value of each item associated with the time difference model, the control unit 10 uses the above-described Equations 1 and 2 to calculate the proportionality constant β and the SN ratio η (2) for each item. (Multiplier ratio) is calculated (step S14).

制御部１０は、式３を用いて項目毎の比例定数βとＳＮ比η（２乗比）を用いて、選択された信号期間における各メンバーについて、式３により出力の推定値を算出する（ステップＳ１５）。 The control unit 10 calculates the estimated value of the output by Expression 3 for each member in the selected signal period using the proportional constant β and the SN ratio η (square ratio) for each item using Expression 3. Step S15).

制御部１０は、式４により、推定値についての推定精度であるＳＮ比（２乗比）を重み付け係数として用いた総合推定値を算出する（ステップＳ１６）。 The control unit 10 calculates an overall estimated value using the SN ratio (square ratio), which is the estimation accuracy of the estimated value, as a weighting coefficient by using Equation 4 (step S16).

次に制御部１０は、式５に基づき、信号値及び総合推定値に基づいて、各項目の総合推定値のＳＮ比（ｄｂ）を算出する（ステップＳ１７）。 Next, based on Equation 5, the control unit 10 calculates the SN ratio (db) of the overall estimated value of each item based on the signal value and the overall estimated value (step S17).

制御部１０は、各項目について、要因効果値を導出する（ステップＳ１８）。要因効果値は、上述したように、各項目について、当該項目を除いた各項目の項目値に対する総合推定値のＳＮ比に対する、当該項目を含む各項目の項目値に対する総合推定値のＳＮ比の差分を求めることによって算出される。総合推定値のＳＮ比とは、信号値に対する当該項目の項目値の相関の強さであり、分散の逆数に比例する値の対数として示した値である。 The control unit 10 derives a factor effect value for each item (step S18). As described above, the factor effect value is the SN ratio of the overall estimated value with respect to the item value of each item including the item with respect to the SN ratio of the overall estimated value with respect to the item value of each item excluding the item. Calculated by obtaining the difference. The S / N ratio of the comprehensive estimated value is the strength of correlation of the item value of the item with respect to the signal value, and is a value expressed as a logarithm of a value proportional to the reciprocal of the variance.

次に制御部１０は、要因効果値が大きいものから順に選択した複数の項目の項目値に対する総合推定値のＳＮ比を、項目数毎に算出する（ステップＳ１９）。ステップＳ１９における処理の詳細は、上述にて図８を参照して説明したものである。 Next, the control unit 10 calculates, for each item number, the SN ratio of the comprehensive estimated value with respect to the item values of the plurality of items selected in descending order of the factor effect value (step S19). Details of the processing in step S19 have been described above with reference to FIG.

制御部１０は、項目数毎の総合推定値のＳＮ比に基づき、当該ＳＮ比を最大にする項目数を決定する（ステップＳ２０）。 The control unit 10 determines the number of items that maximizes the SN ratio based on the SN ratio of the comprehensive estimated value for each number of items (step S20).

制御部１０は、ステップＳ２０にて決定した項目数分の項目を選択する（ステップＳ２１）。そして制御部１０は、時間差モデルにおける予測対象期間に対応付けられている所定期間前の項目毎の項目値、すなわち、予測対象期間の直近の対応期間における項目毎の項目値のうち、選択した項目の項目値を式４に当てはめ、予測値を算出する（ステップＳ２２）。式４におけるβ及びηは、ステップＳ１４にて算出したもの（時間差モデルにより対応付けられ、変数変換された信号値及び各項目の項目値によって算出されたもの）を用いる。ステップＳ２２にて予測値は、出力部１４から出力されるか、記録部１１に記録される。なお、信号値が規準化されている場合は、逆変換を行なって求めればよい。 The controller 10 selects as many items as the number of items determined in step S20 (step S21). Then, the control unit 10 selects the item value among the item values for each item before the predetermined period associated with the prediction target period in the time difference model, that is, the item value for each item in the corresponding period closest to the prediction target period. Is applied to Equation 4 to calculate a predicted value (step S22). Β and η in Equation 4 are those calculated in step S14 (corresponding to the time difference model and calculated by the variable-converted signal value and the item value of each item). In step S <b> 22, the predicted value is output from the output unit 14 or recorded in the recording unit 11. If the signal value is normalized, it can be obtained by performing inverse transformation.

図１１は変数変換の処理手順を示すフローチャートである。制御部１０は、各信号値Ｍ_i （ｉ＝１，２，…，ｎ）について対数変換を行う（ステップＳ１３１）。すなわち、制御部１０は、対数変換後の信号値をｍ_i （ｉ＝１，２，…，ｎ）とした場合、ｍ_i ＝ｌｏｇＭ_i を算出する。 FIG. 11 is a flowchart showing a variable conversion processing procedure. The control unit 10 performs logarithmic conversion on each signal value M _i (i = 1, 2,..., N) (step S131). That is, the control unit 10 calculates m _i = log M _i when the signal value after logarithmic conversion is m _i (i = 1, 2,..., N).

次いで、制御部１０は、対数変換後の信号値ｍ_i に対し、時間差モデルで対応付けられる各項目の項目値Ｘ_{i-t j} （ｊ＝１，２，…，ｋ）を２次変数変換する（ステップＳ１３２）。具体的には、制御部１０は、対数変換後の信号値ｍ_i を各項目の項目値Ｘ_{i-t j} に対して２次回帰し、項目の数だけ変換係数（２次項の係数、１次項の係数、定数）を算出する。そして、制御部１０は、項目毎に求めた回帰曲線上において、各項目の項目値Ｘ_{i-t j} に対応する信号値を求め、その信号値を変換後の項目値ｘ_{i-t j} として一時記憶部１２等に記憶させる処理を行う。 Next, the control unit 10 performs secondary variable conversion of the item values X _{it j} (j = 1, 2,..., K) of each item associated with the time difference model with respect to the signal value m _i after logarithmic conversion ( Step S132). Specifically, the control unit 10, 2 ascribes next signal value m _i after logarithmic transformation for an item value X _{it j} of each item, only the transform coefficients (the second-order terms the number of items coefficients, first-order term of Coefficient, constant). Then, the control unit 10, on a regression curve obtained for each item, obtains a signal value corresponding to the item value X _{it j} of each item, a temporary storage unit that signal value as the item value x _{it j} after conversion 12 The process to memorize to etc. is performed.

次いで、制御部１０は、未知項目の項目値について変数変換を行う（ステップＳ１３３）。ステップＳ１３１で既知信号について変換後の信号値ｍ_i を算出することができ、ステップＳ１３２で既知項目について変換後の各項目の項目値ｘ_{i-t j} を算出することができる。しかしながら、上述した未知項目については、項目値としては存在するものの、対応する未知信号（予測対象の信号）が存在しないため、変数変換後の項目値を求めることができない。このため、本実施の形態では、既知項目について求めた変換係数を流用し、未知項目の項目値について変数変換を行う。
なお、変数変換後の未知項目が必要となるのは、上述したステップＳ２２において、予測値を算出する段階であるため、ステップＳ２１において項目を選択した後に未知項目を変数変換する構成としてもよい。 Next, the control unit 10 performs variable conversion on the item value of the unknown item (step S133). In step S131, the converted signal value m _i can be calculated for the known signal, and in step S132, the converted item value x _{it j} for each item can be calculated. However, although the above-described unknown item exists as an item value, there is no corresponding unknown signal (predicted signal), and thus the item value after variable conversion cannot be obtained. For this reason, in this embodiment, the conversion coefficient obtained for the known item is diverted, and the variable conversion is performed for the item value of the unknown item.
The unknown item after the variable conversion is required in the step of calculating the predicted value in the above-described step S22. Therefore, the unknown item may be converted into a variable after the item is selected in step S21.

図１２は総合推定値のＳＮ比の算出手順を示すフローチャートである。制御部１０は、ステップＳ１６にて導出した要因効果値の最小値以下の値を閾値の初期値として設定する（ステップＳ１７１）。ステップＳ１７１は、図７において、初期値を設定した状態である。なお、図７に示すように、要因効果値の最小値を閾値として設定することにより処理量を削減することができる。 FIG. 12 is a flowchart showing a procedure for calculating the SN ratio of the comprehensive estimated value. The control unit 10 sets a value equal to or smaller than the minimum value of the factor effect value derived in step S16 as an initial value of the threshold value (step S171). Step S171 is a state in which an initial value is set in FIG. As shown in FIG. 7, the processing amount can be reduced by setting the minimum factor effect value as a threshold value.

次いで、制御部１０は、設定した閾値以上の要因効果値が算出された項目を選択する（ステップＳ１７２）。要因効果値の最小値を閾値の初期値として設定した場合、最初のステップＳ１７２の段階では、全ての項目が選択されることになる。 Next, the control unit 10 selects an item for which a factor effect value equal to or greater than the set threshold is calculated (step S172). When the minimum value of the factor effect value is set as the initial value of the threshold value, all items are selected in the first step S172.

次いで、制御部１０は、ステップＳ１７２にて選択した項目のデータに対する予想対象の相関の強さ、即ち、総合推定値のＳＮ比を算出し（ステップＳ１７３）、算出した総合推定値のＳＮ比を、項目数に対応付けて一時記憶部１２に記録する（ステップＳ１７４）。 Next, the control unit 10 calculates the strength of the correlation of the prediction target with respect to the data of the item selected in step S172, that is, the SN ratio of the total estimated value (step S173), and calculates the SN ratio of the calculated total estimated value. Then, it is recorded in the temporary storage unit 12 in association with the number of items (step S174).

次いで、制御部１０は、選択した項目に対する総合推定値のＳＮ比の算出を完了したか否かを判定する（ステップＳ１７５）。ステップＳ１７５は、繰り返し処理の終了判定であり、項目数毎の総合推定値のＳＮ比の算出処理が、各項目数について完了したか否かを判定する処理である。例えば、設定されている閾値が要因効果値の最大値以上の値をとる場合、後述する閾値の再設定により、閾値以上となる要因効果値が存在しなくなる場合、算出した項目数毎の総合推定値の個数が選択の対象となる項目数に一致した場合等の完了条件を適宜設定することができる。 Next, the control unit 10 determines whether or not the calculation of the SN ratio of the comprehensive estimated value for the selected item has been completed (step S175). Step S175 is a process for determining whether to end the repetitive process, and is a process for determining whether or not the calculation process of the SN ratio of the total estimated value for each item number has been completed for each item number. For example, if the set threshold value is greater than or equal to the maximum factor effect value, and if a factor effect value that exceeds the threshold value does not exist due to resetting of the threshold value described later, a comprehensive estimate for each calculated number of items Completion conditions such as when the number of values matches the number of items to be selected can be set as appropriate.

ステップ１７５において、選択した項目に対する総合推定値のＳＮ比の算出を完了しておらず、更に少ない項目数に対する総合推定値のＳＮ比の算出を要すると判定した場合（Ｓ１７５：ＮＯ）、制御部１０は、設定している閾値を所定値分大きい値に再設定し（ステップＳ１７６）、ステップＳ１７２へ進み、以降の処理を繰り返す。 If it is determined in step 175 that calculation of the SN ratio of the comprehensive estimated value for the selected item is not completed and calculation of the SN ratio of the comprehensive estimated value for a smaller number of items is required (S175: NO), the control unit 10 resets the set threshold value to a value larger by a predetermined value (step S176), proceeds to step S172, and repeats the subsequent processing.

ステップＳ１７５において、選択した項目に対する総合推定値のＳＮ比の算出を完了したと判定した場合（Ｓ１７５：ＹＥＳ）、制御部１０は、本フローチャートによる処理を終了する。 If it is determined in step S175 that the calculation of the SN ratio of the comprehensive estimated value for the selected item has been completed (S175: YES), the control unit 10 ends the process according to this flowchart.

なお、ここでは、閾値の初期値を要因効果値の最小値以下に設定し、所定値分ずつ大きくなるように閾値を再設定する形態を示したが、逆の処理を行うようにしても良い。すなわち、閾値の初期値を要因効果値の最大値以上に設定し、設定した閾値以上の要因効果値が算出された項目を選択し、閾値が所定値分ずつ小さくなるように再設定するようにしても良い。 Here, the mode in which the initial value of the threshold value is set to be equal to or less than the minimum value of the factor effect value and the threshold value is reset so as to increase by a predetermined value is shown, but the reverse processing may be performed. . In other words, the initial value of the threshold is set to be equal to or greater than the maximum value of the factor effect value, the item for which the factor effect value equal to or greater than the set threshold is calculated is selected, and the threshold value is reset so as to decrease by a predetermined value. May be.

以下、予測装置１による予測方法を適用した具体的な実施例について説明する。
図１３は本実施例における項目の内容を示す説明図である。図１３に示す項目は、各種農業機械の出荷台数を表す項目であり、月次データとして記録される。例えば、「トラクタ」の項目は、２０ｐｓ未満、２０〜３０ｐｓ、３０〜５０ｐｓ、５０ｐｓ以上のクラス別に分けた出荷台数の総計を表している。本実施例では、予測対象の信号として、「コンバイン」（すなわち、自脱型及び普通型を併せた出荷台数）を選択し、それ以外の２６系列の項目に基づき予測対象の信号を予測する構成とした。 Hereinafter, specific examples to which the prediction method by the prediction device 1 is applied will be described.
FIG. 13 is an explanatory diagram showing the contents of items in this embodiment. The items shown in FIG. 13 are items representing the number of shipments of various agricultural machines, and are recorded as monthly data. For example, the item “tractor” represents the total number of shipments divided into classes of less than 20 ps, 20 to 30 ps, 30 to 50 ps, and 50 ps or more. In the present embodiment, “combine” (that is, the number of shipments including both the self-removal type and the normal type) is selected as the prediction target signal, and the prediction target signal is predicted based on the other 26 series items. It was.

図１４は変数変換前の信号値及び項目値の関係を示すグラフである。図１４の横軸は規準化したコンバイン台数（対数値）を示し、縦軸は規準化した自脱型コンバイン台数（対数値）を示している。すなわち、対数変換した信号値と、変数変換前の特定の項目の項目値（生値）との関係を示したものである。なお、図１４に示す例では単位空間を１つ設定している。具体的には、全ての信号値の平均値を算出し、その平均値に最も近い信号サンプルを単位空間の信号値とし、その信号サンプルに対応する項目値を単位空間の項目値として設定している。 FIG. 14 is a graph showing the relationship between signal values and item values before variable conversion. The horizontal axis of FIG. 14 shows the number of standardized combine machines (logarithmic value), and the vertical axis shows the standardized number of combine harvesters (logarithmic value). That is, the relationship between the logarithmically converted signal value and the item value (raw value) of a specific item before variable conversion is shown. In the example shown in FIG. 14, one unit space is set. Specifically, the average value of all the signal values is calculated, the signal sample closest to the average value is set as the unit space signal value, and the item value corresponding to the signal sample is set as the unit space item value. Yes.

図１５は変数変換後の信号値及び項目値の関係を示すグラフである。図１５の横軸は規準化したコンバイン台数（対数値）を示し、縦軸は規準化した自脱型コンバイン台数（対数値）を示している。図１５に示すグラフは、図１４に示す信号値に対し、項目値を２次変数変換したものである。対数変換した信号値に対し、項目値を２次変数変換しているので、項目値も対数値となる。
本願では、信号に対する項目の３次以上の偶然誤差を除去し、２次成分までを主効果とすることで、信号及び項目の双方について分散の安定化を図ることを特徴の１つとしている。 FIG. 15 is a graph showing the relationship between signal values and item values after variable conversion. The horizontal axis of FIG. 15 indicates the number of standardized combine harvesters (logarithmic value), and the vertical axis indicates the standardized number of combine harvesters (logarithmic value). The graph shown in FIG. 15 is obtained by subjecting item values to secondary variable conversion with respect to the signal values shown in FIG. Since the item value is subjected to secondary variable conversion for the logarithmically converted signal value, the item value is also a logarithmic value.
One feature of the present application is to stabilize the dispersion of both the signal and the item by removing the third or higher order coincidence error of the item with respect to the signal and using the second order component as the main effect.

図１６は項目選択数と総合推定値のＳＮ比との対応を示すグラフである。図１６に示すグラフは、縦軸に総合推定値のＳＮ比（ｄｂ）をとり、横軸に項目選択数をとって、項目選択数に対する総合推定値のＳＮ比の推移を示したものである。信号には図１３に示す項目のうち「コンバイン」の出荷数を選択し、残りの２６系列を項目として選択し、図１２に示すフローチャートに従って総合推定値のＳＮ比（ｄｂ）を算出したものを示している。 FIG. 16 is a graph showing the correspondence between the number of item selections and the SN ratio of the overall estimated value. In the graph shown in FIG. 16, the SN ratio (db) of the total estimated value is taken on the vertical axis, and the number of item selections is taken on the horizontal axis, and the transition of the SN ratio of the total estimated value with respect to the number of item selections is shown. . For the signal, the number of shipments of “combine” is selected from the items shown in FIG. 13, the remaining 26 series are selected as items, and the SN ratio (db) of the total estimated value is calculated according to the flowchart shown in FIG. Show.

図１６のグラフにおいて、白抜きの菱形は、全項目を選択した場合の総合推定値のＳＮ比を示しており、本実施例の場合、１７．２ｄｂであった。
白抜きの四角は、要因効果値が正の項目を選択した場合の総合推定値のＳＮ比を示している。本実施例において、要因効果値が正となったのは、「コンバイン（自脱型）」、「防除機」、「乾燥機」、「耕うん機」、「動力散粉機（防除機）」、「普通型（コンバイン）」、「カッター」、「精米機」、及び「米選機」の９つの項目であり、ＳＮ比は１８．９ｄｂであった。 In the graph of FIG. 16, white diamonds indicate the SN ratio of the overall estimated value when all items are selected, and in this example, it was 17.2 db.
A white square indicates the SN ratio of the comprehensive estimated value when an item having a positive factor effect value is selected. In this example, the factor effect values were positive: “combine (self-removal type)”, “control machine”, “dryer”, “cultivator”, “power duster (control machine)”, There were nine items of “normal type (combine)”, “cutter”, “rice mill”, and “rice selector”, and the S / N ratio was 18.9 db.

白丸は総合推定値のＳＮ比（総合ＳＮ比）が最大となる値を示している。本実施例において、総合推定値のＳＮ比が最大となったのは、「コンバイン（自脱型）」、「防除機」、及び「乾燥機」の３つの項目を選択した場合であり、ＳＮ比は１９．３ｄｂであった。 White circles indicate values at which the S / N ratio (total S / N ratio) of the total estimated value is maximized. In this example, the S / N ratio of the overall estimated value was maximized when three items of “combine (self-removal)”, “control machine”, and “dryer” were selected. The ratio was 19.3 db.

日本農業機械工業会の統計資料に依れば、コンバインは「自脱型」及び「普通型」の合計であり、イネやムギの収穫機である。防除機は夏場の除草や害虫駆除に使用され、乾燥機は収穫後の工程である乾燥に使用される。したがって、総合推定値のＳＮ比を最大化する項目選択方法で上述した３種の農業機械が選択されたことには合理性がある。しかしながら、要因効果が正となる項目を選択する従来の手法では、総合推定値のＳＮ比が低下し、「コンバイン（自脱型）」、「防除機」、及び「乾燥機」以外の６種の農業機械が選択されている。特に、畑作用途である「耕うん機」や飼料の裁断機である「カッター」が選択されていることは不合理である。 According to the statistical data of the Japan Agricultural Machinery Industry Association, the combine is the sum of “self-destructing” and “ordinary”, and is the harvesting machine for rice and wheat. The control machine is used for weeding and pest control in summer, and the dryer is used for drying, which is a post-harvest process. Therefore, it is reasonable that the above-described three types of agricultural machines are selected by the item selection method that maximizes the SN ratio of the overall estimated value. However, in the conventional method of selecting items for which the factor effect is positive, the SN ratio of the overall estimated value decreases, and there are six types other than “combine (self-removal type)”, “control machine”, and “dryer”. Agricultural machinery is selected. In particular, it is unreasonable to select a “cultivator” for field cropping and a “cutter” for feed cutting.

以下、本願による変数変換処理、項目選択方法、単異空間の設定による効果について説明する。図１７は本願の予測方法による予測結果の精度評価を示すグラフである。図１７に示すグラフは、横軸に項目選択方法をとり、左側の縦軸に予測のＳＮ比（ｄｂ）、右側の縦軸に平均絶対パーセント誤差（ＭＡＰＥ）をとって、項目選択方法の違いによる予測のＳＮ比（ｄｂ）及びＭＡＰＥの推移を示したものである。なお、図１７では、単位空間を１つ設定して導出した予測結果の精度を示している。 Hereinafter, the effects of the variable conversion process, the item selection method, and the single space setting according to the present application will be described. FIG. 17 is a graph showing the accuracy evaluation of the prediction result by the prediction method of the present application. In the graph shown in FIG. 17, the horizontal axis represents the item selection method, the left vertical axis represents the predicted SN ratio (db), and the right vertical axis represents the average absolute percentage error (MAPE). The transition of the SN ratio (db) and MAPE of the prediction according to is shown. Note that FIG. 17 shows the accuracy of the prediction result derived by setting one unit space.

図１７に示す黒丸は、対数変換した信号値、及び変数変換しない項目値を用いて導出した予測のＳＮ比を示し、白丸はそのＭＡＰＥを示したものである。なお、図１７では、変数変換をしないで予測を行った場合を０次の変数変換の結果として示している。同様に、黒塗りの三角は、対数変換した信号値、及び対数変換した信号値に対し１次変数変換した項目値を用いて導出した予測のＳＮ比、白抜きの三角はそのＭＡＰＥを示している。
黒塗りの四角は、本願の変数変換により得られた信号値及び項目値（すなわち、対数変換した信号値、及び対数変換した信号値に対し２次変数変換した項目値）を用いて導出した予測のＳＮ比、白抜きの四角はそのＭＡＰＥを示している。 A black circle shown in FIG. 17 indicates the SN ratio of the prediction derived using the logarithmically converted signal value and the item value not subjected to variable conversion, and the white circle indicates the MAPE. In FIG. 17, the case where the prediction is performed without performing the variable conversion is shown as the result of the zeroth-order variable conversion. Similarly, a black triangle indicates a signal value obtained by logarithm conversion and an S / N ratio of a prediction derived using an item value obtained by performing primary variable transformation on the signal value obtained by logarithmic transformation, and a white triangle indicates the MAPE. Yes.
The black squares are predictions derived using the signal values and item values obtained by the variable transformation of the present application (that is, the logarithmically transformed signal values and the logarithmically transformed signal values are the secondary variable transformed item values). The S / N ratio and the open square indicate the MAPE.

図１７に依れば、項目の変数変換の次数を上げる程、予測のＳＮ比及びＭＡＰＥが大幅に向上することが分かる。また、全項目を採用した場合、及び要因効果値が正の項目を採用した場合と比較して、総合ＳＮ比を最大化する項目数で選択した項目を採用した場合には、予測のＳＮ比及びＭＡＰＥが向上することが分かる。 According to FIG. 17, it can be seen that the SN ratio and MAPE of the prediction are greatly improved as the order of the variable conversion of the item is increased. In addition, when all items are adopted, and when items selected with the number of items that maximize the total SN ratio are adopted compared to the case where items with positive factor effect values are adopted, the predicted SN ratio It can be seen that MAPE is improved.

以上のように、項目の変数変換の次数、及び項目選択方法によって予測精度は大きな影響を受ける。したがって、信号の対数変換、単位空間の設定方法等の条件による影響について、予測スキームの検討が必要となる。 As described above, the prediction accuracy is greatly influenced by the order of item variable conversion and the item selection method. Therefore, it is necessary to examine a prediction scheme for the influence of conditions such as signal logarithmic conversion and unit space setting method.

図１８は比較例の予測方法による予測結果の精度評価を示すグラフである。図１８は、信号を生値のまま採用し、両側Ｔ法を適用して予測を行った場合の予測結果の精度を示したものである。横軸に項目選択方法をとり、左側の縦軸に予測のＳＮ比（ｄｂ）、右側の縦軸に平均絶対パーセント誤差（ＭＡＰＥ）をとって、項目選択方法の違いによる予測のＳＮ比（ｄｂ）及びＭＡＰＥの推移を示したものである。なお、図１８では、単位空間を１つ設定して導出した予測結果の精度を示している。 FIG. 18 is a graph showing the accuracy evaluation of the prediction result by the prediction method of the comparative example. FIG. 18 shows the accuracy of the prediction result when the signal is adopted as a raw value and prediction is performed by applying the two-sided T method. Taking the item selection method on the horizontal axis, the SN ratio (db) for prediction on the left vertical axis, and the average absolute percentage error (MAPE) on the vertical axis on the right, the SN ratio (db for prediction depending on the item selection method) ) And MAPE. FIG. 18 shows the accuracy of the prediction result derived by setting one unit space.

図１８に示す黒丸は、生値である信号値及び項目値を用いて導出した予測のＳＮ比を示し、白丸はそのＭＡＰＥを示したものである。同様に、黒塗りの三角は、生値である信号値、及びその信号値に対し１次変数変換した項目値を用いて導出した予測のＳＮ比、白抜きの三角はそのＭＡＰＥを示している。また、黒塗りの四角は、生値である信号値、及びその信号値に対し２次変数変換した項目値を用いて導出した予測のＳＮ比、白抜きの四角はそのＭＡＰＥを示している。 The black circles shown in FIG. 18 indicate the SN ratio of the prediction derived using the raw signal values and item values, and the white circles indicate the MAPE. Similarly, a black triangle indicates a signal value that is a raw value and an S / N ratio of a prediction derived by using an item value obtained by converting the signal value to a primary variable, and a white triangle indicates the MAPE. . A black square indicates a signal value that is a raw value and a predicted SN ratio derived by using an item value obtained by performing secondary variable conversion on the signal value, and a white square indicates the MAPE.

図１８に依れば、生値の信号を採用して両側Ｔ法を適用した場合、項目について１次又は２次の変数変換を行ったとしても、予測精度は向上しないことが分かる。なお、変数変換しないＴ法（０次）では、要因効果正の項目を選択した場合に、予測のＳＮ比及びＭＡＰＥが最も改善する。
ただし、生値の信号を採用して両側Ｔ法を適用した場合には、図１７に示す信号の対数変換を適用した予測スキームと比較し、予測精度が悪化していることが分かる。 According to FIG. 18, it is understood that when the raw value signal is adopted and the two-sided T method is applied, the prediction accuracy is not improved even if the primary or secondary variable conversion is performed on the item. Note that, in the T method (0th order) without variable conversion, the prediction S / N ratio and MAPE are most improved when an item with positive factor effect is selected.
However, it is understood that when the raw value signal is adopted and the two-sided T method is applied, the prediction accuracy is deteriorated as compared with the prediction scheme to which the logarithmic transformation of the signal shown in FIG. 17 is applied.

図１９は比較例の予測方法による予測結果の精度評価を示すグラフである。図１９は、信号値について対数変換を行い、変換後の信号値に対し項目値の変数変換を行った場合の予測結果の精度を示したものである。図１９では、単位空間の値として、信号及び各項目共にその平均値を使用している点が、図１７及び図１８と相違する。 FIG. 19 is a graph showing the accuracy evaluation of the prediction result by the prediction method of the comparative example. FIG. 19 shows the accuracy of the prediction result when the logarithmic conversion is performed on the signal value and the variable conversion of the item value is performed on the converted signal value. FIG. 19 is different from FIGS. 17 and 18 in that the average value of the signal and each item is used as the unit space value.

図１９の横軸は項目選択方法を示し、左側の縦軸は予測のＳＮ比（ｄｂ）、右側の縦軸は平均絶対パーセント誤差（ＭＡＰＥ）を示している。図１９において、黒丸、黒塗りの三角、黒塗りの四角は、対数変換した信号値に対し、それぞれ変数変換しない項目値、１次変数変換した項目値、２次変数変換した項目値を用いて導出した予測のＳＮ比を示しており、白丸、白抜きの三角、白抜きの四角は、それぞれのＭＡＰＥを示している。 The horizontal axis of FIG. 19 shows the item selection method, the vertical axis on the left side shows the predicted SN ratio (db), and the vertical axis on the right side shows the average absolute percentage error (MAPE). In FIG. 19, black circles, black triangles, and black squares use item values that are not subjected to variable conversion, item values that are subjected to primary variable conversion, and item values that are subjected to secondary variable conversion with respect to logarithmically converted signal values. The SN ratio of the derived prediction is shown. White circles, white triangles, and white squares indicate the respective MAPEs.

図１９に依れば、項目の変数変換の次数を上げたことによる予測精度への影響は少ないものの、項目の変数変換の次数が上がると、予測のＳＮ比及びＭＡＰＥが多少改善していることが分かる。ただし、単位空間を平均値に設定した場合には、図１７に示す単位空間を１個設定した場合よりも、予測精度が悪化していることが分かる。 According to FIG. 19, although the effect on the prediction accuracy due to the increase in the order of the variable conversion of the item is small, the SN ratio and the MAPE of the prediction are slightly improved when the order of the variable conversion of the item is increased. I understand. However, when the unit space is set to an average value, it can be seen that the prediction accuracy is worse than when one unit space shown in FIG. 17 is set.

以上から、本実施の形態では、（１）信号について対数変換を行い、対数変換した信号に対して項目を２次変数変換することで、信号及び項目の双方について分散の安定化を図ること、（２）単位空間を１個設定すること、（３）Ｔ法計算で総合推定ＳＮ比が最大になる項目数を決定し、決定した項目数に従って項目選択を行うことを特徴とした予測スキームを採用することとした。
このような予測スキームを採用することで、従来のＴ法と比較して、大幅に予測精度を向上させることが可能となった。 As described above, in the present embodiment, (1) logarithmic conversion is performed on a signal, and second-order variable conversion is performed on the logarithmically converted signal, thereby stabilizing dispersion for both the signal and the item. (2) A prediction scheme characterized in that one unit space is set, and (3) the number of items that maximizes the total estimated SN ratio is determined by T-method calculation, and item selection is performed according to the determined number of items. I decided to adopt it.
By adopting such a prediction scheme, it is possible to significantly improve the prediction accuracy as compared with the conventional T method.

今回開示された実施の形態は、全ての点で例示であって、制限的なものではないと考えられるべきである。本発明の範囲は、上述した意味ではなく、特許請求の範囲によって示され、特許請求の範囲と均等の意味及び範囲内での全ての変更が含まれることが意図される。また、各実施の形態で記載されている技術的特徴は、お互いに組み合わせることが可能である。 The embodiment disclosed this time is to be considered as illustrative in all points and not restrictive. The scope of the present invention is defined by the terms of the claims, rather than the meanings described above, and is intended to include any modifications within the scope and meaning equivalent to the terms of the claims. In addition, the technical features described in each embodiment can be combined with each other.

１予測装置
２予測プログラム
１０制御部
１１記録部
１２一時記憶部
１３入力部
１４出力部
１１０データベース DESCRIPTION OF SYMBOLS 1 Prediction apparatus 2 Prediction program 10 Control part 11 Recording part 12 Temporary storage part 13 Input part 14 Output part 110 Database

Claims

A recording unit that records a value related to an event that changes in time series and an item value of each item that includes each of a plurality of related events related to the event in a time-series manner; Based on the strength of correlation between the value related to the event in the first period within the period recorded in the means and the item value related to the related event in the second period before the first period. A prediction device for predicting a value related to the event in the prediction target period,
Means for logarithmically converting the value related to the event;
Means for secondary variable conversion of the item value related to the item with respect to the logarithmically converted value;
Means for calculating a factor effect value indicating the strength of correlation of each item with respect to a change in the value related to the event, using the value related to the event after conversion and the item value related to the item ;
Means for calculating, for each number of items, the strength of correlation with the prediction target for a plurality of items selected in descending order of the calculated factor effect value;
Means for determining the number of items that maximizes the strength of the correlation with the prediction target based on the calculated strength of the correlation;
Means for selecting items for the determined number of items in descending order of the factor effect value;
A prediction device comprising: a prediction unit that calculates a predicted value related to the event in the prediction target period based on an item value of a selected item .

By a computer provided with a recording means for recording a value related to an event that changes in time series and an item value of each item that includes each of a plurality of related events related to the event in time series And a strong correlation between the value related to the event in the first period within the period recorded in the recording means and the item value related to the related event in the second period before the first period. And a prediction method for predicting a value related to the event in the prediction target period,
The computer
Logarithmically transform the value related to the event,
For the logarithmically converted value, convert the item value related to the item to a secondary variable,
Using the value related to the event after conversion and the item value related to the item , a factor effect value indicating the strength of correlation of each item with respect to a change in the value related to the event is calculated,
Calculate the strength of the correlation with the prediction target for each of the items selected in descending order of the calculated factor effect value for each number of items,
Based on the calculated correlation strength, determine the number of items that maximizes the correlation strength with the prediction target,
In order from the largest factor effect value, select items for the determined number of items,
A prediction method, comprising: calculating a prediction value related to the event in the prediction target period based on an item value of a selected item .

In a computer provided with a recording unit that records a value related to an event that changes in time series and an item value of each item that includes each of a plurality of related events related to the event in time series And a strong correlation between the value related to the event in the first period within the period recorded in the recording means and the item value related to the related event in the second period before the first period. And a computer program for predicting a value related to the event in the prediction target period,
The computer,
Means for logarithmically converting the value relating to the event,
Means for secondary variable conversion of the item value related to the item with respect to the logarithmically converted value;
Means for calculating a factor effect value indicating the strength of correlation of each item with respect to a change in the value related to the event, using the value related to the event after conversion and the item value related to the item ;
Means for calculating, for each number of items, the strength of correlation with the prediction target for a plurality of items selected in descending order of the calculated factor effect value;
Means for determining the number of items that maximizes the strength of the correlation with the prediction target based on the calculated strength of the correlation;
Means for selecting items for the determined number of items in descending order of the factor effect value;
A computer program for causing a computer to function as a prediction unit that calculates a predicted value related to the event in the prediction target period based on an item value of a selected item .