JP2021012649A

JP2021012649A - Information processing device, information processing method, and program

Info

Publication number: JP2021012649A
Application number: JP2019127787A
Authority: JP
Inventors: 真史吉川; Mao Yoshikawa
Original assignee: Preferred Networks Inc
Current assignee: Preferred Networks Inc
Priority date: 2019-07-09
Filing date: 2019-07-09
Publication date: 2021-02-04

Abstract

To allow for efficiently extracting feature quantities.SOLUTION: An information processing device is provided, comprising a memory and a processor. The processor is configured to perform steps of: searching for multiple feature quantities that can be extracted using information contained in multiple tables; granting metadata for the multiple feature quantities; and determining at least one feature quantity from among the multiple feature quantities based on the metadata.SELECTED DRAWING: Figure 2

Description

本発明の実施形態は、情報処理装置、情報処理方法及びプログラムに関する。 Embodiments of the present invention relate to information processing devices, information processing methods and programs.

関連のある複数のテーブルから特徴量を抽出、取得することは、例えば、機械学習の分野等において行われている。これは、複数のテーブルに共通するID（識別子：Identifier）に基づいてデータ等を紐付けし、紐付けされたデータ等から特徴量を算出等することにより実行される。しかしながら、一般的にこのような統計処理、機械学習等に用いられるテーブルは、データ数が膨大なものである。このため、合成に用いるオペレーションの数又はデータそのものの量に起因して、全てのテーブルにおいて特徴量を抽出することは、計算機のリソース不足、又は、コストの増大に繋がり、本来取得したかった特徴量が得られないこともある。 Extracting and acquiring features from a plurality of related tables is performed, for example, in the field of machine learning. This is executed by associating data or the like based on an ID (identifier: Identifier) common to a plurality of tables and calculating a feature amount from the associated data or the like. However, a table generally used for such statistical processing, machine learning, etc. has a huge amount of data. For this reason, extracting features in all tables due to the number of operations used for synthesis or the amount of data itself leads to a shortage of computer resources or an increase in cost, and features that were originally desired to be acquired. The amount may not be available.

J. M. Kanter, et.al., "Deep feature synthesis: Towards automating data science endeavors," 2015 IEEE International Conference on Data Science and Advanced Analytics (DSAA), 19-21 Oct. 2015J. M. Kanter, et.al., "Deep feature synthesis: Towards automating data science endeavors," 2015 IEEE International Conference on Data Science and Advanced Analytics (DSAA), 19-21 Oct. 2015 H. T. Lam, et.al., "One button machine for automating feature engineering in relational databases," arXiv.org, arXiv:1706.00327, https://arxiv.org.abs/1706.00327v1, 1 Jun., 2017H. T. Lam, et.al., "One button machine for automating feature engineering in relational databases," arXiv.org, arXiv: 1706.0327, https://arxiv.org.abs/1706.00327v1, 1 Jun., 2017

一実施形態は、効率よく特徴量を抽出する情報処理装置、情報処理方法及びプログラムを提供する。 One embodiment provides an information processing device, an information processing method, and a program for efficiently extracting feature quantities.

一実施形態によれば、情報処理装置は、メモリと、プロセッサを備える。プロセッサは、複数のテーブルに含まれる情報を用いて抽出可能な複数の特徴量を探索することと、前記複数の特徴量について、メタデータを付与することと、前記メタデータに基づいて、前記複数の特徴量から少なくとも１つの特徴量を決定することと、を実行するように構成される。 According to one embodiment, the information processing device comprises a memory and a processor. The processor searches for a plurality of feature quantities that can be extracted using information contained in a plurality of tables, adds metadata to the plurality of feature quantities, and the plurality of features based on the metadata. It is configured to determine at least one feature from the features of and to perform.

複数テーブルの一例。An example of multiple tables. 一実施形態に係る特徴量抽出装置の機能を示すブロック図。The block diagram which shows the function of the feature amount extraction apparatus which concerns on one Embodiment. 一実施形態に係る特徴量抽出の処理を示すフローチャート。A flowchart showing a feature amount extraction process according to an embodiment. 一実施形態に係る特徴量抽出装置の実装例を示す図。The figure which shows the implementation example of the feature amount extraction apparatus which concerns on one Embodiment.

以下、図面を参照して実施形態について説明する。説明に用いているテーブルは、一例として示したものであり、本実施形態を限定するものではない。 Hereinafter, embodiments will be described with reference to the drawings. The table used in the description is shown as an example, and does not limit the present embodiment.

例えば、図面におけるテーブルよりも単純なテーブルでもよいし、さらに複雑な相互の関連性を有するテーブルであってもよい。また、行数、列数もこれに限られるものではなく、さらに多くの行数、列数を有するものであってもよい。さらに、製品の購入に対する例を挙げるが、これには限られず、例えば、機械学習で学習の対象となりうるような複数のテーブルであってもよい。また、複数のテーブルに分割されているが、これには限られず、１のテーブル内に複数のテーブルが合成されているようなテーブルであっても構わない。 For example, it may be a simpler table than the table in the drawing, or it may be a table having more complicated mutual relationships. Further, the number of rows and the number of columns are not limited to this, and the number of rows and columns may be further increased. Further, examples for purchasing a product are given, but the present invention is not limited to this, and for example, there may be a plurality of tables that can be learned by machine learning. Further, although it is divided into a plurality of tables, the present invention is not limited to this, and a table may be a table in which a plurality of tables are combined in one table.

図１は、特徴量抽出の対象となる複数テーブル（テーブル群）の一例を示す図である。本実施形態では、このような複数のテーブルから有効な特徴量を抽出する。 FIG. 1 is a diagram showing an example of a plurality of tables (table groups) to be extracted. In the present embodiment, effective features are extracted from such a plurality of tables.

顧客テーブルは、顧客IDと顧客の属性を関連づけるテーブルである。顧客IDは、例えば、顧客１人ごとに付与され、この顧客IDと性別、年齢のデータが紐付けられる。 The customer table is a table that associates customer IDs with customer attributes. A customer ID is assigned to each customer, for example, and the customer ID is associated with gender and age data.

注文テーブルは、注文IDと顧客IDを関連づけるテーブルである。この注文IDは、例えば、顧客が１注文をした場合に、注文ごとに付与されるIDである。注文は、例えば、製品の購入の注文であり、１注文に１又は複数の製品の購入を含んでもよい。 The order table is a table that associates the order ID with the customer ID. This order ID is, for example, an ID given for each order when the customer places one order. The order is, for example, an order for the purchase of a product and may include the purchase of one or more products per order.

処理テーブルは、処理IDと注文ID及び製品IDを関連づけるテーブルである。この処理IDは、注文された製品ごとに付与されるIDであり、注文IDに対して製品IDを１ずつ関連づける。１注文で複数の製品の購入注文が含まれる場合には、処理IDが製品IDごとに割り振られる。例えば、図において、処理ID１、２に対して製品ID３、１が関連づけられ、これら２つの処理に対して、同じ注文ID１が関連づけられる。これは、注文ID１に対する注文が、製品ID１、３の製品の購入注文をしたことを意味する。 The processing table is a table that associates the processing ID with the order ID and the product ID. This processing ID is an ID given to each ordered product, and one product ID is associated with the ordered ID. When one order includes purchase orders for a plurality of products, a processing ID is assigned to each product ID. For example, in the figure, product IDs 3 and 1 are associated with processing IDs 1 and 2, and the same order ID 1 is associated with these two processes. This means that the order for order ID1 has placed a purchase order for the products with product IDs 1 and 3.

製品テーブルは、製品IDと価格とを関連づけるテーブルである。各製品IDに対して、当該製品IDが示す製品の価格が格納されている。 The product table is a table that associates the product ID with the price. For each product ID, the price of the product indicated by the product ID is stored.

このような複数のテーブルから、特徴量を抽出する。特徴量は、例えば、これらのテーブルの情報から統計処理である、平均、最大値、最小値、分散、モード、メディアン、共分散、その他の統計処理、若しくは、微分、積分、FFT（Fast Fourier Transform）等の数値処理、又は、その組み合わせにより取得することができる数値である。 Features are extracted from such a plurality of tables. The feature quantity is, for example, statistical processing from the information in these tables, such as mean, mode, minimum value, variance, mode, median, covariance, and other statistical processing, or differentiation, integration, and FFT (Fast Fourier Transform). ), Etc., or a combination thereof.

図２は、本実施形態に係る特徴量抽出装置の機能を示すブロック図である。特徴量抽出装置１は、入力部１０と、記憶部１２と、特徴量探索部１４と、メタデータ付与部１６と、特徴量決定部１８と、特徴量抽出部２０と、出力部２２と、を備える。 FIG. 2 is a block diagram showing the functions of the feature amount extraction device according to the present embodiment. The feature amount extraction device 1 includes an input unit 10, a storage unit 12, a feature amount search unit 14, a metadata addition unit 16, a feature amount determination unit 18, a feature amount extraction unit 20, and an output unit 22. To be equipped.

入力部１０は、データの入力及びユーザからの要求を受け付ける。入力部１０は、例えば、ネットワーク等と接続されるインタフェース、又は、ユーザからの入力を受け付けるユーザインタフェース等を介して特徴量抽出装置１と外部とを接続する。 The input unit 10 receives data input and a request from the user. The input unit 10 connects the feature amount extraction device 1 and the outside via, for example, an interface connected to a network or the like, or a user interface that accepts input from a user.

記憶部１２は、特徴量抽出装置１における特徴量の抽出について必要となるデータ、プログラム等を格納する。また、記憶部１２は、特徴量の抽出過程における各種数値の合成値等を一時的に格納するメモリを備えていてもよい。 The storage unit 12 stores data, programs, and the like necessary for extracting the feature amount in the feature amount extraction device 1. Further, the storage unit 12 may include a memory for temporarily storing a composite value of various numerical values in the feature amount extraction process.

特徴量探索部１４は、複数のテーブルのデータから取得可能な特徴量を探索する。特徴量の探索は、一般的な手法で実行されてもよい。例えば、複数のテーブルのデータにおいて、準備された統計処理、数値処理等のうちデータに対して実行可能な組み合わせを探索し、抽出可能な特徴量を探索する。 The feature amount search unit 14 searches for a feature amount that can be acquired from the data of a plurality of tables. The search for features may be performed by a general method. For example, in the data of a plurality of tables, a combination that can be executed for the data among the prepared statistical processing, numerical processing, and the like is searched, and the feature amount that can be extracted is searched.

メタデータ付与部１６は、特徴量探索部１４が探索した特徴量に対して、メタデータを付与する。付与されるメタデータは、例えば、特徴量を合成するために用いたオペレーション、合成するために辿るテーブルの数（テーブルまでのパス数）、合成元のテーブルのデータ、例えば、データの一意性、分散等、また、連続値である、カテゴリカル値であると言ったデータの型についての情報である。これらのデータは、複数が組み合わせで付与されてもよい。 The metadata addition unit 16 adds metadata to the feature amount searched by the feature amount search unit 14. The metadata given is, for example, the operation used to synthesize the features, the number of tables to be followed for synthesis (the number of paths to the table), the data of the source table, for example, the uniqueness of the data. It is information about the type of data such as distribution, continuous value, and categorical value. A plurality of these data may be given in combination.

以下、特徴量の「合成」と記載しているが、これは、複数の変数から特徴量を合成するという意味であり、隠れた特徴量の「抽出」と同義である。すなわち、本実施形態に係る装置、方法は、テーブル群のデータから抽出可能な複数の特徴量を探索し、抽出したこれら複数の特徴量について優先度に基づいて絞りこんで、所望の特徴量をさらに抽出する、と言い換えてもよい。 Hereinafter, the feature amount is described as "synthesis", which means that the feature amount is synthesized from a plurality of variables, and is synonymous with the "extraction" of the hidden feature amount. That is, the apparatus and method according to the present embodiment search for a plurality of feature quantities that can be extracted from the data of the table group, narrow down the extracted plurality of feature quantities based on the priority, and obtain a desired feature quantity. In other words, it is extracted further.

特徴量決定部１８は、メタデータ付与部１６が付与したメタデータに基づいて特徴量探索部１４が探索した特徴量のうち、抽出対象となる１又は複数の特徴量を選択、決定する。特徴量の決定（選択）には、メタデータに基づいて算出された優先度を用いてもよい。例えば、抽出過程における演算等に対して、所定のルール、関数等を適用して優先度を算出してもよい。特徴量決定部１８は、あらかじめ設定されている条件で探索された特徴量の優先度を算出し、優先度に基づいて抽出する特徴量を決定する。また、特徴量決定部１８は、算出された優先度の順番に特徴量を抽出してもよい。 The feature amount determination unit 18 selects and determines one or a plurality of feature amounts to be extracted from the feature amounts searched by the feature amount search unit 14 based on the metadata given by the metadata addition unit 16. The priority calculated based on the metadata may be used for determining (selecting) the feature amount. For example, a predetermined rule, function, or the like may be applied to the calculation or the like in the extraction process to calculate the priority. The feature amount determination unit 18 calculates the priority of the feature amount searched under the preset conditions, and determines the feature amount to be extracted based on the priority. Further, the feature amount determination unit 18 may extract the feature amount in the order of the calculated priority.

優先度は、ユーザがあらかじめ設定したものであってもよい。優先度は、例えば、統計処理であれば、平均＞分散＞最大＝最小＞メディアン、等として設定される。別の量にも同様に優先度を設定してもよく、例えば、パスの長さが短いほど優先度を高くする、等としてもよい。記憶部１２は、これらの優先度付与の設定をユーザから入力部１０を介して受信し、格納しておいてもよい。特徴量決定部１８は、これらあらかじめ設定された処理、量ごとの優先度に基づいて、探索した特徴量ごとに優先度を算出する。 The priority may be preset by the user. For example, in the case of statistical processing, the priority is set as average> variance> maximum = minimum> median. The priority may be set for another amount in the same manner. For example, the shorter the path length, the higher the priority. The storage unit 12 may receive and store these priority assignment settings from the user via the input unit 10. The feature amount determination unit 18 calculates the priority for each searched feature amount based on the preset processing and the priority for each amount.

上記の優先度は、例えば、需要予測等の所望のタスクに適した優先度をユーザが決定してもよい。また、以下に説明するように、算出された重要度に基づいて、特徴量決定部１８が選択して抽出対象として決定してもよい。 As for the above priority, the user may determine a priority suitable for a desired task such as demand forecasting. Further, as described below, the feature amount determining unit 18 may select and determine the extraction target based on the calculated importance.

特徴量決定部１８は、探索された特徴量の優先度を算出する関数を学習により取得してもよい。具体的には、探索された特徴量の重要度に基づいて、優先度を取得する。 The feature amount determination unit 18 may acquire a function for calculating the priority of the searched feature amount by learning. Specifically, the priority is acquired based on the importance of the searched features.

例えば、permutation importanceの考え方に基づき、あるテーブル群に対して複数の特徴量を用いたタスク、例えば、統計処理、ニューラルネットワークによる処理が存在する（設定されている）場合に、当該処理において用いられている特徴量について、重要度を算出する。この重要度は、例えば、ある特徴量に着目し、当該特徴量の内容をシャッフルして処理を行う。シャッフル前後で処理の結果があまり変わらないものは重要度を低くし、シャッフル前後で処理の結果が大きく変わるものほど重要度を高くする。この重要度を優先度として、特徴量を抽出してもよい。 For example, based on the concept of permutation importance, when a task using a plurality of features for a certain table group, for example, statistical processing or processing by a neural network exists (is set), it is used in the processing. Calculate the importance of the feature amount. For this importance, for example, a certain feature amount is focused on, and the content of the feature amount is shuffled for processing. Those whose processing results do not change much before and after shuffling are less important, and those whose processing results change significantly before and after shuffling are more important. Features may be extracted with this importance as a priority.

また、シャッフルには限られず、当該特徴量の平均値、分散値等に基づいて、当該特徴量の値として適切ではない値に固定して処理をしてもよい。この場合も同様に、値の固定前後で結果が変化するものほど重要度を高くしてもよい。 Further, the processing is not limited to shuffle, and the processing may be performed by fixing the value to an inappropriate value of the feature amount based on the average value, the dispersion value, or the like of the feature amount. In this case as well, the importance may be higher as the result changes before and after the value is fixed.

なお、上記において、結果があまり変わらない／大きく変わる、とは、例えば、シャッフル前の処理の結果の平均値、分散値等の統計量に基づいて判断してもよい。例えば、シャッフル後の処理の値が、シャッフル前の処理の平均値から±１σ内に入っているか否か、等によってもよい。１σは、一例として挙げたものではなく限定されるものではない。これには限られず、所定の条件により適切に判断されるものであればよく、別の例としては、シャッフル後の処理の平均値、分散等の統計量に基づいてもよい。また、一般的に用いられているpermutation importanceの手法と同等の手法に基づいてもよい。 In the above, it may be determined that the result does not change much / greatly changes, for example, based on statistics such as the average value and the variance value of the processing results before shuffling. For example, it may depend on whether or not the value of the processing after shuffling is within ± 1σ from the average value of the processing before shuffling. 1σ is not given as an example and is not limited. The present invention is not limited to this, as long as it is appropriately determined under predetermined conditions, and as another example, it may be based on statistics such as the average value and variance of the processing after shuffling. Further, it may be based on a method equivalent to the commonly used method of permutation importance.

上記の例は、必ずしも独立したものでなくともよい。すなわち、所定のルール、関数等により算出された優先度に基づいて、抽出候補となる特徴量を選択し、当該選択された特徴量に対して、値をシャッフル等して重要度を求め、この重要度に基づいて優先度を更新してもよい。これには限られず、適切に優先度を算出できる処理であれば、所定ルール等に基づいて算出された値と、上記の重要度と、の両方を参酌して優先度を算出してもよい。 The above example does not necessarily have to be independent. That is, a feature amount to be an extraction candidate is selected based on the priority calculated by a predetermined rule, a function, or the like, and the value is shuffled or the like to obtain the importance of the selected feature amount. The priority may be updated based on the importance. The priority is not limited to this, and the priority may be calculated by taking into consideration both the value calculated based on the predetermined rule and the above-mentioned importance as long as the process can appropriately calculate the priority. ..

特徴量抽出部２０は、特徴量決定部１８が決定した特徴量を複数テーブルから合成して抽出する。特徴量決定部１８が複数の特徴量を抽出対象として決定した場合には、特徴量抽出部２０は、例えば、あらかじめ設定された所定時間において合成できる前記特徴量を合成して抽出してもよいし、あらかじめ設定された所定最大値までの数の特徴量を合成して抽出してもよい。 The feature amount extraction unit 20 synthesizes and extracts the feature amount determined by the feature amount determination unit 18 from a plurality of tables. When the feature amount determining unit 18 determines a plurality of feature amounts as extraction targets, the feature amount extracting unit 20 may synthesize and extract the feature amounts that can be synthesized in a predetermined time set in advance, for example. Then, a number of features up to a predetermined maximum value set in advance may be combined and extracted.

あらかじめ設定された所定時間がある場合には、特徴量抽出部２０は、自己又は他の部において時間を計測しておき、所定時間が経過した時点で取得できている特徴量を抽出対象としてもよいし、所定時間が経過時点で取得中の特徴量までを抽出対象としてもよい。 If there is a predetermined time set in advance, the feature amount extraction unit 20 may measure the time in itself or in another unit, and the feature amount that can be acquired when the predetermined time has elapsed may be extracted. Alternatively, up to the feature amount being acquired after a predetermined time has elapsed may be extracted.

あらかじめ設定された所定最大値がある場合には、特徴量決定部１８がこの最大値以下の特徴量を抽出対象として決定し、特徴量抽出部２０がこれらの特徴量を抽出してもよい。別の例としては、特徴量抽出部２０は、特徴量決定部１８が決定した特徴量について、優先度が高い順番に所定最大値までの特徴量を抽出してもよい。この場合、特徴量決定部１８が優先度を特徴量抽出部２０へと出力してもよいし、特徴量決定部１８において優先度順に抽出する特徴量を並び替え、特徴量抽出部２０は、並び替えられた順番に特徴量を抽出してもよい。 When there is a predetermined maximum value set in advance, the feature amount determination unit 18 may determine the feature amount equal to or less than the maximum value as an extraction target, and the feature amount extraction unit 20 may extract these feature amounts. As another example, the feature amount extraction unit 20 may extract the feature amounts up to a predetermined maximum value in descending order of priority from the feature amounts determined by the feature amount determination unit 18. In this case, the feature amount determination unit 18 may output the priority to the feature amount extraction unit 20, or the feature amount determination unit 18 rearranges the feature amounts to be extracted in order of priority, and the feature amount extraction unit 20 may perform the feature amount extraction unit 20. The features may be extracted in the sorted order.

出力部２２は、特徴量抽出部２０が抽出した特徴量を出力する。ここで、本実施形態において出力とは、特徴量抽出装置１の外部への出力はもちろん、記憶部１２へ格納することを含む概念であるものとする。 The output unit 22 outputs the feature amount extracted by the feature amount extraction unit 20. Here, in the present embodiment, the output is a concept including not only the output to the outside of the feature amount extraction device 1 but also the storage in the storage unit 12.

図３は、本実施形態に係る特徴量抽出装置１の処理の流れを示すフローチャートである。以下、この図３を用いて本実施形態に係る特徴量抽出処理について説明する。 FIG. 3 is a flowchart showing a processing flow of the feature amount extraction device 1 according to the present embodiment. Hereinafter, the feature amount extraction process according to the present embodiment will be described with reference to FIG.

まず、入力部１０を介して特徴量抽出装置１は、テーブルの情報等の入力を受け付ける（Ｓ１００）。受け付ける情報は、テーブルの情報だけではなく、付与するメタデータの種類、メタデータから優先度へと変換する関数、上述した所定時間及び所定最大値のうち必要となる情報である。上記の数値等は、状況に応じて、あらかじめ特徴量抽出装置１に設定されていてもよい。 First, the feature amount extraction device 1 receives input of table information and the like via the input unit 10 (S100). The information to be accepted is not only the information in the table, but also the type of metadata to be given, the function for converting the metadata to the priority, and the required information among the above-mentioned predetermined time and predetermined maximum value. The above numerical values and the like may be set in advance in the feature amount extraction device 1 depending on the situation.

次に、特徴量探索部１４は、複数のテーブル情報から合成可能な特徴量を探索する（Ｓ１０２）。特徴量探索部１４は、例えば、あらかじめ設定されている、統計処理、数値処理等に基づいて、複数のテーブルから合成可能な特徴量を探索する。 Next, the feature amount search unit 14 searches for a feature amount that can be synthesized from a plurality of table information (S102). The feature amount search unit 14 searches for a feature amount that can be synthesized from a plurality of tables based on, for example, preset statistical processing, numerical processing, and the like.

例えば、図１のような４つのテーブルが特徴量を抽出する対象となるテーブルである場合、顧客に対して、１注文ごとの平均価格、１注文における購入製品の平均価格等の特徴量や、製品に対して、購入した顧客の性別、年齢、１注文で一緒に注文された製品に関する情報、等が特徴量として合成されうる。上記は、一例として示したものであり、この他にも特徴量として合成可能である量は、多数ある。各特徴量に関して、パスの数、使用するオペレーションの種類、数等の情報を同じタイミングで取得することも可能である。 For example, when the four tables as shown in FIG. 1 are the tables for which the feature amount is to be extracted, the feature amount such as the average price for each order and the average price of the purchased product in one order for the customer, For the product, the gender, age, information about the product ordered together with one order, etc. of the customer who purchased the product can be synthesized as a feature quantity. The above is shown as an example, and there are many other quantities that can be synthesized as feature quantities. It is also possible to acquire information such as the number of passes, the type of operation to be used, and the number of each feature at the same timing.

図３に戻り、次に、メタデータ付与部１６は、メタデータを付与する（Ｓ１０４）。メタデータ付与部１６は、特徴量探索部１４が探索した特徴量に関する情報に基づいて当該特徴量にメタデータを付与する。メタデータは、上述したように、オペレータ、パス数、テーブルのデータの情報に基づいて、探索された特徴量のそれぞれに付与される。 Returning to FIG. 3, the metadata addition unit 16 then adds metadata (S104). The metadata addition unit 16 adds metadata to the feature amount based on the information about the feature amount searched by the feature amount search unit 14. As described above, the metadata is given to each of the searched features based on the information of the operator, the number of passes, and the data of the table.

次に、特徴量決定部１８は、付与されたメタデータに基づいて、特徴量探索部１４が探索した特徴量について、抽出する優先度を算出する（Ｓ１０６）。上述したように、一例として、優先度は、要素ごとにユーザがあらかじめ設定する。ユーザが設定した要素ごとの優先度を総合して、特徴量に関する優先度を算出する。例えば、特徴量の合成に用いる演算の優先度が、平均＝５、分散＝４、最大＝最小＝３、・・・、と設定され、パスの長さに対して、優先度がmax（６−パスの長さ，０）と設定されているとする。この場合、パスの長さが１であり、平均によって合成される特徴量の優先度は、５＋５＝１０として算出されてもよい。和ではなく、積（２５）と算出してもよいし、各要素についての最大値（５）と算出してもよい。さらに、最大と平均を用いる場合には、これらの和を取って、他の要素との積をとる等、優先度の算出方法は、ユーザにより、又は、機械学習により最適となるように訓練された手法により、自由に決定してもよい。例えば、ある演算同士の組み合わせが重要な場合には、当該演算の組み合わせを備える特徴量の優先度を特に高くするようにしてもよい。 Next, the feature amount determination unit 18 calculates the priority to be extracted for the feature amount searched by the feature amount search unit 14 based on the added metadata (S106). As described above, as an example, the priority is preset by the user for each element. The priority for each feature is calculated by summing the priorities for each element set by the user. For example, the priority of the calculation used for synthesizing the features is set as average = 5, variance = 4, maximum = minimum = 3, ..., And the priority is max (6) with respect to the length of the path. -It is assumed that the path length is set to 0). In this case, the length of the path is 1, and the priority of the feature amount synthesized by the average may be calculated as 5 + 5 = 10. Instead of the sum, it may be calculated as the product (25), or it may be calculated as the maximum value (5) for each element. Furthermore, when using the maximum and the average, the method of calculating the priority, such as taking the sum of these and taking the product with other elements, is trained to be optimal by the user or by machine learning. You may decide freely by the method. For example, when the combination of certain operations is important, the priority of the feature quantity including the combination of the operations may be particularly high.

このように、ユーザが高くしたい優先度がより高くなるように算出できる演算であればどのようなものであっても構わない。もちろん、テーブルのデータの情報についても同様に優先度を設定し、合成方法、パス数、テーブルデータ情報の３つに基づいて優先度が算出されてもよいし、これら３つのうち少なくとも１つから優先度が算出されるようにしてもよい。 In this way, any operation can be used as long as it can be calculated so that the priority that the user wants to increase is higher. Of course, the priority may be set for the table data information in the same manner, and the priority may be calculated based on the composition method, the number of passes, and the table data information, or from at least one of these three. The priority may be calculated.

なお、優先度が高い場合に抽出するようにしたが、低い場合に優先して抽出する設計としてもよい。この場合には、優先したい処理ほど優先度が小さくなるようにユーザが設定し、この設定に基づいて、優先して抽出する特徴量の優先度がより小さくなるように算出方法を決定する。 It should be noted that although the extraction is performed when the priority is high, the design may be such that the extraction is performed with priority when the priority is low. In this case, the user sets the priority so that the process to be prioritized becomes smaller, and based on this setting, the calculation method is determined so that the priority of the feature amount to be extracted with priority becomes smaller.

次に、特徴量決定部１８は、抽出対象となる特徴量を決定する（Ｓ１０８）。例えば、Ｓ１０６で算出された優先度順に、抽出対象となる特徴量を決定する。なお、上述したように、特徴量決定部１８において優先度を算出し、その優先度情報を特徴量抽出部２０へと送信し、特徴量抽出部２０において受信した優先度に基づいて抽出対象となる特徴量を決定してもよい。 Next, the feature amount determination unit 18 determines the feature amount to be extracted (S108). For example, the feature quantities to be extracted are determined in the order of priority calculated in S106. As described above, the feature amount determination unit 18 calculates the priority, transmits the priority information to the feature amount extraction unit 20, and sets the extraction target based on the priority received by the feature amount extraction unit 20. The feature amount may be determined.

次に、特徴量抽出部２０は、決定された順番に優先度を合成して抽出する（Ｓ１１０）。例えば、図１に示した複数のテーブルの場合、顧客ごとの１注文における平均価格を算出する。例えば、顧客ID２の顧客について算出する場合を説明する。 Next, the feature amount extraction unit 20 synthesizes and extracts the priorities in the determined order (S110). For example, in the case of the plurality of tables shown in FIG. 1, the average price for one order for each customer is calculated. For example, the case of calculating the customer with the customer ID 2 will be described.

特徴量抽出部２０は、平均値を求めるために必要なデータを各テーブルから取得する。まず、注文テーブルにおいて顧客が注文した注文IDを抽出する。注文ID１、４が対象となる。次に、処理テーブルから、注文ID１、４において購入された製品を抽出する。注文ID１においては、製品３と製品１が、注文ID４については、製品３が購入されている The feature amount extraction unit 20 acquires data necessary for obtaining an average value from each table. First, the order ID ordered by the customer is extracted from the order table. Order IDs 1 and 4 are eligible. Next, the products purchased under order IDs 1 and 4 are extracted from the processing table. For order ID 1, product 3 and product 1 are purchased, and for order ID 4, product 3 is purchased.

この結果と製品テーブルにおける価格とに基づいて、抽出する特徴量である顧客ID２の購入した１注文ごとの平均値を算出する。この場合、各テーブルを参照して、以下のように算出することができる。

Based on this result and the price in the product table, the average value for each purchased order of the customer ID 2 which is the feature amount to be extracted is calculated. In this case, it can be calculated as follows with reference to each table.

このように、特徴量抽出部２０は、直接他のテーブルの情報を参照して抽出対象の特徴量を計算することにより抽出対象となる特徴量を合成する。すなわち、特徴量抽出部２０は、他の特徴量（上記の例では、注文IDごとの和等の特徴量）をそれぞれのテーブルにおいて算出し、テーブルとして保持することなく、抽出対象となる特徴量を合成する。他の顧客IDについても同様に１注文あたりの平均購入価格を算出し、特徴量として抽出する。 In this way, the feature amount extraction unit 20 synthesizes the feature amount to be extracted by directly referring to the information in other tables and calculating the feature amount to be extracted. That is, the feature amount extraction unit 20 calculates another feature amount (in the above example, a feature amount such as the sum of each order ID) in each table, and the feature amount to be extracted without holding it as a table. Is synthesized. Similarly, for other customer IDs, the average purchase price per order is calculated and extracted as a feature amount.

図３に戻り、次に、特徴量抽出部２０は、出力部２２を介して抽出した特徴量を出力する（Ｓ１１２）。抽出した特徴量は、特徴量抽出装置１の内部又は外部のストレージ等に格納されてもよい。 Returning to FIG. 3, the feature amount extraction unit 20 then outputs the feature amount extracted via the output unit 22 (S112). The extracted feature amount may be stored in a storage or the like inside or outside the feature amount extraction device 1.

特徴量抽出装置１は、特徴量の抽出が終了したか否かを判断し（Ｓ１１４）、特徴量の抽出が終了している場合（Ｓ１１４：ＹＥＳ）には、処理を終了する。特徴量の抽出が終了していない場合（Ｓ１１４：ＮＯ）には、優先度に基づいて、次の特徴量を抽出する。 The feature amount extraction device 1 determines whether or not the feature amount extraction is completed (S114), and if the feature amount extraction is completed (S114: YES), ends the process. When the extraction of the feature amount is not completed (S114: NO), the next feature amount is extracted based on the priority.

以上のように、本実施形態によれば、複数のテーブルにわたりデータを選択し、選択されたデータから特徴量を合成して抽出する場合において、探索された特徴量の全てを計算して一時記憶領域に保持することなく、重要な特徴量を抽出することが可能となる。このように、優先度が高い特徴量を、その合成途中の特徴量を保存することなく合成して抽出することにより、特徴量抽出の間に使用するメモリを大幅に削減することが可能となる。また、他の探索された特徴量を合成することなく、優先度の高い特徴量を選択して合成することができるので、処理時間をも削減することが可能となる。 As described above, according to the present embodiment, when data is selected over a plurality of tables and feature quantities are synthesized and extracted from the selected data, all the searched feature quantities are calculated and temporarily stored. It is possible to extract important features without holding them in the region. In this way, by synthesizing and extracting the features with high priority without preserving the features during the synthesis, it is possible to significantly reduce the memory used during the feature extraction. .. In addition, since it is possible to select and synthesize high-priority feature quantities without synthesizing other searched feature quantities, it is possible to reduce the processing time.

このように、本実施形態に係る特徴量抽出装置１を用いることにより、重要な特徴量抽出について柔軟なスケジュールで実行でき、メモリに乗り切れずに重要な特徴量が抽出できないと言った問題にも対処することが可能となる。また、重要な特徴量に絞って抽出を行うことが可能なため、機械学習においてノイズとなり得る不要な特徴量の抽出を回避することもできる。 As described above, by using the feature amount extraction device 1 according to the present embodiment, the important feature amount extraction can be executed on a flexible schedule, and the problem that the important feature amount cannot be extracted without being able to overcome the memory can be solved. It becomes possible to deal with it. In addition, since it is possible to narrow down the extraction to important features, it is possible to avoid the extraction of unnecessary features that may cause noise in machine learning.

前述した実施形態及び変形例における特徴量抽出装置１において、各機能は、アナログ回路、デジタル回路又はアナログ・デジタル混合回路で構成された回路であってもよい。また、各機能の制御を行う制御回路を備えていてもよい。各回路の実装は、ＡＳＩＣ（Application Specific Integrated Circuit）、ＦＰＧＡ（Field Programmable Gate Array）等によるものであってもよい。 In the feature amount extraction device 1 in the above-described embodiment and modification, each function may be a circuit composed of an analog circuit, a digital circuit, or an analog / digital mixed circuit. Further, a control circuit for controlling each function may be provided. The mounting of each circuit may be by ASIC (Application Specific Integrated Circuit), FPGA (Field Programmable Gate Array) or the like.

上記の全ての記載において、特徴量抽出装置１の少なくとも一部はハードウェアで構成されていてもよいし、ソフトウェアで構成され、ソフトウェアの情報処理によりＣＰＵ等が実施をしてもよい。ソフトウェアで構成される場合には、特徴量抽出装置１及びその少なくとも一部の機能を実現するプログラムをフレキシブルディスクやＣＤ−ＲＯＭ等の記憶媒体に収納し、コンピュータに読み込ませて実行させるものであってもよい。記憶媒体は、磁気ディスクや光ディスク等の着脱可能なものに限定されず、ハードディスク装置やメモリなどの固定型の記憶媒体であってもよい。すなわち、ソフトウェアによる情報処理がハードウェア資源を用いて具体的に実装されるものであってもよい。さらに、ソフトウェアによる処理は、ＦＰＧＡ等の回路に実装され、ハードウェアが実行するものであってもよい。モデルの生成や、モデルに入力をした後の処理は、例えば、ＧＰＵ等のアクセラレータを使用して行ってもよい。 In all the above descriptions, at least a part of the feature amount extraction device 1 may be composed of hardware or software, and may be executed by a CPU or the like by information processing of the software. When it is composed of software, the feature extraction device 1 and a program that realizes at least a part of the functions are stored in a storage medium such as a flexible disk or a CD-ROM, read by a computer, and executed. You may. The storage medium is not limited to a removable one such as a magnetic disk or an optical disk, and may be a fixed storage medium such as a hard disk device or a memory. That is, information processing by software may be concretely implemented using hardware resources. Further, the processing by software may be implemented in a circuit such as FPGA and executed by hardware. The generation of the model and the processing after inputting to the model may be performed by using an accelerator such as a GPU, for example.

例えば、コンピュータが読み取り可能な記憶媒体に記憶された専用のソフトウェアをコンピュータが読み出すことにより、コンピュータを上記の実施形態の装置とすることができる。記憶媒体の種類は特に限定されるものではない。また、通信ネットワークを介してダウンロードされた専用のソフトウェアをコンピュータがインストールすることにより、コンピュータを上記の実施形態の装置とすることができる。こうして、ソフトウェアによる情報処理が、ハードウェア資源を用いて、具体的に実装される。 For example, the computer can be made into the device of the above-described embodiment by reading the dedicated software stored in the storage medium readable by the computer. The type of storage medium is not particularly limited. Further, by installing the dedicated software downloaded via the communication network on the computer, the computer can be used as the device of the above embodiment. In this way, information processing by software is concretely implemented using hardware resources.

図４は、本発明の一実施形態におけるハードウェア構成の一例を示すブロック図である。特徴量抽出装置１は、プロセッサ７１と、主記憶装置７２と、補助記憶装置７３と、ネットワークインタフェース７４と、デバイスインタフェース７５と、を備え、これらがバス７６を介して接続されたコンピュータ装置７として実現できる。 FIG. 4 is a block diagram showing an example of a hardware configuration according to an embodiment of the present invention. The feature amount extraction device 1 includes a processor 71, a main storage device 72, an auxiliary storage device 73, a network interface 74, and a device interface 75, and these are connected as a computer device 7 via a bus 76. realizable.

なお、図４のコンピュータ装置７は、各構成要素を一つ備えているが、同じ構成要素を複数備えていてもよい。また、図４では、１台のコンピュータ装置７が示されているが、ソフトウェアが複数のコンピュータ装置にインストールされて、当該複数のコンピュータ装置それぞれがソフトウェアの異なる一部の処理を実行してもよい。 Although the computer device 7 of FIG. 4 includes one component, the computer device 7 may include a plurality of the same components. Further, although one computer device 7 is shown in FIG. 4, software may be installed on a plurality of computer devices, and each of the plurality of computer devices may execute a part of processing different from the software. ..

プロセッサ７１は、コンピュータの制御装置および演算装置を含む電子回路（処理回路、Processing circuit、Processing circuitry）である。プロセッサ７１は、コンピュータ装置７の内部構成の各装置などから入力されたデータやプログラムに基づいて演算処理を行い、演算結果や制御信号を各装置などに出力する。具体的には、プロセッサ７１は、コンピュータ装置７のＯＳ（オペレーティングシステム）や、アプリケーションなどを実行することにより、コンピュータ装置７を構成する各構成要素を制御する。プロセッサ７１は、上記の処理を行うことができれば特に限られるものではない。特徴量抽出装置１の各構成要素は、プロセッサ７１により実現される。 The processor 71 is an electronic circuit (processing circuit, processing circuitry) including a control device and an arithmetic unit of a computer. The processor 71 performs arithmetic processing based on data and programs input from each apparatus of the internal configuration of the computer apparatus 7, and outputs the arithmetic result and the control signal to each apparatus and the like. Specifically, the processor 71 controls each component constituting the computer device 7 by executing an OS (operating system) of the computer device 7, an application, or the like. The processor 71 is not particularly limited as long as it can perform the above processing. Each component of the feature amount extraction device 1 is realized by the processor 71.

主記憶装置７２は、プロセッサ７１が実行する命令および各種データなどを記憶する記憶装置であり、主記憶装置７２に記憶された情報がプロセッサ７１により直接読み出される。補助記憶装置７３は、主記憶装置７２以外の記憶装置である。なお、これらの記憶装置は、電子情報を格納可能な任意の電子部品を意味するものとし、メモリでもストレージでもよい。また、メモリには、揮発性メモリと、不揮発性メモリがあるが、いずれでもよい。特徴量抽出装置１内において各種データを保存するためのメモリは、主記憶装置７２または補助記憶装置７３により実現されてもよい。例えば、記憶部１２は、この主記憶装置７２又は補助記憶装置７３に実装されていてもよい。別の例として、アクセラレータが備えられている場合には、記憶部１２は、当該アクセラレータに備えられているメモリ内に実装されていてもよい。また、メモリ１つに対して複数のプロセッサが接続されてもよいし、単数のプロセッサが接続されてもよい。 The main storage device 72 is a storage device that stores instructions executed by the processor 71, various data, and the like, and the information stored in the main storage device 72 is directly read by the processor 71. The auxiliary storage device 73 is a storage device other than the main storage device 72. Note that these storage devices mean arbitrary electronic components capable of storing electronic information, and may be memory or storage. Further, the memory includes a volatile memory and a non-volatile memory, but any of them may be used. The memory for storing various data in the feature amount extraction device 1 may be realized by the main storage device 72 or the auxiliary storage device 73. For example, the storage unit 12 may be mounted on the main storage device 72 or the auxiliary storage device 73. As another example, when an accelerator is provided, the storage unit 12 may be mounted in the memory provided in the accelerator. Further, a plurality of processors may be connected to one memory, or a single processor may be connected.

ネットワークインタフェース７４は、無線または有線により、通信ネットワーク８に接続するためのインタフェースである。ネットワークインタフェース７４は、既存の通信規格に適合したものを用いればよい。ネットワークインタフェース７４により、通信ネットワーク８を介して通信接続された外部装置９Ａと情報のやり取りが行われてもよい。 The network interface 74 is an interface for connecting to the communication network 8 wirelessly or by wire. As the network interface 74, one conforming to the existing communication standard may be used. Information may be exchanged by the network interface 74 with the external device 9A which is communicated and connected via the communication network 8.

外部装置９Ａは、例えば、カメラ、モーションキャプチャ、出力先デバイス、外部のセンサ、入力元デバイスなどが含まれる。また、外部装置９Ａは、特徴量抽出装置１の構成要素の一部の機能を有する装置でもよい。そして、コンピュータ装置７は、特徴量抽出装置１の処理結果の一部を、クラウドサービスのように通信ネットワーク８を介して受け取ってもよい。 The external device 9A includes, for example, a camera, motion capture, an output destination device, an external sensor, an input source device, and the like. Further, the external device 9A may be a device having some functions of the components of the feature amount extraction device 1. Then, the computer device 7 may receive a part of the processing result of the feature amount extraction device 1 via the communication network 8 like a cloud service.

デバイスインタフェース７５は、外部装置９Ｂと直接接続するＵＳＢ（Universal Serial Bus）などのインタフェースである。外部装置９Ｂは、外部記憶媒体でもよいし、ストレージ装置でもよい。記憶部１２は、外部装置９Ｂにより実現されてもよい。 The device interface 75 is an interface such as a USB (Universal Serial Bus) that directly connects to the external device 9B. The external device 9B may be an external storage medium or a storage device. The storage unit 12 may be realized by the external device 9B.

外部装置９Ｂは出力装置でもよい。出力装置は、例えば、画像を表示するための表示装置でもよいし、音声などを出力する装置などでもよい。例えば、ＬＣＤ（Liquid Crystal Display）、ＣＲＴ（Cathode Ray Tube）、ＰＤＰ（Plasma Display Panel）、スピーカなどがあるが、これらに限られるものではない。 The external device 9B may be an output device. The output device may be, for example, a display device for displaying an image, a device for outputting audio, or the like. For example, there are LCD (Liquid Crystal Display), CRT (Cathode Ray Tube), PDP (Plasma Display Panel), speaker and the like, but the present invention is not limited to these.

なお、外部装置９Ｂは入力装置でもよい。入力装置は、キーボード、マウス、タッチパネルなどのデバイスを備え、これらのデバイスにより入力された情報をコンピュータ装置７に与える。入力装置からの信号はプロセッサ７１に出力される。また、本開示の処理、手段等の一部又は全部は、ネットワークを介したクラウド上に備えられる制御回路及び記憶回路の少なくとも一方に基づいて実行され又は機能されるものであってもよい。 The external device 9B may be an input device. The input device includes devices such as a keyboard, a mouse, and a touch panel, and gives the information input by these devices to the computer device 7. The signal from the input device is output to the processor 71. In addition, some or all of the processes, means, etc. of the present disclosure may be executed or functioned based on at least one of a control circuit and a storage circuit provided on the cloud via a network.

本明細書において、「a、bおよびcの少なくとも1つ（一方）」または「a、bまたはcの少なくとも1つ（一方）」の表現は、a、b、c、a-b、a-c、b-c、a-b-cのいずれかの組み合わせを含む。また、a-a、a-b-b、a-a-b-b-c-cなどのいずれかの要素の複数のインスタンスとの組み合わせをカバーする。さらに、a-b-c-dを有するなどa、b及び／又はc以外の他の要素を加えることをカバーする。 As used herein, the expressions "at least one of a, b and c (one)" or "at least one of a, b or c (one)" are a, b, c, ab, ac, bc, Includes any combination of abc. It also covers combinations with multiple instances of any element such as a-a, a-b-b, a-a-b-b-c-c. In addition, it covers the addition of other elements other than a, b and / or c, such as having a-b-c-d.

上記の全ての記載に基づいて、実施形態の追加、効果又は種々の変形を想到できるかもしれないが、上記した個々の実施形態に限定されるものではない。特許請求の範囲に規定された内容及びその均等物から導き出される本発明の概念的な思想と趣旨を逸脱しない範囲において種々の追加、変更及び部分的削除が可能である。例えば、前述した全ての実施形態において、説明に用いた数式および数値は、一例として示したものであり、これらに限られるものではない。 Based on all the above descriptions, additions, effects or various modifications of the embodiments may be conceived, but are not limited to the individual embodiments described above. Various additions, changes and partial deletions can be made without departing from the conceptual idea and purpose of the present invention derived from the contents defined in the claims and their equivalents. For example, in all the above-described embodiments, the mathematical formulas and numerical values used in the explanation are shown as examples, and are not limited thereto.

１：特徴量抽出装置、
１０：入力部、
１２：記憶部、
１４：特徴量探索部、
１６：メタデータ付与部、
１８：特徴量決定部、
２０：特徴量抽出部、
２２：出力部 1: Feature extraction device,
10: Input section,
12: Memory,
14: Feature search unit,
16: Metadata assignment department,
18: Feature determination unit,
20: Feature extraction unit,
22: Output section

Claims

With one or more memories
It comprises one or more processors connected to the one or more memories.
The one or more processors
Searching for multiple features that can be extracted using information contained in multiple tables,
Adding metadata for the plurality of features and
Determining at least one feature from the plurality of features based on the metadata
Is configured to run,
Information processing device.

The one or more processors
To calculate the priority for each of the plurality of features based on the metadata,
Determining at least one feature from the plurality of features based on the calculated priority.
Is configured to run further,
The information processing device according to claim 1.

The one or more processors
To calculate the priority for the plurality of features based on a predetermined rule,
Is configured to run further,
The information processing device according to claim 2.

Extracting the determined feature amount from the information contained in the plurality of tables, and
Is configured to run further,
The information processing device according to claim 2 or 3.

Performing machine learning using the extracted features,
Is configured to run further,
The information processing device according to claim 4.

The metadata includes at least the processing used for extracting the feature amount, the number of paths to the table provided with the data necessary for extracting the feature amount, and information about the data in the table from which the feature amount is extracted. Including one,
The information processing device according to claim 4 or 5.

The process used for extracting the feature amount is a statistical process.
The information processing device according to any one of claims 4 to 6.

The one or more processors
To determine the order of extracting the feature quantities in descending order of the calculated priority.
Is configured to run further,
The information processing device according to any one of claims 4 to 7.

The one or more processors
To calculate the priority based on a predetermined value for the extraction process of the plurality of feature quantities.
Is configured to run further,
The information processing device according to claim 8.

The one or more processors
When a process using an extractable feature amount is preset, the priority is calculated based on the result of executing the process by changing the feature amount.
Is configured to run further,
The information processing device according to claim 8 or 9.

The one or more processors
Acquiring information across the plurality of tables based on the identifier given to the information,
Extracting the feature amount based on the acquired information,
Is configured to run further,
The information processing device according to any one of claims 4 to 10.

The one or more processors
Extracting the feature amount that can be extracted in a predetermined time,
Is configured to run,
The information processing device according to any one of claims 4 to 11.

The one or more processors
Extracting the number of features up to a predetermined maximum value,
The information processing apparatus according to any one of claims 4 to 12, which is configured to execute the above.

The one or more processors
Extracting the feature amount without calculating the feature amount other than the feature amount to be extracted and storing it as a table in the memory.
Is configured to run,
The information processing device according to any one of claims 4 to 13.

One or more processors
Search for multiple features that can be extracted using the information contained in multiple tables,
Metadata is added to the plurality of features.
Based on the metadata, at least one feature quantity is determined from the plurality of feature quantities.
Information processing method that prepares for that.

The one or more processors
Based on the metadata, the priority is calculated for each of the plurality of features.
The at least one feature amount is determined from the plurality of feature amounts based on the calculated priority.
The information processing method according to claim 15, further comprising the above.

The one or more processors
The determined feature amount is extracted from the information contained in the plurality of tables.
The information processing method according to claim 15 or 16, further comprising the above.

The metadata includes at least the processing used for extracting the feature amount, the number of paths to the table provided with the data necessary for extracting the feature amount, and information about the data in the table from which the feature amount is extracted. Including one,
The information processing method according to any one of claims 15 to 17.

The one or more processors
The feature amount to be extracted is extracted without calculating the feature amount other than the feature amount and storing it as a table in the memory.
The information processing method according to any one of claims 15 to 18.

On the computer
A means for searching for multiple features that can be extracted using information contained in multiple tables,
A means for imparting metadata for the plurality of features,
A means for determining at least one feature quantity from the plurality of feature quantities based on the metadata.
A program that functions as.

When executed by a computer, the computer
Steps to search for multiple features that can be extracted using information contained in multiple tables,
A step of adding metadata for the plurality of features,
A step of determining at least one feature from the plurality of features based on the metadata.
A non-transitory computer-readable medium that stores a program that runs a program.