JP2016194909A

JP2016194909A - Sectional linear model generation system and generation method

Info

Publication number: JP2016194909A
Application number: JP2016040842A
Authority: JP
Inventors: ホゥワン; Hu Wang; チュンチェンリウ; chun cheng Liu; ルウフオン; Lu Feng; 遼平藤巻; Ryohei Fujimaki
Original assignee: NEC Corp
Current assignee: NEC Corp
Priority date: 2015-03-31
Filing date: 2016-03-03
Publication date: 2016-11-17
Anticipated expiration: 2036-03-03
Also published as: JP6164320B2; CN106156858A

Abstract

PROBLEM TO BE SOLVED: To learn both of a model structure and model parameter in a sectional linear model such as a hierarchical mixture of experts (HME).SOLUTION: A sectional linear model generation system includes: an acquisition module 201 that is adapted to acquire data on a plurality of tasks; a first setting module 202 that is adapted to set a task correlation latent variable expressing a correlation among the plurality of tasks, in which each task further correlates to one sectional liner model; a second setting module 203 that sets a model structure to a plurality of sectional linear models; and a model optimization module 207 that is adapted to optimize the model structure and model parameter on the basis of the data on the plurality of tasks, the task correlation latent variable and hierarchical latent variable.SELECTED DRAWING: Figure 2

Description

本開示の様々な実施形態は、機械学習の分野に関し、より具体的には区分的線形モデル生成システム及び生成方法に関する。 Various embodiments of the present disclosure relate to the field of machine learning, and more specifically to piecewise linear model generation systems and methods.

区分的線形モデル（例えば、エキスパートの階層的混合（ｈｉｅｒａｒｃｈｉｃａｌｍｉｘｔｕｒｅｏｆｅｘｐｅｒｔｓ（ＨＭＥ））は、多くの企業レベルの機械学習アプリケーションで広く使用されている。ＨＭＥは、一般的な線形モデルより柔軟で、かつ特徴空間の確率的な規則に基づく分割と区分的ローカル線形エキスパートとを使用する重要な利点は残している。ＨＭＥの学習手順は、個々のエキスパートの分割構造の決定とモデリングから成る。トレーニング・サンプルのノイズが多く、サンプルサイズが十分でない場合、これは挑戦的なものになる。また、柔軟なＨＭＥは、トレーニング・データセットにオーバーフィッティングして、汎化パフォーマンスが悪いものになる可能性がある。 Piecewise linear models (e.g., hierarchical mixture of experts (HME)) are widely used in many enterprise-level machine learning applications. HME is more flexible than general linear models, And the important advantage of using feature space probabilistic rule-based partitioning and piecewise local linear experts remains: The HME learning procedure consists of determining and modeling the partition structure of individual experts. This can be challenging if the sample is noisy and the sample size is not enough, and a flexible HME can overfit the training data set, resulting in poor generalization performance. is there.

実用的なアプリケーションでは、例えば共同体の建物のエネルギー需要を予測するためには、相関した機械学習タスクの集合が、しばしば存在する。単純なアプローチでは、これらの相関したタスクの間の共通性を無視して、これらのタスクを独立して解くことになる（シングルタスク学習（ＳｉｎｇｌｅＴａｓｋＬｅａｒｎｉｎｇ（ＳＴＬ））。マルチタスク学習（Ｍｕｌｔｉ−ＴａｓｋＬｅａｒｎｉｎｇ（ＭＴＬ））では、これらの相関したタスクは、タスクをまたいで適切に情報を共有して利用することによって、共同で学習される。複数の相関したタスクを共同で学習することで、効果的に各タスクのサンプルサイズを大きくし、結果、汎化パフォーマンスを向上させる可能性がある。 In practical applications, there is often a set of correlated machine learning tasks, for example to predict the energy demand of a community building. A simple approach would be to solve these tasks independently, ignoring the commonality between these correlated tasks (Single Task Learning (STL)), Multitask Learning (Multi-). In Task Learning (MTL), these correlated tasks are learned jointly by appropriately sharing and using information across the tasks, by learning multiple correlated tasks jointly, It can effectively increase the sample size of each task and, as a result, improve generalization performance.

マルチタスク学習は、フィッティング・ノイズを防ぎうるように、サンプルの量を増やすために、タスク間で共有した情報を通じて、種々のタスクをまたいでサンプルを共同で処理する。しかし、現在のマルチタスク学習技術は、主に固定モデル構造（例えば、線形回帰）のシナリオに使用される。ＨＭＥのような区分的線形モデルでは、モデル構造とモデルパラメータの両方が学習されるべきである。しかし、現在のマルチタスク学習はそれを実行できない。 Multitask learning collaboratively processes samples across different tasks through information shared between tasks to increase the amount of samples so that fitting noise can be prevented. However, current multitask learning techniques are mainly used for scenarios with a fixed model structure (eg, linear regression). For piecewise linear models such as HME, both model structure and model parameters should be learned. However, current multitask learning cannot do that.

本開示の目的は、従来技術の上記課題の少なくとも一部を解決するために、区分的線形モデル生成システムと生成方法とを提供することである。 An object of the present disclosure is to provide a piecewise linear model generation system and a generation method in order to solve at least a part of the above-described problems of the prior art.

本開示の第１の観点によれば、複数のタスクのデータを取得するように構成された取得モジュールと、前記複数のタスク間の相関性を表すタスク相関潜在変数を設定するように構成された第１の設定モジュールと、を含み、各タスクは１つの区分的線形モデルに相関しており、複数の区分的線形モデルにモデル構造を設定し、対応するモデルパラメータと階層的潜在変数とを初期化するように構成された第２の設定モジュールと、前記複数のタスクの前記データと、前記タスク相関潜在変数と、前記階層的潜在変数とに基づいて、前記モデル構造と前記モデルパラメータとを最適化するように構成されたモデル最適化モジュールと、を含む区分的線形モデル生成システムが提供される。 According to a first aspect of the present disclosure, an acquisition module configured to acquire data of a plurality of tasks and a task correlation latent variable representing correlation between the plurality of tasks are configured. Each of the tasks is correlated to a piecewise linear model, sets the model structure to a plurality of piecewise linear models, and initializes the corresponding model parameters and hierarchical latent variables. Optimizing the model structure and the model parameters based on a second setting module configured to enable, the data of the plurality of tasks, the task correlation latent variables, and the hierarchical latent variables A piecewise linear model generation system comprising: a model optimization module configured to:

本開示の１つの代表的な実施形態によれば、前記タスク相関潜在変数は、前記複数のタスクの各タスクと、前記複数の区分的線形モデルの各区分的線形モデルとの間の対応関係を示すように構成される。 According to one exemplary embodiment of the present disclosure, the task correlation latent variable includes a correspondence relationship between each task of the plurality of tasks and each piecewise linear model of the plurality of piecewise linear models. Configured as shown.

本開示の１つの代表的な実施形態によれば、前記モデル構造は、木構造を含む。 According to one exemplary embodiment of the present disclosure, the model structure includes a tree structure.

本開示の１つの代表的な実施形態によれば、前記システムは、前記複数のタスクの前記データと、前記タスク相関潜在変数と、前記モデル構造と、前記モデルパラメータとに基づいて、前記階層的潜在変数を最適化するように構成された階層的潜在変数最適化モジュールを更に含む。 According to one exemplary embodiment of the present disclosure, the system is configured to generate the hierarchical data based on the data of the plurality of tasks, the task correlation latent variable, the model structure, and the model parameters. Further included is a hierarchical latent variable optimization module configured to optimize the latent variables.

本開示の１つの代表的な実施形態によれば、前記システムは、前記複数のタスクのデータと、前記階層的潜在変数と、前記モデル構造と、前記モデルパラメータとに基づいて、前記タスク相関潜在変数を最適化するように構成されたタスク相関潜在変数最適化モジュールを更に含む。 According to one exemplary embodiment of the present disclosure, the system includes the task correlation latency based on the data of the plurality of tasks, the hierarchical latent variables, the model structure, and the model parameters. It further includes a task correlation latent variable optimization module configured to optimize the variables.

本開示の代表的な実施形態によれば、前記システムは、前記タスク相関潜在変数を最適化した後に、前記タスク相関潜在変数と、更新された前記タスク相関潜在変数とに基づいて、前記複数のタスクのハード・クラスタリングを実行するように構成されたハード・クラスタリング・モジュールを更に含む。 According to an exemplary embodiment of the present disclosure, the system, after optimizing the task correlation latent variable, based on the task correlation latent variable and the updated task correlation latent variable, A hard clustering module configured to perform hard clustering of the task is further included.

本開示の代表的な実施形態によれば、前記システムは、前記モデル構造と前記モデルパラメータとが最適か否か判定するように構成された判定モジュールを更に含む。 According to an exemplary embodiment of the present disclosure, the system further includes a determination module configured to determine whether the model structure and the model parameters are optimal.

本開示の代表的な実施形態によれば、前記判定モジュールは、前記モデル構造と前記モデルパラメータとの間のフィッティング度合に基づいて、前記モデル構造と前記モデルパラメータとが最適か否か判定する。 According to a representative embodiment of the present disclosure, the determination module determines whether or not the model structure and the model parameter are optimal based on a fitting degree between the model structure and the model parameter.

本開示の他の観点によれば、複数のタスクのデータを取得することと、前記複数のタスク間の相関性を表すタスク相関潜在変数を設定することと、を含み、各タスクは１つの区分的線形モデルに相関しており、複数の区分的線形モデルにモデル構造を設定し、対応するモデルパラメータと階層的潜在変数とを初期化することと、前記複数のタスクの前記データと、前記タスク相関潜在変数と、前記階層的潜在変数とに基づいて、前記モデル構造と前記モデルパラメータとを最適化することと、を含む区分的線形モデル生成方法が提供される。 According to another aspect of the present disclosure, the method includes: acquiring data of a plurality of tasks; and setting a task correlation latent variable that represents a correlation between the plurality of tasks, each task having one category Correlating to a linear model, setting a model structure in a plurality of piecewise linear models, initializing corresponding model parameters and hierarchical latent variables, the data of the plurality of tasks, and the tasks A piecewise linear model generation method is provided that includes optimizing the model structure and the model parameters based on correlated latent variables and the hierarchical latent variables.

本開示の１つの代表的な実施形態によれば、前記方法は、前記複数のタスクの前記データと、前記タスク相関潜在変数と、前記モデル構造と、前記モデルパラメータとに基づいて、前記階層的潜在変数を最適化することを更に含む。 According to one exemplary embodiment of the present disclosure, the method includes the hierarchical method based on the data of the plurality of tasks, the task correlation latent variables, the model structure, and the model parameters. It further includes optimizing the latent variable.

本開示の１つの代表的な実施形態によれば、前記方法は、前記複数のタスクの前記データと、前記階層的潜在変数と、前記モデル構造と、前記モデルパラメータとに基づいて、前記タスク相関潜在変数を最適化することを更に含む。 According to one exemplary embodiment of the present disclosure, the method includes the task correlation based on the data of the plurality of tasks, the hierarchical latent variable, the model structure, and the model parameters. It further includes optimizing the latent variable.

本開示の代表的な実施形態によれば、前記方法は、前記タスク相関潜在変数を最適化した後に、前記タスク相関潜在変数と、更新された前記タスク相関潜在変数とに基づいて、前記複数のタスクのハード・クラスタリングを実行することを更に含む。 According to an exemplary embodiment of the present disclosure, the method includes: after optimizing the task correlation latent variable, based on the task correlation latent variable and the updated task correlation latent variable, It further includes performing hard clustering of tasks.

本開示の代表的な実施形態によれば、前記方法は、前記モデル構造と前記モデルパラメータとが最適か否か判定することを更に含む。 According to an exemplary embodiment of the present disclosure, the method further includes determining whether the model structure and the model parameters are optimal.

本開示の代表的な実施形態によれば、前記モデル構造と前記モデルパラメータとが最適か否か前記判定することは、前記モデル構造と前記モデルパラメータとの間のフィッティング度合に基づいて、前記モデル構造と前記モデルパラメータとが最適か否か判定することを含む。 According to a representative embodiment of the present disclosure, the determining whether the model structure and the model parameter are optimal is based on a fitting degree between the model structure and the model parameter. Determining whether the structure and the model parameters are optimal.

本開示の様々な実施形態の技術的解決方法では、区分的線形モデルの生成を容易にするために、モデル生成処理中に、タスク間の相関を示すために新しい潜在変数が追加されるので、複数のタスクのデータをモデリングに利用することができ、それにより、モデル生成の精度が増す。 In the technical solutions of the various embodiments of the present disclosure, new latent variables are added during the model generation process to indicate correlation between tasks to facilitate the generation of piecewise linear models, Data from multiple tasks can be used for modeling, thereby increasing the accuracy of model generation.

また、各タスクが、１つの区分的線形モデルのみに相関するように設定されているので、本開示による区分的線形モデル生成システム及び生成方法では、より大きいデータを処理しうる。 In addition, since each task is set to correlate with only one piecewise linear model, the piecewise linear model generation system and method according to the present disclosure can process larger data.

代表的な実施形態の以下の詳細な説明を、添付図面と合わせて読めば、上記及び他の目的と、特徴と、利点とが明らかになるだろう。 These and other objects, features and advantages will become apparent when the following detailed description of the exemplary embodiments is read in conjunction with the accompanying drawings.

本開示の１つの実施形態による区分的線形モデル生成方法を模式的に示すフロー図である。FIG. 3 is a flow diagram schematically illustrating a piecewise linear model generation method according to one embodiment of the present disclosure. 本開示の１つの実施形態による区分的線形モデル生成システムを模式的に示すブロック図である。1 is a block diagram schematically illustrating a piecewise linear model generation system according to one embodiment of the present disclosure. FIG. 本開示の１つの実施形態による区分的線形モデル生成方法のステップを模式的に示す図表である。6 is a chart schematically illustrating steps of a piecewise linear model generation method according to an embodiment of the present disclosure.

以下で、添付図面のいくつかの代表的な実施形態に関して、本開示の原理と方法とが述べられる。これらの実施形態は、当業者がより理解し、更に本開示を実施可能なように述べられただけであって、どの手法でも本開示の範囲を限定することを意図してはいないと理解されるべきである。 In the following, the principles and methods of the present disclosure will be described with reference to some exemplary embodiments of the accompanying drawings. These embodiments are understood to be better understood by those skilled in the art and are further described to enable the present disclosure, and are not intended to limit the scope of the present disclosure in any way. Should be.

本開示の様々な実施形態による区分的線形モデル生成方法及び生成システムを説明する前に、ここに現れる技術的な概念のいくつかを、まず示す。 Before describing the piecewise linear model generation method and system according to various embodiments of the present disclosure, some of the technical concepts that appear here will first be presented.

区分的線形モデル
区分的線形モデルの前身は回帰木である。回帰木では、分割が規則の連鎖により特定されており、ローカル・エキスパートは一定の価値を有する。回帰木の表現は理解が容易だが、個々のエキスパートの予測能力は高くなく、良い予測パフォーマンスを達成するためには、ツリーの深さが大きくなる傾向がある。 Piecewise Linear Model The predecessor of the piecewise linear model is a regression tree. In the regression tree, the partition is specified by a chain of rules, and the local expert has a certain value. Regression tree representations are easy to understand, but the predictability of individual experts is not high, and the tree depth tends to increase to achieve good prediction performance.

エキスパートの階層的な混在（ＨＩＥＲＡＲＣＨＩＣＡＬＭＩＸＴＵＲＥＯＦＥＸＰＥＲＴＳ（ＨＭＥ））
ＨＭＥは、区分的モデルをつくるために分割統治戦略を採用する。それらのツリーの分割構造は、確率的なソフト分割関数である「ゲーティング」関数で決定される。ＨＭＥｓは、ゲーティング関数の設計により、どのような分割構造でも表すことができるが、学習した分割構造が実際に理解可能なように、規則連鎖の（ツリーの）分割構造のみが、ここでは採用される。 Hierarchical mixture of experts (HIERARCHICAL MIXTURE OF EXPERTS (HME))
HME adopts a divide-and-conquer strategy to create a piecewise model. The division structure of these trees is determined by a “gating” function which is a stochastic soft division function. HMEs can be expressed in any partition structure depending on the design of the gating function, but only a regular chain (tree) partition structure is adopted here so that the learned partition structure can be actually understood. Is done.

ＨＭＥモデルは以下のように定義される。

ここで、θとγとは、ゲーティング・ノードからのパラメータであり、Ｅはエキスパートの数であり、ε_ｊはルート・ノードからｊ番目のエキスパート・ノードまでの唯一のパスのゲーティング・ノードの全ての索引を含む索引セットである。ｇ（ｘ，α_ｉ）は、α_ｉ＝（β_ｉ；γ_ｉ）によりパラメータ化されたゲーティング関数と呼ばれ、ψ_ｇ（ｘ，ｉ，ｊ）は、エキスパートｊとｉの左サブツリーに属するｘの確率であり、ｐ（ｙ｜ｘ，φ_ｊ）は、

によりパラメータ化されるガウス形雑音を有するｊ番目のエキスパートの条件付き分布である。 The HME model is defined as follows.

Where θ and γ are parameters from the gating node, E is the number of experts, and ε _j is the only path gating node from the root node to the jth expert node. Is an index set that includes all of the indexes. g (x, α _i ) is called the gating function parameterized by α _i = (β _i ; γ _i ), and ψ _g (x, i, j) is in the left subtree of experts j and i The probability of x belonging to p (y | x, φ _j ) is

Is a conditional distribution of the j th expert with Gaussian noise parameterized by:

混合モデルにおける因子化漸近ベイジアン（ＦａｃｔｏｒｉｚｅｄＡｓｙｍｐｔｏｔｉｃＢａｙｅｓｉａｎ（ＦＡＢ））
今、以下の観測データが存在すると仮定する。

ｚ^Ｎ＝ｚ^（１），・・・，ｚ^（Ｎ），ｚ^（ｎ）∈｛１，０｝^Ｃは、ｘ^Ｎに対応する潜在変数を示し、どの混合成分がｘ^（ｎ）を生成するかを示す。このような潜在変数モデルは、因子化漸近ベイジアン（ＦａｃｔｏｒｉｚｅｄＡｓｙｍｐｔｏｔｉｃＢａｙｅｓｉａｎ（ＦＡＢ））推論により解ける。ＦＡＢは、因子化情報量基準（ＦａｃｔｏｒｉｚｅｄＩｎｆｏｒｍａｔｉｏｎＣｒｉｔｅｒｉｏｎ（ＦＩＣ））に基づいた、近年開発されたベイジアン近似である。ＦＩＣは、以下のように、周辺対数尤度の漸近近似として導き出される。

ここで、ｑ（ｚ^Ｎ）はｚ^Ｎの変分分布であり、θ⁻は、モデルパラメータの最尤推定量である。ＦＩＣ^ｍｍで最も重要な項は

であり、この項は、無関係な潜在変数の自動的な減少とパラメータ識別可能性という、ＦＡＢ推理における、理論的に望ましい、これらの特性をもたらす。 Factorized Asymptotic Bayesian (FAB) in mixed models
Assume that the following observation data exists.

z ^N = z ⁽¹⁾ ,..., z ^(N) , z ⁽ⁿ⁾ ∈ {1, 0} ^C denotes a latent variable corresponding to x ^N , and which mixture component generates x ⁽ⁿ⁾ Indicates what to do. Such latent variable models can be solved by factorized asymptotic Bayesian (FAB) inference. FAB is a recently developed Bayesian approximation based on the Factorized Information Criterion (FIC). The FIC is derived as an asymptotic approximation of the marginal log likelihood as follows.

Here, q (z ^N ) is a variational distribution of z ^N , and θ ⁻ is a maximum likelihood estimator of model parameters. The most important term in FIC ^mm is

This term provides these theoretically desirable properties in FAB inference: automatic reduction of irrelevant latent variables and parameter identifiability.

ＦＡＢは、ＦＩＣの扱いやすい下限を、最適化する。ｑとθに関する下限を交互に最大化することが、ＦＩＣの下限の単調増加を保証する。 FAB optimizes the manageable lower bound of FIC. Alternately maximizing the lower limits on q and θ guarantees a monotonic increase in the lower limit of FIC.

マルチタスク学習（Ｍｕｌｔｉ−ＴａｓｋＬｅａｒｎｉｎｇ（ＭＴＬ））
機械学習のＭＴＬにおける共通のパラダイムは、ペナルティ経験損失を最小化することである。

ここで、Ｗは、トレーニング・サンプルから推定されるパラメータであり、Ｌ（Ｗ）は、トレーニング・セットの経験的損失であり、Ω（Ｗ）はタスク相関性をコード化する正則項である。タスク相関性に関して異なる仮定をすると、異なる正則項になる。マルチタスク学習の分野では、新しい正則化を使用してタスク間の関係をモデル化する先行研究がたくさんある。 Multi-task learning (Multi-Task Learning (MTL))
A common paradigm in machine learning MTL is to minimize penalty experience loss.

Where W is a parameter estimated from the training sample, L (W) is the empirical loss of the training set, and Ω (W) is a regular term encoding task correlation. Different assumptions about task correlation result in different regular terms. In the field of multitasking learning, there are many previous studies that use new regularization to model the relationships between tasks.

以下で、本発明の概念を、図１−３で示される代表的な実施形態に関して示す。下記の説明では、区分的線形モデルの例であるＨＭＥモデルで本開示の原理を示す。しかし、本開示はＨＭＥモデルに限定されず、他の区分的線形モデルにも適用可能である。 In the following, the concept of the present invention will be illustrated with respect to the exemplary embodiment shown in FIGS. 1-3. In the following description, the principles of the present disclosure are illustrated with an HME model, which is an example of a piecewise linear model. However, the present disclosure is not limited to the HME model and can be applied to other piecewise linear models.

本開示の要旨は、マルチタスク学習を使用する区分的線形モデルを生成するという挑戦的な課題に対処することである。本開示では、因子化漸近ベイジアン（ＦａｃｔｏｒｉｚｅｄＡｓｙｍｐｔｏｔｉｃＢａｙｅｓｉａｎ（ＦＡＢ））の理論と、マルチタスク学習（Ｍｕｌｔｉ−ＴａｓｋＬｅａｒｎｉｎｇ（ＭＴＬ））の理論とが、結合される。まず、タスクのサブセットが同じ分布に従うと仮定する。本開示の文脈では、マルチタスク学習は、同じ分布に属するタスクのクラスタを検出することと、区分的線形モデル（例えば、ＨＭＥモデル）を使用して、各クラスタをフィットさせることとを目的としている。同じクラスタ内のタスクは、互いの強度で、学習性能を向上させうる。 The gist of the present disclosure is to address the challenging task of generating piecewise linear models that use multitask learning. In the present disclosure, the theory of Factorized Asymptotic Bayesian (FAB) and the theory of multi-task learning (Multi-Task Learning (MTL)) are combined. First, assume that a subset of tasks follows the same distribution. In the context of this disclosure, multitask learning is aimed at detecting clusters of tasks belonging to the same distribution and fitting each cluster using a piecewise linear model (eg, HME model). . Tasks in the same cluster can improve learning performance with each other's strength.

図１は、本開示の１つの実施形態による区分的線形モデル生成方法のフロー図を模式的に示す。図１に示すように、区分的線形モデル生成方法は、通常、１０１で複数のタスクのデータを取得することと、１０２で複数のタスク間の相関性を表すタスク相関潜在変数を設定することと、を含み、各タスクは１つの区分的線形モデルに相関しており、１０３で複数の区分的線形モデルにモデル構造を設定し、対応するモデルパラメータと階層的潜在変数とを初期化することと、１０７で複数のタスクのデータと、タスク相関潜在変数と、階層的潜在変数とに基づいて、モデル構造とモデルパラメータとを最適化することと、を含むものとしても良い。以下で、各ステップを詳述に説明する。 FIG. 1 schematically illustrates a flow diagram of a piecewise linear model generation method according to one embodiment of the present disclosure. As shown in FIG. 1, the piecewise linear model generation method generally includes obtaining data of a plurality of tasks at 101 and setting a task correlation latent variable representing the correlation between the plurality of tasks at 102. Each task is correlated to a piecewise linear model, and at 103, a model structure is set for a plurality of piecewise linear models, and corresponding model parameters and hierarchical latent variables are initialized; and , 107 may include optimizing the model structure and model parameters based on a plurality of task data, task correlation latent variables, and hierarchical latent variables. Each step will be described in detail below.

ステップ１０１では、Ｎ個のタスクのデータを取得する。 In step 101, data of N tasks is acquired.

ステップ１０２では、複数のタスク間の相関性を表すタスク相関潜在変数が設定される。ここで、各タスクは１つの区分的線形モデルに相関している。例として、タスク相関潜在変数は、複数のタスクの各タスクと、複数の区分的線形モデルの各区分的線形モデルとの間の対応関係を表すように構成されている。言い換えれば、タスク間の相関性は、各タスクがモデルに属する確率により表されても良い。２つのタスクが同じモデルに属する場合、２つのタスクは相関すると考えられる。本開示では、各タスクは１つの区分的線形モデルのみに対応し、よって、より大きなデータを処理することができる。 In step 102, a task correlation latent variable representing the correlation between a plurality of tasks is set. Here, each task is correlated to one piecewise linear model. As an example, the task correlation latent variable is configured to represent the correspondence between each task of the plurality of tasks and each piecewise linear model of the plurality of piecewise linear models. In other words, the correlation between tasks may be represented by the probability that each task belongs to the model. Two tasks are considered correlated if they belong to the same model. In the present disclosure, each task corresponds to only one piecewise linear model, and thus larger data can be processed.

ステップ１０３では、複数の区分的線形モデルにモデル構造を設定し、それらのモデルパラメータと階層的潜在変数とを初期化する。例として、モデル構造は木構造であっても良く、初期化処理はランダムに実行されるものとしても良い。 In step 103, model structures are set in a plurality of piecewise linear models, and their model parameters and hierarchical latent variables are initialized. As an example, the model structure may be a tree structure, and the initialization process may be executed at random.

１つの代表的な実施形態によれば、区分的線形モデル生成方法は、ステップ１０３の後かつステップ１０７の前に、複数のタスクのデータと、タスク相関潜在変数と、モデル構造と、モデルパラメータとに基づいて、階層的潜在変数を最適化するステップ１０４を更に含んでも良い。階層的潜在変数は、ステップ１０３でランダムに初期化されるので、それらは、このステップで最適化されうる。階層的潜在変数は、データをどのように分割するかを示すためのものであり、そして、特徴空間は領域に分割される。各領域はローカル・エキスパートにより表現される。階層的潜在変数により領域に分割する処理では、特徴ではなくサンプルによりクラスタ化される。すなわち、どのサンプルが１つの領域に配置され、どのサンプルが他の領域に配置されるか、ということである。具体的な例では、各サンプルを、確率に応じてクラスタ化する。具体的な最適化方法では、以下で示される式（１３）で表される処理を採用しても良い。 According to one exemplary embodiment, the piecewise linear model generation method includes a plurality of task data, a task correlation latent variable, a model structure, a model parameter, after step 103 and before step 107. May further include a step 104 of optimizing the hierarchical latent variable. Since hierarchical latent variables are initialized randomly at step 103, they can be optimized at this step. The hierarchical latent variable is for indicating how to divide the data, and the feature space is divided into regions. Each region is represented by a local expert. In the process of dividing into regions by hierarchical latent variables, clustering is performed not by features but by samples. That is, which samples are placed in one area and which samples are placed in other areas. In a specific example, each sample is clustered according to probability. In a specific optimization method, a process represented by the following expression (13) may be adopted.

１つの代表的な実施形態によれば、区分的線形モデル生成方法は、ステップ１０４の後かつステップ１０７の前に、複数のタスクのデータと、階層的潜在変数と、モデル構造と、モデルパラメータとに基づいて、タスク相関潜在変数を最適化するステップ１０５を更に含んでも良い。ステップ１０２で設定されたタスク相関潜在変数は、ランダムな初期化を通じて生成されたものなので、それらは最適化される必要がある。このステップでは、複数のタスクのデータと、モデル構造と、モデルパラメータと共に、ステップ１０４で最適化された階層的潜在変数を使って、タスク相関潜在変数を最適化しても良い。具体的な最適化方法では、以下で示される式（１１）で表される処理を採用しても良い。 According to one exemplary embodiment, the piecewise linear model generation method includes a plurality of task data, a hierarchical latent variable, a model structure, a model parameter, after step 104 and before step 107. May further include a step 105 of optimizing the task correlation latent variable. Since the task correlation latent variables set in step 102 were generated through random initialization, they need to be optimized. In this step, the task correlation latent variable may be optimized using the hierarchical latent variable optimized in step 104 together with the data of a plurality of tasks, the model structure, and the model parameters. In a specific optimization method, a process represented by the following formula (11) may be adopted.

１つの代表的な実施形態によれば、区分的線形モデル生成方法は、ステップ１０５の後かつステップ１０７の前に、タスク相関潜在変数と、更新されたタスク相関潜在変数に基づいて、複数のタスクのハード・クラスタリングを実行するステップ１０６を更に含んでも良い。ハード・クラスタリングは、あるタスクに、１つのモデルのみに属すこと、すなわち、このモデルに属すか、あるいは、あのモデルに属すかのどちらかを強いる。それにより、スペース・オーバーヘッドが大きくなることを回避する。したがって、本開示による本開示の区分的線形モデルは、予測精度を高められるだけでなく、計算リソースも最適化する。したがって、それは、大きなデータ・シナリオに適合されうる。具体的な最適化方法では、以下で示される式（１２）で表される処理を採用しても良い。 According to one exemplary embodiment, the piecewise linear model generation method includes a plurality of tasks based on the task correlation latent variable and the updated task correlation latent variable after step 105 and before step 107. The method may further include a step 106 of performing the hard clustering. Hard clustering forces a task to belong to only one model, either to belong to this model or to that model. This avoids an increase in space overhead. Thus, the piecewise linear model of the present disclosure according to the present disclosure not only increases prediction accuracy, but also optimizes computational resources. It can therefore be adapted to large data scenarios. In a specific optimization method, a process represented by the following formula (12) may be adopted.

１つの代表的な実施形態によれば、ステップ１０７では、ステップ１０４で最適化された階層的潜在変数と、ステップ１０５で最適化されたタスク相関潜在変数とが、複数のタスクのデータと共に、モデル構造とモデルパラメータとを最適化するのに利用されても良い。具体的な最適化方法では、以下で示される式（１４）で表される処理を採用しても良い。 According to one exemplary embodiment, in step 107, the hierarchical latent variable optimized in step 104 and the task correlation latent variable optimized in step 105 are combined with data of a plurality of tasks, as well as a model. It may be used to optimize the structure and model parameters. In a specific optimization method, the process represented by the following formula (14) may be adopted.

１つの代表的な実施形態によれば、区分的線形モデル生成方法は、ステップ１０７の後に、モデル構造とモデルパラメータとが最適か否か判定するステップ１０８を更に含み、もし最適であれば、区分的線形モデルを出力するステップ１０９を実行し、もし最適でなければ、戻ってステップ１０４を実行するものとしても良い。上記の判定の評価指標は、モデルとデータの間のフィッティング度合であっても良い。フィッティング度合は、上限なしに増加することはできない。寧ろ上限がある。インクリメンタル処理において、最適化を継続することによる増加は、ある程度増加した後、非常に小さくなる。増加が十分に小さい場合、最適モデルが導出されたと考えられる。そこで、最適化をストップし、結果として生じた区分的線形モデルを、ステップ１０９で出力する。 According to one exemplary embodiment, the piecewise linear model generation method further includes a step 108 of determining whether the model structure and model parameters are optimal after step 107, and if Step 109 for outputting a linear model may be executed, and if it is not optimal, step 104 may be returned to and executed. The evaluation index for the determination may be a fitting degree between the model and the data. The degree of fitting cannot be increased without an upper limit. On the contrary, there is an upper limit. In incremental processing, the increase due to continuing optimization increases to some extent and then becomes very small. If the increase is small enough, an optimal model is considered to have been derived. Therefore, optimization is stopped and the resulting piecewise linear model is output at step 109.

図１に関して上述した特定の実施形態は、本開示による区分的線形モデルを生成する原理を示す。しかし、当業者は、本開示による区分的線形モデル生成方法は、上記特定のステップに限定されず、請求項で限定された範囲に基づくと理解するだろう。様々なニーズに基づいて、１以上のステップを削除または追加することを、当業者は容易に想像するだろう。 The particular embodiment described above with respect to FIG. 1 illustrates the principle of generating a piecewise linear model according to the present disclosure. However, one of ordinary skill in the art will appreciate that the piecewise linear model generation method according to the present disclosure is not limited to the specific steps described above, but is based on the scope limited in the claims. One skilled in the art will readily imagine deleting or adding one or more steps based on various needs.

次に、ステップ１０４−１０７に適用可能な最適化方法を、具体例に基づいて説明する。 Next, an optimization method applicable to steps 104 to 107 will be described based on a specific example.

まず、いくつかの記号を以下のように定義する。
Ｎ：タスクの数
Ｌ：各タスクにおけるサンプルの数
ｘ^ＮＬ，ｙ^ＮＬ：各々、トレーニング・データセットの特徴とラベル
Ｄ：データ生成のための元のモデル
Ｍ：混合におけるＨＭＥモデルの数
ｚ：タスクからモデルまでの割り当てベクトル
ζ：サンプルからＨＭＥのエキスパートまでの割り当てベクトル First, some symbols are defined as follows.
N: number of tasks L: number of samples in each task x ^NL , y ^NL : each feature and label of the training data set D: original model for data generation M: number of HME models in the mixture z: task Assignment vector ζ to model: assignment vector from sample to HME expert

ＭＴＬは、各タスクにおける、タスクのクラスタリング（ｚにより示される）とサンプルのクラスタリング（ζにより示される）を考慮に入れ、その主な関心は、周辺尤度ｐ（ｙ^ＮＬ｜ｘ^ＮＬ，Ｄ）である。したがって、ｑ（ｚ^Ｎ）とｑ（ζ^ＮＬ）に対する周辺対数尤度の下限は、以下のように導き出される。

ここで、Ｎ_ｃはｃ番目のクラスタにおけるタスクの数である。ここで、

と仮定することによって、ｚ^Ｎ _ｃに基づいてハード・クラスタリングが適用される。 MTL takes into account task clustering (denoted by z) and sample clustering (denoted by ζ) in each task, and its main interest is marginal likelihood p (y ^NL | x ^NL , D) It is. Therefore, the lower limit of the peripheral log likelihood for q (z ^N ) and q (ζ ^NL ) is derived as follows.

Here, N _c is the number of tasks in the c-th cluster. here,

, Hard clustering is applied based on z ^N _c .

次に、式（４）と式（５）とを組み合わせることにより、下限を導出しても良い。

Next, the lower limit may be derived by combining Equation (4) and Equation (5).

ｍａｘ_ｑ｛ＶＬＢ（ｑ，ｘ^ＮＬ，ｙ^ＮＬ，Ｄ）｝は、ｌｏｇｐ（ｙ^ＮＬ｜ｘ^ＮＬ，Ｄ）と一致することが保証されうる。 max _q {VLB (q, x ^NL , y ^NL , D)} can be guaranteed to match logp (y ^NL | x ^NL , D).

次に、因子化漸近近似を説明する。 Next, factorized asymptotic approximation will be described.

タスク・クラスタと、各タスク・クラスタにおけるサンプル・クラスタとの間の相互独立を仮定すると、我々は、ラプラス・メソッドで、因子化情報量基準（ＦａｃｔｏｒｉｚｅｄＩｎｆｏｒｍａｔｉｏｎＣｒｉｔｅｒｉｏｎ（ＦＩＣ））をほぼ得る。

ここで、ρ＝（α，ρ_１，・・・，ρ_Ｍ）であり、αは各クラスタの混合比率ベクトルであり、ρ_ｃ＝（β_ｃ，γ_ｃ，φ_ｃ）はｃ番目のＨＭＥモデルのパラメータであり、ｃ＝１，・・・，Ｍであり、

である。上記処理では、周辺対数尤度の近似に基づいてのみ、ＦＩＣは判定される。 Assuming mutual independence between the task clusters and the sample clusters in each task cluster, we approximately obtain a Factorized Information Criterion (FIC) with the Laplace method.

Here, ρ = (α, ρ ₁ ,..., Ρ _M ), α is a mixture ratio vector of each cluster, and ρ _c = (β _c , γ _c , φ _c ) is the c-th HME. Model parameters, c = 1,..., M,

It is. In the above processing, the FIC is determined only based on the approximation of the peripheral log likelihood.

次に、上記区分的線形モデル生成方法で使用されうるＦＡＢアルゴリズムを説明する。 Next, a FAB algorithm that can be used in the piecewise linear model generation method will be described.

ＦＡＢアルゴリズムは、実際にはＦＩＣの漸近的一致の下限を最大にする。下限は、以下のように導き出される。

ここで、

であり、ｑｎｇ_ｉとｑｎｅ_ｉとは、新しいパラメータ（分布）ｑ^〜を有する、ｎｇ_ｉとｎｅ_ｉとの近似である。

The FAB algorithm actually maximizes the lower bound of the FIC asymptotic agreement. The lower limit is derived as follows.

here,

In it, the QNG _i and qne _i, a new parameter (distribution) having ^{q ~,} approximation of ng _i and ne _i.

それから、次の最適化問題が生じる。

Then the following optimization problem arises.

ＦＡＢアルゴリズムは、ｑとθに関して、Ｅ−ステップとＭ−ステップで、最大化を交互に行う。Ｅ−ステップは、以下のように、変分分布ｑを最適化する。

ここで、

である。 The FAB algorithm alternately performs maximization in E-step and M-step with respect to q and θ. The E-step optimizes the variation distribution q as follows.

here,

It is.

上記したように、式（１１）は、ステップ１０５で、タスク相関潜在変数を最適化するのに使用されうる。 As described above, equation (11) may be used at step 105 to optimize task correlation latent variables.

それから、各タスクに、ハード・クラスタリングが適用される。

Then, hard clustering is applied to each task.

上記したように、式（１２）は、ステップ１０６で、複数のタスクにハード・クラスタリングを実行するのに使用されうる。 As described above, equation (12) may be used at step 106 to perform hard clustering for multiple tasks.

Ｅ−ステップは、ｃ番目のモデルに対し、ｑ（ζ^ｎｌ _ｃｊ）を最適化する。

ここで、

であり、

は、ｘ_ｎｌが、（β_ｃｉ，γ_ｃｉ）によりパラメータ化される所定のＨＭＥモデルのエキスパートｊ及びｉの左サブツリーに属する確率である。 The E-step optimizes q (ζ ^nl _cj ) for the c th model.

here,

And

Is the probability that x _nl belongs to the left subtree of experts j and i of a given HME model parameterized by (β _ci , γ _ci ).

上記したように、式（１３）は、ステップ１０４で、階層的潜在変数の最適化を実現しうる。 As described above, equation (13) may achieve hierarchical latent variable optimization at step 104.

Ｍ−ステップは、以下のように、パラメータを最適化する。

The M-step optimizes the parameters as follows.

上記したように、式（１４）は、ステップ１０７で、モデル構造とモデルパラメータとの最適化を実行するのに使用されうる。 As described above, equation (14) can be used in step 107 to perform optimization of the model structure and model parameters.

図２は、本開示の１つの実施形態による区分的線形モデル生成システムのブロック図を模式的に示す。図２に見られるように、区分的線形モデル生成システムは、一般に、複数のタスクのデータを取得するように構成された取得モジュール２０１と、複数のタスク間の相関性を表すタスク相関潜在変数を設定するように構成された第１の設定モジュール２０２と、を含み、各タスクは１つの区分的線形モデルに相関しており、複数の区分的線形モデルにモデル構造を設定し、対応するモデルパラメータと階層的潜在変数とを初期化するように構成された第２の設定モジュール２０３と、複数のタスクのデータと、タスク相関潜在変数と、階層的潜在変数とに基づいて、モデル構造とモデルパラメータとを最適化するように構成されたモデル最適化モジュール２０７と、を含んでも良い。以下で、各モジュールを詳細に説明する。 FIG. 2 schematically illustrates a block diagram of a piecewise linear model generation system according to one embodiment of the present disclosure. As seen in FIG. 2, the piecewise linear model generation system generally includes an acquisition module 201 configured to acquire data for a plurality of tasks, and a task correlation latent variable representing the correlation between the plurality of tasks. A first setting module 202 configured to set, wherein each task is correlated to one piecewise linear model, setting a model structure for the plurality of piecewise linear models, and corresponding model parameters Model structure and model parameters based on the second setting module 203 configured to initialize and hierarchical latent variables, data of a plurality of tasks, task correlation latent variables, and hierarchical latent variables And a model optimization module 207 configured to optimize. Each module will be described in detail below.

取得モジュール２０１では、Ｎ個のタスクのデータが取得されても良い。 In the acquisition module 201, data of N tasks may be acquired.

第１の設定モジュール２０２では、複数のタスク間の相関性を表すタスク相関潜在変数が設定され、各タスクは１つの区分的線形モデルに相関している。例として、タスク相関潜在変数は、複数のタスクのうちの各タスクと、複数の区分的線形モデルのうちの各区分的線形モデルとの間の対応関係を表すように構成されている。言い換えれば、タスク間の相関性は、各タスクがモデルに属する確率により表されうる。２つのタスクが同じモデルに属する場合、当該２つのタスクは関連があると考えられる。本開示では、各タスクが、１つの区分的線形モデルのみに対応するので、より大きいデータを処理することができる。 In the first setting module 202, a task correlation latent variable representing the correlation between a plurality of tasks is set, and each task is correlated to one piecewise linear model. As an example, the task correlation latent variable is configured to represent a correspondence between each task of the plurality of tasks and each piecewise linear model of the plurality of piecewise linear models. In other words, the correlation between tasks can be represented by the probability that each task belongs to the model. If two tasks belong to the same model, the two tasks are considered related. In this disclosure, each task corresponds to only one piecewise linear model, so larger data can be processed.

第２の設定モジュール２０３では、複数の区分的線形モデルにモデル構造が設定され、それらのモデルパラメータと階層的潜在変数とが初期化される。例として、モデル構造は木構造であっても良く、初期化処理はランダムに実行されても良い。 In the second setting module 203, model structures are set in a plurality of piecewise linear models, and their model parameters and hierarchical latent variables are initialized. As an example, the model structure may be a tree structure, and the initialization process may be executed at random.

１つの代表的な実施形態によれば、区分的線形モデル生成システムは、複数のタスクのデータと、タスク相関潜在変数と、モデル構造と、モデルパラメータとに基づいて、階層的潜在変数を最適化するように構成された階層的潜在変数最適化モジュール２０４を更に含んでも良い。階層的潜在変数は、第２の設定モジュール２０３でランダムに初期化されるので、それらは、このステップで、最適化されうる。階層的潜在変数は、データをどのように分割するかを示すためのものであり、そして、特徴空間は領域に分割される。各領域はローカル・エキスパートにより表現される。階層的潜在変数により領域に分割する処理では、特徴ではなくサンプルによりクラスタ化される。すなわち、どのサンプルが１つの領域に配置され、どのサンプルが他の領域に配置されるか、ということである。具体的な例では、各サンプルを、確率に応じてクラスタ化する。具体的な最適化方法では、上記した式（１３）で表された処理を採用しても良い。 According to one exemplary embodiment, the piecewise linear model generation system optimizes hierarchical latent variables based on data for multiple tasks, task correlation latent variables, model structure, and model parameters. A hierarchical latent variable optimization module 204 configured to do so may further be included. Since hierarchical latent variables are randomly initialized in the second setting module 203, they can be optimized in this step. The hierarchical latent variable is for indicating how to divide the data, and the feature space is divided into regions. Each region is represented by a local expert. In the process of dividing into regions by hierarchical latent variables, clustering is performed not by features but by samples. That is, which samples are placed in one area and which samples are placed in other areas. In a specific example, each sample is clustered according to probability. In a specific optimization method, the process represented by the above equation (13) may be employed.

１つの代表的な実施形態によれば、区分的線形モデル生成システムは、複数のタスクのデータと、階層的潜在変数と、モデル構造と、モデルパラメータとに基づいて、タスク相関潜在変数を最適化するように構成されたタスク相関潜在変数最適化モジュール２０５を更に含んでも良い。第１の設定モジュール１０２におけるタスク相関潜在変数は、ランダムな初期化により生成されているので、それらを最適化する必要がある。タスク相関潜在変数最適化モジュール２０５では、階層的潜在変数最適化モジュール２０４で最適化された階層的潜在変数を、複数のタスクのデータと、モデル構造と、モデルパラメータと共に使用して、タスク相関潜在変数を最適化しても良い。具体的な最適化方法では、上記した式（１１）で表された処理を採用しても良い。 According to one exemplary embodiment, the piecewise linear model generation system optimizes a task correlation latent variable based on data of multiple tasks, hierarchical latent variables, model structure, and model parameters. A task correlation latent variable optimization module 205 may be further included. Since the task correlation latent variables in the first setting module 102 are generated by random initialization, it is necessary to optimize them. In the task correlation latent variable optimization module 205, the hierarchical latent variable optimized by the hierarchical latent variable optimization module 204 is used together with the data of a plurality of tasks, the model structure, and the model parameters to obtain a task correlation latency. Variables may be optimized. In a specific optimization method, the process represented by the above formula (11) may be adopted.

１つの代表的な実施形態によれば、区分的線形モデル生成システムは、タスク相関潜在変数と、更新されたタスク相関潜在変数とに基づいて、複数のタスクのハード・クラスタリングを実行するように構成されたハード・クラスタリング・モジュール２０６を更に含んでも良い。ハード・クラスタリングは、あるタスクに、１つのモデルのみに属すこと、すなわち、このモデルに属すか、あるいは、あのモデルに属すかのどちらかを強いる。それにより、スペース・オーバーヘッドが大きくなることを回避する。したがって、本開示による本開示の区分的線形モデルは、予測精度を高められるだけでなく、計算リソースも最適化する。したがって、それは、大きなデータ・シナリオに適合されうる。具体的な最適化方法では、上記した式（１２）で表される処理を採用しても良い。 According to one exemplary embodiment, the piecewise linear model generation system is configured to perform hard clustering of a plurality of tasks based on the task correlation latent variable and the updated task correlation latent variable. The hard clustering module 206 may be further included. Hard clustering forces a task to belong to only one model, either to belong to this model or to that model. This avoids an increase in space overhead. Thus, the piecewise linear model of the present disclosure according to the present disclosure not only increases prediction accuracy, but also optimizes computational resources. It can therefore be adapted to large data scenarios. In a specific optimization method, the process represented by the above equation (12) may be employed.

１つの代表的な実施形態によれば、モデル最適化モジュール２０７では、階層的潜在変数最適化モジュール２０４で最適化された階層的潜在変数と、タスク相関潜在変数最適化モジュール２０５で最適化されたタスク相関潜在変数とが、複数のタスクのデータと共に、モデル構造とモデルパラメータとを最適化するのに利用されても良い。具体的な最適化方法では、上記した式（１４）で表される処理を採用しても良い。 According to one exemplary embodiment, the model optimization module 207 optimizes the hierarchical latent variables optimized by the hierarchical latent variable optimization module 204 and the task correlation latent variable optimization module 205. Task correlation latent variables may be used to optimize the model structure and model parameters along with data for multiple tasks. In a specific optimization method, the process represented by the above formula (14) may be employed.

１つの代表的な実施形態によれば、区分的線形モデル生成システムは、モデル最適化モジュール２０７により出力されたモデル構造とモデルパラメータとが最適か否か判定するように構成された判定モジュール２０８を更に含んでも良く、もし最適であれば、出力モジュール２０９が区分的線形モデルを出力し、もし最適でなければ、階層的潜在変数最適化モジュール２０４と、タスク相関潜在変数最適化モジュール２０５と、ハード・クラスタリング・モジュール２０６と、モデル最適化モジュール２０７とが、モデル構造とモデルパラメータとの最適化を継続する。上記の判定の評価指標は、モデルとデータの間のフィッティング度合であっても良い。フィッティング度合は、上限なしに増加することはできない。寧ろ上限がある。インクリメンタル処理において、最適化を継続することによる増加は、ある程度増加した後、非常に小さくなる。増加が十分に小さい場合、最適モデルが導出されたと考えられる。そこで、最適化をストップし、出力モジュール２０９は、結果として生じた区分的線形モデルを出力する。 According to one exemplary embodiment, the piecewise linear model generation system includes a determination module 208 configured to determine whether the model structure and model parameters output by the model optimization module 207 are optimal. Further, if optimal, output module 209 outputs a piecewise linear model; if not optimal, hierarchical latent variable optimization module 204, task correlation latent variable optimization module 205, hardware The clustering module 206 and the model optimization module 207 continue to optimize the model structure and model parameters. The evaluation index for the determination may be a fitting degree between the model and the data. The degree of fitting cannot be increased without an upper limit. On the contrary, there is an upper limit. In incremental processing, the increase due to continuing optimization increases to some extent and then becomes very small. If the increase is small enough, an optimal model is considered to have been derived. Therefore, optimization is stopped and the output module 209 outputs the resulting piecewise linear model.

図２で示された区分的線形モデル生成システムは、図１で示された区分的線形モデル生成方法に相当する。したがって、図１による区分的線形モデル生成方法についての上記記載は、図２で示された区分的線形モデル生成システムにも同様に当てはまるので、ここでは詳述しない。 The piecewise linear model generation system shown in FIG. 2 corresponds to the piecewise linear model generation method shown in FIG. Accordingly, the above description of the piecewise linear model generation method according to FIG. 1 applies to the piecewise linear model generation system shown in FIG. 2 as well, and will not be described in detail here.

図３は、本開示の１つの実施形態による区分的線形モデル生成方法のステップを模式的に図表で示す。図３に見られるように、まず、上の図は、複数のタスクの入力データＴ１，Ｔ２，・・・ＴＮを示す。ここで、各タスクのデータは、分離したデータの点で表される。その後、中の図では、図１で示された最適化のステップを使って、モデリングが実行されている。各タスクは、１つのモデルのみをモデリングするために使用されても良いと分かり、例えば、Ｔ１とＴ４とは、各々、モデルＨＭＥ１をモデリングするのに使用されても良く、Ｔ２はモデルＨＭＥ２をモデリングするのに使用されても良く、Ｔ３はモデルＨＭＥ３をモデリングするのに使用されても良い。最後に、複数の最適化されたモデル、例えば、エキスパートの階層的な混合、ＨＭＥ（ＴＩ），ＨＭＥ（Ｔ２），・・・，ＨＭＥ（ＴＮ）が、下の図で出力される。 FIG. 3 schematically illustrates the steps of a piecewise linear model generation method according to one embodiment of the present disclosure. As shown in FIG. 3, first, the above figure shows input data T1, T2,. Here, the data of each task is represented by separated data points. Thereafter, in the middle figure, modeling is performed using the optimization steps shown in FIG. Each task is understood to be used to model only one model, for example, T1 and T4 may each be used to model model HME1, and T2 models model HME2. T3 may be used to model the model HME3. Finally, a plurality of optimized models, such as a hierarchical mixture of experts, HME (TI), HME (T2),..., HME (TN) are output in the diagram below.

本発明によれば、区分的線形モデルの生成を促進するために、モデル生成プロセス中に、タスク間の相関性を表すために、新しい潜在変数が追加されるので、複数のタスクのデータをモデリングに利用可能であり、それにより、モデル生成の精度を増す。また、各タスクが、１つの区分的線形モデルのみに関連して設定されるので、本開示による区分的線形モデル生成システム及び生成方法は、より大きなデータを処理することができる。 According to the present invention, in order to facilitate the generation of piecewise linear models, new latent variables are added during the model generation process to represent the correlation between tasks, thus modeling the data of multiple tasks. To increase the accuracy of model generation. Also, since each task is set in association with only one piecewise linear model, the piecewise linear model generation system and generation method according to the present disclosure can process larger data.

本発明の実施形態は、ソフトウェア、又はハードウェア、又はそれらの組み合わせで実現されうるという点に更に留意すべきである。ハードウェア部分は、特別なロジックにより実現可能であり、ソフトウェア部分は、メモリに格納され、マイクロプロセッサ又は専用に設計されたハードウェアなどの適切な指示実行システムにより実行されうる。上記方法及びシステムは、コンピュータ実行可能な指示、及び／又は、例えば、磁気ディスク又はＣＤ又はＤＶＤ−ＲＯＭのようなベアラ媒体、又は読出し専用メモリ（ファームウェア）又は光学又は電子信号ベアラ等のデータ・ベアラのようなプログラム可能メモリで供給されるコードといった、プロセッサに含まれた制御コードで実現されうる、と通常の当業者は理解しうる。本実施形態における装置及びその構成領域は、例えば、非常に大規模な集積回路又はゲートアレイ、又は論理チップ又はトランジスタのような半導体、又はフィールド・プログラマブル・ゲートアレイのようなプログラム可能なハードウェア・デバイス、又はプログラム可能な論理デバイスといったハードウェア電気回路構成により実現されても良く、又は様々な種類のプロセッサにより実行されるソフトウェアにより実現されても良く、又は例えばファームウェアといった上記ハードウェア電気回路構成とソフトウェアとの組み合わせ、により実現されても良い。 It should further be noted that embodiments of the present invention may be implemented in software, hardware, or a combination thereof. The hardware portion can be implemented with special logic, and the software portion can be stored in memory and executed by a suitable instruction execution system such as a microprocessor or specially designed hardware. The method and system may comprise computer-executable instructions and / or a bearer medium such as, for example, a magnetic disk or CD or DVD-ROM, or a read-only memory (firmware) or a data bearer such as an optical or electronic signal bearer. Those of ordinary skill in the art will appreciate that the code may be implemented with control code included in the processor, such as code supplied in a programmable memory such as The device and its constituent areas in this embodiment include, for example, very large scale integrated circuits or gate arrays, semiconductors such as logic chips or transistors, or programmable hardware such as field programmable gate arrays. A hardware electrical circuit configuration such as a device, or a programmable logic device, or may be implemented by software executed by various types of processors, or the hardware electrical circuit configuration described above, eg, firmware It may be realized by a combination with software.

本発明による方法の動作は、図面中に特定の順序で表されているが、これは、それらの動作が当該特定の順序で実行されなければならないということを、又は望ましい結果は示された全ての動作が実行されてのみ成し遂げられうるということを、要求するものでも又は含むものでもない。寧ろ、フロー図で示されるステップは、異なる順序で実行されても良い。その上に、又は、代わりに、いくつかのステップを省略しても良く、及び/又は実行のために複数のステップを１つのステップに統合しても良く、及び/又は実行のために１つのステップを複数のステップに分解しても良い。 The operations of the method according to the invention are represented in a particular order in the drawings, which means that they must be performed in that particular order, or all the desired results have been shown. It does not require or include that the above operations can only be accomplished. Rather, the steps shown in the flow diagram may be performed in a different order. Additionally or alternatively, some steps may be omitted and / or multiple steps may be combined into one step for execution and / or one step for execution A step may be decomposed into a plurality of steps.

本発明を、目下考慮に入れた実施形態に関して記述したが、本発明は、開示された実施形態に限定されないと理解するべきである。それどころか、添付の請求項の趣旨及び範囲内に入る様々な修正及び等価な並べ替えに及ぶことを、本発明は意図する。添付の請求の範囲は、最も広い解釈に一致し、そのような修正と、等価な構造及び機能の全てに及ぶ。 Although the present invention has been described in terms of presently contemplated embodiments, it should be understood that the invention is not limited to the disclosed embodiments. On the contrary, the invention is intended to cover various modifications and equivalent permutations falling within the spirit and scope of the appended claims. The scope of the appended claims is consistent with the broadest interpretation and covers all such modifications and equivalent structures and functions.

（付記１）
複数のタスクのデータを取得するように構成された取得モジュールと、
前記複数のタスク間の相関性を表すタスク相関潜在変数を設定するように構成された第１の設定モジュールと、
を含み、
各タスクは１つの区分的線形モデルに相関しており、
複数の区分的線形モデルにモデル構造を設定し、対応するモデルパラメータと階層的潜在変数とを初期化するように構成された第２の設定モジュールと、
前記複数のタスクの前記データと、前記タスク相関潜在変数と、前記階層的潜在変数とに基づいて、前記モデル構造と前記モデルパラメータとを最適化するように構成されたモデル最適化モジュールと、
を含む区分的線形モデル生成システム。 (Appendix 1)
An acquisition module configured to acquire data for multiple tasks;
A first setting module configured to set a task correlation latent variable representing the correlation between the plurality of tasks;
Including
Each task is correlated to a piecewise linear model,
A second setting module configured to set a model structure for a plurality of piecewise linear models and initialize corresponding model parameters and hierarchical latent variables;
A model optimization module configured to optimize the model structure and the model parameters based on the data of the plurality of tasks, the task correlation latent variables, and the hierarchical latent variables;
A piecewise linear model generation system including

（付記２）
前記タスク相関潜在変数は、前記複数のタスクの各タスクと、前記複数の区分的線形モデルの各区分的線形モデルとの間の対応関係を示すように構成されている付記１に記載の区分的線形モデル生成システム。 (Appendix 2)
The piecewise correlation variable of claim 1, wherein the task correlation latent variable is configured to indicate a correspondence relationship between each task of the plurality of tasks and each piecewise linear model of the plurality of piecewise linear models. Linear model generation system.

（付記３）
前記モデル構造は、木構造を含む付記１に記載の区分的線形モデル生成システム。 (Appendix 3)
The piecewise linear model generation system according to attachment 1, wherein the model structure includes a tree structure.

（付記４）
前記複数のタスクの前記データと、前記タスク相関潜在変数と、前記モデル構造と、前記モデルパラメータとに基づいて、前記階層的潜在変数を最適化するように構成された階層的潜在変数最適化モジュールを更に含む付記１に記載の区分的線形モデル生成システム。 (Appendix 4)
A hierarchical latent variable optimization module configured to optimize the hierarchical latent variable based on the data of the plurality of tasks, the task correlation latent variable, the model structure, and the model parameter. The piecewise linear model generation system according to claim 1, further comprising:

（付記５）
前記複数のタスクのデータと、前記階層的潜在変数と、前記モデル構造と、前記モデルパラメータとに基づいて、前記タスク相関潜在変数を最適化するように構成されたタスク相関潜在変数最適化モジュールを更に含む付記１に記載の区分的線形モデル生成システム。 (Appendix 5)
A task correlation latent variable optimization module configured to optimize the task correlation latent variable based on the data of the plurality of tasks, the hierarchical latent variable, the model structure, and the model parameter; The piecewise linear model generation system according to appendix 1, further including:

（付記６）
前記タスク相関潜在変数を最適化した後に、前記タスク相関潜在変数と、更新された前記タスク相関潜在変数とに基づいて、前記複数のタスクのハード・クラスタリングを実行するように構成されたハード・クラスタリング・モジュールを更に含む付記５に記載の区分的線形モデル生成システム。 (Appendix 6)
Hard clustering configured to perform hard clustering of the plurality of tasks based on the task correlation latent variable and the updated task correlation latent variable after optimizing the task correlation latent variable The piecewise linear model generation system according to appendix 5, further including a module.

（付記７）
前記モデル構造と前記モデルパラメータとが最適か否か判定するように構成された判定モジュールを更に含む付記１に記載の区分的線形モデル生成システム。 (Appendix 7)
The piecewise linear model generation system of claim 1, further comprising a determination module configured to determine whether the model structure and the model parameters are optimal.

（付記８）
前記判定モジュールは、前記モデル構造と前記モデルパラメータとの間のフィッティング度合に基づいて、前記モデル構造と前記モデルパラメータとが最適か否か判定する付記７に記載の区分的線形モデル生成システム。 (Appendix 8)
The piecewise linear model generation system according to appendix 7, wherein the determination module determines whether or not the model structure and the model parameter are optimal based on a fitting degree between the model structure and the model parameter.

（付記９）
複数のタスクのデータを取得することと、
前記複数のタスク間の相関性を表すタスク相関潜在変数を設定することと、
を含み、
各タスクは１つの区分的線形モデルに相関しており、
複数の区分的線形モデルにモデル構造を設定し、対応するモデルパラメータと階層的潜在変数とを初期化することと、
前記複数のタスクの前記データと、前記タスク相関潜在変数と、前記階層的潜在変数とに基づいて、前記モデル構造と前記モデルパラメータとを最適化することと、
を含む区分的線形モデル生成方法。 (Appendix 9)
Getting data for multiple tasks,
Setting a task correlation latent variable representing the correlation between the plurality of tasks;
Including
Each task is correlated to a piecewise linear model,
Setting up model structures for multiple piecewise linear models and initializing corresponding model parameters and hierarchical latent variables;
Optimizing the model structure and the model parameters based on the data of the plurality of tasks, the task correlation latent variables, and the hierarchical latent variables;
A piecewise linear model generation method including:

（付記１０）
前記タスク相関潜在変数は、前記複数のタスクの各タスクと、前記複数の区分的線形モデルの各区分的線形モデルとの間の対応関係を示すように構成されている付記９に記載の区分的線形モデル生成方法。 (Appendix 10)
The piecewise piecewise variable of claim 9, wherein the task correlation latent variable is configured to indicate a correspondence relationship between each task of the plurality of tasks and each piecewise linear model of the plurality of piecewise linear models. Linear model generation method.

（付記１１）
前記モデル構造は、木構造を含む付記９に記載の区分的線形モデル生成方法。 (Appendix 11)
The piecewise linear model generation method according to appendix 9, wherein the model structure includes a tree structure.

（付記１２）
前記複数のタスクの前記データと、前記タスク相関潜在変数と、前記モデル構造と、前記モデルパラメータとに基づいて、前記階層的潜在変数を最適化することを更に含む付記９に記載の区分的線形モデル生成方法。 (Appendix 12)
The piecewise linear of clause 9, further comprising optimizing the hierarchical latent variable based on the data of the plurality of tasks, the task correlation latent variable, the model structure, and the model parameter. Model generation method.

（付記１３）
前記複数のタスクの前記データと、前記階層的潜在変数と、前記モデル構造と、前記モデルパラメータとに基づいて、前記タスク相関潜在変数を最適化することを更に含む付記９に記載の区分的線形モデル生成方法。 (Appendix 13)
The piecewise linear of clause 9, further comprising optimizing the task correlation latent variable based on the data of the plurality of tasks, the hierarchical latent variable, the model structure, and the model parameter. Model generation method.

（付記１４）
前記タスク相関潜在変数を最適化した後に、前記タスク相関潜在変数と、更新された前記タスク相関潜在変数とに基づいて、前記複数のタスクのハード・クラスタリングを実行することを更に含む付記１３に記載の区分的線形モデル生成方法。 (Appendix 14)
14. The method of claim 13, further comprising performing hard clustering of the plurality of tasks based on the task correlation latent variable and the updated task correlation latent variable after optimizing the task correlation latent variable. A piecewise linear model generation method.

（付記１５）
前記モデル構造と前記モデルパラメータとが最適か否か判定することを更に含む付記９に記載の区分的線形モデル生成方法。 (Appendix 15)
The piecewise linear model generation method according to appendix 9, further comprising determining whether or not the model structure and the model parameters are optimal.

（付記１６）
前記モデル構造と前記モデルパラメータとが最適か否か前記判定することは、前記モデル構造と前記モデルパラメータとの間のフィッティング度合に基づいて、前記モデル構造と前記モデルパラメータとが最適か否か判定することを含む付記１５に記載の区分的線形モデル生成方法。 (Appendix 16)
The determination as to whether or not the model structure and the model parameter are optimal is based on a fitting degree between the model structure and the model parameter, whether or not the model structure and the model parameter are optimal The piecewise linear model generation method according to supplementary note 15, including:

Claims

An acquisition module configured to acquire data for multiple tasks;
A first setting module configured to set a task correlation latent variable representing the correlation between the plurality of tasks;
Including
Each task is correlated to a piecewise linear model,
A second setting module configured to set a model structure for a plurality of piecewise linear models and initialize corresponding model parameters and hierarchical latent variables;
A model optimization module configured to optimize the model structure and the model parameters based on the data of the plurality of tasks, the task correlation latent variables, and the hierarchical latent variables;
A piecewise linear model generation system including

The category of claim 1, wherein the task correlation latent variable is configured to indicate a correspondence relationship between each task of the plurality of tasks and each piecewise linear model of the plurality of piecewise linear models. Linear model generation system.

The piecewise linear model generation system according to claim 1, wherein the model structure includes a tree structure.

A hierarchical latent variable optimization module configured to optimize the hierarchical latent variable based on the data of the plurality of tasks, the task correlation latent variable, the model structure, and the model parameter. The piecewise linear model generation system according to claim 1, further comprising:

A task correlation latent variable optimization module configured to optimize the task correlation latent variable based on the data of the plurality of tasks, the hierarchical latent variable, the model structure, and the model parameter; The piecewise linear model generation system according to claim 1, further comprising:

Hard clustering configured to perform hard clustering of the plurality of tasks based on the task correlation latent variable and the updated task correlation latent variable after optimizing the task correlation latent variable 6. The piecewise linear model generation system of claim 5, further comprising a module.

The piecewise linear model generation system of claim 1, further comprising a determination module configured to determine whether the model structure and the model parameters are optimal.

The piecewise linear model generation system according to claim 7, wherein the determination module determines whether the model structure and the model parameter are optimal based on a fitting degree between the model structure and the model parameter.

Getting data for multiple tasks,
Setting a task correlation latent variable representing the correlation between the plurality of tasks;
Including
Each task is correlated to a piecewise linear model,
Setting up model structures for multiple piecewise linear models and initializing corresponding model parameters and hierarchical latent variables;
Optimizing the model structure and the model parameters based on the data of the plurality of tasks, the task correlation latent variables, and the hierarchical latent variables;
A piecewise linear model generation method including: