JP2004030628A

JP2004030628A - Information processing apparatus and method, program storage medium, and program

Info

Publication number: JP2004030628A
Application number: JP2003132606A
Authority: JP
Inventors: Masato Ito; 伊藤　真人; Atsushi Tani; 谷　淳
Original assignee: Sony Corp; RIKEN Institute of Physical and Chemical Research
Current assignee: Sony Corp; RIKEN Institute of Physical and Chemical Research
Priority date: 2002-05-10
Filing date: 2003-05-12
Publication date: 2004-01-29

Abstract

<P>PROBLEM TO BE SOLVED: To produce a novel pattern which is not learnt. <P>SOLUTION: Data x<SB>t</SB>corresponding to a prescribed time series pattern are inputted to an input layer 11 of a recurrent neural network 1, and a predictive value x*<SB>t+1</SB>is obtained from an output layer 13. A difference between data x<SB>t+1</SB>as a teacher and the predictive value x*<SB>t+1</SB>is learnt by a back propagation method, and a weighting coefficient of the intermediate layer 12 is set to a prescribed value. After a plurality of time series patterns are learnt, a parameter different from a value during learning is inputted to a parametric bias node 11-2, and a non-learnt time series pattern corresponding to the parameter is produced from the output layer 13. The invention is applied to a robot. <P>COPYRIGHT: (C)2004,JPO

Description

【０００１】
【発明の属する技術分野】
本発明は、情報処理装置および方法、プログラム格納媒体、並びにプログラムに関し、特に、学習していない新規なパターンを出力することができるようにした情報処理装置および方法、プログラム格納媒体、並びにプログラムに関する。
【０００２】
【従来の技術】
最近、人間や動物の脳に関する研究が盛んに行われている。脳のモデルとして、ニューラルネットワークを用いることができることが知られている。
【０００３】
【発明が解決しようとする課題】
しかしながら、ニューラルネットワークにおいては、所定のパターンを学習させると、その学習したパターンを識別することが可能であるが、新たなパターンを生成することができない課題があった。
【０００４】
本発明は、このような状況に鑑みてなされたものであり、学習していない新規なパターンを生成することができるようにするものである。
【０００５】
【課題を解決するための手段】
本発明の情報処理装置は、時系列パターンを入力する入力手段と、入力手段から入力された複数の時系列パターンのそれぞれについて、１つ以上の外部から操作可能な特徴量パラメータを有する共通の非線形力学系によるモデルを決定するモデル決定手段と、決定されたモデルに基づいて、特徴量パラメータの値を算出する演算手段と、特徴量パラメータとして、演算手段により算出された値とは異なる値を設定し、特徴量パラメータの値を算出した演算の逆演算を行うことにより、新たな時系列パターンを出力する出力手段とを有することを特徴とする。
【０００６】
上記非線形力学系は、操作パラメータ付きリカレント型ニューラルネットワークであるようにすることができる。
【０００７】
上記特徴量パラメータは、時系列パターンの非線形力学系における力学構造を表すようにすることができる。
【０００８】
上記出力手段は、入力された複数の時系列パターンと共有可能な力学構造を有する新たな時系列パターンを出力するようにすることができる。
【０００９】
本発明の情報処理方法は、時系列パターンを入力する入力ステップと、入力ステップの処理で入力された複数の時系列パターンのそれぞれについて、１つ以上の外部から操作可能な特徴量パラメータを有する共通の非線形力学系によるモデルを決定するモデル決定ステップと、決定されたモデルに基づいて、特徴量パラメータの値を算出する演算ステップと、特徴量パラメータとして、演算ステップの処理により算出された値とは異なる値を設定し、特徴量パラメータの値を算出した演算の逆演算を行うことにより、新たな時系列パターンを出力する出力ステップとを含むことを特徴とする。
【００１０】
本発明のプログラム格納媒体のプログラムは、時系列パターンを入力する入力ステップと、入力ステップの処理で入力された複数の時系列パターンのそれぞれについて、１つ以上の外部から操作可能な特徴量パラメータを有する共通の非線形力学系によるモデルを決定するモデル決定ステップと、決定されたモデルに基づいて、特徴量パラメータの値を算出する演算ステップと、特徴量パラメータとして、演算ステップの処理により算出された値とは異なる値を設定し、特徴量パラメータの値を算出した演算の逆演算を行うことにより、新たな時系列パターンを出力する出力ステップとを含むことを特徴とする。
【００１１】
本発明のプログラムは、時系列パターンを入力する入力ステップと、入力ステップの処理で入力された複数の時系列パターンのそれぞれについて、１つ以上の外部から操作可能な特徴量パラメータを有する共通の非線形力学系によるモデルを決定するモデル決定ステップと、決定されたモデルに基づいて、特徴量パラメータの値を算出する演算ステップと、特徴量パラメータとして、演算ステップの処理により算出された値とは異なる値を設定し、特徴量パラメータの値を算出した演算の逆演算を行うことにより、新たな時系列パターンを出力する出力ステップとを含むことを特徴とする。
【００１２】
本発明の情報処理装置および方法、プログラム格納媒体、並びにプログラムにおいては、入力された時系列パターンに対応する新たな時系列パターンが出力される。
【００１３】
【発明の実施の形態】
図１は、本発明を適用したリカレント型ニューラルネットワークの構成例を表している。このリカレント型ニューラルネットワーク（ＲＮＮ）１は、入力層１１、中間層（隠れ層）１２、および出力層１３により構成されている。これらの入力層１１、中間層１２、および出力層１３は、それぞれ任意の数のニューロンにより構成されている。
【００１４】
入力層１１の一部のニューロン１１−１には、時系列パターンに関するデータｘ_ｔが入力される。具体的には例えば、カメラ画像等を基に画像処理により得られる人間の身体運動パターン（例えば、手先位置の運動軌道等）などの時系列パターンに関するデータである。Ｐ_ｔはベクトルであり次元は時系列パターンにより任意である。入力層１１の一部のニューロンであるパラメトリックバイアスノード１１−２には、パラメータＰ_ｔが入力される。パラメトリック・バイアス・ノードの数は、１つ以上である。そのノード数は、リカレント・ニューラル・ネットを構成し、かつ、モデル決定手段のパラメータであるウェイト・マトリックスの数を決定するニューロンの総数に対して、十分に小さいことが望ましい。本実施の形態では、前記ニューロン総数約５０個に対して、パラメトリック・バイアス・ノードの数は約１〜２個である。ただし、本願発明がこの数に限定されないことは言うまでもない。パラメトリックバイアスノードは、非線形力学系における力学構造をモジュレーションするものであり、本実施の形態においては、リカレント型ニューラルネットワークが保持する力学構造をモジュレーションする働きをするノードである。ただし、本発明がリカレント型ニューラルネットワークに限定されるものではない。さらに、入力層１１の一部のニューロン１１−３には、出力層１３の一部のニューロン１３−２より出力されたデータが、ＲＮＮ１の内部の状態を表すコンテキストＣｔとしてフィードバックされている。コンテキストＣｔについては、リカレント型ニューラルネットワークに関する一般的用語であり、参考文献（Ｅｌｍａｎ，　Ｊ．Ｌ．　（１９９０）．　Ｆｉｎｄｉｎｇ　ｓｔｒｕｃｔｕｒｅ　ｉｎ　ｔｉｍｅ．　Ｃｏｇｎｉｔｉｖｅ
Ｓｃｉｅｎｃｅ，　１４，　１７９−２１１）等を参照されたい。
【００１５】
中間層１２のニューロンは、入力されたデータに対して重み付け加算処理を行い、順次後段に出力する処理を実行する。すなわち、データｘ_ｔ，Ｐ_ｔ，ｃ_ｔに対して所定の重み付け係数に対する演算処理（非線形関数に基づく演算処理）を行った後、出力層１３に出力する。本実施の形態では例えば、データｘ_ｔ，Ｐ_ｔ，ｃ_ｔの所定の重み付け和の入力に対して、シグモイド関数等の非線形出力特性を有する関数に基づく演算処理を行った後、出力層１３に出力する。
【００１６】
出力層１３を構成する一部のニューロン１３−１は、入力データに対応するデータｘ^＊ _ｔ＋１を出力する。
【００１７】
また、ＲＮＮ１は、バックプロパケーションによる学習のため、演算器２１を有している。演算部２２は、ＲＮＮ１に対する重み付け係数の設定処理を行う。
【００１８】
次に、図２のフローチャートを参照して、ＲＮＮ１の学習処理について説明する。
【００１９】
図２のフローチャートに示される処理は、学習させる時系列パターン毎に実行される。換言すれば、学習する時系列パターンの数だけ仮想的なＲＮＮが用意され、各仮想ＲＮＮ毎に図２の処理が実行される。
【００２０】
仮想的なＲＮＮ毎に図２のフローチャートに示される処理が実行され、仮想ＲＮＮ毎に時系列パターンが学習された後、実際のＲＮＮ１に対して、係数を設定する処理が実行される。ただし、以下の説明では、仮想的なＲＮＮも、実際のＲＮＮ１として説明する。
【００２１】
最初に、ステップＳ１１において、ＲＮＮ１の入力層１１のニューロン１１−１は、所定の時刻ｔの入力ｘ_ｔを取り込む。ステップＳ１２において、ＲＮＮ１の中間層１２は、入力ｘ_ｔに対して、重み付け係数に対応する演算処理を行い、出力層１３のニューロン１３−１から、入力された時系列パターンにおける時系列ｔ＋１の値の予測値ｘ^＊ _ｔ＋１を出力する。
【００２２】
ステップＳ１３において、演算部２１は、次の時刻ｔ＋１の入力ｘ_ｔ＋１を教師データとして取り込む。ステップＳ１４において、演算部２１は、ステップＳ１３の処理で取り込んだ教師入力ｘ_ｔ＋１と、ステップＳ１２の処理で演算して得た予測値ｘ^＊ _ｔ＋１の誤差を演算する。
【００２３】
ステップＳ１５において、ＲＮＮ１は、ステップＳ１４の処理で演算して得た誤差を出力層１３のニューロン１３−１から入力し、中間層１２、さらに入力層１１の順に伝搬することで、学習処理を行い、演算結果ｄＸ_ｂｐｔを得る。
【００２４】
ステップＳ１６において、中間層１２は、式（１）に基づいて、内部状態の修正値ｄＸＵを得る。
【００２５】
【数１】

【００２６】
さらに、中間層１２は、式（２）乃至式（４）に基づいて、修正値ｄＸＵを修正する。
【数２】

【数３】

【００２７】
ステップＳ１７において、パラメトリックノード１１−２は、その内部状態の値を保存する処理を実行する。
【００２８】
次に、ステップＳ１８において、ＲＮＮ１は、学習処理を終了するか否かを判定し、まだ学習処理を終了しない場合には、ステップＳ１１に戻り、それ以降の処理を繰り返し実行する。
【００２９】
ステップＳ１８において、学習処理を終了すると判定された場合、学習処理が終了される。
【００３０】
以上のような学習処理を行うことで、仮想ＲＮＮに対して１つの時系列パターンが学習される。
【００３１】
以上のようにして、学習パターンの数に対応する仮想ＲＮＮの学習処理が行われた後、その学習処理により得られた重み付け係数を、実ＲＮＮ３１に設定する処理が行われる。図３は、この場合の処理を表している。
【００３２】
演算部２２は、ステップＳ２１において、仮想ＲＮＮ毎に図２のフローチャートに示される処理を実行した結果得られた係数の合成値を演算する。この合成値としては、例えば、平均値を用いることができる。すなわち、各仮想ＲＮＮの重み付け係数の平均値がここで演算される。
【００３３】
次に、ステップＳ２２において、演算部２２は、ステップＳ２１の処理で演算した合成値（平均値）を実ＲＮＮ１のニューロンに対して、重み付け係数として設定する処理を実行する。
【００３４】
これにより、実際のＲＮＮ１の中間層１２の各ニューロンに、複数の時系列パターンを学習して得た係数が設定されることになる。
【００３５】
中間層１２の各ニューロンの重み付け係数には、複数の教示時系列パターンを生成する上で、共有可能な力学構造に関する情報が保持され、パラメトリックバイアスノードには、共有可能な力学構造を各教示時系列パターンの生成に適した力学構造に切り替えるために、必要な情報が保持されることになる。ここで、「共有可能な力学構造」をさらに例をあげて説明する。例えば、図４Ａ乃至図４Ｃに示されるように、振幅は異なるが周期が同じ時系列パターンＡと時系列パターンＢが入力された場合には、出力時系列パターンＣの周期が共有可能な力学構造にあたり、また、図５Ａ乃至図５Ｃに示されるように、周期は異なるが振幅は同じ時系列パターンＡと時系列パターンＢが入力された場合には、出力時系列パターンＣの振幅が共有可能な力学構造に該当する。ただし、本願発明がこれに限定されるものでないのは言うまでもない。
【００３６】
例えば、図６に示されるように、第１のデータを入力し、学習させることで、比較的大きな振幅を有する曲線Ｌ１で表される時系列パターンが学習される。
【００３７】
同様に、図７に示されるように、第２のデータを入力し、学習させることで、曲線Ｌ２で示される、比較的小さい振幅を有する時系列パターンが学習される。
【００３８】
このような時系列パターンの学習を行った後、ＲＮＮ１に新たな時系列パターンを発生させる場合、図８のフローチャートに示されるような処理が実行される。
【００３９】
すなわち、最初に、ステップＳ３１において、パタメトリックバイアスノード１１−２は、学習時と異なるパラメータを入力する。ステップＳ３２において、中間層１２は、ステップＳ３１の処理で、パタメトリックバイアスノード１１−２に入力されたパラメータに対して、重み付け係数に基づく演算を行う。具体的には、学習時にパラメータの値を算出するときに行った演算の逆演算にあたる。そして、ステップＳ３３において、ＲＮＮ１のニューロン１３−１は、ステップＳ３１の処理で入力されたパラメータに対応するパターンを出力する。
【００４０】
図９は、ＲＮＮ１に対して、図６と図７に示される時系列パターンを学習させた後、ＲＮＮ１のパラメトリックバイアスノード１１−２にパラメータＰ_ｔとしてパラメータＰ_Ｎを入力した場合の例を表している。このパラメータＰ_Ｎは、図６のパターン学習時においてパラメトリックバイアスノード１１−２に出力されるパラメータＰ_Ａ、並びに図７に示される時系列パターン学習時に出力されるパラメータＰ_Ｂとは異なる値とされている。すなわち、この例の場合、パラメータＰ_Ｎの値は、パラメータＰ_ＡとパラメータＰ_Ｂの値の中間の値とされている。
【００４１】
このような場合、出力層１３のニューロン１３−１から出力される時系列パターンは、図９に曲線Ｌ３で示される時系列パターンとなる。この曲線Ｌ３の振幅は、図６に示される時系列パターンＡの曲線Ｌ１の振幅より小さく、図７に示される時系列パターンＢの曲線Ｌ２の振幅より大きい値となっている。換言すれば、曲線Ｌ３の振幅は、曲線Ｌ１の振幅と曲線Ｌ２の振幅の中間の値となっている。すなわち、この例においては、図６と図７に示される曲線Ｌ１と曲線Ｌ２の中間の曲線Ｌ３が線形的に補間されたことになる。
【００４２】
以下に、実験の結果について、図１０乃至図２８を参照して説明する。
【００４３】
図１０乃至図１３は、第１の時系列パターンを学習させた場合のエラー（図１０）、ターゲット（入力データ）（図１１）、出力（図１２）、およびパラメトリックバイアス（パラメータ）（図１３）を、それぞれ表している。各図の縦軸は、それぞれの値（正規化された値）を表し、横軸は、ステップを表している。
【００４４】
第１のパターン学習時においては、図１０に示されるように、エラーは直線Ｌ１１で示されるように、ほぼ０．０である。図１１に示されるように、ターゲット（入力パターン）は、直線Ｌ１２で示される正弦波である。
【００４５】
図１１に示されるターゲットに対応する出力は、図１２に曲線Ｌ１３で示されるように、ターゲットとしての曲線Ｌ１２にほぼ対応する曲線（正弦波）となっている。
【００４６】
図１３に示されるように、２つのパラメトリックバイアス（パラメータ）の値の一方は、曲線Ｌ１４で示されるように、ほぼ０．３７の値で収束し、他方は、曲線Ｌ１５で示されるように、ほぼ０．０で一定となっている。
【００４７】
図１４乃至図１７は、第２の時系列パターンを学習した場合におけるエラー（図１４）、ターゲット（図１５）、出力（図１６）、およびパラメトリックバイアス（パラメータ（図１７））をそれぞれ表している。
【００４８】
図１４に示されるように、エラーは、直線Ｌ２１で示されるように、ほぼ０．０で一定である。ターゲットは、図１５に曲線Ｌ２２で示されるように、ほぼ正弦波状の時系列パターンとなっている。図１５は、図１１と比較して明かなように、曲線Ｌ２２の振幅は、曲線Ｌ１２の振幅とほぼ同一であるが、曲線Ｌ２２の周期は、曲線Ｌ１２の周期より長くなっている（周波数が低くなっている）。
【００４９】
図１５に示されるターゲットに対応して、図１６に示されるように、曲線Ｌ２２にほぼ対応する曲線２３の出力が得られている。
【００５０】
図１７に示されるように、第２の時系列パターンを学習した場合における２つのパラメトリックバイアスのうちの一方の値は、曲線Ｌ２４で示されるように、ほぼ０．３６の値に収束し、他方の値は、曲線Ｌ２５で示されるように、０．６７の値にほぼ収束する。
【００５１】
図１８乃至図２１は、第３の時系列パターンを学習した場合におけるエラー（図１８）、ターゲット（図１９）、出力（図２０）、およびパラメトリックバイアス（図２１）を、それぞれ表している。
【００５２】
図１８に示されるように、学習時におけるエラーは、曲線Ｌ３１で示されるように、ほぼ０．０である。ターゲットは、曲線Ｌ３２で示されるように、ほぼ正弦波状の信号とされる。曲線Ｌ３２を図１１の曲線Ｌ１２、および図１５の曲線Ｌ２２と比較して明かなように、曲線Ｌ３２の振幅は、曲線Ｌ１２および曲線Ｌ２２の振幅とほぼ同一であるが、その周期は、曲線Ｌ２２における場合よりさらに長くなっている（周波数はＬ２２における場合より低くなっている）。
【００５３】
図２０に示されるように、出力の曲線Ｌ３３は、図１９のターゲットの曲線ＬＬ３２にほぼ対応した正弦波状となっている。
【００５４】
図２１に示されるように、第３の時系列パターンを学習した場合におけるパラメトリックバイアスの値の一方の値は、曲線Ｌ３４で示されるように、ほぼ０．２１となり、他方の値は、曲線Ｌ３５で示されるように、１．００でほぼ一定となっている。
【００５５】
図１１、図１５、および図１９に示されるような３つの時系列パターンをＲＮＮ１に学習させた後、ＲＮＮ１のパラメトリックバイアスノード１１−２にパラメトリックバイアス（パラメータ）として（０．３６，０３６）の２つの値を入力した場合、出力層１３を構成するニューロン１３−１から、図２２に示されるように、曲線Ｌ４１で示される時系列パターンが出力された。
【００５６】
同様に、パラメータとして（０．８０，０．２５）の値を入力したとき、図２３に示されるように、曲線Ｌ５１のパターンが得られた。比較のために、図２４乃至図２６に、図１１、図１５、および図１９に示されるターゲットの時系列パターンを示している。
【００５７】
これらの図を比較して明かなように、図２２の曲線Ｌ４１の振幅は、図２４乃至図２６の曲線Ｌ１３，Ｌ２３，Ｌ３３の新福とほぼ同一である。しかしながら、曲線Ｌ４１の周期は、図２４の曲線Ｌ１３の周期より長く、図２５の曲線Ｌ２２の周期より短くなっている。
【００５８】
これは、図２２のパラメータ（０．３６，０．３６）が、図２４のパラメータ（０．００，０．３７）と図２５のパラメータ（０．６７，０．３６）の中間の値であることに起因する。
【００５９】
また、図２３の曲線Ｌ５１は、その振幅は、図２４乃至図２６の曲線Ｌ１３，Ｌ２３，Ｌ３３とほぼ同一であるが、その周期は、図２５の曲線Ｌ２２のそれより長く、図２６の曲線Ｌ３２のそれより短い値となっている。
【００６０】
これは、図２３のパラメータ（０．８０，０．２５）の値が、図２５のパラメータ（０．６７，０．３６）と図２６のパラメータ（１．００，０．０１）の中間の値であることに起因する。
【００６１】
図２７は、パラメトリックバイアスのうちの一方の値を横軸に、他方の値を縦軸に取った場合の周期のグラフを表している。同図における点Ａ１，Ａ２，Ａ３，Ｂ１，Ｂ２は、それぞれ図２４（Ａ１）、図２５（Ａ２）、図２６（Ａ３）、図２２（Ｂ１）、図２３（Ｂ２）における、それぞれのパラメータの値をプロットしたものである。図２７において、同じ濃度は、同じ周期であることを表している。色の差がグラデーションしていることは、周期がパラメトリックバイアスの値に応じて、滑らかに変化していることを意味する。
【００６２】
図２８は、２つのパラメトリックバイアスに対応する振幅の変化を表している。この図からも明かなように、パラメトリックバイアスに対応して、振幅がほぼ滑らかに変化している。
【００６３】
以上においては、線形的な時系列パターンを学習させるようにしたが、非線形の時系列パターンを学習させた場合にも、同様の結果が得られた。
【００６４】
すなわち、図２９に示されるように、曲線Ｌを６１で示される時系列パターンと、図３０に示されるように、曲線Ｌ６２で示される時系列パターンとを、ＲＮＮ１に学習させた後、図３１に示されるように、パラメータとして、図２９の曲線Ｌ６１を学習したとき得られるパラメータＰ_Ｃと、図３０の曲線Ｌ６２のパターンを学習させた場合に得られるパターンＰ_Ｄと異なるパラメータＰ_Ｍを、ＲＮＮ１のパラメトリックバイアスノード１１−２に入力すると、曲線Ｌ６３で示されるように、図２９の曲線Ｌ６１、並びに図３０に示される曲線Ｌ６２のいずれとも異なる新規なパターンを生成することができる。
【００６５】
以下に、具体的な実験例について説明する。
【００６６】
図３２乃至図３５は、第４のパターンを学習させた場合におけるエラー（図３２）、ターゲット（図３３）、出力（図３４）、およびパラメトリックバイアス（図３５）を、それぞれ表している。
【００６７】
第４の時系列パターン学習時におけるエラーは、曲線Ｌ７１で示されるように、ほぼ０．０である。ターゲットは、曲線Ｌ７２で示されるように、正弦波とされる。対応する出力は、曲線Ｌ７３で示される。この曲線Ｌ７３は、ほぼ曲線Ｌ７２に対応している。
【００６８】
この場合に得られるパラメトリックバイアスのうちの一方の値は、曲線Ｌ７４で示されるように、１．００であり、他方の値は、曲線Ｌ７５で示されるように、０．７９である。
【００６９】
図３６乃至図３９は、第５のパターン学習時におけるエラー（図３６）、ターゲット（図３７）、出力（図３８）、およびパラメトリックバイアス（図３９）を、それぞれ表している。
【００７０】
図３６に示されるように、学習時におけるエラーは、曲線Ｌ８１で示されるように、ほぼ０．０である。
【００７１】
ターゲットは、図３７の曲線Ｌ８２で示されるように、図３３の曲線Ｌ７２で示される綺麗な正弦波とは異なる変形した（歪んだ）正弦波となっている。
【００７２】
図３８に示されるように、図３７で示されるターゲットに対応する曲線Ｌ８３で示される出力が得られている。
【００７３】
この学習時において、得られるパラメトリックバイアス値は、一方の値は、曲線Ｌ８４で示されるように、０．１２に収束し、他方の値は、曲線Ｌ８５で示されるように、ほぼ０．６０で収束している。
【００７４】
ＲＮＮ１に対して、図３３と図３７に示される２つの時系列パターンを学習させた後、パラメトリックバイアスノード１１−２に、パラメトリックバイアス（０．１２，０．６８）を入力した場合、図４０に曲線Ｌ９１で示される出力が得られた。
【００７５】
同様に、パラメトリックバイアス（０．２５，０．５９）を与えた場合、図４１に曲線Ｌ９２で示される出力が得られ、パラメトリックバイアス（０．５０，０．６０）を与えた場合、図４２に曲線Ｌ９３で示される出力が得られた。
【００７６】
図４０乃至図４２の出力と比較するために、パラメトリックバイアス（１．００，０．７９）またはパラメトリックバイアス（０．１２，０．６０）を与えた場合の出力が、図４３と図４４に示されている。すなわち、図４３の出力は、図３４の出力と同一であり、図４４の出力は、図３８の出力と同一である。
【００７７】
このように、この例の場合においては、非線形的な演算処理がパラメータに対応して行われていることになる。
【００７８】
図４５は、パラメトリックバイアスに対応する周期の変化を表している。図４６は、同様に、パラメトリックバイアスに対応する振幅の変化を表している。これらの図においても、図２７と図２８における場合と同様に、同一の濃度は、同一の周期または振幅を表している。また、図４５と図４６において、Ａ１，Ａ２，Ｂ１，Ｂ２，Ｂ３は、それぞれ図４３、図４４、図３０、図４１、および図４２のパラメータをプロットしたものである。
【００７９】
図４５と図４６からも明かなように、非線形の演算処理が行われる場合にも、パラメータに対応して、ほぼ滑らかに周期と振幅が変化している。
【００８０】
このように、複数の時系列パターンを学習させた後、所定のパラメータを入力することで、学習させた時系列パターンとは異なる新たな時系列パターンを、線形的または非線形的補間により生成、出力することができる。
【００８１】
なお、パラメトリックバイアスは、ＲＮＮ１が生成する時空間パターンを切り替えるパラメータである。このパラメータと、それにより生成される時空間パターンとの関係は、予め与えられるものではなく、教示時空間パターンの学習により決定される。
【００８２】
本発明は、例えば、ヒューマロイド型のロボットにおける「ものまね運動学習」に応用することができる。これにより、ロボットが人から教示された複数の運動パターン（ダンスなど）を学習し、それらの運動を再現することができるだけでなく、複数の教示運動パターンの間に共通する特徴を抽出し、それに基づく新規な運動パターンを生成することが可能となる。
【００８３】
上述した一連の処理は、ハードウエアにより実行させることもできるが、ソフトウエアにより実行させることもできる。この場合、例えば、図４７に示されるようなパーソナルコンピュータ１６０が用いられる。
【００８４】
図４７において、ＣＰＵ（Ｃｅｎｔｒａｌ　Ｐｒｏｃｅｓｓｉｎｇ　Ｕｎｉｔ）１６１は、ＲＯＭ（Ｒｅａｄ　ＯｎｌｙＭｅｍｏｒｙ）１６２に記憶されているプログラム、または記憶部１６８からＲＡＭ（Ｒａｎｄｏｍ　Ａｃｃｅｓｓ　Ｍｅｍｏｒｙ）１６３にロードされたプログラムに従って各種の処理を実行する。ＲＡＭ１６３にはまた、ＣＰＵ１６１が各種の処理を実行する上において必要なデータなども適宜記憶される。
【００８５】
ＣＰＵ１６１、ＲＯＭ１６２、およびＲＡＭ１６３は、バス１６４を介して相互に接続されている。このバス１６４にはまた、入出力インタフェース１６５も接続されている。
【００８６】
入出力インタフェース１６５には、キーボード、マウスなどよりなる入力部１６６、ＣＲＴ，ＬＣＤなどよりなるディスプレイ、並びにスピーカなどよりなる出力部１６７、ハードディスクなどより構成される記憶部１６８、モデム、ターミナルアダプタなどより構成される通信部１６９が接続されている。通信部１６９は、ネットワークを介しての通信処理を行う。
【００８７】
入出力インタフェース１６５にはまた、必要に応じてドライブ１７０が接続され、磁気ディスク１７１、光ディスク１７２、光磁気ディスク１７３、或いは半導体メモリ１７４などが適宜装着され、それらから読み出されたコンピュータプログラムが、必要に応じて記憶部１６８にインストールされる。
【００８８】
一連の処理をソフトウエアにより実行させる場合には、そのソフトウエアを構成するプログラムが、パーソナルコンピュータ１６０に、ネットワークや記録媒体からインストールされる。
【００８９】
この記録媒体は、図４７に示されるように、装置本体とは別に、ユーザにプログラムを提供するために配布される、プログラムが記録されている磁気ディスク１７１（フロッピディスクを含む）、光ディスク１７２（ＣＤ−ＲＯＭ（Ｃｏｍｐａｃｔ　Ｄｉｓｋ−Ｒｅａｄ　Ｏｎｌｙ　Ｍｅｍｏｒｙ），ＤＶＤ（Ｄｉｇｉｔａｌ　Ｖｅｒｓａｔｉｌｅ　Ｄｉｓｋ）を含む）、光磁気ディスク１７３（ＭＤ（Ｍｉｎｉ−Ｄｉｓｋ）を含む）、もしくは半導体メモリ１７４などよりなるパッケージメディアにより構成されるだけでなく、装置本体に予め組み込まれた状態でユーザに提供される、プログラムが記録されているＲＯＭ１６２や、記憶部１６８に含まれるハードディスクなどで構成される。
【００９０】
なお、本明細書において、記録媒体に記録されるプログラムを記述するステップは、記載された順序に沿って時系列的に行われる処理はもちろん、必ずしも時系列的に処理されなくとも、並列的あるいは個別に実行される処理をも含むものである。
【００９１】
また、本明細書において、システムとは、複数の装置により構成される装置全体を表すものである。
【００９２】
【発明の効果】
以上の如く、本発明によれば、時系列パターンを発生させることができる。また、発生させる時系列パターンを、新たな時系列パターンとすることができる。
【図面の簡単な説明】
【図１】本発明を適用したリカレント型ニューラルネットワークの構成を示す図である。
【図２】図１のリカレント型ニューラルネットワークの学習処理を説明するフローチャートである。
【図３】図１のリカレント型ニューラルネットワークの係数設定処理を説明するフローチャートである。
【図４】振幅が異なり、同期が同じ時系列パターンの例を示す図である。
【図５】周期が異なり、振幅が同じ時系列パターンの例を示す図である。
【図６】学習パターンの例を示す図である。
【図７】学習パターンの例を示す図である。
【図８】図１のリカレント型ニューラルネットワークの時系列パターン生成処理を説明するフローチャートである。
【図９】生成する時系列パターンの例を示す図である。
【図１０】第１のパターンを学習させる場合におけるエラーの変化を示す図である。
【図１１】第１のパターンを学習させる場合におけるターゲットを示す図である。
【図１２】第１のパターンを学習させる場合における出力を示す図である。
【図１３】第１のパターンを学習させる場合におけるパラメトリックバイアスの変化を示す図である。
【図１４】第２のパターンを学習させる場合におけるエラーの変化を示す図である。
【図１５】第２のパターンを学習させる場合におけるターゲットを示す図である。
【図１６】第２のパターンを学習させる場合における出力を示す図である。
【図１７】第２のパターンを学習させる場合におけるパラメトリックバイアスの変化を示す図である。
【図１８】第３のパターンを学習させる場合におけるエラーの変化を示す図である。
【図１９】第３のパターンを学習させる場合におけるターゲットを示す図である。
【図２０】第３のパターンを学習させる場合における出力を示す図である。
【図２１】第３のパターンを学習させる場合におけるパラメトリックバイアスの変化を示す図である。
【図２２】生成したパターンの例を示す図である。
【図２３】生成したパターンの他の例を示す図である。
【図２４】図１２に対応する出力を示す図である。
【図２５】図１６に対応する出力を示す図である。
【図２６】図２０に対応する出力を示す図である。
【図２７】図２２乃至図２６のパターンの周期の変化とパラメトリックバイアスの関係を示す図である。
【図２８】図２２乃至図２６のパターンの振幅の変化とパラメトリックバイアスの関係を示す図である。
【図２９】学習させるパターンの例を示す図である。
【図３０】学習させるパターンの他の例を示す図である。
【図３１】図２９と図３０に示すパターンを学習させた場合に生成されるパターンを示す図である。
【図３２】第４のパターンを学習させる場合におけるエラーの変化を示す図である。
【図３３】第４のパターンを学習させる場合におけるターゲットを示す図である。
【図３４】第４のパターンを学習させる場合における出力を示す図である。
【図３５】第４のパターンを学習させる場合におけるパラメトリックバイアスの変化を示す図である。
【図３６】第５のパターンを学習させる場合におけるエラーの変化を示す図である。
【図３７】第５のパターンを学習させる場合におけるターゲットを示す図である。
【図３８】第５のパターンを学習させる場合における出力を示す図である。
【図３９】第５のパターンを学習させる場合におけるパラメトリックバイアスの変化を示す図である。
【図４０】生成されたパターンの例を示す図である。
【図４１】生成されたパターンの例を示す図である。
【図４２】生成されたパターンの例を示す図である。
【図４３】図３４に対応する出力を示す図である。
【図４４】図３８に対応する出力を示す図である。
【図４５】図４０乃至図４４のパターンの周期のパラメトリックバイアスとの関係を示す図である。
【図４６】図４０乃至図４４のパターンの振幅とパラメトリックバイアスとの関係を示す図である。
【図４７】本発明を適用したパーソナルコンピュータの構成例を示すブロック図である。
【符号の説明】
１　リカレント型ニューラルネットワーク，　１１　入力層，　１２　中間層，　１３　出力層，　２１　演算部[0001]
TECHNICAL FIELD OF THE INVENTION
The present invention relates to an information processing apparatus and method, a program storage medium, and a program, and more particularly, to an information processing apparatus and method, a program storage medium, and a program capable of outputting a new pattern that has not been learned.
[0002]
[Prior art]
Recently, research on the brains of humans and animals has been actively conducted. It is known that a neural network can be used as a brain model.
[0003]
[Problems to be solved by the invention]
However, in a neural network, when a predetermined pattern is learned, it is possible to identify the learned pattern, but there is a problem that a new pattern cannot be generated.
[0004]
The present invention has been made in view of such a situation, and is intended to generate a new pattern that has not been learned.
[0005]
[Means for Solving the Problems]
An information processing apparatus according to the present invention includes an input unit for inputting a time-series pattern, and a common non-linear having at least one externally operable feature parameter for each of the plurality of time-series patterns input from the input unit. Model determining means for determining a model based on a dynamical system, calculating means for calculating a value of a feature parameter based on the determined model, and setting a value different from the value calculated by the calculating means as the feature parameter And output means for outputting a new time-series pattern by performing an inverse operation of the operation of calculating the value of the feature parameter.
[0006]
The above-mentioned nonlinear dynamic system may be a recurrent neural network with operation parameters.
[0007]
The feature parameter may represent a dynamic structure of the time-series pattern in the nonlinear dynamic system.
[0008]
The output means can output a new time-series pattern having a dynamic structure that can be shared with a plurality of input time-series patterns.
[0009]
An information processing method according to the present invention provides an input step of inputting a time series pattern, and a common step having at least one externally operable feature parameter for each of the plurality of time series patterns input in the processing of the input step. A model determining step of determining a model based on the nonlinear dynamical system, a calculating step of calculating a value of the feature parameter based on the determined model, and a value calculated by the processing of the calculating step as the feature parameter. An output step of outputting a new time-series pattern by setting a different value and performing an inverse operation of the operation for calculating the value of the feature parameter.
[0010]
The program of the program storage medium of the present invention includes an input step of inputting a time-series pattern, and one or more externally operable feature amount parameters for each of the plurality of time-series patterns input in the processing of the input step. A model determining step of determining a model based on a common nonlinear dynamical system having a calculation step of calculating a value of a feature parameter based on the determined model; and a value calculated by the processing of the calculation step as a feature parameter. An output step of outputting a new time-series pattern by setting a value different from the above and performing an inverse operation of the operation of calculating the value of the feature parameter.
[0011]
A program according to the present invention includes an input step of inputting a time-series pattern, and a common non-linear function having one or more externally operable feature parameters for each of the plurality of time-series patterns input in the processing of the input step. A model determining step of determining a model based on a dynamical system, an operation step of calculating a value of a feature parameter based on the determined model, and a value different from the value calculated by the processing of the operation step as the feature parameter And performing an inverse operation of the operation of calculating the value of the feature parameter, thereby outputting a new time-series pattern.
[0012]
In the information processing apparatus and method, the program storage medium, and the program according to the present invention, a new time series pattern corresponding to the input time series pattern is output.
[0013]
BEST MODE FOR CARRYING OUT THE INVENTION
FIG. 1 shows a configuration example of a recurrent neural network to which the present invention is applied. The recurrent neural network (RNN) 1 includes an input layer 11, an intermediate layer (hidden layer) 12, and an output layer 13. Each of the input layer 11, the intermediate layer 12, and the output layer 13 includes an arbitrary number of neurons.
[0014]
Some neurons 11-1 of the input layer 11, the data _{x t} is input about the time series pattern. More specifically, the data is, for example, data relating to a time-series pattern such as a human body movement pattern (for example, a movement trajectory of a hand position) obtained by image processing based on a camera image or the like. _Pt is a vector, and the dimension is arbitrary according to the time series pattern. The parameter _Pt is input to the parametric bias node 11-2 which is a part of the neuron of the input layer 11. The number of parametric bias nodes is one or more. It is desirable that the number of nodes is sufficiently smaller than the total number of neurons that form the recurrent neural net and determine the number of weight matrices that are parameters of the model determining means. In the present embodiment, the number of parametric bias nodes is about 1-2 for the total number of about 50 neurons. However, it goes without saying that the present invention is not limited to this number. The parametric bias node modulates a dynamic structure in a nonlinear dynamic system. In the present embodiment, the parametric bias node is a node that functions to modulate a dynamic structure held by a recurrent neural network. However, the present invention is not limited to a recurrent neural network. Further, to some neurons 11-3 in the input layer 11, data output from some neurons 13-2 in the output layer 13 is fed back as a context Ct representing an internal state of the RNN1. The context Ct is a general term for a recurrent neural network and is described in a reference (Elman, JL (1990). Finding structure in time. Cognitive.
Science, 14, 179-211).
[0015]
The neuron of the intermediate layer 12 performs a weighted addition process on the input data, and sequentially executes a process of outputting the data to a subsequent stage. That is, the data x _t , P _t , and _ct are subjected to arithmetic processing (operation processing based on a nonlinear function) for a predetermined weighting coefficient, and then output to the output layer 13. In this embodiment, for example, data x _t, P _t, for the input of a predetermined weighted sum of c _t, after the calculation processing based on a function having a nonlinear output characteristic such as a sigmoid function, the output layer 13 Output.
[0016]
Some neurons 13-1 forming the output layer 13 output data x ^* _{t + 1} corresponding to the input data.
[0017]
Further, the RNN 1 has an arithmetic unit 21 for learning by back propagation. The calculation unit 22 performs a setting process of a weighting coefficient for the RNN1.
[0018]
Next, the learning process of the RNN 1 will be described with reference to the flowchart of FIG.
[0019]
The process shown in the flowchart of FIG. 2 is executed for each time-series pattern to be learned. In other words, as many virtual RNNs as the number of time-series patterns to be learned are prepared, and the process in FIG. 2 is executed for each virtual RNN.
[0020]
The process shown in the flowchart of FIG. 2 is executed for each virtual RNN, and after a time-series pattern is learned for each virtual RNN, a process for setting a coefficient is executed for the actual RNN 1. However, in the following description, the virtual RNN is also described as the actual RNN1.
[0021]
First, in step S11, the neuron 11-1 of the input layer 11 of RNN1 takes in the input _{x t} of a predetermined time t. In step S12, the intermediate layer 12 of RNN1 on the input _{x t,} performs arithmetic processing corresponding to the weighting coefficients from the neurons 13-1 of the output layer 13, the time series t + 1 in the time series pattern input value Is output as x ^* _{t + 1} .
[0022]
In step S13, the calculation unit 21 takes in the input xt + 1 at the next time _{t + 1} as teacher data. In step S14, the calculation unit 21 calculates an error between the teacher input _{xt + 1} captured in the processing in step S13 and the predicted value x ^* _{t + 1} calculated in the processing in step S12.
[0023]
In step S15, the RNN 1 performs a learning process by inputting the error obtained by the calculation in step S14 from the neuron 13-1 of the output layer 13 and transmitting the error in the order of the intermediate layer 12 and the input layer 11. , And the operation result dX _bpt is obtained.
[0024]
In step S16, the intermediate layer 12 obtains the correction value dXU of the internal state based on equation (1).
[0025]
(Equation 1)

[0026]
Further, the intermediate layer 12 corrects the correction value dXU based on Expressions (2) to (4).
(Equation 2)

[Equation 3]

[0027]
In step S17, the parametric node 11-2 executes a process of storing the value of the internal state.
[0028]
Next, in step S18, the RNN 1 determines whether or not to end the learning process. If the learning process has not been ended yet, the process returns to step S11 and repeats the subsequent processes.
[0029]
If it is determined in step S18 that the learning process is to be ended, the learning process is ended.
[0030]
By performing the learning process as described above, one time-series pattern is learned for the virtual RNN.
[0031]
As described above, after the learning process of the virtual RNN corresponding to the number of the learning patterns is performed, the process of setting the weighting coefficient obtained by the learning process to the real RNN 31 is performed. FIG. 3 shows the processing in this case.
[0032]
In step S21, the calculation unit 22 calculates a composite value of coefficients obtained as a result of executing the processing illustrated in the flowchart of FIG. 2 for each virtual RNN. As the combined value, for example, an average value can be used. That is, the average value of the weighting coefficients of each virtual RNN is calculated here.
[0033]
Next, in step S22, the calculation unit 22 performs a process of setting the composite value (average value) calculated in the process of step S21 as a weighting factor for the neuron of the real RNN1.
[0034]
As a result, coefficients obtained by learning a plurality of time-series patterns are set in each neuron of the intermediate layer 12 of the actual RNN 1.
[0035]
The weighting coefficient of each neuron in the intermediate layer 12 holds information about a sharable dynamic structure when generating a plurality of teaching time-series patterns, and a parametric bias node stores a sharable dynamic structure at each teaching time. In order to switch to a dynamic structure suitable for generating a sequence pattern, necessary information is held. Here, the “sharable mechanical structure” will be described with further examples. For example, as shown in FIGS. 4A to 4C, when the time series pattern A and the time series pattern B having different amplitudes but the same cycle are input, the dynamic structure in which the cycle of the output time series pattern C can be shared. 5A to 5C, when the time series pattern A and the time series pattern B having different periods but the same amplitude are input, the amplitude of the output time series pattern C can be shared. It corresponds to a mechanical structure. However, it goes without saying that the present invention is not limited to this.
[0036]
For example, as shown in FIG. 6, by inputting and learning the first data, a time-series pattern represented by a curve L1 having a relatively large amplitude is learned.
[0037]
Similarly, as shown in FIG. 7, by inputting and learning the second data, a time-series pattern having a relatively small amplitude represented by a curve L2 is learned.
[0038]
When a new time-series pattern is generated in the RNN 1 after learning such a time-series pattern, a process as shown in the flowchart of FIG. 8 is executed.
[0039]
That is, first, in step S31, the parameter bias node 11-2 inputs parameters different from those at the time of learning. In step S32, the intermediate layer 12 performs an operation based on the weighting coefficient for the parameter input to the parameter bias node 11-2 in the process of step S31. Specifically, it corresponds to the inverse operation of the operation performed when calculating the parameter value during learning. Then, in step S33, the neuron 13-1 of the RNN 1 outputs a pattern corresponding to the parameter input in the processing in step S31.
[0040]
9, with respect RNN1, after training the time series pattern shown in FIGS. 6 and 7, shows an example of entering the parameter _{P N} parametrically bias node 11-2 RNN1 as a parameter _{P t} ing. The parameter P _N is a value different from the parameter P _B, which is output at the time of pattern learning 6 parameters P _A to be output to the parametric bias node _11-2, and the time series pattern during learning shown in FIG. 7 ing. That is, in this case, the value of the parameter P _N, are an intermediate value of values of the parameters P _A and the parameter P _B.
[0041]
In such a case, the time-series pattern output from the neuron 13-1 of the output layer 13 is a time-series pattern indicated by a curve L3 in FIG. The amplitude of the curve L3 is smaller than the amplitude of the curve L1 of the time series pattern A shown in FIG. 6 and larger than the amplitude of the curve L2 of the time series pattern B shown in FIG. In other words, the amplitude of the curve L3 is an intermediate value between the amplitude of the curve L1 and the amplitude of the curve L2. That is, in this example, the intermediate curve L3 between the curves L1 and L2 shown in FIGS. 6 and 7 is linearly interpolated.
[0042]
Hereinafter, the results of the experiment will be described with reference to FIGS.
[0043]
FIGS. 10 to 13 show errors (FIG. 10), targets (input data) (FIG. 11), outputs (FIG. 12), and parametric biases (parameters) (FIG. 13) when the first time-series pattern is learned. ) Respectively. The vertical axis of each figure represents each value (normalized value), and the horizontal axis represents a step.
[0044]
At the time of the first pattern learning, as shown in FIG. 10, the error is almost 0.0 as shown by the straight line L11. As shown in FIG. 11, the target (input pattern) is a sine wave indicated by a straight line L12.
[0045]
The output corresponding to the target shown in FIG. 11 is a curve (sine wave) almost corresponding to the curve L12 as the target, as shown by the curve L13 in FIG.
[0046]
As shown in FIG. 13, one of the two parametric bias (parameter) values converges at a value of approximately 0.37, as shown by curve L14, and the other, as shown by curve L15, It is almost constant at 0.0.
[0047]
14 to 17 show errors (FIG. 14), targets (FIG. 15), outputs (FIG. 16), and parametric biases (parameters (FIG. 17)) when the second time-series pattern is learned, respectively. I have.
[0048]
As shown in FIG. 14, the error is substantially constant at 0.0, as indicated by the straight line L21. The target has a substantially sinusoidal time-series pattern as indicated by a curve L22 in FIG. As is clear from FIG. 11, the amplitude of the curve L22 is almost the same as the amplitude of the curve L12, but the cycle of the curve L22 is longer than the cycle of the curve L12 (when the frequency is Lower).
[0049]
As shown in FIG. 16, corresponding to the target shown in FIG. 15, an output of a curve 23 substantially corresponding to the curve L22 is obtained.
[0050]
As shown in FIG. 17, one of the two parametric biases when the second time-series pattern is learned converges to a value of approximately 0.36, as shown by the curve L24, and the other one. Value substantially converges to a value of 0.67 as shown by the curve L25.
[0051]
FIGS. 18 to 21 show an error (FIG. 18), a target (FIG. 19), an output (FIG. 20), and a parametric bias (FIG. 21), respectively, when the third time-series pattern is learned.
[0052]
As shown in FIG. 18, the error at the time of learning is approximately 0.0 as shown by the curve L31. The target is a substantially sinusoidal signal as indicated by the curve L32. As is clear from the comparison of the curve L32 with the curve L12 of FIG. 11 and the curve L22 of FIG. 15, the amplitude of the curve L32 is almost the same as the amplitude of the curve L12 and the curve L22, but the period is the same as the curve L22 (The frequency is lower than in L22).
[0053]
As shown in FIG. 20, the output curve L33 has a sine waveform substantially corresponding to the target curve LL32 of FIG.
[0054]
As shown in FIG. 21, when the third time-series pattern is learned, one of the values of the parametric bias is approximately 0.21, as shown by the curve L34, and the other value is the curve L35. As shown by, it is almost constant at 1.00.
[0055]
After making the RNN 1 learn three time-series patterns as shown in FIGS. 11, 15, and 19, a parametric bias (parameter) of (0.36, 036) is applied to the parametric bias node 11-2 of the RNN 1. When two values are input, the neuron 13-1 forming the output layer 13 outputs a time-series pattern indicated by a curve L41 as shown in FIG.
[0056]
Similarly, when a value of (0.80, 0.25) was input as a parameter, a pattern of a curve L51 was obtained as shown in FIG. For comparison, FIGS. 24 to 26 show the time-series patterns of the targets shown in FIGS. 11, 15, and 19. FIG.
[0057]
As is clear from comparison of these figures, the amplitude of the curve L41 in FIG. 22 is almost the same as that of the curves L13, L23, and L33 in FIGS. However, the cycle of the curve L41 is longer than the cycle of the curve L13 in FIG. 24 and shorter than the cycle of the curve L22 in FIG.
[0058]
This is because the parameters (0.36, 0.36) in FIG. 22 are intermediate values between the parameters (0.00, 0.37) in FIG. 24 and the parameters (0.67, 0.36) in FIG. Due to being.
[0059]
The curve L51 in FIG. 23 has substantially the same amplitude as the curves L13, L23, and L33 in FIGS. 24 to 26, but has a longer period than that of the curve L22 in FIG. The value is shorter than that of L32.
[0060]
This is because the value of the parameter (0.80, 0.25) in FIG. 23 is intermediate between the parameter (0.67, 0.36) in FIG. 25 and the parameter (1.00, 0.01) in FIG. Value.
[0061]
FIG. 27 shows a graph of the period when one value of the parametric bias is plotted on the horizontal axis and the other value is plotted on the vertical axis. Points A1, A2, A3, B1, and B2 in the same figure are the respective parameters in FIGS. 24 (A1), 25 (A2), 26 (A3), 22 (B1), and 23 (B2), respectively. Is plotted. In FIG. 27, the same density indicates the same period. The gradation of the color difference means that the period changes smoothly in accordance with the value of the parametric bias.
[0062]
FIG. 28 shows a change in amplitude corresponding to two parametric biases. As is clear from this figure, the amplitude changes almost smoothly in response to the parametric bias.
[0063]
In the above, a linear time-series pattern is learned, but the same result was obtained when a nonlinear time-series pattern was learned.
[0064]
That is, as shown in FIG. 29, after the RNN 1 learns the time series pattern indicated by the curve L 61 as shown in FIG. 30 and the time series pattern indicated by the curve L 62 as shown in FIG. as shown in, as a parameter, and the parameter P _C obtained when the learning curve L61 in FIG. 29, the different parameters P _M pattern P _D obtained when train the pattern of the curve L62 of Fig. 30, When input to the parametric bias node 11-2 of the RNN1, a new pattern different from both the curve L61 in FIG. 29 and the curve L62 shown in FIG. 30 can be generated as shown by the curve L63.
[0065]
Hereinafter, specific experimental examples will be described.
[0066]
FIGS. 32 to 35 respectively show an error (FIG. 32), a target (FIG. 33), an output (FIG. 34), and a parametric bias (FIG. 35) when the fourth pattern is learned.
[0067]
The error at the time of learning the fourth time-series pattern is substantially 0.0 as shown by the curve L71. The target is a sine wave as shown by a curve L72. The corresponding output is shown by curve L73. This curve L73 substantially corresponds to the curve L72.
[0068]
One value of the parametric bias obtained in this case is 1.00 as shown by the curve L74, and the other value is 0.79 as shown by the curve L75.
[0069]
FIGS. 36 to 39 show an error (FIG. 36), a target (FIG. 37), an output (FIG. 38), and a parametric bias (FIG. 39), respectively, during the fifth pattern learning.
[0070]
As shown in FIG. 36, the error at the time of learning is almost 0.0 as shown by the curve L81.
[0071]
As shown by the curve L82 in FIG. 37, the target is a deformed (distorted) sine wave different from the beautiful sine wave shown by the curve L72 in FIG.
[0072]
As shown in FIG. 38, an output represented by a curve L83 corresponding to the target shown in FIG. 37 is obtained.
[0073]
At the time of this learning, one of the obtained parametric bias values converges to 0.12 as shown by the curve L84, and the other value becomes approximately 0.60 as shown by the curve L85. Has converged.
[0074]
After learning the two time-series patterns shown in FIGS. 33 and 37 for the RNN1, a parametric bias (0.12, 0.68) is input to the parametric bias node 11-2. As a result, an output represented by a curve L91 was obtained.
[0075]
Similarly, when a parametric bias (0.25, 0.59) is applied, an output indicated by a curve L92 in FIG. 41 is obtained, and when a parametric bias (0.50, 0.60) is applied, FIG. As a result, an output represented by a curve L93 was obtained.
[0076]
For comparison with the outputs of FIGS. 40 to 42, the outputs when parametric bias (1.00, 0.79) or parametric bias (0.12, 0.60) are given are shown in FIGS. 43 and 44. It is shown. That is, the output of FIG. 43 is the same as the output of FIG. 34, and the output of FIG. 44 is the same as the output of FIG.
[0077]
Thus, in the case of this example, the non-linear calculation processing is performed in accordance with the parameters.
[0078]
FIG. 45 shows a change in the period corresponding to the parametric bias. FIG. 46 similarly shows a change in amplitude corresponding to a parametric bias. In these figures, as in the case of FIGS. 27 and 28, the same density represents the same cycle or amplitude. In FIGS. 45 and 46, A1, A2, B1, B2, and B3 plot the parameters of FIGS. 43, 44, 30, 41, and 42, respectively.
[0079]
As is clear from FIGS. 45 and 46, even when the non-linear calculation processing is performed, the period and the amplitude change almost smoothly in accordance with the parameters.
[0080]
In this way, after learning a plurality of time-series patterns, by inputting predetermined parameters, a new time-series pattern different from the learned time-series pattern is generated and output by linear or nonlinear interpolation. can do.
[0081]
Note that the parametric bias is a parameter for switching the spatiotemporal pattern generated by the RNN 1. The relationship between this parameter and the spatiotemporal pattern generated thereby is not given in advance, but is determined by learning the teaching spatiotemporal pattern.
[0082]
INDUSTRIAL APPLICABILITY The present invention can be applied to, for example, "imitation movement learning" in a humanoid robot. This allows the robot to not only learn multiple movement patterns (such as dance) taught by humans and reproduce those movements, but also extract features common to multiple teaching movement patterns, It is possible to generate a new motion pattern based on the motion pattern.
[0083]
The series of processes described above can be executed by hardware, but can also be executed by software. In this case, for example, a personal computer 160 as shown in FIG. 47 is used.
[0084]
In FIG. 47, a CPU (Central Processing Unit) 161 executes various processes according to a program stored in a ROM (Read Only Memory) 162 or a program loaded from a storage unit 168 into a RAM (Random Access Memory) 163. . The RAM 163 also appropriately stores data necessary for the CPU 161 to execute various processes.
[0085]
The CPU 161, the ROM 162, and the RAM 163 are interconnected via a bus 164. An input / output interface 165 is also connected to the bus 164.
[0086]
The input / output interface 165 includes an input unit 166 including a keyboard and a mouse, a display including a CRT and an LCD, an output unit 167 including a speaker, a storage unit 168 including a hard disk, a modem, a terminal adapter, and the like. The configured communication unit 169 is connected. The communication unit 169 performs communication processing via a network.
[0087]
A drive 170 is connected to the input / output interface 165 as necessary, and a magnetic disk 171, an optical disk 172, a magneto-optical disk 173, a semiconductor memory 174, or the like is appropriately mounted. It is installed in the storage unit 168 as needed.
[0088]
When a series of processing is executed by software, a program constituting the software is installed in the personal computer 160 from a network or a recording medium.
[0089]
As shown in FIG. 47, the recording medium is a magnetic disk 171 (including a floppy disk) on which the program is recorded and an optical disk 172 (including a floppy disk) which are distributed to provide the program to the user separately from the apparatus main body. It is configured by a package medium including a CD-ROM (Compact Disk-Read Only Memory), a DVD (including a Digital Versatile Disk), a magneto-optical disk 173 (including an MD (Mini-Disk)), or a semiconductor memory 174. In addition, it is configured by a ROM 162 in which a program is recorded, which is provided to the user in a state where the program is incorporated in the apparatus main body in advance, a hard disk included in the storage unit 168, and the like.
[0090]
In this specification, the steps of describing a program recorded on a recording medium may be performed in chronological order according to the described order, but may not be performed in chronological order. This also includes processes executed individually.
[0091]
Also, in this specification, a system refers to an entire device including a plurality of devices.
[0092]
【The invention's effect】
As described above, according to the present invention, a time-series pattern can be generated. Further, the time series pattern to be generated can be a new time series pattern.
[Brief description of the drawings]
FIG. 1 is a diagram showing a configuration of a recurrent neural network to which the present invention is applied.
FIG. 2 is a flowchart illustrating a learning process of the recurrent neural network of FIG. 1;
FIG. 3 is a flowchart illustrating a coefficient setting process of the recurrent neural network of FIG. 1;
FIG. 4 is a diagram illustrating an example of a time-series pattern having different amplitudes and the same synchronization.
FIG. 5 is a diagram showing an example of a time-series pattern having different periods and the same amplitude.
FIG. 6 is a diagram illustrating an example of a learning pattern.
FIG. 7 is a diagram illustrating an example of a learning pattern.
FIG. 8 is a flowchart illustrating a time-series pattern generation process of the recurrent neural network of FIG. 1;
FIG. 9 is a diagram illustrating an example of a time-series pattern to be generated.
FIG. 10 is a diagram illustrating a change in an error when a first pattern is learned.
FIG. 11 is a diagram showing a target in the case of learning a first pattern.
FIG. 12 is a diagram showing an output when a first pattern is learned.
FIG. 13 is a diagram showing a change in a parametric bias when the first pattern is learned.
FIG. 14 is a diagram illustrating a change in an error when a second pattern is learned.
FIG. 15 is a diagram showing a target in the case of learning a second pattern.
FIG. 16 is a diagram showing an output when a second pattern is learned.
FIG. 17 is a diagram showing a change in a parametric bias when a second pattern is learned.
FIG. 18 is a diagram showing a change in an error when a third pattern is learned.
FIG. 19 is a diagram showing a target in the case of learning a third pattern.
FIG. 20 is a diagram showing an output when a third pattern is learned.
FIG. 21 is a diagram illustrating a change in a parametric bias when a third pattern is learned.
FIG. 22 is a diagram illustrating an example of a generated pattern.
FIG. 23 is a diagram showing another example of the generated pattern.
FIG. 24 is a diagram showing an output corresponding to FIG. 12;
FIG. 25 is a diagram showing an output corresponding to FIG. 16;
FIG. 26 is a diagram showing an output corresponding to FIG. 20;
FIG. 27 is a diagram showing a relationship between a change in the period of the patterns of FIGS. 22 to 26 and a parametric bias.
FIG. 28 is a diagram showing a relationship between a change in the amplitude of the patterns of FIGS. 22 to 26 and a parametric bias.
FIG. 29 is a diagram showing an example of a pattern to be learned.
FIG. 30 is a diagram showing another example of a pattern to be learned.
FIG. 31 is a diagram showing a pattern generated when the patterns shown in FIGS. 29 and 30 are learned.
FIG. 32 is a diagram showing a change in an error when a fourth pattern is learned.
FIG. 33 is a diagram showing targets when learning a fourth pattern.
FIG. 34 is a diagram showing an output when a fourth pattern is learned.
FIG. 35 is a diagram showing a change in a parametric bias when a fourth pattern is learned.
FIG. 36 is a diagram showing a change in an error when a fifth pattern is learned.
FIG. 37 is a diagram showing a target in the case of learning a fifth pattern.
FIG. 38 is a diagram showing an output when a fifth pattern is learned.
FIG. 39 is a diagram showing a change in a parametric bias when a fifth pattern is learned.
FIG. 40 is a diagram illustrating an example of a generated pattern.
FIG. 41 is a diagram illustrating an example of a generated pattern.
FIG. 42 is a diagram illustrating an example of a generated pattern.
FIG. 43 is a diagram showing an output corresponding to FIG. 34;
FIG. 44 is a diagram showing an output corresponding to FIG. 38.
FIG. 45 is a diagram showing the relationship between the period of the patterns of FIGS. 40 to 44 and the parametric bias.
FIG. 46 is a diagram showing the relationship between the amplitudes of the patterns of FIGS. 40 to 44 and the parametric bias.
FIG. 47 is a block diagram illustrating a configuration example of a personal computer to which the present invention has been applied.
[Explanation of symbols]
Reference Signs List 1 recurrent neural network, 11 input layer, 12 hidden layer, 13 output layer, 21 operation unit

Claims

In an information processing device that outputs a time-series pattern,
Input means for inputting a time-series pattern;
Model determination means for determining a model based on a common non-linear dynamical system having one or more externally operable feature amount parameters for each of the plurality of time-series patterns input from the input means;
Calculating means for calculating a value of the feature parameter based on the determined model;
An output that outputs a new time-series pattern by setting a value different from the value calculated by the calculation unit as the feature amount parameter and performing a reverse operation of the calculation that calculates the value of the feature amount parameter And an information processing apparatus.

The information processing apparatus according to claim 1, wherein the nonlinear dynamic system is a recurrent neural network with operation parameters.

The information processing apparatus according to claim 1, wherein the feature parameter represents a dynamic structure of the time-series pattern in a nonlinear dynamic system.

The information processing apparatus according to claim 3, wherein the output unit outputs a new time-series pattern having a dynamic structure that can be shared with the plurality of input time-series patterns.

In an information processing method of an information processing device that outputs a time-series pattern,
An input step for inputting a time-series pattern;
For each of the plurality of time-series patterns input in the processing of the input step, a model determining step of determining a model by a common nonlinear dynamical system having one or more externally operable feature amount parameters,
A calculating step of calculating a value of the feature amount parameter based on the determined model;
A new time series pattern is output by setting a value different from the value calculated by the processing in the calculation step as the feature amount parameter and performing the inverse calculation of the calculation in which the value of the feature amount parameter is calculated. An information processing method.

A program of an information processing device that outputs a time-series pattern,
An input step for inputting a time-series pattern;
For each of the plurality of time-series patterns input in the processing of the input step, a model determining step of determining a model by a common nonlinear dynamical system having one or more externally operable feature amount parameters,
A calculating step of calculating a value of the feature amount parameter based on the determined model;
A new time series pattern is output by setting a value different from the value calculated by the processing in the calculation step as the feature amount parameter and performing the inverse calculation of the calculation in which the value of the feature amount parameter is calculated. And a program storage medium storing a computer-readable program.

A computer program for controlling an information processing device that outputs a time-series pattern,
An input step for inputting a time-series pattern;
For each of the plurality of time-series patterns input in the processing of the input step, a model determining step of determining a model by a common nonlinear dynamical system having one or more externally operable feature amount parameters,
A calculating step of calculating a value of the feature amount parameter based on the determined model;
A new time series pattern is output by setting a value different from the value calculated by the processing in the calculation step as the feature amount parameter and performing the inverse calculation of the calculation in which the value of the feature amount parameter is calculated. And an output step of performing the following.